The Following 5 Users Say Thank You to mautz For This Useful Post: | ||
|
2017-07-02
, 19:51
|
Posts: 959 |
Thanked: 3,427 times |
Joined on Apr 2012
|
#352
|
The Following 3 Users Say Thank You to taixzo For This Useful Post: | ||
|
2017-07-03
, 06:44
|
|
Posts: 654 |
Thanked: 2,368 times |
Joined on Jul 2014
@ UK
|
#353
|
|
2017-07-04
, 02:29
|
Posts: 959 |
Thanked: 3,427 times |
Joined on Apr 2012
|
#354
|
The Following 3 Users Say Thank You to taixzo For This Useful Post: | ||
|
2017-08-18
, 13:09
|
Posts: 959 |
Thanked: 3,427 times |
Joined on Apr 2012
|
#355
|
The Following 2 Users Say Thank You to taixzo For This Useful Post: | ||
|
2018-07-04
, 07:35
|
Posts: 1,414 |
Thanked: 7,547 times |
Joined on Aug 2016
@ Estonia
|
#357
|
how many words should the input dictionary have? and has anyone of you tried doing this for Hungarian?
The Following 3 Users Say Thank You to rinigus For This Useful Post: | ||
|
2018-07-04
, 09:25
|
Posts: 58 |
Thanked: 65 times |
Joined on Oct 2009
@ Finland
|
#358
|
The Following 2 Users Say Thank You to mattiviljanen For This Useful Post: | ||
|
2018-07-04
, 11:33
|
Posts: 1,414 |
Thanked: 7,547 times |
Joined on Aug 2016
@ Estonia
|
#359
|
First, huge thanks for @eber42 for making OKboard!
I'm working on a Finnish dictionary, but it's really hard to find quality corpuses. I'm currently experimenting with Wikipedia-based and news based, but it seems I need bigger and better corpora... Does anyone know any good sources?
I did manage to get the thing to build (by cruely skipping the very last step that causes the build to fail and suggesting a bigger corpora - I wanted a proof of concept, won't be skipping the test in release version) but there are problems. I cut the original word list in half, but I'm still getting kinda huge (12MB...30MB) fi.tre file, predict-fi.db is 26kB and predict-fi.ng is 813kB. In comparison the English en.tre is below two megabytes... As a result, the delay between the gesture and the word appearing is...very noticable to be modest. What would be a good size to aim at?
Thanks all!
The Following 6 Users Say Thank You to rinigus For This Useful Post: | ||
|
2018-11-09
, 16:56
|
Posts: 105 |
Thanked: 205 times |
Joined on Dec 2015
@ Spain
|
#360
|
Tags |
bettertxtentry, huntnpeck sucks, okboard, sailfish, swype |
|
EDIT: Ok, i ran into some problems, my old corpora files seems to include some incompatible expressions and i can not get the clean_corpus.py script working. Any ideas on that problem?
EDIT2: Solved by lbzip2 -d < corpus.txt.bz2 | ./clean_corpus.py | lbzip2 > new_corpus.txt.bz2
EDIT3: Update published on openrepos.
Last edited by mautz; 2017-03-08 at 11:24.