View Single Post
Posts: 106 | Thanked: 205 times | Joined on Dec 2015 @ Spain
#228
Originally Posted by spidernik84 View Post
Hello Ferlanero, thanks for all the efforts. I was working as well on the Italian dictionary since I was unaware of this thread. Eric, the author of okboard, pointed me here! Good to know so we don't duplicate the efforts.

I had other issues that prevented me from generating the dict, possibly related to memory errors.

Is it possible for you to get more verbosity by running bash in debug mode? I run the script like this:
bash -x db/build.sh it

Thanks.

BTW: what corpus are you using? I found a bunch of them, the most complete being PAISA http://www.corpusitaliano.it/it/cont...scription.html

But also this http://www.corpora.heliohost.org/download.html
Hi spidernik84. First of all thank you very much for adding support for OKBoard

For Italian language I'm using this corpus files:
http://corpora2.informatik.uni-leipzig.de/download.html
http://opus.lingfil.uu.se/OpenSubtitles2016.php

Which are perfect for coloquial language

I already have the whole file ready to process, so, if you are interested in using it for generating the prediction keyboard for italian language, I send you the download link to process it.So, if you are working on italian language, I can focus on other languages, do you agree?
 

The Following User Says Thank You to ferlanero For This Useful Post: