maemo.org - Talk

maemo.org - Talk (https://talk.maemo.org/index.php)
-   SailfishOS (https://talk.maemo.org/forumdisplay.php?f=52)
-   -   [Announcement]Open source text prediction input plugin (https://talk.maemo.org/showthread.php?t=100266)

rinigus 2018-12-12 10:55

Re: [Announcement]Open source text prediction input plugin
 
Quote:

Originally Posted by FlyingAntero (Post 1551690)
I can try to find that kind of list or make it by myself. Should that list also include every conjugation of specific word? Finnish words have
dozens of conjugation forms. Here are few examples:
Word: run = juosta
  • I run = Minä juoksen
  • You run = Sinä juokset
  • He/she runs = Hän juoksee
Word: box = laatikko
  • The color of a box = Laatikon väri
  • Look at that box = Katso tuota laatikkoa
  • The cat went inside the box = Kissa meni laatikkoon

Assuming that ljo will filter, I don't know what's the preference. Most probably something like LIKE statement filters should be OK (http://www.sqlitetutorial.net/sqlite-like/). But please wait for conformation...

FlyingAntero 2018-12-17 08:24

Re: [Announcement]Open source text prediction input plugin
 
I made a list of profanity words. It is just a simple CSV file and every conjugation form is a single word in the list. I also uploaded The National Library's journal's Finnish n-grams (1820-2000) to cloud. If someone is willing to help, you can find the link below.

rinigus 2018-12-18 21:10

Re: [Announcement]Open source text prediction input plugin
 
Quote:

Originally Posted by FlyingAntero (Post 1551807)
I made a list of profanity words. It is just a simple CSV file and every conjugation form is a single word in the list. I also uploaded The National Library's journal's Finnish n-grams (1820-2000) to cloud. If someone is willing to help, you can find the link below.

I can look into it, probably next week, if ljo will not beat me to it.

ljo 2018-12-19 18:06

Re: [Announcement]Open source text prediction input plugin
 
Quote:

Originally Posted by rinigus (Post 1551862)
I can look into it, probably next week, if ljo will not beat me to it.

Been super busy the last week, but I will see after tomorrow.

taixzo 2019-01-15 21:00

Re: [Announcement]Open source text prediction input plugin
 
It seems that libpresage has been removed from OpenRepos. Was this intentional? If so, how to install the plugin?

ljo 2019-01-15 21:44

Re: [Announcement]Open source text prediction input plugin
 
Quote:

Originally Posted by taixzo (Post 1552797)
... how to install the plugin?

You install one of the localized keyboards from sailfish_keyboard. All dependencies should be brought in.

rinigus 2020-03-15 13:04

Re: [Announcement]Open source text prediction input plugin
 
I have added the first version of German Presage predicting keyboard. Its made together with @matzgewinn who found the corpus and processed it. While relatively small corpus (300MB), let's hope it works. Hunspell dictionary was added as well.

As usual, just install https://openrepos.net/content/sailfi...ext-prediction and all the rest should be pulled. Easiest is to install, enable in settings, and reboot.

Corresponding issue: https://github.com/sailfish-keyboard/presage/issues/26

My German is non-existent, so its up to the users to test and improve it.

taixzo 2020-03-16 15:07

Re: [Announcement]Open source text prediction input plugin
 
I've noticed that it seems to have issues predicting words with apostrophes (e.g. it predicts "aren" instead of "aren't"). Is there a way to fix this?

rinigus 2020-03-16 15:57

Re: [Announcement]Open source text prediction input plugin
 
Quote:

Originally Posted by taixzo (Post 1566091)
I've noticed that it seems to have issues predicting words with apostrophes (e.g. it predicts "aren" instead of "aren't"). Is there a way to fix this?

Its probably an issue with tokenizer. Not sure where exactly, as prediction databases seem to have "aren't" in them. So, would require some investigation. I don't have time for it, unfortunately. So, someone would have to take a look if it is going to be fixed.


All times are GMT. The time now is 16:35.

vBulletin® Version 3.8.8