Maemo5 speech recognition - maemo.org

Active Topics

Camera phone competition March 2023 "Endless" (35)
to General by Maemish - 3 days, 1 hr ago
TLS 1.2 for N9 is ready (28)
to MeeGo / Harmattan by smartblu9 - 3 days, 2 hrs ago
Maemo repository (3)
to Community by torsen - 3 days, 18 hrs ago
Error after installing Leste on sdcard (11)
to Maemo 7 / Leste by Maemish - 4 days, 15 hrs ago
Introducing ubiboot N9 (multiboot OS loader) (1,393)
to Alternatives by smatkovi - 5 days, 22 hrs ago
more...

Thread Tools

zolakt	2010-01-25 , 16:19
Posts: 58 \| Thanked: 10 times \| Joined on Dec 2009	#1

Hi,

can anyone tell me if N900, or better to say Maemo5 has a built in speech recognition system? Available by some API?

I know I can port some linux projects, but that's a paint.
I hope it has something built in.
If not can someone tell me which projects are among the best in this field, and is there any easy way to port them to Maemo?

Note: I'm looking for a speech-to-text STT, not the other way around (TTS)

Thanx in advace

Quote & Reply |

conny	2010-01-25 , 16:54
Posts: 654 \| Thanked: 664 times \| Joined on Feb 2009 @ Germany	#2

Nothing build in. Sorry.

Quote & Reply |

zolakt	2010-01-25 , 17:53
Posts: 58 \| Thanked: 10 times \| Joined on Dec 2009	#3

Originally Posted by conny

Nothing build in. Sorry.

That's a shame...
The more I look into this system the more disappointed I get.

Even 3310 had a speech recognition algorithm for speed dialing. I really can't see, why after 10 (or more) years and a few huge technological steps, they don't include these basic features in their top of the line phones.

So, anyway do you have any suggestions for porting linux libraries to Memo? Which ones are the best by performance? And are there any already ported?

Quote & Reply |

tattergreis	2010-01-25 , 17:53
Posts: 13 \| Thanked: 5 times \| Joined on Dec 2009 @ Europe	#4

did some research on ocr and tts-engines. while google provides us with a perfect open-source-ocr (did set it up in easx-debian) i couldn't find a state-of-the-art speech-recognition-system.

the following open-source-project was my starting-point:
http://www.simon-listens.org/index.php?id=122&L=1

it uses a japanese continuous-stt-engine, which seems to be equal/better than most commercial ones.

however it is difficult to get all the necessary sounds/statistical data. though there are open-source-projects which are collecting these.

please keep us up-to-date if you have something working on desktop or N900. i use windows vista's speech-recognition within a virtual machine in ubuntu (just out of interest for stt).

i never really used any stt-system on my desktop/mobile for productive purposes. even on my phone long-pressing a key was far superiour. but thats just my personal experience..

Last edited by tattergreis; 2010-01-25 at 18:27.

Quote & Reply |

zolakt	2010-01-25 , 18:01
Posts: 58 \| Thanked: 10 times \| Joined on Dec 2009	#5

tts (text-to-speech) is not such a big problem
there are some web services that can do it pretty good
like: Google translate (has English pronunciation now), Ispeech, loquendo engine ...

speech-to-text is my problem
i can't exactly tell why aren't there any online services that can do this, except because of potential slow upload of audio files to server
obviously i have to implement it on the client, which is in this case N900 and Maemo

Quote & Reply |

tattergreis	2010-01-25 , 18:04
Posts: 13 \| Thanked: 5 times \| Joined on Dec 2009 @ Europe	#6

sorry i meant stt (quite confused as i am typing this while bringinhg my kids to bed) have a look at simon listens. there are some impressive youtube-videos for this speech-recognition-system.

Last edited by tattergreis; 2010-01-25 at 18:26.

Quote & Reply |

dba	2010-01-26 , 00:57
Posts: 50 \| Thanked: 56 times \| Joined on Oct 2009 @ Oviedo, Florida	#7

See Carnegie Mellon's PocketSphinx. (Some assembly required...)

Quote & Reply |

pawpawyoung	2010-01-26 , 02:44
Posts: 13 \| Thanked: 4 times \| Joined on Feb 2008 @ Beijing, China	#8

Check gnome-voice-control of GNOME, I've ported to maemo 4, and it work NOT so good just like gnome-voice-control, further more it's CPU consuming, from my point, it can only be regarded as a prototype SW.

Quote & Reply |

« Previous Thread | Next Thread »

Forum Jump

All times are GMT. The time now is 11:55.

Active Topics

Camera phone competition March 2023 "Endless" (35)

TLS 1.2 for N9 is ready (28)

Maemo repository (3)

Error after installing Leste on sdcard (11)

Introducing ubiboot N9 (multiboot OS loader) (1,393)