Active Topics

 


Reply
Thread Tools
Posts: 58 | Thanked: 10 times | Joined on Dec 2009
#1
Hi,

can anyone tell me if N900, or better to say Maemo5 has a built in speech recognition system? Available by some API?

I know I can port some linux projects, but that's a paint.
I hope it has something built in.
If not can someone tell me which projects are among the best in this field, and is there any easy way to port them to Maemo?

Note: I'm looking for a speech-to-text STT, not the other way around (TTS)

Thanx in advace
 
Posts: 654 | Thanked: 664 times | Joined on Feb 2009 @ Germany
#2
Nothing build in. Sorry.
 
Posts: 58 | Thanked: 10 times | Joined on Dec 2009
#3
Originally Posted by conny View Post
Nothing build in. Sorry.
That's a shame...
The more I look into this system the more disappointed I get.

Even 3310 had a speech recognition algorithm for speed dialing. I really can't see, why after 10 (or more) years and a few huge technological steps, they don't include these basic features in their top of the line phones.

So, anyway do you have any suggestions for porting linux libraries to Memo? Which ones are the best by performance? And are there any already ported?
 
Posts: 13 | Thanked: 5 times | Joined on Dec 2009 @ Europe
#4
did some research on ocr and tts-engines. while google provides us with a perfect open-source-ocr (did set it up in easx-debian) i couldn't find a state-of-the-art speech-recognition-system.

the following open-source-project was my starting-point:
http://www.simon-listens.org/index.php?id=122&L=1

it uses a japanese continuous-stt-engine, which seems to be equal/better than most commercial ones.

however it is difficult to get all the necessary sounds/statistical data. though there are open-source-projects which are collecting these.

please keep us up-to-date if you have something working on desktop or N900. i use windows vista's speech-recognition within a virtual machine in ubuntu (just out of interest for stt).

i never really used any stt-system on my desktop/mobile for productive purposes. even on my phone long-pressing a key was far superiour. but thats just my personal experience..

Last edited by tattergreis; 2010-01-25 at 18:27.
 
Posts: 58 | Thanked: 10 times | Joined on Dec 2009
#5
tts (text-to-speech) is not such a big problem
there are some web services that can do it pretty good
like: Google translate (has English pronunciation now), Ispeech, loquendo engine ...

speech-to-text is my problem
i can't exactly tell why aren't there any online services that can do this, except because of potential slow upload of audio files to server
obviously i have to implement it on the client, which is in this case N900 and Maemo
 
Posts: 13 | Thanked: 5 times | Joined on Dec 2009 @ Europe
#6
sorry i meant stt (quite confused as i am typing this while bringinhg my kids to bed) have a look at simon listens. there are some impressive youtube-videos for this speech-recognition-system.

Last edited by tattergreis; 2010-01-25 at 18:26.
 
Posts: 50 | Thanked: 56 times | Joined on Oct 2009 @ Oviedo, Florida
#7
See Carnegie Mellon's PocketSphinx. (Some assembly required...)
 
Posts: 13 | Thanked: 4 times | Joined on Feb 2008 @ Beijing, China
#8
Check gnome-voice-control of GNOME, I've ported to maemo 4, and it work NOT so good just like gnome-voice-control, further more it's CPU consuming, from my point, it can only be regarded as a prototype SW.
 
Reply


 
Forum Jump


All times are GMT. The time now is 11:55.