Active Topics

 


Reply
Thread Tools
Posts: 39 | Thanked: 7 times | Joined on Dec 2009 @ Stockholm
#1
hi,

i would like to see OCR support on N900. I used it alot on my SE P1i. Be able to take a picture on a business card or to grab an article or recipe, time table, pricelist and convert it to text is just so powerful and usable!


what do you say?
 
Apoc's Avatar
Posts: 73 | Thanked: 79 times | Joined on May 2009 @ Virginia
#2
+1 with support for tables so I could easily grab my work schedule and stick it on my server for my other lazy co-workers
 
Posts: 39 | Thanked: 7 times | Joined on Dec 2009 @ Stockholm
#3
lets not get to hasty here ...dont know if there exist any OCR that can read tables, especially if we are talking open source. Though there is one open source software that could be of help here:
http://en.wikipedia.org/wiki/Tesseract_(software)

In this case its just almost about making the GUI thats missing and of course JPG implementation.

Last edited by illemann; 2010-02-14 at 09:08.
 
Bec's Avatar
Posts: 876 | Thanked: 396 times | Joined on Dec 2009
#4
Here's a brainstorm for it, some progress has been made: http://maemo.org/community/brainstorm/view/ocr_for_n900
__________________
 
RevdKathy's Avatar
Posts: 2,173 | Thanked: 2,678 times | Joined on Oct 2009 @ Cornwall, UK
#5
I would imagine this should be possible this app is doing something similar and then taking it a step further,. i would assume.
__________________
Hi! I'm Kathy and I'm a Maemo Greeter! Welcome.
Useful links for newcomers: New members say hello , New users start here, Community subforum, Beginners' wiki page, Maemo5 101, Frequently Asked Questions (FAQ)
Did you know Meego.com has forums too?
 

The Following User Says Thank You to RevdKathy For This Useful Post:
dwould's Avatar
Posts: 529 | Thanked: 262 times | Joined on Dec 2008 @ Eastleigh, Hampshire, UK
#6
so i was playing today and I compiled tesseract for n900. i also compiled ImageMagic so i could convert from jpg to tif for tesseract.

providing your picture is black text on white backround it seems to work pretty well.

i'm wondering how hard it would be to write a 'sharing' plugin for the photo manager, which will just call a script to process the image through convert and tesseract and spit out the text.

if i get the time I'll play more. might be cool if someone with a setup for packaging would consider uploading tesseract and magemagic to extras-devel....
__________________
----------
N900
http://danielwould.wordpress.com
Check out Witter, a twitter client for N900
http://danielwould.wordpress.com/witter

If Witter isn't working for you, eg crashes/doesn't start, gives errors etc etc. Please run it from x-term using:
run-standalone.sh python2.5 /opt/witter/witter.py

This will generate diagnostic output. Without this I cannot help you.
 

The Following User Says Thank You to dwould For This Useful Post:
Posts: 1,096 | Thanked: 760 times | Joined on Dec 2008
#7
Originally Posted by dwould View Post
so i was playing today and I compiled tesseract for n900. i also compiled ImageMagic so i could convert from jpg to tif for tesseract.

providing your picture is black text on white backround it seems to work pretty well.

i'm wondering how hard it would be to write a 'sharing' plugin for the photo manager, which will just call a script to process the image through convert and tesseract and spit out the text.

if i get the time I'll play more. might be cool if someone with a setup for packaging would consider uploading tesseract and magemagic to extras-devel....
I was gonna do the same thing, but did not get around to it yet.
I also use unpaper when automatically processing scans on server.

it is a small program but helps a good bit with tesseract ocr

i tie together some imagemagick->unpaper->tesseract with pretty great results and insert the raw text output into db as metadata for searching docs on our server.

works pretty well.
 

The Following User Says Thank You to quipper8 For This Useful Post:
dwould's Avatar
Posts: 529 | Thanked: 262 times | Joined on Dec 2008 @ Eastleigh, Hampshire, UK
#8
thanks for the tip. i will check it out today.
__________________
----------
N900
http://danielwould.wordpress.com
Check out Witter, a twitter client for N900
http://danielwould.wordpress.com/witter

If Witter isn't working for you, eg crashes/doesn't start, gives errors etc etc. Please run it from x-term using:
run-standalone.sh python2.5 /opt/witter/witter.py

This will generate diagnostic output. Without this I cannot help you.
 
dwould's Avatar
Posts: 529 | Thanked: 262 times | Joined on Dec 2008 @ Eastleigh, Hampshire, UK
#9
Sadly unpaper compiles fine, but segfaults when run under the ARMEL target. appears to be fine under the X86 target. I think I'm probably at the limits of my understanding to attempt to figure out what it doesn't like.

Anyone that understands C what to take a look?
__________________
----------
N900
http://danielwould.wordpress.com
Check out Witter, a twitter client for N900
http://danielwould.wordpress.com/witter

If Witter isn't working for you, eg crashes/doesn't start, gives errors etc etc. Please run it from x-term using:
run-standalone.sh python2.5 /opt/witter/witter.py

This will generate diagnostic output. Without this I cannot help you.
 
Reply


 
Forum Jump


All times are GMT. The time now is 06:31.