Linux - HardwareThis forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
I have tested several software to use the OCR with my HP printer. Unfortunately the software that comes with it is only available for Mac OS and Windows. As I said I installed several software without success.
In my search I found that the Tesseract is better OCR application for Linux.
However I found two problems:
He does not have a GUI, a graphical interface, but it is possible to be done by commands, which is very boring when you want to scan several pages.
The results were very unsatisfactory, at least in my language "Portuguese", in a text with 1000 words he recognized only two or three which is very little, i text with several letters, magazines, books and folders.
So, i want help to install and use some OCR on Linux.
I usually use ocrad and it actually produces decent results at least for English, but you should probably apply some image filters, and maybe use something like unpaper as well before you run ocrad on it, or the output will not be as good. The better the input image, the better the OCR translation.
It's true I haven't seen a truly professional OCR for Linux, but try some out, maybe there is one out there that you might find acceptable.
Last edited by H_TeXMeX_H; 01-27-2010 at 01:38 PM.
I have tested several software to use the OCR with my HP printer. Unfortunately the software that comes with it is only available for Mac OS and Windows. As I said I installed several software without success.
In my search I found that the Tesseract is better OCR application for Linux.
However I found two problems:
He does not have a GUI, a graphical interface, but it is possible to be done by commands, which is very boring when you want to scan several pages.
The results were very unsatisfactory, at least in my language "Portuguese", in a text with 1000 words he recognized only two or three which is very little, i text with several letters, magazines, books and folders.
So, i want help to install and use some OCR on Linux.
You can install GOCR, as it has a GUI, but Tesseract is much more accurate, providing you use it correctly. A quick Google search turns up:
which has examples, instructions, and basic shell scripts to 'automate' OCR of a bunch of pages. Note that if you don't load the right language (German, English, etc.), accuracy is always going to be bad.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.