LinuxQuestions.org
Visit Jeremy's Blog.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 09-13-2020, 03:22 PM   #1
JZL240I-U
Senior Member
 
Registered: Apr 2003
Location: Germany
Distribution: openSuSE Tumbleweed-KDE, Mint 21, MX-21, Manjaro
Posts: 4,629

Rep: Reputation: Disabled
gscan2pdf + tesseract error message not helping / applicable


This is on tumbleweed. Error after scanning a page is this:

Code:
[DS] Profile read from file (tesseract_opencl_profile_devices.dat).
[DS] Device[1] 0:(null) score is 0.358755
[DS] Selected Device[1]: "(null)" (Native)
Error opening data file /usr/share/tessdata/[DS] Device[1] 0:(null) score is 0.358755.traineddata
Please make sure the TESSDATA_PREFIX environment variable is set to your "tessdata" directory.
Failed loading language '[DS] Device[1] 0:(null) score is 0.358755'
Tesseract couldn't load any languages!
Could not initialize tesseract.
I did
Code:
export TESSDATA_PREFIX=/usr/share/tessdata
which is where all the language files reside. I even downloaded the newest eng.traineddata, all to no avail. There is a ton of complaints in the net with exactly this error, but none had solutions other than the two I already tried. Anyone here with ideas?

P.S.: The scanner is a brother MFC-L2710DW 4 in one.

Last edited by JZL240I-U; 09-13-2020 at 03:23 PM.
 
Old 09-14-2020, 08:47 AM   #2
business_kid
LQ Guru
 
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 16,281

Rep: Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322
I had my best success without gscan2pdf
  • tesseract file.jpg >> something.txt
  • Open txt in word processor, correct & format
  • Export to pdf if you want to treble the size.

Tesseract sucks imo. But the options (gscan2pdf, gocr) suck much worse. Get tesseract 4.0+. If you have one or two projects where tesseract fails and you need the best ocr, Abbyy (A proprietary program) did a linux version with a one month free trial. It's probably the best option performance wise (only).
 
Old 09-14-2020, 10:58 AM   #3
JZL240I-U
Senior Member
 
Registered: Apr 2003
Location: Germany
Distribution: openSuSE Tumbleweed-KDE, Mint 21, MX-21, Manjaro
Posts: 4,629

Original Poster
Rep: Reputation: Disabled
I have already tesseract 4.1.1 (AFAIR the last minor digit alright). Tesseract itself can't be too bad, Google uses it for its book scanning thingy. The allure of tesseract integrated into gscan2pdf is that one can (ahmm, could, in my case) convert a scan on the fly. And I hate it, when software throws a spoke into my wheels. It should work, darn it all (including misleading error messages since years and years).

Last edited by JZL240I-U; 09-14-2020 at 10:59 AM.
 
Old 09-14-2020, 12:37 PM   #4
business_kid
LQ Guru
 
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 16,281

Rep: Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322
Don't try anything fancy, I'd advise. And don't expect it to work, except 1 page at a time.
As for the scanner, scan big. 600 dpi or bigger.I wrote a script to give it one page at a time.

I had one project recently, a play set in the 1950s in rural Ireland. There were hand edits over pale typewritten pages. For me, the word processor stage was essential. The error messages age a bit like hieroglyphics - you figure them, then forget.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Upgraded to 14.04 lts and no longer finds my Gscan2pdf scanner FlyerDan69 Linux - Hardware 5 08-18-2014 03:04 PM
LXer: gscan2pdf 1.1.2 Brings Various Improvements LXer Syndicated Linux News 0 02-12-2013 10:20 PM
gscan2pdf won't multi-page scan doxieman40228 Linux - Software 0 09-14-2011 12:30 AM
(error) fsck: operation not applicable to FSType nfs hebeles Solaris / OpenSolaris 1 12-07-2010 04:13 AM
LXer: gscan2pdf - Scan multiple Documents, import images to PDF & DjVu LXer Syndicated Linux News 0 08-28-2008 07:41 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 10:33 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration