LinuxQuestions.org
Review your favorite Linux distribution.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 09-15-2010, 12:32 PM   #1
Alexvader
Member
 
Registered: Oct 2009
Location: Japan
Distribution: Arch, Debian, Slackware
Posts: 994

Rep: Reputation: 94
OCR from PDF to ascii...


Hi

I have this technical report...

http://www.calculix.de/auslegung.pdf

It pertains the use of an FOSS finite element package do size an

arcraft engine, well, sort of... this is a scale engine for model

aircraft.

I would like to convert this to english, but the PDF has been

scanned... so no text layer inside...

What can I do to have the text converetd to Ascii, so as to translate

it in a translator ?

BRGDS

Alex
 
Old 09-15-2010, 12:54 PM   #2
TB0ne
LQ Guru
 
Registered: Jul 2003
Location: Birmingham, Alabama
Distribution: SuSE, RedHat, Slack,CentOS
Posts: 26,634

Rep: Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965
Quote:
Originally Posted by Alexvader View Post
Hi

I have this technical report...
http://www.calculix.de/auslegung.pdf

It pertains the use of an FOSS finite element package do size an
arcraft engine, well, sort of... this is a scale engine for model
aircraft.

I would like to convert this to english, but the PDF has been
scanned... so no text layer inside...
What can I do to have the text converetd to Ascii, so as to translate
it in a translator ?
There are several OCR programs available for Linux. A recent benchmark was done between several of them here:
http://www.splitbrain.org/blog/2010-...are_comparison

Grab them, and find out which one works best for you. Getting the image scanned is the first step, which you've already done.
 
1 members found this post helpful.
Old 09-15-2010, 01:02 PM   #3
Alexvader
Member
 
Registered: Oct 2009
Location: Japan
Distribution: Arch, Debian, Slackware
Posts: 994

Original Poster
Rep: Reputation: 94
Hi TB0ne

Thx..

BTW do you know of any FOSS translator, German-English...?

My sister has Systran in some windows mobo, but she lives quite far from me, and besides Systran is not FOSS...
 
Old 09-15-2010, 01:10 PM   #4
John VV
LQ Muse
 
Registered: Aug 2005
Location: A2 area Mi.
Posts: 17,624

Rep: Reputation: 2651Reputation: 2651Reputation: 2651Reputation: 2651Reputation: 2651Reputation: 2651Reputation: 2651Reputation: 2651Reputation: 2651Reputation: 2651Reputation: 2651
i just normally use "google/translate "
http://translate.google.com/
 
1 members found this post helpful.
Old 09-15-2010, 01:14 PM   #5
Alexvader
Member
 
Registered: Oct 2009
Location: Japan
Distribution: Arch, Debian, Slackware
Posts: 994

Original Poster
Rep: Reputation: 94
Hi John_VV

Google translate is kewl, but it only translates tiny ammounts of text...

I am talking a 70s pages document here...,

I bet Google translate cannot hold a single page...

I was refering to sum program...

Last edited by Alexvader; 09-15-2010 at 01:31 PM.
 
Old 09-15-2010, 01:40 PM   #6
H_TeXMeX_H
LQ Guru
 
Registered: Oct 2005
Location: $RANDOM
Distribution: slackware64
Posts: 12,928
Blog Entries: 2

Rep: Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301
For translating, it may be harder to get a good program, but here are some:
http://freshmeat.net/search/?q=machi...&Go.x=0&Go.y=0
http://logos-os.dfki.de/
 
1 members found this post helpful.
Old 09-30-2010, 03:54 PM   #7
ilnli
Member
 
Registered: Jul 2004
Location: Pakistan
Distribution: Slackware 10.0, SUSE 9.1, RH 7, 7.3, 8, 9, FC2
Posts: 413

Rep: Reputation: 32
Quote:
Originally Posted by Alexvader View Post
Hi

arcraft engine, well, sort of... this is a scale engine for model

aircraft.

I would like to convert this to english, but the PDF has been

scanned... so no text layer inside...
Hi,

You can use online pdf to text converter at http://www.ocrconvert.com, it lets you convert 5 files at same time and its very quick.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
OCR Pedroski Linux - Software 5 02-06-2010 11:56 PM
OCR abdoh Linux - Newbie 3 06-27-2009 11:41 PM
ocr John Master Linux - Software 7 06-12-2005 05:56 PM
Ocr apffal Linux - Software 1 06-12-2005 05:01 AM
OCR initialization failed accessing OCR device: PROC-26 cheeku Linux - Software 0 09-19-2004 08:36 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 07:27 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration