I only ever installed it from the Ubuntu repositories, and it 'just worked'.
To use the command line too, you need to convert input images into tiff format first (or "MDR", whatever that is). I used the ImageMagick convert program to do this. e.g. using a page of text from the
distributed proofreaders project:
Code:
wget http://www.pgdp.net/projects/projectID47d3b81d1228b/005.png
convert 005.png 005.tiff
tesseract 005.tiff 005
This produces the file 005.txt containing the OCR'd text.
I don't know how easy or otherwise it will be to use it from a program, rather than with the command line program.
Like most all OCR programs, it's not perfect, but it's pretty good compared to other free software OCR software.