Linux - NewbieThis Linux forum is for members that are new to Linux.
Just starting out and have a question?
If it is not in the man pages or the how-to's this is the place!
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Introduction to Linux - A Hands on Guide
This guide was created as an overview of the Linux Operating System, geared toward new users as an exploration tour and getting started guide, with exercises at the end of each chapter.
For more advanced trainees it can be a desktop reference, and a collection of the base knowledge needed to proceed with system and network administration. This book contains many real life examples derived from the author's experience as a Linux system and network administrator, trainer and consultant. They hope these examples will help you to get a better understanding of the Linux system and that you feel encouraged to try out things on your own.
Click Here to receive this Complete Guide absolutely free.
I have a PC that I wish to retire as my windows machine (I have to have one for work) I thought it would be really useful to me to make this PC a dedicated document management PC i.e. for scanning, optically recognising the documents and then turning them int searchable PDFs.
This is something I do now on WinXP I have been messing with a few live-CDs (completely new to linux) to see whihc ones I like the feel of.
I have tried the following so far
OpenSUSE - runs very slow
Ubuntu - Seems quite nice
Kubuntu - Failed to load
PuppyLinux - Too Cluttered and confusing
I will give others a go but to reduce the time I spend trying to get a system working fully I thought it would be a good idea to post my Spec and ask for suggestions on both best distro for my hardware and software to accomplish my document management needs.
Pentium 4 2.8GHZ
Graphics AGPx4 512MB dedicated memory
Internet connection currently via LAN but may switch to WiFi but that shouldn't be an issue as I will do that post set up and specifically seek compatible hardware.
If you need any other info please let me know
And thanks very much for your help
Scanner is a Plustek PS256 Smartoffice ADF Scanner.
Yeah the scanner does seem to be a sticking point, also finding there is little in the way of decent document management and OCR software available on LINUX which is another major issue for me.
I'm a one man business and was interested in setting up a "mirror" PC to do everything I do for my work on my windows box now to see if there would be a feasible point to switch over, but it is looking very unlikely to happen any time in the next few years I think.
Debian can easily be installed in a minimal configuration without a lot of extra junk you don't need on a server, and it runs well on slightly older spec hardware since it lacks a lot of the Bloat found in some distros.
gscan2pdf gscan2pdf - attaches metadata to PDF upon creation to allow indexing for searches.
OCR (Optical Character Recognition)
The gocr or tesseract utilities are used to produce text from an image.
There is an OCR output buffer for each page and is embedded both as an annotation (pop-up note) and as plain text behind the scanned image in the PDF produced. This way, Beagle can index (i.e. search) the plain text, and the contents of the annotations can be viewed in Acrobat Reader.
In DjVu files, the OCR output buffer is embedded in the hidden text layer. Thus these can also be indexed by Beagle.
There is an interesting review of OCR software at http://groundstate.ca/ocr. An important conclusion was that 400dpi is necessary for decent results.
Let us know how you put this all together and what software you choose.
Thanks for the advice it is very much appreciated.
I think my best course of action is to build a much better windows box as previously planned.
Tune this machine in to a LInux box in the corner of the room still not sure which distro and whether a GNOME or KDE environment is better for my needs.
Keep doing my scanning etc on the windows box while I get a good book on LINUX and learn as much about my distro as possible and take a trial and error approach to getting the machine to do what I want. (Might take months but i like learning new things)
Not sure if I should as this on another thread but, is it true i only need a firewall on a linux machine and no virus software or is this just an old wives tale?