I am a data guru and have been hording information since the ripe old age of 7
I would like to see if I can use computers to help me as much as possible, especially since I am now undertaking my masters degree, and soon a Ph. D.
I want to be able to scan articles, papers and even whole books into digital form, then process them with ocr software. I want the result to go into my database as an intact file. I am ok with having keywords with a document but would like a (pretty) fast search engine that would be able to go through all of the text in the stored documents and search for words that I am looking for as they appear in the actual document.
I also need to be able to store webpages, jpegs, sound and even video. I am ok with adding keywords to these ( not having the software scan them) but would still like the same search engine to include this when searching. I am anticipating that within five years this database will be over 20GB and will continue to grow. It will be a part of my brain
First of all, can someone give me some more specific key words to help me describe what I am looking for? Most databases that I have run across are very structured, in that the user has to define fields and insert the data into the field before it can be looked up.
Second, can anyone suggest some linux software that will do this? Right now I only have a dual 650 mhz P3 to play with - am thinking of geting some 10K rpm scsi or SATA hard drives to put in it. I will have enough money to upgrade in about a year if I need to. BTW, it will have a single user.
Thanks in advance for any help you can provide.