LinuxQuestions.org
Latest LQ Deal: Latest LQ Deals
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 07-09-2009, 09:56 AM   #1
zerobane
Member
 
Registered: Jan 2006
Posts: 47

Rep: Reputation: 16
Search engine for Linux?


Hello,

Looking for a complete search engine for linux to crawl and index a company intranet...

Company is mostly microsquish products, so it would need to parse msword, excel, pdf...

Any recommendations?

Has google desktop search wiped the linux world of search projects?

So far i've tried

htidg (looks like a zombie'd project)

easy to setup
searching in minutes

LFS was a pain to figure out
project is dead
Database is wiped out and fully re-indexed; no partial indexing
search fuzzy algorithms are er special...
spider takes forever
parsers lock up on excel files; memory leak somewhere (ate up 10 gigs of ram on a 2 megabyte file)
IF the spider / pareser locks up you lose an entire nights work of indexing...

Nutch

Appears to have support for parsing word documents / excel; cannot get to work

meant to be more of an api?
ugh, tomcat and sun java
hard to get working; still cannot receive search results
documentation is bit hard to dig up
 
Old 07-10-2009, 07:39 PM   #2
XavierP
Moderator
 
Registered: Nov 2002
Location: Kent, England
Distribution: Debian Testing
Posts: 19,192
Blog Entries: 4

Rep: Reputation: 475Reputation: 475Reputation: 475Reputation: 475Reputation: 475
What about Google? They provide Google for intranets.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Google has Linux only Search Engine CouchMaster Linux - General 2 06-05-2005 10:07 AM
linux desktop search engine mohd_rish Programming 1 04-04-2005 12:36 PM
LinuxBazis :: Linux links base (search engine) BTamas Linux - General 1 05-06-2003 04:40 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 08:48 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration