Web Hosting Talk







View Full Version : PHP search script needed


netfreak
09-27-2006, 08:23 AM
I'm in need of a PHP search script that will index a collection of approximately 150,000 files (mostly text, some PDF) into a mysql database. prefer one that will read a portion of the file into the index for better searching (for PDF too if possible, not but required). I've already tried looking around the free script places and i'm quite sure that nothing free will work for this size of collection. Any recommendations on commercial scripts that can handle this?

sasha
09-27-2006, 10:09 AM
That does not sound all that big. I worked on php/mysql application indexing initially over 10 millions of pdf files and adding one million pdfs monthly (I used non-free vividata for OCR - width speed and quality it beats all OS alternatives). That should not have to be huge project but without more information it is hard to offer good advice.

dgeorge
09-27-2006, 10:18 AM
Might want to check this out?

http://cs.ioc.ee/~ando/sphider/

netfreak
09-27-2006, 11:26 AM
That does not sound all that big. I worked on php/mysql application indexing initially over 10 millions of pdf files and adding one million pdfs monthly (I used non-free vividata for OCR - width speed and quality it beats all OS alternatives). That should not have to be huge project but without more information it is hard to offer good advice.

Ah, i'm just going by the poor results of some random free scripts I tried from a few sources. A lot of them only claimed to work with indexes up to 10,000 entries or so, and would crash while trying to index.

Thanks for the link dgeorge. I'll give that one a shot.

mikey1090
09-28-2006, 02:55 AM
have you checked hotscripts.com?

netfreak
09-28-2006, 04:43 AM
Yeah that was the first place I checked. I've tried a few different ones from there over the years without a lot of luck.

The script posted by dgeorge seems to be working pretty good for me so far.

azn0
09-29-2006, 11:13 AM
Thanks for that link, i lost that script ages ago i have been looking for it for ages.