Website PDF Search function

Soldato
Joined
19 Oct 2002
Posts
8,075
Location
Swindoniantown
Hi Guys,

I've been asked to investigate how to put a .pdf document search facility onto our Company Website!.

What we want to do is have a 'Secure Company Logon' area (which is the easy part!) with a search box so we here in the UK and our US office can search our Drawing register for specific drawings. We'd also like it so when a part of a drawing number is input it brings up a selection of drawings to choose from..

eg: if the drawing you want is 25345D4.pdf but you only know the 25345 part the search will bring up a page with links to 25345D1, 25345D2, 25345D3, 25345D4, 25345D5 etc... so you can then pick from that list rather than a massive list of drawings!.

Any help would be great, I've looked on t'interweb and found some, but but their not perfect!, So I thought I'd ask the h'experts on here too!!..

What sort of things would I need?, Scripts, Programs etc... How easy would it be to implement!?!

Cheers Doods.
Sim :)
 
If you are just searching on the file name and not the contents of the PDFs it is easy.

Personally, I would use a regular expression to create the search string, and then search the directory. Although, if there are many files it might be worth storing the file names in a database, but that would require you to keep tabs on files moving, removing and adding etc.

The language you use if probably determined by your server set up, it could be PHP or ASP.NET or many other languages - all of which should have facilities for regex searching and file/directory operations.

As a competent programmer (and assuming you are only searching file names) I would say it is moderately easy.
 
Use something like Lucene to do the searching.

If you don't want to do anything that complicated open the PDF and read the text content in to a database field and search on that.
 
Back
Top Bottom