Hi all,
Just looking at creating another web portal, only this time I want it automated so looking into web crawlers. Checked out a few such as DataparkSearch, etc. But it's all a little heavy, was hoping to find a simple open source option that I could modify slightly.
Will be a targeted web crawl to specific web urls that I want to crawl. Would only need simple details found on the page. And store into a local DB for querying. Without giving to much away, think along the lines of property...yes it had been done, but there is a niche that I want to cover.
Anyone got any ideas on best way to tackle this?
Just looking at creating another web portal, only this time I want it automated so looking into web crawlers. Checked out a few such as DataparkSearch, etc. But it's all a little heavy, was hoping to find a simple open source option that I could modify slightly.
Will be a targeted web crawl to specific web urls that I want to crawl. Would only need simple details found on the page. And store into a local DB for querying. Without giving to much away, think along the lines of property...yes it had been done, but there is a niche that I want to cover.
Anyone got any ideas on best way to tackle this?