What Is TurkCrawler?
TurkCrawler is a powerfull web crawler engine/library which enables web data
extraction easier and safer. TurkCrawler uses
NCrawler
in its core engine and it is an advanced version of NCrawler. TurkCrawler uses
for different types of criterias to collect data from the web;
RegularExpressionItem, BasicHtmlItem, HtmlNodeItem and HtmlNodeCollectionItem.
Please click here for more information
about how TurkCrawler works.
Please visit
http://ncrawler.codeplex.com/ for more information about NCrawler.
Web Data Extraction
Web Data Extraction is a kind of information retrieval whose goal is to
automatically extract structured information from unstructured or
semi-structured web data sources. Some examples are;
- Financial Data
- Real Estate Data
- Product Pricing Data
- Duplicate an online database
- Dynamic Web Content
- Create Innovative New Services
- Sales Leads
- Capture Dating Site Info
- Capture Auction Info
- Capture Job Postings from Online Job Websites