See: Description
Class | Description |
---|---|
ACrawler |
The main functionalities of a crawler thread.
|
Crawler |
Controls the whole publication searching procedure.
|
Extract |
Specific tool for extracting basic data of a publication from text block.
|
HTTPRequest |
Executes a HTTP request and downloads the response.
|
HTTPRequestEx |
Extends HTTPRequest to fit the programs requirements: retries download on error,
sets up random proxy for every try, and stores downloaded pages in a cache.
|
PubListCrawler |
Crawls a result list.
|
PubListHTMLCrawler |
Parses a HTML block as a publication list.
|
PubPageCrawler |
Processes a pubpage (page which contains details of a publication): extracts
basic information and starts the list crawler for referring publications if
needed (if transitivity level is above 0).
|
ThreadScheduler |
ThreadScheduler can help you limit the count of actually running threads.
|