|
|
|
Spidering | |
While a spider is downloading pages, it is called Spidering. Most modern spiders used by Search engines are only responsible for downloading the pages and storing them raw in a temporary database. An indexer is then used to process the page for inclusion in a Search Engine database. Spiders have a wide range of variables and guidelines that they can be setup to use and follow. Some include: speed at which it downloads pages, whether it will walk or Crawl through a website, whether it only goes after Index pages, what time of day it is active, which domains it will connect to, how many pages it will accept from one Domain. | | | | |