Crawling via google's cache
December 29th, 2006
Google’s cache is rather extensive, especially when it comes to popular sites. Therefor a neat trick to keep in mind if one every has to crawl a slow site or don’t want to hog the poor server’s bandwidth is to go through google instead. If you want to fetch google’s cache for a page with url http://foo.tld/bar then just the google search for cache:foo.tld/bar (e.g. http://www.google.com/search?q=cache%3Afoo.tld/bar) . It will display google’s cache of its latest crawl, ready to be reused.
This is of course not to be recommended for sites that don’t rank well in google as they are rarely thoroughly crawled.
Sorry, comments are closed for this article.