Quote:
Originally posted by fnet
Nice theory, but wrong. Show me a counter example to what's actually been discussed here, on the exclusive basis of update frequency.
|
You don't have to believe me.
Google uses a real simple way to determine if a page might have been updated.
The frequency it is willing to check depends on pagerank though, so you can't get it to check every hour.
Here's an example from a subpage only linked from the index:
64.68.82.46 - - [23/Feb/2003:07:12:16 +0100] "GET /links.php HTTP/1.0" 200
1394 "-" "Googlebot/2.1 (+
http://www.googlebot.com/bot.html)"
64.68.82.36 - - [27/Feb/2003:22:22:17 +0100] "GET /links.php HTTP/1.0" 200
1394 "-" "Googlebot/2.1 (+
http://www.googlebot.com/bot.html)"
64.68.82.34 - - [28/Feb/2003:14:08:28 +0100] "GET /links.php HTTP/1.0" 200
1394 "-" "Googlebot/2.1 (+
http://www.googlebot.com/bot.html)"
64.68.82.51 - - [02/Mar/2003:17:03:37 +0100] "GET /links.php HTTP/1.0" 200
1394 "-" "Googlebot/2.1 (+
http://www.googlebot.com/bot.html)"
This is the index:
64.68.82.14 - - [20/Dec/2002:10:27:35 +0100] "GET / HTTP/1.0" 200 2713 "-"
"Googlebot/2.1 (+
http://www.googlebot.com/bot.html)"
64.68.82.66 - - [22/Dec/2002:13:07:54 +0100] "GET / HTTP/1.0" 200 2713 "-"
"Googlebot/2.1 (+
http://www.googlebot.com/bot.html)"
64.68.82.28 - - [23/Dec/2002:11:23:02 +0100] "GET / HTTP/1.0" 200 2713 "-"
"Googlebot/2.1 (+
http://www.googlebot.com/bot.html)"
64.68.82.39 - - [24/Dec/2002:09:21:15 +0100] "GET / HTTP/1.0" 200 2713 "-"
"Googlebot/2.1 (+
http://www.googlebot.com/bot.html)"
64.68.82.14 - - [25/Dec/2002:09:22:47 +0100] "GET / HTTP/1.0" 200 2713 "-"
"Googlebot/2.1 (+
http://www.googlebot.com/bot.html)"
64.68.82.38 - - [26/Dec/2002:12:56:21 +0100] "GET / HTTP/1.0" 200 2713 "-"
"Googlebot/2.1 (+
http://www.googlebot.com/bot.html)"
64.68.82.38 - - [31/Dec/2002:13:58:03 +0100] "GET / HTTP/1.0" 200 2713 "-"
"Googlebot/2.1 (+
http://www.googlebot.com/bot.html)"