We’d like to remind Forumites to please avoid political debate on the Forum.
This is to keep it a safe and useful space for MoneySaving discussions. Threads that are – or become – political in nature may be removed in line with the Forum’s rules. Thank you for your understanding.
📨 Have you signed up to the Forum's new Email Digest yet? Get a selection of trending threads sent straight to your inbox daily, weekly or monthly!
The Forum now has a brand new text editor, adding a bunch of handy features to use when creating posts. Read more in our how-to guide
Online cache of web site
Undervalued
Posts: 9,860 Forumite
I wonder if anybody can help please.....
I am trying to find out when a particular page on a publicly accessible website changed (in the last couple of months). The site is cached on Wayback Machine around twenty times in the last seven years at seemingly random intervals. The most recent is late November last year, then on two consecutive days in June. The November cache is useful up to a point in that it is very different to the information that appears now. However I really need to know more accurately when it changed, which may of course have happened in stages on various days between then and now.
Can anybody suggest anywhere else I can try? Also, it would be interesting to know how these cache dates are picked? Are they triggered by a change?
Thanks
I am trying to find out when a particular page on a publicly accessible website changed (in the last couple of months). The site is cached on Wayback Machine around twenty times in the last seven years at seemingly random intervals. The most recent is late November last year, then on two consecutive days in June. The November cache is useful up to a point in that it is very different to the information that appears now. However I really need to know more accurately when it changed, which may of course have happened in stages on various days between then and now.
Can anybody suggest anywhere else I can try? Also, it would be interesting to know how these cache dates are picked? Are they triggered by a change?
Thanks
0
Comments
-
For Wayback somewhat randomly. The only way to get the data as accurately as you wish would be from the change logs on the server itself, or from Google's servers if it was data that they retained, but they do not retain old copies of most sites.Undervalued said:I wonder if anybody can help please.....
I am trying to find out when a particular page on a publicly accessible website changed (in the last couple of months). The site is cached on Wayback Machine around twenty times in the last seven years at seemingly random intervals. The most recent is late November last year, then on two consecutive days in June. The November cache is useful up to a point in that it is very different to the information that appears now. However I really need to know more accurately when it changed, which may of course have happened in stages on various days between then and now.
Can anybody suggest anywhere else I can try? Also, it would be interesting to know how these cache dates are picked? Are they triggered by a change?
Thanks
0 -
The likes of Wayback Machine respect (where available) what's called the Robots meta Tag.https://yoast.com/robots-meta-tags/ for more informationWhat this basically means is back in the day if the website creator says don't come back after so many days, then the web spiders won't (although these days that's largely ignored anyway, but its still valid for older archive copies if it existed), otherwise it'll just visit it in the next major crawl.There is a pattern to the Wayback Machine and how it works, it isn't just totally random as to how often a website gets crawled: There are so-called "Worldwide Web Crawls" that happen on occasion (which may be broken down into other crawls and happen at the same time) and this is what determines how often a website gets archived, though it can look as if its random, its more by luck than anything else.0
Confirm your email address to Create Threads and Reply
Categories
- All Categories
- 354K Banking & Borrowing
- 254.3K Reduce Debt & Boost Income
- 455.3K Spending & Discounts
- 247K Work, Benefits & Business
- 603.6K Mortgages, Homes & Bills
- 178.3K Life & Family
- 261.1K Travel & Transport
- 1.5M Hobbies & Leisure
- 16.1K Discuss & Feedback
- 37.7K Read-Only Boards