"Our Web Site Checker" is the new (less technical) name for Web Speed Cache Crawler. The service will still be available separately from @OurWebHostingUk plans. Updated site will be coming soon.
In other news, work continues this week to identify orphaned pages. #buildinpublic
Fingers crossed, we are now feature complete for our early Jan initial release.
Orphaned pages found via the sitemap are now crawled and tagged - Your landing pages deserve attention too. Email content tweaked to match correct names.
The log email now includes the duration of the crawl and the amount of content checked (pages, images and downloads). We have also been tweaking the logic for cached content percentage.
Next up is to review 301 directs. Should they be excluded to reach 100%. #buildinpublic
You can now keep an eye on the bandwidth used per crawl.
Pages = Downloaded in full to find page links. Images / CSS / JavaScript = HTTP Header information only, reducing bandwidth used and keeping it speedy.