robots.txt & Sitemap Monitor

KeyValue
StatusActive
OwnerQA Automation
Updated2026-03-26
ScopeDaily crawl-policy and sitemap health monitoring across CNC sites

This monitor watches a part of the stack that is easy to overlook and expensive to get wrong. A homepage can look fine while robots.txt silently blocks crawling or a sitemap starts returning errors.

What The Monitor Checks

CheckWhy It Matters
robots.txt change detectioncatches accidental crawl-policy changes
sitemap healthcatches broken discovery endpoints

Current Operating Model

MetricCurrent Snapshot
monitored sites16 CNC sites
frequencydaily
alert stylededuplicated Slack alerts
state trackingstored baseline/history file

Why This Is High-Stakes

ProblemReal Risk
bad robots.txt changesearch engines stop crawling important surfaces
broken sitemap endpointnew content discovery degrades
silent removal of sitemap URLsSEO and content freshness suffer before anyone notices

Why A Dedicated Monitor Is Better Than A Generic Test

This is not a classic user-flow problem. It is a publishing and discoverability problem. The focused monitor is better because it:

  • runs cheaply
  • tracks change over time
  • alerts only when something meaningful changes
  • speaks to SEO and platform health, not just frontend rendering

Slack Behavior

The monitor is designed to avoid spam.

Reporting PrincipleWhat It Means
deduplicationone issue should not flood the channel repeatedly
change-aware alertsno alert when nothing changed
operator-readable outputthe alert should explain what changed, not just say “failed”

Good Use Cases

Use CaseWhy The Monitor Helps
accidental deploy changecatches policy drift before it becomes a long-running SEO issue
backend issue in sitemap generationcatches failures even when the homepage still works
silent SEO regressionssurfaces issues that UI tests will not catch

[EXPAND: What to do when it alerts]

  1. confirm whether the change was intentional
  2. check whether the impact is one site or many
  3. decide whether the issue belongs to SEO, platform, or release engineering
  4. keep the thread open until the next healthy run confirms recovery

[END EXPAND]

NeedGo To
general monitoring overviewMonitoring
command catalogCLI Reference