- What does it check?
- XML schema compliance, URL format (absolute, reachable), lastmod date format, limits (50,000 URLs max per file, 50MB uncompressed), encoding.
- Does it verify URLs return 200?
- No — URL reachability requires live HTTP requests, which the browser can't do cross-origin. Use a crawler (Screaming Frog, Sitebulb) for full checks.
- Common issues?
- HTTP URLs when site is HTTPS, trailing slash inconsistencies, URLs from other domains (sitemap should only list same-domain URLs), and invalid dates.
- What about sitemap indexes?
- Yes — validates the index structure (list of sitemap URLs) and each referenced sitemap in turn. Common for large sites.