01 // URL EXTRACTOR

Sitemap URL extractor.

Dump every URL out of any sitemap — XML, HTML, sitemap-index or text. Deduped, validated, exportable in five formats.

Sitemap URL

Supports XML sitemaps, sitemap-index files (auto-followed), HTML site indexes and plain-text URL lists.

Export format

How extraction handles edge cases

Frequently asked

How many URLs can it handle?

No fixed cap, but each fetched sitemap is bounded to 25 MB. A 50,000-URL sitemap typically lands around 8–12 MB.

Can I extract URLs from a sitemap index in one call?

Yes — point it at the index and it'll fan out, fetch every child, and return one flat deduplicated list.