Static vs dynamic, Google Search Central limits, GSC checklist — LaunchGPT sitemap finder utility to probe sitemap.xml paths.
LaunchGPT Team
Product & research
Published
Searches for free XML sitemap generator spike after site migrations, framework upgrades, or when Google Search Console shows "Couldn't fetch sitemap." A valid sitemap is straightforward XML that lists URLs you want crawled and indexed. But having a valid XML file is only part of the job. Crawl budget, canonical tags, what you include or exclude, and how you submit all determine whether a sitemap actually helps your site rank.
Google Search Central documents sitemap guidelines and limits — including the 50,000 URL maximum per sitemap file and the requirements for encoding, format, and submission (Learn about sitemaps). This guide explains when and why to use a sitemap, the difference between static and dynamic approaches, what to include and exclude, submission steps, and where LaunchGPT Sitemap Finder helps you verify your sitemap is accessible after deploy.
For most websites, yes. Google can discover pages through internal links, but a sitemap makes crawling faster and more reliable — especially for new sites, large sites, sites with deep content, or sites updated frequently. Sitemaps do not guarantee indexing, but they help search engines understand your site structure and prioritize crawl time on your important pages.
Small static sites with clear internal linking sometimes rank fine without a sitemap. Large e-commerce sites, content sites with hundreds of posts, multi-locale sites, and sites that have just migrated all benefit significantly from a well-maintained sitemap.
Include pages that represent the canonical, publicly accessible content you want Google to index and rank. For most sites this means:
This is where many teams go wrong. Including low-quality or duplicate pages does not help and can waste crawl budget on pages that should not rank.
Exclude:
/blog/page/2, /products?page=3)The goal is to tell search engines what your best pages are, not to submit every URL your site can render.
A minimal valid sitemap looks like this:
<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
<url>
<loc>https://example.com/</loc>
<lastmod>2026-05-01</lastmod>
<changefreq>monthly</changefreq>
<priority>1.0</priority>
</url>
<url>
<loc>https://example.com/blog/example-post/</loc>
<lastmod>2026-04-15</lastmod>
<changefreq>weekly</changefreq>
<priority>0.8</priority>
</url>
</urlset>
Key rules:
<lastmod> should reflect actual content changes, not today's date on every page.<changefreq> and <priority> are hints, not commands — Google may ignore them.& becomes &, spaces become %20.Framework sites (Next.js, Nuxt, Astro): Use the built-in sitemap module or a plugin. Verify output by visiting /sitemap.xml on your live domain.
WordPress: Install a plugin like Yoast SEO or Rank Math. Sitemap is generated automatically.
Custom or static sites: Use a command-line tool, a browser-based generator, or write the XML manually for small sites. Tools like sitemap-generator-cli (Node.js) can crawl your site and output a valid XML file.
For very small sites: A free online XML sitemap generator can crawl your public pages and output a downloadable file. These work for sites with fewer than a few hundred pages and no authentication.
Add this line to your robots.txt file:
Sitemap: https://yourdomain.com/sitemap.xml
This helps all search engines — not only Google — discover the sitemap automatically.
Before submitting to Google, confirm the sitemap loads without errors. It should return HTTP 200 at a stable HTTPS URL. No login required, no redirect to the homepage, no server error.
Use LaunchGPT Sitemap Finder to probe common sitemap paths (sitemap.xml, sitemap_index.xml, WordPress plugin patterns) and confirm expected status codes after deploy.
Open Google Search Console → select your property → Sitemaps in the sidebar → enter the sitemap URL → click Submit.
After submission, return to Sitemaps and check the coverage report. Look for:
Submit to Bing Webmaster Tools separately. Bing also respects the Sitemap: line in robots.txt for automatic discovery.
After deploy, run Sitemap Finder & Checker to confirm common paths return expected HTTP status codes. This catches misconfigurations like a 301 redirect chain on the sitemap URL, a cached 404 from a previous migration, or a sitemap that only works on www but not the naked domain.
Open sitemap finder
Browse all developer utilities on the Tools utilities hub.
Mistake 1: Including every URL the site generates. Paginated pages, session-ID URLs, filter pages, and thin content pollute your sitemap and waste crawl budget.
Mistake 2: Submitting a sitemap on a site with canonical conflicts. If pages have conflicting canonical tags, submitting them in the sitemap does not override the canonical. Fix canonical tags first.
Mistake 3: Using today's date on every <lastmod> field.
Google tracks this. If every page shows today's date regardless of actual updates, the signal becomes meaningless and may be ignored.
Mistake 4: Not excluding staging or preview URLs. Check that your staging environment does not end up in the generated sitemap for your production domain.
Mistake 5: Forgetting the sitemap index for large sites. Once you exceed 50,000 URLs, you need a sitemap index file that points to individual child sitemaps.
Mistake 6: Failing to update after migrations. After a domain change, URL structure change, or CMS switch, regenerate and resubmit the sitemap. Old URLs in the sitemap trigger crawl errors that GSC tracks.
Next.js: Use the built-in sitemap.ts export (Next.js 13+) or next-sitemap package. Exclude draft routes and preview URLs.
WordPress: Yoast SEO, Rank Math, or All in One SEO handle sitemap generation. Verify exclusions include tag/category pages with thin content if applicable.
Shopify: Shopify generates sitemap.xml automatically. Add it to GSC. You cannot fully control which product pages appear, but most standard Shopify sitemaps are clean.
Webflow: Sitemap is auto-generated. Toggle per-page SEO settings to exclude specific pages.
Custom builds: Use a build-step script that reads your route manifest and outputs clean XML. Version-control the sitemap generator config.
Submitting a sitemap once is not enough. Search engines crawl it on their schedule, and your site changes over time. Set a recurring monthly task to:
For fast-changing sites like e-commerce or news, automate sitemap regeneration as part of your deploy pipeline so the live sitemap always matches current published content.
Site migrations are the highest-risk moment for sitemaps. When you change domains, restructure URLs, or switch platforms, your old sitemap becomes inaccurate immediately. Steps for migration:
Missing any step can cost weeks of ranking recovery. The sitemap tells Google where your canonical pages are; the redirects and canonicals enforce it.
Free XML sitemap generator tools work when the URLs inside are clean, canonical, and worth indexing. Generate from your CMS or framework, exclude thin and duplicate content, probe with LaunchGPT Sitemap Finder, submit to Google Search Console, and monitor coverage regularly.
Browse utilities
Related: Sitemap Finder & Checker · Free URL shortener with analytics · Discover
Was this useful?
0 reactions · Comments coming soon
LaunchGPT Team
Product & research
We build AI-powered SaaS discovery so buyers can shortlist, compare, and validate tools in days instead of weeks. Our comparisons blend public pricing signals, integration coverage, and real-world rollout patterns—always with transparent methodology. Follow the blog for stack blueprints, category teardowns, and vendor-neutral buying guides.
More guides and comparisons from the LaunchGPT blog.