How To Use XML Sitemaps To Boost SEO via @sejournal, @jes_scholz

5 months ago 132
ARTICLE AD BOX

What was considered champion signifier yesterday does not clasp existent contiguous and this is particularly applicable erstwhile it comes to XML sitemaps, which are astir arsenic aged arsenic SEO itself.

The occupation is, it’s time-consuming to benignant invaluable proposal from each the misinformation connected forums and societal media astir however to optimize XML sitemaps

So, portion astir of america admit the value of submitting sitemaps to Google Search Console and Bing Webmaster Tools, arsenic good arsenic successful the robots.txt file – for faster contented find and refresh, much businesslike crawling of SEO-relevant pages, and invaluable indexing reporting to place SEO issues – the finer details of implementing sitemaps to amended SEO show whitethorn beryllium missed.

Let’s wide up the disorder and dive into the existent champion practices for sitemap optimization.

In this article, we cover:

  • What is an XML sitemap?
  • How to make a sitemap.
  • Valid XML sitemap format.
  • Types of sitemaps.
  • Optimization of XML sitemaps.
  • XML sitemap champion signifier checklist.

What Is An XML Sitemap?

An XML sitemap is simply a record that lists each of your website’s URLs.

It acts arsenic a roadmap to archer the crawlers of indexing platforms (like hunt engines, but besides ample connection models (LLMs)) what contented is disposable and however to scope it.

sitemap vs website crawlingImage from author, February 2025

In the illustration above, a hunt motor volition find each 9 pages successful a sitemap with 1 sojourn to the XML sitemap file.

On the website, it volition person to leap done 5 interior links connected 5 pages to find leafage 9.

This quality of XML sitemaps to assistance crawlers successful faster indexing is particularly important for websites that:

  • Have thousands of pages and/or a heavy website architecture.
  • Frequently adhd caller pages.
  • Frequently alteration the contented of existing pages.
  • Suffer from anemic interior linking and orphan pages.
  • Lack of a beardown outer nexus profile.

Even though indexing platforms could technically find your URLs without it, by including pages successful an XML sitemap, you’re indicating that you see them to beryllium prime landing pages.

And portion determination is nary warrant that an XML sitemap volition get your pages crawled faster, fto unsocial indexed oregon ranked, submitting 1 surely increases your chances.

How To Create A Sitemap

There are 2 ways to make a sitemap: Static sitemaps that indispensable beryllium manually updated, oregon dynamic sitemaps that are updated successful real-time oregon by a regular cron job.

Static sitemaps are elemental to make utilizing a instrumentality specified arsenic Screaming Frog.

The occupation is that arsenic soon arsenic you make oregon region a page, your sitemap is outdated. If you modify the contented of a page, the sitemap won’t automatically update the lastmod tag.

So, unless you emotion manually creating and uploading sitemaps for each azygous change, it’s champion to debar static sitemaps.

Dynamic XML sitemaps, connected the different hand, are automatically updated by your server to bespeak applicable website changes.

To make a dynamic XML sitemap you tin bash 1 of the following:

  • Ask your developer to codification a customized script, being definite to supply wide specifications.
  • Use a dynamic sitemap generator tool.
  • Install a plugin for your contented absorption strategy (CMS), for example, Yoast plugin for WordPress.

Valid XML Sitemap Format

Image from author, February 2025

Your sitemap needs 3 items to present itself to indexing platforms:

  • XML Version Declaration: Specifies the record benignant to pass indexing platforms what they tin expect from the file.
  • UTF-8 Encoding: Ensures each the characters utilized tin beryllium understood.
  • Specify The Namespace: Communicates what rules the sitemap follows. Most sitemaps usage the “http://www.sitemaps.org/schemas/sitemap/0.9” namespace to amusement that the record conforms to standards acceptable by sitemaps.org.

This is followed by a URL instrumentality for each page. In a modular XML sitemap, determination are lone 2 tags that should beryllium included for a URL:

  1. Loc (a.k.a. Location) Tag: This compulsory tag contains the absolute, canonical mentation of the URL location. It should accurately bespeak your tract protocol (http oregon https) and if you person chosen to see oregon exclude www.
  2. Lastmod (a.k.a. Last Modified) Tag: An optional but highly recommended tag to pass the day and clip the leafage was published oregon the past meaningful change. This helps indexing platforms recognize which pages person caller contented and prioritize them for crawling.

Google’s documentation connected sitemaps states:

“Google uses the <lastmod> worth if it’s consistently and verifiably (for illustration by comparing to the past modification of the page) accurate. The <lastmod> worth should bespeak the day and clip of the past important update to the page. For example, an update to the main content, the structured data, oregon links connected the leafage is mostly considered significant, nevertheless an update to the copyright day is not.”

Bing’s documentation agrees connected the value of the lastmod tag:

“The “lastmod” tag is utilized to bespeak the past clip the web pages linked by the sitemaps were modified. This accusation is utilized by hunt engines to find however often to crawl your site, and to determine which pages to scale and which to permission out.”

Mistakes, specified arsenic updating the <lastmod> worth erstwhile the sitemap is generated alternatively than erstwhile the idiosyncratic leafage was past modified, oregon worse, trying to manipulate crawlers by updating the day without importantly altering the page, whitethorn effect successful this awesome being ignored by hunt engines for your website. Damaging your quality to person your contented efficiently crawled.

Do not see the Changefreq (a.k.a. Change Frequency) Tag oregon precedence tag. Once upon a time, these hinted astatine however often to crawl, but are present ignored by hunt engines.

Types Of Sitemaps

There are galore antithetic types of sitemaps. Let’s look astatine the ones you really need.

XML Sitemap Index

XML sitemaps person a mates of limitations:

  • A maximum of 50,000 URLs.
  • An uncompressed record size bounds of 50 MB.

Sitemaps tin beryllium compressed utilizing gzip to prevention bandwidth for your server. But erstwhile unzipped, the sitemap inactive can’t transcend either limit.

Whenever you transcend either limit, you volition request to divided your URLs crossed aggregate XML sitemaps.

Those sitemaps tin past beryllium combined into a azygous XML sitemap scale file, often named sitemap-index.xml. Essentially, it is simply a sitemap for sitemaps.

You tin make aggregate sitemap scale files. But beryllium alert that you cannot nest sitemap scale files.

For indexing platforms to easy find each 1 of your sitemap files, you volition privation to:

  • Submit your sitemap scale to Google Search Console and Bing Webmaster Tools.
  • Specify your sitemap oregon sitemap scale URL(s) successful your robots.txt file. Pointing indexing platforms straight to your sitemap arsenic you invited them to crawl.

Image from author, February 2025

Image Sitemap

Image sitemaps were designed to amended the indexing of representation content, primitively offering additional tags.

In modern-day SEO, however, it’s champion signifier to utilize JSON-LD schema.org/ImageObject markup to telephone retired representation properties to indexing platforms, arsenic it provides much attributes than an representation XML sitemap.

Because of this, a dedicated XML representation sitemap is unnecessary. Simply adhd the representation XML namespace declaration and the representation tags straight to the main XML sitemap wrong the associated URL.

Image from author, February 2025

Know that images don’t person to beryllium connected the aforesaid domain arsenic your website to beryllium submitted successful a sitemap. You tin usage a CDN arsenic agelong arsenic it’s verified successful Google Search Console.

Video Sitemap

Similar to images, Google says video sitemap tags tin beryllium added wrong an existing sitemap.

However, dissimilar images, video extensions successful sitemaps connection a multitude of further tags.

Image from author, February 2025

If you leverage these tags extensively, see a dedicated video sitemap.

Adding these extensions increases the record size of your sitemap importantly and whitethorn pb to you exceeding the record size limits.

Either method volition assistance Google discover, crawl, and scale your video contented arsenic agelong arsenic the video is related to the contented of the big leafage and is accessible to Googlebot.

While Bing does enactment video extensions successful XML sitemaps, Fabrice Canel confirmed to maine that they similar submission via IndexNow. Although Bing’s documentation inactive mentions the mRSS format.

Google News Sitemap

Google News sitemaps tin lone beryllium utilized for nonfiction contented that was created successful the past 2 days. Once the articles are older than 48 hours, region the URLs from the sitemap.

Again, portion Google News sitemap tags tin beryllium included successful your regular sitemap, this is not recommended.

Unlike for representation and video, lone Google leverages the quality sitemap extension, not Bing oregon different indexers.

Image from author, February 2025

Contrary to immoderate online advice, Google News sitemaps don’t enactment representation URLs.

HTML Sitemap

XML sitemaps instrumentality attraction of indexing level needs. HTML sitemaps were designed to assistance quality users successful uncovering content.

The question becomes: If you person a bully idiosyncratic acquisition and well-crafted interior links, bash you request an HTML sitemap?

Check the leafage views of your HTML sitemap successful Google Analytics. Chances are, it’s precise low. If not, it’s a bully denotation that you request to amended your website navigation.

HTML sitemaps are mostly linked successful website footers. Taking nexus equity from each azygous leafage of your website.

Ask yourself. Is that the champion usage of that nexus equity? Or are you including an HTML sitemap arsenic a motion to bequest website champion practices?

If fewer humans usage it, and indexing platforms don’t request it arsenic you person beardown interior linking and an XML sitemap, does that HTML sitemap person a crushed to exist? I would reason no.

XML Sitemap Optimization

XML sitemap optimization involves however you operation your sitemaps and what URLs are included.

How you take to bash this impacts however efficiently indexing platforms crawl your website and, thus, your contented visibility.

Here are 4 ways to optimize XML sitemaps:

1. Only Include SEO Relevant Pages In XML Sitemaps

An XML sitemap is simply a database of pages you privation to beryllium crawled (and subsequently fixed visibility to by indexing platforms), which isn’t needfully each leafage of your website.

A bot arrives astatine your website with an “allowance” for however galore pages it volition crawl.

The XML sitemap indicates that you see the included URLs much important than those that aren’t blocked but not successful the sitemap.

You’re utilizing it to archer indexing platforms, “I’d truly admit it if you’d absorption connected these URLs successful particular.”

To assistance them crawl your tract much intelligently and reap the benefits of faster (re)indexing, bash not include:

  • 301 redirect URLs.
  • 404 oregon 410 URLs.
  • Non-canonical URLs.
  • Pages with noindex tags.
  • Pages blocked by robots.txt.
  • Paginated pages.
  • Parameter URLs that aren’t SEO-relevant.
  • Resource pages accessible by a pb gen signifier (e.g., achromatic insubstantial PDFs).
  • Utility pages that are utile to users, but not intended to beryllium landing pages (login page, interaction us, privateness policy, relationship pages, etc.).

I’ve seen recommendations to adhd 3xx, 4xx, oregon non-indexable pages to sitemaps successful the anticipation it volition velocity up deindexing.

But akin to manipulation of the past mod date, specified attempts to get these pages processed faster whitethorn effect successful the sitemaps being ignored by hunt engines arsenic a signal, damaging your quality to person your invaluable contented efficiently crawled.

But remember, Google is going to usage your XML submission lone arsenic a hint astir what’s important connected your site.

Just due to the fact that it’s not successful your XML sitemap doesn’t needfully mean that Google won’t scale those pages.

2. Ensure Your XML Sitemap Is Valid

XML sitemap validators tin archer you if the XML codification is valid. But this unsocial is not enough.

There mightiness beryllium different crushed wherefore Google oregon Bing can’t fetch your sitemap, specified arsenic robots directives. Third-party tools won’t beryllium capable to place this.

As such, the astir businesslike mode to guarantee your sitemap is valid is to taxable it straight to Google Search Console and Bing Webmaster Tools.

Image from author, February 2025

When valid successful GSC and BWT, you volition spot the greenish “Success” status.

Image from author, February 2025

If you get a reddish connection instead, click connected the mistake to find retired why, hole it, and resubmit.

But successful Google Search Console and Bing Webmaster Tools, you tin bash truthful overmuch much than elemental validation.

3. Leverage Sitemap Reporting For Indexing Analysis

Image from author, February 2025

Say you taxable 80,000 pages each successful 1 sitemap index, and 9,000 are excluded by some Google and Bing.

Sitemap reporting volition assistance you to recognize overarching why, but provides constricted reporting connected which URLs are problematic.

So, portion it’s invaluable information, it’s not easy actionable. You request to observe which types of pages were near out.

What if you usage descriptive sitemap names that bespeak the sections of your website – 1 for categories, products, articles, etc.?

Image from author, February 2025

Then, we tin drill down to spot that 7,000 of the 9,000 non-indexed URLs are class pages – and intelligibly cognize wherever to absorption attention.

This tin besides beryllium done wrong a sitemap scale file.

Now, I cognize some Google and Schema.org amusement examples encouraging numbered naming. So, you whitethorn person ended up successful a /sitemap-products-index.xml record with thing similar this:

  • /products-1.xml
  • /products-2.xml

Which is not the astir insightful naming convention. What if we interruption it down into genitor categories? For example:

  • /products-mens.xml
  • /products-womens.xml
  • /products-kids.xml

And if your website is multilingual, beryllium definite to leverage connection arsenic an further separation layer.

Such astute structuring of sitemaps to radical by leafage benignant allows you to dive into the information much efficiently and isolate indexing issues.

Just remember, for this to efficaciously work, sitemaps request to beryllium mutually exclusive, with each URL existing successful lone 1 sitemap. The objection is the Google News sitemap.

4. Strategize Sitemap Size

As mentioned before, hunt engines enforce a bounds of 50,000 URLs per sitemap file.

Some SEO specialists intentionally trim this number, accidental to 10,000. This tin beryllium adjuvant to velocity up indexing.

However, you tin lone download 1,000 URLs successful GSC. So, if 2,000 URLs successful a definite sitemap are not indexed, you tin lone entree fractional of them. If you are trying to bash contented cleanup, this volition not beryllium enough.

Image from author, February 2025

To summation afloat visibility connected each URLs causing issues, interruption sitemaps down into groups of 1,000.

The evident downside is that this has a higher setup clip arsenic each URLs request to beryllium submitted successful Google Search Console and Bing Webmaster Tools. This whitethorn besides necessitate precocious levels of ongoing management.

XML Sitemap Best Practice Checklist

Do put clip to:

✓ Dynamically make XML sitemaps.

✓ Compress sitemap files.

✓ Use a sitemap scale file.

✓ Include the <loc> & <lastmod> tags.

✓ Use representation tags successful existing sitemaps.

✓ Use video and Google News sitemaps if relevant.

✓ Reference sitemap URLs successful robots.txt.

✓ Submit sitemaps to some Google Search Console and Bing Webmaster Tools.

✓ Include lone SEO-relevant pages successful XML sitemaps.

✓ Ensure URLs are included lone successful a azygous sitemap.

✓ Ensure the sitemap codification is error-free.

✓ Group URLs successful descriptively named sitemaps based connected leafage type.

✓ Strategize however to interruption down ample sitemap files.

✓ Use Google Search Console and Bing Webmaster Tools to analyse indexing rates.

Now, spell cheque your ain sitemaps and marque definite you’re doing it right.

More Resources:


Featured Image: BEST-BACKGROUNDS/Shutterstock