Mistake #1: Including Videos from YouTube, Vimeo, and Other Platforms in Your Sitemap
Videos are excellent for attracting visitors and boosting your website’s ranking in search results. However, your sitemap should only include videos hosted on your own server or CDN. A common error is attempting to include externally hosted videos (e.g., from YouTube or Vimeo) in your XML sitemap.
What happens if you make this mistake?
Google will completely ignore these videos in your sitemap. You'll lose valuable additional traffic and the opportunity to attract new customers through video search.
How to avoid this mistake:
Use Schema.org markup (VideoObject) directly on your webpage for externally hosted videos:
See the Pen Schema.org by LoginovNet (@LoginovNet) on CodePen.
For videos hosted on your own server:
<video:video>
<!-- Your video details here -->
</video:video>
Mistake #2: Irrelevant Images in the Sitemap
Many website owners try to include every image from their pages in the sitemap, which is incorrect. Only images clearly representing the main content of the page should be included.
- The product itself;
- Packaging and accessories;
- Product in real-life settings.
- Manufacturer logos;
- Comparison tables;
- Technical diagrams.
Why is this important?
Irrelevant images increase bounce rates. High bounce rates decrease trust in your website, negatively affect user signals, and ultimately result in lower Google rankings.
Correct XML example:
See the Pen Untitled by LoginovNet (@LoginovNet) on CodePen.
Mistake #3: Exceeding Sitemap Size and URL Limits
An XML sitemap can contain up to 50,000 URLs and must not exceed 50 MB. Exceeding these limits results in Google ignoring some pages.
- Important pages won't be indexed;
- Traffic loss and reduced SEO performance.
- Split your sitemap into multiple files;
- Use a sitemap index:
See the Pen Untitled by LoginovNet (@LoginovNet) on CodePen.
Include the sitemap in robots.txt:
Sitemap: https://yoursite.com/sitemap.xml
Compress large sitemaps using gzip:
sitemap.xml.gz
Mistake #4: Mismatch Between Canonical URLs in Sitemap and Website
Canonical URLs are the primary addresses of a page defined via <link rel="canonical">. Your sitemap should precisely match these canonical URLs.
What is a canonical URL?
A canonical URL is the main address for a webpage explicitly indicated in HTML to avoid duplicate content and consolidate SEO value.
- Pages are considered duplicates;
- SEO value spreads incorrectly, causing rankings to drop.
Proper usage:
On the webpage:
<link rel="canonical" href="https://yoursite.com/product" />
In the sitemap:
See the Pen Untitled by LoginovNet (@LoginovNet) on CodePen.
Mistake #5: Neglecting Regular Sitemap Updates
Creating a sitemap once and neglecting regular updates leads to outdated or broken links.
- Loss of trust from Google;
- Decline in current rankings and traffic.
- Update your sitemap regularly (at least monthly);
- Use CMS plugins like Yoast SEO or Rank Math.
Mistake #6: Conflicts Between robots.txt and Sitemap.xml
If a page is blocked in robots.txt, it shouldn't be listed in the sitemap.
- Google receives conflicting instructions, reducing site trust;
- Indexing issues and loss of important indexed pages.
Example mistake:
robots.txt:
User-agent: *
Disallow: /admin/
Incorrectly in sitemap:
See the Pen Untitled by LoginovNet (@LoginovNet) on CodePen.
Final Recommendations
- Speed up indexing of relevant pages;
- Clearly communicate site structure to Google;
- Strengthen your website's trust and authority;
- Generate additional traffic from image and video searches;
- Avoid duplicate content and other critical SEO mistakes.
At our agency, we pay close attention to every detail. We offer comprehensive support for your website, including thorough audits, technical optimization, and strategic SEO consulting.