/Canonical Tag
📘Concept⭐️ Pillar

Canonical Tag

최종 업데이트:

Definition

A canonical tag is a meta tag inserted in the HTML <head> in the form <link rel="canonical" href="URL"/>. When identical or very similar content exists across multiple URLs, it instructs search engines to "treat this URL as the representative version."

If duplicate content is left unmanaged, search engines struggle to decide which URL to index among multiple URLs, and PageRank dispersed across URLs is not consolidated, harming rankings. Canonical tags prevent this dispersion and concentrate SEO signals on one URL—a core on-page tool.


Summary

Canonical essentials: ①<link rel="canonical" href="canonicalURL"/> → insert in <head> → ②Consolidate duplicate URLs such as www/non-www, HTTP/HTTPS, URL parameters, mobile versions → ③PageRank concentrates on canonical URL for ranking advantage → ④Insert self-referencing canonical on every page. Google treats canonical as a hint, so consistent trust signals matter.


5 Situations Requiring Canonical Tags

1. URL Parameter Duplicates

When parameter combinations like /products?sort=price, /products?sort=name, /products?color=blue display identical content. Specify /products canonical on all parameter variant URLs. See URL Parameters for details.

2. www vs non-www, HTTP vs HTTPS

When http://example.com, https://example.com, http://www.example.com, and https://www.example.com return identical content. Specify HTTPS version as canonical URL and use 301 redirects in parallel.

3. Pagination

Pagination URLs like /blog, /blog?page=2, /blog?page=3. Each page is independent content, so use self-referencing canonical, with /blog representing the first page. See Pagination for details.

4. Separate Mobile/Desktop URLs

Legacy structure with separate m.example.com and example.com. Specify desktop URL as canonical on mobile URL and add <link rel="alternate" media="only screen and (max-width: 640px)" href="mobileURL"/> on desktop URL. See Mobile-First Indexing for details.

5. Content Syndication

When contributing to or providing content to other sites. Requesting syndicated copies on external sites to specify original site URL as canonical protects original SEO signals.


Canonical Tag Implementation Methods

HTML Implementation

<head>
  <link rel="canonical" href="https://example.com/products/" />
</head>

Insert in every page's <head>. Use absolute URLs (including https://). Relative URLs risk implementation errors and are not recommended.

HTTP Header Implementation

For non-HTML resources like PDF:

Link: <https://example.com/document.pdf>; rel="canonical"

Self-Referencing Canonical

Insert canonical pointing to the canonical URL itself. Protects against Google recognizing unexpected URL variants (automatic parameter addition, etc.).

<!-- on https://example.com/products/ page -->
<link rel="canonical" href="https://example.com/products/" />

Canonical vs Other Duplicate Handling Tools

[COMPARISON_TABLE: canonical vs 301 redirect vs noindex vs robots.txt comparison]

canonical tag

  • Effect: Specifies canonical URL, concentrates PageRank
  • Original URL maintained: ✅ (accessible)
  • Google handling: Hint — may be ignored
  • Suitable situation: Duplicate content consolidation, URL parameters

301 redirect

  • Effect: Permanent move, PageRank transfer
  • Original URL maintained: ❌ (automatic redirect)
  • Google handling: Directive — always followed
  • Suitable situation: Permanent URL change, domain migration

noindex

  • Effect: Excludes from indexing
  • Original URL maintained: ✅ (crawling allowed)
  • Google handling: Directive — almost always followed
  • Suitable situation: Excluding pages not needing indexing

See 301 Redirect, noindex for details.


5 Common Canonical Mistakes

Mistake 1: Using Relative URLs

Use absolute URLs <link rel="canonical" href="https://example.com/products/"/> instead of <link rel="canonical" href="/products/"/>.

Mistake 2: Canonical Chain

Chain structure where A → B → C. Google may not follow intermediate steps. Both A and B must point directly to C.

Mistake 3: Incorrect Canonical Settings

CMS default settings pointing all page canonicals to homepage (/). Modify so each page points to its own URL.

Mistake 4: Canonical and noindex Applied Together

Applying canonical and noindex on the same page causes conflict. Use noindex only on pages you do not want indexed; use canonical only on pages where you want to specify canonical URL.

Mistake 5: Setting All Pagination to First Page Canonical

Setting /blog?page=2 canonical to /blog causes Google to ignore page 2's unique content. Self-referencing canonical on each pagination page is recommended.


Application in the Korean Market

Korean/English Multilingual Sites

When operating Korean versions of global services, use hreflang and canonical together. Korean pages specify Korean URL as canonical; English pages specify English URL as canonical. Cross-reference language-specific canonical URLs with hreflang tags. See hreflang for details.

Canonical and Naver SEO

Naver Search does not support rel="canonical" as strongly as Google. To fully resolve duplicate content on Naver, maintaining only one canonical URL with 301 redirects is more reliable. See Naver SEO for details.

Implementation by CMS

WordPress widely used in Korea supports automatic canonical insertion via Yoast SEO or RankMath plugins. Next.js sets via alternates.canonical in generateMetadata().


Frequently Asked Questions

Q. Does Google ignore canonical tags in some cases?
A. Yes. Google treats canonical as a "hint" and may choose a different URL as canonical based on its own judgment. When other signals (internal links, sitemap, 301 redirects) conflict with canonical, Google tends to ignore canonical. Maintaining all signals consistently pointing to the canonical URL is important.

Q. Can I specify an external domain URL as canonical?
A. Yes. In content syndication, copies published on external sites pointing to original site URL as canonical is a pattern Google recognizes. However, cross-domain canonical for abuse purposes may not be trusted by Google.

Q. Should I prioritize canonical or 301 redirect?
A. Use 301 redirect to permanently change or remove URLs. Use canonical to normalize duplicate content while keeping old URLs. For example, use canonical for URL parameter variants kept for user experience; use 301 redirect for domain migration and URL structure changes.

Q. How do I verify canonical effectiveness?
A. Google Search Console's "URL Inspection Tool" shows the canonical URL Google recognizes when you enter a specific URL. If "Google-selected canonical" matches your specified canonical, it is working correctly. If mismatched, check conflicts with other signals (internal links, sitemap).

Q. What happens if a site has no canonical?
A. Google selects canonical URL based on its own judgment. Unexpected URLs may be selected, dispersing PageRank and wasting crawl budget. Sites with many URL parameters or CMS generating multiple URLs may suffer degraded indexing quality without explicit canonical.


Sources

  • Google Search Central (2024). Consolidate duplicate URLs with canonical tags. Google Developers.
  • Moz (2024). Canonicalization: What It Is and How to Use It. Moz Blog.
  • John Mueller, Google (2023). Canonical tag best practices. Google Search Central Blog.

이 페이지를 참조하는 항목

관련 항목

📘Concept
Crawl Budget
Crawl budget is the number of pages Googlebot can and wants to crawl on your site within a given period — relevant for large sites where crawl allocation affects indexing speed and coverage.
📙How-to
Indexing Coverage Diagnosis
Indexing coverage diagnosis uses the GSC indexing report to check overall site indexing status, identify causes of unindexed pages, and fix them — a core SEO task.
📘ConceptPillar
What Is AEO?
AEO is the practice of optimizing content so AI answer engines cite it.
📘ConceptPillar
Duplicate Content
Duplicate content is a state where identical or very similar content exists on multiple URLs, causing authority dilution and indexing confusion—a common technical SEO problem.
📘Concept
Noindex
noindex is an on-page crawl control directive that tells search engine bots not to include a page in search results via robots meta tags or HTTP headers. It excludes pages that do not need or should not appear in search from the index, saving crawl budget and improving site quality signals.
📘Concept
Pagination
Pagination is a technique for splitting long content or product listings across multiple pages. Since rel=prev/next was deprecated in 2019, it is now managed through canonical tags, infinite scroll, and load more approaches.
📘Concept
301 Redirect
A 301 redirect is an HTTP status code that tells browsers and search engines a URL has permanently moved. It transfers PageRank and backlink authority from the old URL to the new one, enabling URL structure changes without SEO loss — a core technical SEO tool.
📘Concept
Crawling vs Indexing
Crawling is the process where search engine bots follow links across the web and collect pages. Indexing is the process of analyzing collected pages and storing them in a search database. These are the first two stages of SEO’s three stages: crawling → indexing → ranking.
📘ConceptPillar
hreflang Tag
hreflang is an HTML attribute that tells Google about multilingual and multi-regional versions of the same content, showing the correct language and regional page to appropriate users and preventing duplicate content signals.
📘ConceptPillar
Mobile-First Indexing
Mobile-first indexing is Google’s system for crawling, indexing, and ranking based on a site’s mobile version. With full rollout completed in 2024, it is now the default premise of SEO.
📙How-to
How to Allow AI Bots in robots.txt
Allowing AI bots means explicitly permitting major AI crawlers such as GPTBot, ClaudeBot, and PerplexityBot to access your site in robots.txt, exposing your content for citation in generative AI answers.
📘ConceptPillar
Site Architecture
Site architecture is the overall design of page hierarchy, URL structure, and internal linking on a website. It simultaneously determines crawl efficiency, indexing quality, and user navigation experience — a foundational SEO element.
📘Concept
URL Parameters
URL parameters are query strings appended to URLs in ?key=value form. They expose the same content across many URL variants, causing duplicate content problems and crawl budget waste — a major technical SEO management target.

이런 항목도 있어요

이 페이지가 도움이 됐나요?