/URL Parameters
📘Concept

URL Parameters

최종 업데이트:

Definition

URL parameters are query strings starting with ? as key=value pairs after a URL. Multiple parameters are joined with &.

https://example.com/products?category=shoes&sort=price&color=black

From an SEO perspective, URL parameters become problematic because they expose identical or very similar content across countless URL variants, causing duplicate content and crawl budget waste. See Duplicate Content for details.


Summary

URL parameter SEO essentials: ①Parameter variant URLs showing the same content are recognized as duplicate content → ②Crawl budget exhaustion and PageRank dilution → ③Safest fix: canonical tag pointing to canonical URL → ④Normalize tracking parameters (UTM) on-site → ⑤Monitor excessive parameter URL crawling in GSC 'Crawl stats' report.


4 Types of URL Parameters

1. Filtering Parameters

Parameters for filtering products by category, color, size, or price range in ecommerce.

Example: /products?category=shoes&color=black&size=260

The same product list appears across thousands of filter combination URLs. This is the most common parameter problem in SEO.

2. Sorting Parameters

Parameters for sorting products or results.

Example: /products?sort=price_asc, /products?sort=newest, /products?sort=popular

Only content order differs; the product list is the same.

3. Pagination Parameters

Page numbers for paginated lists.

Example: /blog?page=2, /products?offset=20

Pagination parameters are not full duplicates because each page has unique content, but handling requires care.

4. Tracking Parameters (UTM, Click ID)

Parameters for marketing performance tracking. SEO-irrelevant but create duplicate URLs when crawled by bots.

Example: /blog/post?utm_source=newsletter&utm_medium=email&utm_campaign=may2026


URL Parameter Handling Methods

[COMPARISON_TABLE: URL parameter handling method comparison]

Method 1: Canonical Tag (Recommended)

Specify canonical URL from parameter variant URLs. Maintains user experience (filter/sort features) while concentrating PageRank on the canonical URL.

<!-- On /products?sort=price page -->
<link rel="canonical" href="https://example.com/products/" />

Safest method satisfying both user experience and SEO. See Canonical Tag for details.

Method 2: robots.txt Disallow

Completely block parameter URLs. Saves crawl budget but users can still access (only crawling is blocked). Cannot be used when parameter URLs need noindex directives read.

User-agent: *
Disallow: /*?sort=
Disallow: /*?color=

See robots.txt and AI Bots for details.

Method 3: noindex Tag

Insert <meta name="robots" content="noindex"/> on parameter URLs. Crawling occurs but indexing is blocked. Less crawl budget savings than robots.txt Disallow but allows precise control.

Method 4: Remove URL Parameters (Clean URLs)

Handle filter state with clean URLs without parameters on the server side.

Example: /products/shoes/black/ (path instead of parameters)

Cleanest long term but higher development cost.


Special Handling for Tracking Parameters (UTM)

Google Analytics 4 automatically recognizes and processes UTM parameters (utm_source, utm_medium, utm_campaign, etc.). From an SEO perspective, UTM parameter URLs should not be crawled by Google.

Handling methods:

  • Canonical tag from UTM URL to canonical URL
  • Remove UTM parameters from URL with JavaScript History API (history.replaceState)
  • Handle UTM parameter redirects in Next.js next.config.js

Korea Market Application

Parameter Structure in Korean Ecommerce

Korean ecommerce (Coupang, Gmarket, Lotte ON, etc.) widely uses URL parameters for product filters. Parameter URL handling is one of the highest-priority technical tasks in ecommerce SEO for self-operated stores.

Common parameter structures:

  • Category filter: ?cat=1001&subcat=2001
  • Brand filter: ?brand=samsung&brand=lg
  • Price range: ?price_min=10000&price_max=50000
  • Sort: ?order=price_asc

Naver Shopping Integration

When connecting product feeds to Naver Shopping, register the canonical URL — not parameter-included URLs — as the product URL. Parameter URLs exposed on Naver may be treated as duplicate products.

Google Search Console Parameter Monitoring

Check the GSC "Crawl stats" report for excessive parameter URL crawling. If parameter URL crawl ratio exceeds 30% of total crawls, crawl budget handling is needed. See Crawl Budget for details.


Frequently Asked Questions

Q. If parameter URLs are indexed by Google, is ranking disadvantaged?
A. Not necessarily disadvantaged, but PageRank and backlink signals that should concentrate on the canonical URL are diluted, lowering overall SEO efficiency. On large ecommerce sites with hundreds of thousands of parameter URLs, crawl budget exhaustion and PageRank dilution become serious.

Q. Is it realistic to set canonical on every parameter URL?
A. Rule-based automatic handling at CMS or framework level is standard. For example, apply site-wide logic: "All parameter URLs starting with ?sort= or ?filter= automatically canonicalize to the base URL without parameters."

Q. If parameter URLs are blocked in robots.txt, can users still access those URLs?
A. Yes. robots.txt blocks only bot crawling; user browser access is unaffected. Filter and sort features therefore work normally for users while bot crawling is blocked.

Q. What should I do if session IDs are included in URLs?
A. Session IDs in URLs (?session_id=abc123) create unique URLs per user and extremely exhaust crawl budget. Switch session management to cookie-based. Short-term fixes include canonical tags or robots.txt Disallow, but the fundamental fix is server-side session handling.

Q. UTM parameter URLs are indexed by Google. How should I handle this?
A. Set canonical to the canonical URL on indexed UTM URLs; Google updates the index on next crawl. Additionally use GSC "URL removal" as a temporary measure. Going forward, apply robots.txt or canonical site-wide so UTM parameter URLs are not crawled.


Related Sources

  • Google Search Central (2024). URL parameters. Google Search Console Help.
  • Moz (2024). URL Parameters: An SEO Best Practice Guide. Moz Blog.
  • Ahrefs (2024). URL Parameters and SEO: How to Handle Them. Ahrefs Blog.

이 페이지를 참조하는 항목

관련 항목

📘Concept
Crawl Budget
Crawl budget is the number of pages Googlebot can and wants to crawl on your site within a given period — relevant for large sites where crawl allocation affects indexing speed and coverage.
📘Concept
Google Search Console
Google Search Console (GSC) is a free tool from Google for monitoring site search performance, diagnosing indexing issues, and submitting sitemaps — the essential foundation for SEO measurement.
📘ConceptPillar
What Is AEO?
AEO is the practice of optimizing content so AI answer engines cite it.
📘ConceptPillar
Duplicate Content
Duplicate content is a state where identical or very similar content exists on multiple URLs, causing authority dilution and indexing confusion—a common technical SEO problem.
📘ConceptPillar
Canonical Tag
A canonical tag is an HTML meta tag that tells search engines 'this URL is the representative version' when duplicate or similar content exists across multiple URLs. It resolves duplicate content problems and concentrates PageRank on the canonical URL—a core on-page SEO tool.
📘Concept
Noindex
noindex is an on-page crawl control directive that tells search engine bots not to include a page in search results via robots meta tags or HTTP headers. It excludes pages that do not need or should not appear in search from the index, saving crawl budget and improving site quality signals.
📘Concept
Crawling vs Indexing
Crawling is the process where search engine bots follow links across the web and collect pages. Indexing is the process of analyzing collected pages and storing them in a search database. These are the first two stages of SEO’s three stages: crawling → indexing → ranking.
📙How-to
How to Allow AI Bots in robots.txt
Allowing AI bots means explicitly permitting major AI crawlers such as GPTBot, ClaudeBot, and PerplexityBot to access your site in robots.txt, exposing your content for citation in generative AI answers.
📘ConceptPillar
Site Architecture
Site architecture is the overall design of page hierarchy, URL structure, and internal linking on a website. It simultaneously determines crawl efficiency, indexing quality, and user navigation experience — a foundational SEO element.

이런 항목도 있어요

이 페이지가 도움이 됐나요?