/Site Architecture
📘Concept⭐️ Pillar

Site Architecture

최종 업데이트:

Definition

Site architecture encompasses page hierarchy structure, URL design, and internal link patterns on a website. It defines how users and search engine crawlers find and navigate information across the site.

Good architecture is designed so Google discovers and indexes all important pages, efficiently transfers authority between pages, and users reach desired information quickly.


Summary

Site architecture core principles: ①All important pages within 3–4 clicks from home → ②Logical, predictable URL hierarchy → ③Pillar–Cluster internal link structure → ④XML sitemap + breadcrumb as dual safety net. When these four are satisfied, crawl budget waste and authority dispersion are naturally prevented.


4 Site Architecture Models

1. Flat Structure

All pages sit within 1–2 clicks from home. Suitable for small sites (50 pages or fewer).

Home
├── Page A
├── Page B
└── Page C

Pros: Simple, highest crawl efficiency
Cons: Hard to build topic authority as content grows

2. Silo Structure

Content is isolated by topic category (silo). Internal links occur only within each silo.

Home
├── SEO Silo
│   ├── Keyword Research
│   └── On-Page SEO
└── AEO Silo
    ├── What is AEO?
    └── AI Optimization

Pros: Topic authority concentration, clear topic clusters
Cons: Limited connection of related content across silos

3. Pyramid Structure

Home → category → subcategory → individual page hierarchy. Common for large content sites and ecommerce.

Home
└── Category (Level 2)
    └── Subcategory (Level 3)
        └── Individual post/product (Level 4)

Pros: Suitable for large content management, logical hierarchy
Cons: Crawl budget and authority transfer efficiency decrease with depth

4. Graph/Hub Structure

Mutual linking between Pillar pages and Cluster pages. Most recommended approach in current content marketing.

Pillar: Complete SEO Guide
├── Cluster: Keyword Research Methods
├── Cluster: On-Page Optimization
├── Cluster: Link Building
└── Cluster: Technical SEO
    (all link back to Pillar)

Pros: Topic authority building, AEO-friendly, internal link authority circulation
Cons: Complex design and maintenance


5 Principles of Good Site Architecture

Principle 1: Limit Depth to 3–4 Clicks

Any important page should be reachable within 3–4 clicks from the homepage. Pages deeper than 5 clicks are hard for Googlebot to reach and receive little authority transfer.

Diagnosis: Crawl with Screaming Frog and check the "Crawl Depth" report. If depth 5+ pages are important content, structural redesign is needed.

Principle 2: Logical URL Hierarchy

URLs should reflect page location in a predictable way.

✅ Good: /seo/keyword-research/long-tail-keywords
❌ Bad: /post?id=4829&cat=12&lang=ko

URLs alone should indicate location. See URL Slug for details.

Principle 3: Consistent Internal Link Structure

Internal links determine authority flow within the site. Pillar pages should receive links from all related Clusters, and Pillars should link down to Clusters.

[FLOW_DIAGRAM: Pillar → Cluster → Pillar internal link authority circulation]

See Internal Linking for details.

Principle 4: XML Sitemap + Breadcrumb

Sitemaps tell Googlebot "these pages are important." Breadcrumbs explicitly show page hierarchy to both users and bots.

See Breadcrumb for details.

Principle 5: Prevent Duplicates with Canonical

Use canonical tags to specify canonical URLs so URL parameters, sort options, and pagination do not create duplicate URLs.

See Canonical Tag and Pagination for details.


5 Steps to Design Site Architecture

Step 1: Topic Cluster Design

Derive core topics (Pillars) and supporting content (Clusters) the site will build authority on, based on keyword research. See Keyword Mapping for details.

Step 2: Decide URL Structure

Decide subdomain vs subdirectory, category inclusion, and URL depth. See Subdomain vs Subdirectory for details.

Step 3: Navigation Design

Design global navigation (main menu), local navigation (sidebar/related posts), and breadcrumbs.

Step 4: Internal Link Plan

Create a Pillar-Cluster matrix planning which pages should link to which.

Step 5: Set Crawl Priority

Limit areas not to crawl with robots.txt and prioritize important pages in the sitemap. See Crawl Budget for details.


Site Architecture and Topic Authority

Entity Authority Concentration

When Google evaluates a site’s authority on a topic, it looks at connectivity among pages on that topic. Well-designed Pillar-Cluster structure strongly signals to Google that "this site is an expert on this topic." See Entity SEO for details.

Site Architecture in AEO

When AI answer engines query a topic, they prioritize content from sites with high topic authority. Pillar-centric architecture increases AI citation potential. See Query Fan-Out for details.


Common Site Architecture Mistakes

Error 1: Orphan Pages

Pages with no internal links. Googlebot cannot reach them without a sitemap. Discover in regular crawl audits and add internal links.

Error 2: Excessive Hierarchy Depth

URL paths with 5+ levels (/a/b/c/d/e/page). Simplify category structure or remove intermediate levels.

Error 3: Duplicate Structure

Multiple URL paths for the same content. Common on WordPress sites where tags and categories both include the same post. Resolve with canonical.

Error 4: Missing Internal Links

Publishing content without linking to it from existing content. Publishing without an internal link plan accumulates orphan pages.


Korea Market Application

Architecture Considerations in Korean CMS Platforms

WordPress: Category, tag, and archive pages auto-generate and easily create duplicate structure. noindex archive pages or apply canonical in Yoast/Rank Math.

Cafe24/Godo Mall: Ecommerce CMS filter parameters (?color=red&size=M) mass-generate duplicate URLs. Set clear URL parameter handling policy.

Imweb: Default structure is relatively simple on small sites, but hreflang handling is needed for multilingual settings.

Naver vs Google Architecture Strategy

Naver tends to evaluate individual posts directly rather than whole domains, so for Naver-centric sites individual post optimization matters more. For Google-centric sites, whole-site authority and architecture are core.


Frequently Asked Questions

Q. Will changing site architecture drop rankings?
A. Large URL structure changes can cause short-term ranking fluctuations. Always set 301 redirects, update XML sitemap, and change all internal link URLs to new URLs. Changing architecture without sufficient preparation causes serious traffic loss.

Q. How many pages before I need to think about architecture?
A. Above 50 pages, structure should be clearly designed. Sites with 10 pages or fewer are most efficient with flat structure. Once blogs exceed 100 posts, authority dispersion worsens without Pillar-Cluster structure.

Q. Can I use only tags without categories?
A. Categories create URL hierarchy for clear structure; tags mainly connect related content. If using only one, categories are more SEO-favorable. Tag archive pages often become thin content — noindex is recommended.

Q. Which is better for architecture, subdomain or subdirectory?
A. In most cases subdirectories (example.com/blog/) share domain authority and are more favorable for architecture. Subdomains (blog.example.com) are treated as separate sites and authority is split. See Subdomain vs Subdirectory for details.

Q. How should architecture be designed for single-page applications (SPA)?
A. SPAs render content with JavaScript, so Googlebot takes extra time to crawl. Use server-side rendering (SSR) or static generation (SSG) to improve crawlability; each page needs a unique URL. Manage URLs correctly with the HTML History API.


Related Sources

이 페이지를 참조하는 항목

관련 항목

📘Concept
Crawl Budget
Crawl budget is the number of pages Googlebot can and wants to crawl on your site within a given period — relevant for large sites where crawl allocation affects indexing speed and coverage.
📙How-to
Indexing Coverage Diagnosis
Indexing coverage diagnosis uses the GSC indexing report to check overall site indexing status, identify causes of unindexed pages, and fix them — a core SEO task.
📘Concept
Entity SEO: From Keywords to Concepts in Search
Entity SEO is an optimization strategy that helps Google recognize your site and content as real-world entities rather than isolated keywords, so you become a trusted presence in AI-based search and the Knowledge Graph.
📘ConceptPillar
GEO Master Guide: 5-Area Checklist
An execution guide for Generative AI Optimization covering GEO's five areas: content, structure, technical, off-site, and measurement.
📘ConceptPillar
What Is AEO?
AEO is the practice of optimizing content so AI answer engines cite it.
📙How-to
Keyword Mapping
Keyword mapping is a strategic document that assigns one primary keyword and related secondary keywords to each URL, serving as a core tool for preventing cannibalization and discovering content gaps.
📘Concept
Breadcrumb
A breadcrumb is a navigation element that hierarchically displays a page's location within a site. Through BreadcrumbList JSON-LD schema, it appears as a structured path in the SERP.
📘ConceptPillar
Canonical Tag
A canonical tag is an HTML meta tag that tells search engines 'this URL is the representative version' when duplicate or similar content exists across multiple URLs. It resolves duplicate content problems and concentrates PageRank on the canonical URL—a core on-page SEO tool.
📘ConceptPillar
Internal Linking Strategy
Internal linking strategy is the practice of semantically connecting pages within your own site to optimize topic authority and bot and user navigation.
📘Concept
Pagination
Pagination is a technique for splitting long content or product listings across multiple pages. Since rel=prev/next was deprecated in 2019, it is now managed through canonical tags, infinite scroll, and load more approaches.
📘ConceptPillar
URL Slug
A URL slug is the string forming the path after the domain in a web page address. SEO best practices include keyword inclusion, hyphen use, lowercase, and short, clear structure.
📘Concept
301 Redirect
A 301 redirect is an HTTP status code that tells browsers and search engines a URL has permanently moved. It transfers PageRank and backlink authority from the old URL to the new one, enabling URL structure changes without SEO loss — a core technical SEO tool.
📘ConceptPillar
Core Web Vitals
Core Web Vitals are the three core user experience metrics defined by Google.
📘Concept
Crawl Depth
Crawl depth (click depth) is the number of clicks required to reach a page from the homepage. It is a core site structure metric that determines page discovery priority for search engine and AI bots and PageRank transfer efficiency.
📘ConceptPillar
Crawlability
Crawlability is the ability of search engine and AI bots to access website pages and read content. It is the most basic condition for SEO and AEO, a required step that precedes indexing and ranking.
📘Concept
Crawling vs Indexing
Crawling is the process where search engine bots follow links across the web and collect pages. Indexing is the process of analyzing collected pages and storing them in a search database. These are the first two stages of SEO’s three stages: crawling → indexing → ranking.
📘ConceptPillar
Mobile-First Indexing
Mobile-first indexing is Google’s system for crawling, indexing, and ranking based on a site’s mobile version. With full rollout completed in 2024, it is now the default premise of SEO.
📘ConceptPillar
Rendering
Rendering is the process of processing HTML, CSS, and JavaScript to produce the final screen seen by users and bots. The choice among CSR, SSR, SSG, and ISR determines SEO and AEO feasibility.
📙How-to
How to Allow AI Bots in robots.txt
Allowing AI bots means explicitly permitting major AI crawlers such as GPTBot, ClaudeBot, and PerplexityBot to access your site in robots.txt, exposing your content for citation in generative AI answers.
📙How-to
Sitemap (XML Sitemap)
An XML sitemap is an XML file listing a website’s URLs along with last-modified dates, update frequency, and priority information. It helps search engine bots understand site structure and improves crawling efficiency and indexing speed as a technical SEO foundation tool.
📘ConceptPillar
Subdomain vs Subdirectory
Subdomain (subdomain.example.com) and subdirectory (example.com/path/) are two URL structure choices for content. Subdirectories share domain authority and are advantageous in most SEO cases.

이런 항목도 있어요

이 페이지가 도움이 됐나요?