Content
Duplicate Content
Duplicate content refers to blocks of content that appear at multiple URLs — either within the same site (internal duplication) or across different websites (external duplication). While Google typically handles duplicate content by selecting a canonical version to index, excessive duplication can dilute link equity, confuse ranking signals, and in extreme cases trigger manual penalties.
Duplicate content is identified using Screaming Frog's "Near Duplicates" report, Siteliner for internal duplication analysis, and SEMrush or Ahrefs site audits. Canonical tags, 301 redirects, and the Google Search Console Coverage report are the primary tools for resolving and monitoring duplicate content issues.
Authority Links
Related Terms
Technical SEO
Canonical Tag
An HTML `<link rel="canonical">` element in the page head that tells search engines which URL is the authoritative version when duplicate or near-duplicate content exists across multiple URLs.
Content
Thin Content
Pages with insufficient unique value relative to user expectations — often very short, templated, auto-generated, scraped, or thinly differentiated from existing pages — and a primary target of Google's Helpful Content System.
Content
Keyword Cannibalization
When two or more pages on the same site compete for the same target keyword, causing search engines to split ranking signals across the duplicates instead of consolidating them on one strong page.
Content
Meta Keywords
Refer to keywords in the source code that give information to the search engines about the page.

