Free · No account needed

Free keyword clustering tool

Upload a CSV export or paste raw text. Then adjust the settings and hit "Generate clusters" to see your topic map.

From scratch From sitemap From keywords

From internal links

Add your data

Upload CSV

Paste text

Adjust settings

Similarity threshold

0.7

Minimum cluster size

Maximum cluster size

Labels on chart

Vectorization mode

Upload a CSV or paste some text above, then click "Generate clusters" to see your topic map here. Click a cluster to zoom in.

Workflow

How to cluster keywords from a CSV or pasted list

Upload a spreadsheet or paste lines directly — Lexical and Semantic modes both run locally until you choose to share.

Step 1
Upload a CSV from your crawler or CMS, or paste one title, keyword, or URL per line.
Step 2
Pick Lexical for spreadsheet-style overlap, or Semantic when similar intent uses different wording.
Step 3
Adjust similarity, cluster size limits, and optional AI labels, then run Generate clusters.
Step 4
Swap visualizations, export CSV, or publish a sanitized share link for your team.

Controls

Settings available in the keyword clustering tool

Fine-tune grouping tightness, cluster sizes, and how results appear on screen.

Similarity threshold

Controls how tight each keyword group is in Lexical mode, or in Semantic threshold mode.

Min / max cluster size

Prevents tiny noise clusters or oversized catch-all buckets.

Lexical vs Semantic

Lexical runs TF-IDF locally; Semantic sends text to Gemini embeddings — nothing is stored unless you share.

Show cluster labels

Overlays group names on charts for screenshots and decks.

K-means & AI labels

In Semantic mode, pick a target cluster count and let the model name each group.

Visualization

Treemap, force tree, tidy tree, icicle, circle packing, or 3D — same clusters, different lens.

JavaScript required

The clustering and AI generation features run in the browser. Enable JavaScript to use the tool. Shared cluster pages are server-rendered and viewable without JavaScript.

Keyword clustering that respects confidentiality

Ideal when procurement blocks SaaS uploads but leadership still expects polished charts. Pair this flow with AI generation when you need net-new angles.

From URLs, titles & keywords

Cluster the content you already have

Paste in a list of titles and keywords, or export a CSV from your CMS or a crawler. Lexical TF-IDF clustering runs in your browser to group pages into topics; no data leaves your device.

Open this mode →

From scratch (with AI)

Describe your niche, get a full cluster map

No content to upload? No problem. Tell the AI your main topic, industry, and goals, it generates a complete set of pillar pages and supporting content ideas in seconds.

Open this mode →

From internal links

Visualize your existing link structure

Export internal link data from Screaming Frog and drop it in. See every page as a node, every link as an edge, sized, colored, and weighted by link type and authority.

Open this mode →

From a sitemap

Turn any sitemap into a cluster map

Enter a sitemap URL or paste raw XML. The tool fetches, parses, and clusters all your pages in one step, no file export needed.

Open this mode →

Six chart styles for spreadsheet skeptics

Treemaps communicate weightings; trees expose overlap between buckets — toggle freely without recomputing Lexical scores.

Treemap

See cluster sizes at a glance. The nested rectangles make it immediately obvious which topic groups dominate your site.

Force-directed tree

Explore how pages connect to each other. Drag nodes and zoom in to trace relationships across the full content graph.

Tidy tree

A clean hierarchical view that shows the depth and branching of every cluster, ideal for presenting site architecture.

Icicle chart

Space-efficient stacked bars that let you click into any cluster and zoom through the hierarchy level by level.

Circle packing

Circles within circles. A visually rich layout that makes nested cluster relationships intuitive to read at a glance.

3D knowledge graph

An immersive three-dimensional network where nodes float in space and edges flow between them. Rotate, zoom, and orbit to navigate clusters from any angle.

Practical safeguards for enterprise keyword lists

Ship stakeholder-ready visuals without sacrificing compliance — clustering stays client-side until you opt into sharing.

Keyword clustering without handing data to a third party

Lexical TF-IDF vectors are computed inside a dedicated worker thread on your machine. Your spreadsheets stay offline unless you publish a sanitized snapshot.

Flexible inputs from CMS or crawler exports

Mix Title, Meta Description, URL, Keywords, or Unique Inlinks columns — only one textual column is required to seed similarity scoring.

Share cluster outlines without leaking URLs

Generate a time-bound share URL so teammates review grouping logic without exposing proprietary titles.

Frequently asked questions

Keyword uploads, CSV semantics, and collaboration guardrails.

What CSV columns do you accept?

Title, Meta Description, URL, Keywords, and Unique Inlinks may appear in any combination. Provide at least one text-bearing column.

How large can my keyword list be?

Roughly ten thousand rows remain responsive thanks to worker threading — larger lists simply take proportionally longer.

What is the difference between Lexical and Semantic mode?

Lexical builds sparse vectors with TF-IDF over the text we derive from titles, keywords, meta descriptions, and URL paths from your CSV or pasted text. Items land in the same cluster when they literally share important words or stems — great for messy spreadsheets, overlapping product names, or URLs that encode topics in slugs. Semantic sends combined text snippets to Gemini embeddings so similarity reflects meaning, not spelling. Different wording about the same intent can still merge. Start with Lexical when you want deterministic, offline-friendly grouping; switch to Semantic when synonyms and paraphrases split the Lexical map too finely.

What does K-means do in Semantic mode?

K-means partitions embedding vectors into groups automatically. Roughly speaking, cluster count scales with how many documents you have versus your minimum cluster size — raising minimum size yields fewer, broader themes without tuning a cosine cutoff by hand. Use it when you want the algorithm to infer group boundaries while embeddings stay fixed for your dataset.

What does Custom threshold do in Semantic mode?

Custom threshold skips automatic K-means and instead merges pages only when their embedding cosine similarity meets your cutoff — you effectively dial how aggressively clusters fuse. Higher values demand tighter semantic matches (fewer merges); lower values join looser neighborhoods. Around 0.95 cosine similarity is a sensible starting point with Gemini embeddings before you widen or tighten by hand.

Free keyword clustering tool

Add your data

Adjust settings

Cluster visualization

How to cluster keywords from a CSV or pasted list

Settings available in the keyword clustering tool

Similarity threshold

Min / max cluster size

Lexical vs Semantic

Show cluster labels

K-means & AI labels

Visualization

Keyword clustering that respects confidentiality

Cluster the content you already have

Describe your niche, get a full cluster map

Visualize your existing link structure

Turn any sitemap into a cluster map

Six chart styles for spreadsheet skeptics

Treemap

Force-directed tree

Tidy tree

Icicle chart

Circle packing

3D knowledge graph

Practical safeguards for enterprise keyword lists

Keyword clustering without handing data to a third party

Flexible inputs from CMS or crawler exports

Share cluster outlines without leaking URLs

Frequently asked questions

What CSV columns do you accept?

How large can my keyword list be?

What is the difference between Lexical and Semantic mode?

What does K-means do in Semantic mode?

What does Custom threshold do in Semantic mode?