Home
Softono
seo-keyword-clusters

seo-keyword-clusters

Open source Jupyter Notebook
56
Stars
9
Forks
0
Issues
3
Watchers
6 months
Last Commit

About seo-keyword-clusters

An application using Machine Learning to automatically create clusters of queries from Google Search Console.

Platforms

Web Self-hosted

Languages

Jupyter Notebook

Links

⭐ SEO Keyword Clusters (Legacy Project)

⚠️ This repository is archived and represents an early prototype of what eventually became SEOCluster.ai. It is preserved for educational and historical purposes only.


πŸš€ Try the New Production Platform: SEOCluster.ai

A modern AI-powered SaaS for keyword clustering, content briefs, and automated landing pages.

πŸ‘‰ https://seocluster.ai

Highlights of the new platform:

  • ⚑ FastAPI backend + Next.js frontend
  • πŸ”₯ SentenceTransformer embeddings (no more classic KMeans)
  • 🌍 Local-intent detection across 325k+ global locations
  • πŸ“ AI-generated content briefs
  • 🧱 Landing page generator
  • πŸ” Firebase authentication
  • πŸ’³ Stripe billing (Free, Pro, Agency tiers)
  • πŸ“Š Google Search Console OAuth integration
  • πŸš€ Cloud Run multi-worker deployment
  • 🧠 Smart caching & optimized UX

This legacy repo does not include these features β€” it represents the origins of the project.


πŸ“š About This Legacy Version

This codebase was originally created between 2021–2022 for a UWA Data Science Capstone project.

It uses:

  • Python + Flask
  • Pandas & scikit-learn
  • Traditional KMeans clustering
  • TF-IDF cluster labeling
  • Google Data Studio + Tableau for visualization
  • CSV export from Google Search Console
  • Basic SQLite storage

πŸ“Œ This version is not production-ready.


πŸ“¦ Repository Structure

.
β”œβ”€β”€ app.py                    # Legacy Flask app
β”œβ”€β”€ Keyword_Clustering.ipynb  # Main ML notebook
β”œβ”€β”€ Queries.csv               # Sample GSC query data
β”œβ”€β”€ static/                   # Static assets
β”œβ”€β”€ templates/                # Jinja2 templates
β”œβ”€β”€ keyword_clustering.sqlite # Example database
└── README.md

πŸ› License & Usage

This legacy version is open for learning and academic use only.

βœ”οΈ Allowed:

  • Personal study
  • Academic use
  • ML experimentation

❌ Not Allowed:

  • Commercial use
  • Using this code in SaaS products
  • Replicating SEOCluster.ai features
  • Redistributing modified versions for business use

For production use β†’ πŸ‘‰ https://seocluster.ai

πŸ“œ Project History

This repository represents the earliest foundation of SEOCluster.ai.

  • 2021 – Built as a Data Science ML project
  • 2022 – First UI deployed to Heroku
  • 2023–2024 – Rewritten using modern full-stack architecture
  • 2024–2025 – Became SEOCluster.ai, a full SaaS platform

The repository remains public because:

  • It already has ⭐ stars and forks
  • It helps others learn ML-based clustering
  • It documents the evolution of the project

⭐ Support the Journey

If you're interested in how this evolved into a real SaaS business, consider:

  • ⭐ Starring this repo
  • πŸš€ Trying SEOCluster.ai β†’ https://seocluster.ai
  • πŸ’¬ Connecting for collaboration

πŸ”— Useful Links