Home
Softono
vertex-ai-mlops

vertex-ai-mlops

Open source Apache-2.0 Jupyter Notebook
699
Stars
313
Forks
17
Issues
24
Watchers
2 weeks
Last Commit

About vertex-ai-mlops

![tracker](https://us-central1-vertex-ai-mlops-369716.cloudfunctions.net/pixel-tracking?path=statmike%2Fvertex-ai-mlops&file=readme.md) <!--- header table ---> <table> <tr> <td style="text-align: center"> <a href="https://github.com/statmike/vertex-ai-mlops/blob/main/readme.md"> <img width="32px" src="https://www.svgrepo.com/download/217753/github.svg" alt="GitHub logo"> <br>View on<br>GitHub </a> </td> </tr> <tr> <td style="text-align: right"> <b>Share On: </b> <a href="https://www.linkedin.com/sharing/share-offsite/?url=https://github.com/statmike/vertex-ai-mlops/blob/main/readme.md"><img src="https://upload.wikimedia.org/wikipedia/commons/8/81/LinkedIn_icon.svg" alt="Linkedin Logo" width="20px"></a> <a href="https://reddit.com/submit?url=https://github.com/statmike/vertex-ai-mlops/blob/main/readme.md"><img src="https://redditinc.com/hubfs/Reddit%20Inc/Brand/Reddit_Logo.png" alt="Reddit Logo" width="20px"></a> <a href="https://bsky.app/intent/compose?text=ht ...

Platforms

Web Self-hosted Cloud

Languages

Jupyter Notebook

Links

tracker

GitHub logo
View on
GitHub
Share On: Linkedin Logo Reddit Logo BlueSky Logo X (Twitter) Logo
Connect With Author On: Linkedin Logo GitHub Logo YouTube Logo BlueSky Logo X (Twitter) Logo
Download icon Download File (right-click and "Save As")



MLOps on GCP

Google Cloud's AI platform has evolved through several names — Cloud ML Engine, then AI Platform, then Vertex AI, and as of April 2026, Gemini Enterprise Agent Platform. Each renaming reflects an expansion in scope: from custom model training, to a unified ML platform with AutoML and managed notebooks, to the current vision that brings together model building, agent development, orchestration, and governance in a single platform. This repository has tracked that evolution, and its recent content — AI agents with ADK, agent deployment on Agent Engine (formerly Vertex AI Agent Builder), and agentic workflows — reflects exactly where the platform is heading.

While the AI platform is the anchor, this repository reaches across the broader Google Cloud ecosystem. You'll find workflows that integrate BigQuery and BigQuery ML for in-database analytics and AI functions, Dataflow (Apache Beam) and Dataproc (Managed Apache Spark) for batch and streaming inference, Cloud Composer (Airflow) for orchestration, and Cloud Run and GKE for model serving. Database services — Spanner, AlloyDB, Cloud SQL, Memorystore, Firestore, and Bigtable — appear throughout as feature stores, vector search backends, and SQL-based inference endpoints. The goal is to show how these services work together in real ML and AI workflows, not just in isolation.

A comprehensive collection of 470+ interactive notebooks covering custom ML, generative AI, and agent development on Google Cloud — from training and serving to pipelines, feature stores, and production deployment. Each notebook is a hands-on workflow you can learn from, adapt, and use as a starting point for your own projects.


MLOps — 74 notebooks

End-to-end machine learning operations on Vertex AI: everything between training a model and running it reliably in production.

  • Serving (32 notebooks) — Online endpoints (dedicated, shared, private with PSC), batch inference (Vertex AI, Dataflow, Dataproc, Airflow), SQL-based inference (BigQuery ML, AlloyDB, Spanner), multi-platform deployment (Cloud Run, GKE, Cloud Functions), Triton Inference Server, and vLLM for LLM serving
  • Feature Store (21 notebooks) — Vertex AI managed feature store and a 15-notebook deep dive into building a self-managed Bigtable feature store covering serialization, sync patterns, history, schema evolution, vector search, replication, and a recommendation engine capstone
  • Pipelines (13 notebooks) — Vertex AI Pipelines with KFP: components, I/O, control flow, scheduling, notifications, management, testing, and reusable modular patterns
  • Model Evaluation (3 notebooks) — Binary, multi-class, and multi-label classification evaluation with Vertex AI Model Registry
  • Model Monitoring (2 notebooks) — Feature skew and drift detection with BigQuery ML and Vertex AI
  • Experiment Tracking (1 notebook) — Logging parameters, metrics, and artifacts with Vertex AI Experiments

data+ai — 40 notebooks

Machine learning and AI capabilities across Google Cloud's data services — bring inference to where the data lives.

  • BigQuery AI Functions (32 notebooks) — 21 individual function guides covering all 20 built-in AI functions (AI.GENERATE, AI.EMBED, AI.FORECAST, AI.CLASSIFY, and more), plus 9 end-to-end workflows for RAG pipelines, semantic search, document intelligence, content moderation, and time series analysis
  • Dataflow (3 notebooks) — Streaming and batch ML inference with Apache Beam's RunInference API, including model hot-swap patterns and a GPU inference benchmarking study
  • Dataproc (4 notebooks) — Spark ML inference on managed Dataproc: serverless fundamentals, batch inference with Pandas UDFs, Structured Streaming, and Vertex AI Endpoint integration
  • Composer — Orchestrating batch ML inference with Airflow across Dataproc, Dataflow, KFP, and Vertex AI
  • AlloyDB & Spanner — SQL-based ML inference using ML.PREDICT() to call Vertex AI Endpoints
  • Tabular Data (1 notebook) — Efficient BigQuery read patterns for ML workflows with benchmarks and cost analysis

Applied GenAI — 53 notebooks

Practical generative AI workflows that go beyond simple prompting — grounding LLMs with your data through retrieval, ranking, and evaluation.

  • Generate (10 notebooks) — Google Gen AI SDK, Gemini API, Imagen, Veo, token management, long context retrieval, and multimodal prompting
  • Retrieval (11 notebooks) — Vector search across 11 Google Cloud databases (BigQuery, Vertex AI Vector Search, Feature Store, Spanner, AlloyDB, Cloud SQL, Memorystore, Firestore, Bigtable, and more) with cost and latency comparisons
  • Embeddings (7 notebooks) — Text, image, and multimodal embeddings with visualization, similarity math, and hierarchical classification
  • Chunking (3 notebooks) — Document processing with Document AI Layout Parser and PyMuPDF for structure-aware chunking
  • Ranking (1 notebook) — Cross-encoder re-ranking with the Vertex AI Ranking API
  • Evaluation (2 notebooks) — GenAI evaluation metrics and prompt optimization using LLM-as-judge approaches
  • Validate (1 notebook) — Grounding verification with the Vertex AI Check Grounding API
  • Solutions (8 notebooks) — Production-ready multi-format parsing with hybrid search (dense + sparse embeddings, BM25, RRF reranking)
  • RAG Engine — Vertex AI RAG Engine workflows with custom backends and hybrid search

Applied Forecasting — 20 notebooks

A complete learning path for time series forecasting on Google Cloud, from SQL-based methods to foundation models, all using NYC Citibike public data.

  • BigQuery ML — Univariate ARIMA+, multivariate ARIMA+ XREG, regression-based forecasting, and handling granularity/missing data
  • Vertex AI AutoML — Console (no-code), Python client, simultaneous training, and Tabular Workflows
  • Advanced Models — Seq2Seq+, Temporal Fusion Transformer, Time series Dense Encoder
  • Custom Models — Prophet in-notebook and with custom containers on Vertex AI
  • Foundation Models — TimesFM and TimesFM 2.5 for zero-shot forecasting
  • MLOps — Pipelines for BQML ARIMA+, Prophet, and forecasting tournaments with KFP; online serving with Vertex AI Prediction Endpoints

Framework Workflows — 34 notebooks

End-to-end ML workflows for specific frameworks, all training on the same BigQuery data source and sharing a common Vertex AI Model Registry and prediction endpoint.

  • PyTorch (20 notebooks) — Training autoencoders plus 19 serving notebooks covering Vertex AI Endpoints, BigQuery ML (ONNX), AlloyDB, Spanner, Dataflow, TorchServe, and scale testing
  • Keras (3 notebooks) — Multi-backend Keras with JAX and TensorFlow backends, autoencoder training, and BigQuery serving
  • CatBoost (3 notebooks) — Gradient boosted trees with custom FastAPI containers and Feature Store integration
  • Vertex AI AutoML (6 notebooks) — Tabular classification, architecture review, TabNet, Wide and Deep, and feature engineering
  • Flax (1 notebook) — Neural networks with JAX using Flax
  • R (1 notebook) — R on Vertex AI Pipelines with KFP container components

Applied ML — 26 notebooks

Applied machine learning patterns for real-world problems: agents, solution prototypes, forecasting, and optimization.

  • AI Agents (8 projects) — Data onboarding agents, conversational analytics, BigQuery context discovery, travel planning with A2A protocol, image-to-graph conversion, and Vertex AI Agent Engine deployment — built with ADK, MCP, and MCP Toolbox
  • Solution Prototypes — End-to-end solutions for document processing (8 notebooks: extraction, embeddings, anomaly detection, agent deployment), product hierarchy classification (9 approaches across 3 paradigms), and time series chat with interactive forecasting
  • Forecasting — Hierarchical forecasting with BigQuery ML
  • Optimization — Vertex AI Vizier for black-box optimization with multi-objective support and safety thresholds

Additional Content

This repository also contains framework-specific workflow series for TensorFlow (28 notebooks), BigQuery ML (26 notebooks), scikit-learn (12 notebooks), R (12 notebooks), and AutoML (6 notebooks), along with topics like Tips, Working With Document AI, and several other Applied topics. For a full description of all content, see readme-legacy.md.


Approach

These notebooks are designed to be readable, adaptable starting points — not production-hardened code or ad-hoc exploration, but the sweet spot between the two. The heavy lifting is done by Google Cloud services; notebooks orchestrate rather than compute, so most run on minimal machine sizes. Each notebook is self-contained with narrative, code, and visual explanations in one portable file.


More Resources