
Deploying GPT & Large Language Models
This repository contains code for the O'Reilly Live Online Training for Deploying GPT & LLMs
This course is designed to equip software engineers, data scientists, and machine learning professionals with the skills and knowledge needed to deploy AI models effectively in production environments. As AI continues to revolutionize industries, the ability to deploy, manage, and optimize AI applications at scale is becoming increasingly crucial. This course covers the full spectrum of deployment considerations, from leveraging cutting-edge tools like Kubernetes, llama.cpp, and GGUF, to mastering cost management, compute optimization, and model quantization.
Base Notebooks
Introduction to LLMs and Prompting
Cleaning Data and Monitoring Drift
Evaluating Agents
LangGraph and Agents
| Notebook |
Description |
| From Prompts to Workflows |
Why single LLM prompts break on multi-step tasks and how LangGraph provides structure, state, and control flow |
| LangGraph Basics |
Foundational LangGraph primitives: StateGraph, nodes, edges, conditional routing, memory, and visualization |
| Tools and ReAct Agents |
Tool integration and the ReAct pattern with LangChain, manual StateGraph, and MCP |
Advanced Deployment Techniques
More
Fine-Tuning LLMs
Prompt Engineering
Instructor
Sinan Ozdemir Sinan is a former lecturer of Data Science at Johns Hopkins University and the author of multiple textbooks on data science and machine learning. Additionally, he is the founder of the recently acquired Kylie.ai, an enterprise-grade conversational AI platform with RPA capabilities. He holds a master's degree in Pure Mathematics from Johns Hopkins University and is based in San Francisco, CA.