All Courses
Advanced18 Chapters

GenAI Data Pipelines

Master data pipeline engineering for GenAI applications using hosted LLM APIs (OpenAI, Anthropic, Gemini, DeepSeek, LLaMA via Groq/Together), modern retrieval patterns (Contextual Retrieval, Agentic RAG, LazyGraphRAG, ColBERT reranking), and production deployment on GKE Autopilot. Covers VLM-based document ingestion, NeMo Curator data cleaning, context engineering, multimodal embeddings, AlloyDB vector operations, hybrid search with multi-layer caching, knowledge graph construction with LightRAG, evaluation-driven quality engineering (RAGAS, DeepEval), PII detection with Presidio and NeMo Guardrails, agentic pipeline orchestration with Dagster and MCP, data flywheels, and a full production capstone with Crossplane IaC and OTel observability. All labs run on Kubernetes (GKE).

Document ProcessingEmbeddingsVector StoreKnowledge GraphRAG

Learning Path

8 phases • 18 chapters
Phase 10/10 chapters

Foundations

Python essentials and development environment for agent development

0/506 quiz questions
0/180 labs

Tools & Topics

Virtual environments, async programming, type hints, Pydantic, error handling, testing, debugging, logging, project structure

Goals

  • Set up professional development environments
  • Write async Python code fluently
  • Use type hints and Pydantic for robust data handling
  • Implement error handling, testing, logging, and debugging

Chapters

1. Document Ingestion with VLMs
2. Data Cleaning & Quality Agents
3. Chunking & Contextual Retrieval
4. Context Engineering & LLM Enrichment
5. Multi-Format & Multimodal Processing
6. Embedding Model Selection & Benchmarking
7. Embedding Pipelines with Cost Controls
8. Vector Store Operations on AlloyDB
9. Hybrid Search, Reranking & Caching
10. Knowledge Graph Construction with LightRAG
Phase 20/7 chapters

LLM Fundamentals

Core LLM concepts: API clients, token economics, caching, and function calling basics

0/353 quiz questions
0/126 labs

Tools & Topics

LLM APIs, OpenAI/Anthropic/Gemini clients, prompt caching, token economics, function calling basics

Goals

  • Call multiple LLM providers (OpenAI, Anthropic, Gemini)
  • Implement prompt caching and token cost management
  • Build function calling and tool definitions
  • Understand token economics and cost optimization

Chapters

11. Entity Resolution & Linking
12. Agentic Graph-RAG Pipelines
13. Knowledge Graph Maintenance
14. Evaluation-Driven Quality Engineering
15. PII Detection, Guardrails & Compliance
16. Agentic Pipeline Orchestration
17. Data Flywheels & Continuous Improvement
Phase 30/1 chapters

Agent Fundamentals

Agent patterns: ReAct, planning, tool execution, sandboxing, web navigation, and MCP protocol

0/53 quiz questions
0/18 labs

Tools & Topics

ReAct loop, planning patterns, tool execution, sandboxing, web navigation, MCP servers, MCP clients, tool routing

Goals

  • Create agent loops with ReAct and planning patterns
  • Build and consume MCP servers for tool integration
  • Implement sandboxing and web navigation
  • Design structured outputs and prompts

Chapters

18. Production Capstone on GKE
Phase 40/0 chapters

Agent State & Memory

Memory systems, RAG patterns, context optimization, and LangGraph state machines

0/0 quiz questions
0/0 labs

Tools & Topics

Short-term memory, long-term memory (RAG), agentic RAG patterns, semantic memory, context optimization, state graphs, conditional edges, checkpointing, human-in-the-loop, streaming, subgraphs

Goals

  • Implement short-term and long-term memory
  • Build RAG and agentic RAG systems
  • Create state machines with LangGraph
  • Implement checkpointing, streaming, and human-in-the-loop

Chapters

Phase 50/0 chapters

Multi-Agent Systems

Multi-agent patterns, guardrails, evaluations, and observability

0/0 quiz questions
0/0 labs

Tools & Topics

Supervisor pattern, hierarchical pattern, reflector pattern, input guardrails, output guardrails, prompt injection defense, evaluations, benchmarking, tracing, observability

Goals

  • Implement supervisor, hierarchical, and reflector patterns
  • Build input and output guardrails
  • Defend against prompt injection attacks
  • Evaluate agents with benchmarks

Chapters

Phase 60/0 chapters

Production & Operations

Production deployment: APIs, containers, databases, scaling, CI/CD, and monitoring

0/0 quiz questions
0/0 labs

Tools & Topics

FastAPI, Docker, production databases, scaling, CI/CD, monitoring, alerting, model routing, fallbacks, system design

Goals

  • Serve agents via FastAPI with Docker
  • Deploy to Kubernetes with CI/CD
  • Monitor with Prometheus/Grafana
  • Build multi-tenant agent platforms

Chapters

Phase 70/0 chapters

Advanced Topics

Alternative frameworks, protocols, specialized agents, autonomous workflows, and cutting-edge capabilities

0/0 quiz questions
0/0 labs

Tools & Topics

CrewAI/AutoGen, A2A protocols, GraphRAG, local models, vision agents, voice agents, code agents, autonomous workflows, streaming data, agent swarms

Goals

  • Use alternative frameworks (CrewAI, AutoGen)
  • Implement A2A protocol for agent communication
  • Build GraphRAG for complex knowledge
  • Build vision, computer use, and voice agents

Chapters

Phase 80/0 chapters

Agent Production Excellence

Production excellence: trajectory evaluation, safety, cost control, enterprise patterns, and governance

0/0 quiz questions
0/0 labs

Tools & Topics

Agent trajectory evaluation, safety boundaries, cost control, enterprise agent patterns, load testing, versioning, fleet dashboards, autonomous agent governance

Goals

  • Score multi-step agent reasoning with LLM-as-judge pipelines
  • Build safety boundaries with permissions and kill switches
  • Implement per-agent cost budgets and cost-aware routing
  • Deploy enterprise agent patterns for document processing and code review

Chapters

© 2026 GenBodha. All rights reserved.