LLM applications
OpenAI, Anthropic, open-source. Tool use, structured outputs, streaming UI, retries that don't loop.
RAG + vector search
Chunking strategy, hybrid search, rerankers, citations. I've debugged the bad-recall problem.
Multi-agent orchestration
Planner/executor patterns, human-in-the-loop, traces you can actually read.
Evals + guardrails
Regression suites for non-deterministic output. Cost, latency, and quality tracked the same way you track p99.
Multimodal (vision, voice, video)
Document OCR, image generation pipelines, voice cloning, real-time transcription.
Cost + performance tuning
Caching, model routing, batching. The same answer for a quarter the price.