ML Engineer Β· Building AI Agents that think, plan, and execute.
USC MS CS '25 Β· Ex-Cadence Β· Ex-Persistent
name: Deep Vivek Sheth
location: San Francisco, CA
education: MS Computer Science @ USC (2025)
current_role: Building AI systems that reason, plan, and act
interests:
- AI Agents & Multi-Step Reasoning
- GPU-Accelerated Computing (CUDA)
- Distributed Systems & Database Internals
- Full-Stack Product Development
fun_fact: I optimize everythingβcode, travel routes, and coffee brewing time β- Multi-Agent Systems β Coordinating multiple AI agents for complex tasks
- vLLM & Inference Optimization β Making LLMs run fast and cheap
- Advanced RAG Patterns β HyDE, RAPTOR, and Agentic RAG
Odyssey β AI-Powered Travel Planner with Zero Backtracking
GPT-4 Agents Β· Google Maps API Β· RAG Β· Next.js
"Never drive back and forth again." A multi-step AI agent that plans geographically optimized day trips using real-time distance calculations and review synthesis.
πΊοΈ Odyssey β AI Travel Agent
Full-stack AI planner that generates optimized itineraries using GPT-4 agents, Google Places API, and geospatial analytics. Built with FastAPI + Next.js.
Highlights:
- Multi-step agent workflows with ReAct pattern
- Real-time Distance Matrix integration for route optimization
- Review synthesis for "vibe-aware" recommendations
| Project | Description | Tech | Status |
|---|---|---|---|
| π€ Automated_SRE | Agentic AI SRE with RAG + Gemini | Python, FastAPI, ChromaDB | |
| π LogParser_LLM | 33,000x cost reduction log parsing | Python, Gemini, Prefix Tree | |
| π₯ unity-companion | AI Patient Results Assistant | Next.js, FastAPI, RAG | |
| πΊοΈ Odyssey | AI-powered travel planner | TypeScript, FastAPI, GPT-4 | |
| β‘ Parallel PageRank | 300x GPU speedup | C++, CUDA, MPI | |
| ποΈ ADS Buffer Manager | PostgreSQL buffer strategies | C, Spinlocks | |
| π RAG-Search | RAG with FastAPI & Next.js | Python, LangChain | |
| π LLM Teaching Assistant | Agentic course assistant | Python, OpenAI | |
| π€ ChatGPT from Scratch | Built GPT from scratch | PyTorch, Jupyter | |
| π Stock Prediction | News sentiment analysis | Python, LSTM | |
| π» LeetCode | 300+ solved problems | DSA |
- π MS Computer Science β University of Southern California (2025)
- πΌ MLE Intern @ Cadence β Built GPU-accelerated pipelines & RAG systems
- π Published Research β ML-based intrusion detection (IEEE)
- π§ 10M+ Logs Fine-tuned β Production LLM optimization
- πΌ LinkedIn
- π§ deepsheth3@gmail.com
- π Portfolio: Coming Soon
"Ship fast. Measure everything. Iterate."


