KRISHNA   VAMSI
Initializing neural world
Available for new opportunities · Dallas, TX

I build intelligent agentic systems.

> _

AI Engineer at Microsoft, shipping Generative AI, LLM, and multi-agent systems across cloud platforms. Ex Toyota, KeyBank, Google — Google Developer Expert (ML).

0
Years
in AI/ML
0%
Fewer LLM
hallucinations
0×
Faster team
adoption
0.0
GPA
MS Analytics
Scroll
LinkedIn Email
Dallas, TX → 40.7128°N
01 About

An engineer who turns
research into production.

I'm Krishna Vamsi — an AI Engineer specializing in Generative AI, Agentic AI, and large-scale ML systems. Currently at Microsoft, designing production-grade LLM and multi-agent workflows on Azure Foundry.

Over the last six years I've shipped AI across Microsoft, Toyota North America, KeyBank, and Google — orchestrating RAG pipelines, fine-tuning LLMs with LoRA/QLoRA, and standing up observable, governed AI platforms used by non-AI teams.

I'm a recognized Google Developer Expert (ML), and I hold a Master's in Business Analytics from Texas A&M University, Commerce — graduated with a 4.0 GPA.

Generative AI Agentic AI MCP RAG Azure Foundry AWS Bedrock LangGraph Fine-tuning
KV
Krishna Vamsi
AI Engineer · Microsoft
Online
  • LocationDallas, Texas
  • FocusGen AI · Agentic · MCP
  • StackPython · Azure · AWS · GCP
  • EducationMS, Texas A&M · 4.0
  • RecognitionGoogle Developer Expert
~/kv/profile.json
{
  "role": "AI Engineer",
  "company": "Microsoft",
  "building": [
    "multi-agent systems",
    "RAG pipelines",
    "MCP-standardized A2A"
  ],
  "open_to": "new opportunities"
}
02 Experience

A timeline of shipped AI.

Dec 2025 — Present Dallas, TX

AI Engineer

Microsoft

Building production multi-agent systems on Azure Foundry with the Microsoft Agent Framework — MCP-standardized A2A handoffs between Claude and Azure OpenAI agents.

  • Orchestrated multi-agent workflows with Microsoft Agent Framework, Azure Functions for stateful coordination and Cosmos DB vector search.
  • Integrated Azure SRE Agent (Preview) — KQL queries, dynamic thresholds, incident automation across AKS/App Service with human-in-loop approvals.
  • RAG pipelines with Cosmos DB vector search cut triage hallucinations by 95%.
  • Privacy guardrails via Azure Content Safety + Purview; PII redaction pipelines for compliant inference.
  • Standardized agent integration SDKs — lifted internal team adoption .
Microsoft Agent Framework Azure Foundry MCP · A2A Cosmos DB Vector Azure SRE GitHub Actions
Feb 2025 — Dec 2025 Plano, TX

AI / Machine Learning Engineer

Toyota North America

Stood up Gemini Enterprise + GPT-4/Claude 3 powered agents for real-world business workflows behind production SLAs.

  • Orchestrated autonomous agents with LangChain & LangGraph — prompt chains, RAG, multi-model LLM calls.
  • Modular FastAPI backend services with IAM, encryption and PII redaction aligned to enterprise governance.
  • Pinecone / Qdrant vector stores + semantic retrieval to sharpen context-aware agents.
  • Serverless inference (AWS Lambda, Bedrock) + CloudWatch SLO/SLI monitoring.
  • CloudFormation IaC versioned model + service rollouts with rollback.
LangGraphGeminiFastAPI AWS BedrockPineconeDocker
Jan 2024 — Jan 2025 Dallas, TX

ML Engineer

KeyBank

Drove AI-powered customer support and analytics — boosted efficiency and adoption across client-facing surfaces.

  • NLP (classifiers, NER, NLG) + GenAI modules — full lifecycle from build to retrain.
  • Multi-channel chatbots — Slack, WhatsApp, Facebook Messenger.
  • Google Dataflow ETL + real-time ingestion for training/inference at scale.
  • LangChain agents, RAG with LlamaIndex + vector search, AWS Guardrails.
  • Fine-tuned LLMs on domain data; delivered Vertex AI NLP + CV + predictive analytics.
LangChainLlamaIndexVertex AI AWS GuardrailsGCP Dataflow
Jul 2019 — Aug 2022 Hyderabad, India

Data Scientist

.efficiently

Computer vision, predictive models and time-series forecasting — shipped classifiers and OCR pipelines to production.

  • Image enhancement + SIFT feature extraction; VGG-based classifiers with continuous retraining.
  • LoRA / QLoRA fine-tuning — cut compute cost while keeping accuracy.
  • S3-backed training data pipeline, DevOps CI/CD for ML services.
  • Production monitoring + root-cause analysis for deployed models on AWS.
LoRA / QLoRAVGGOCR · SIFT AWS S3Python
May 2019 — Jul 2019 India

Machine Learning Intern

Google

Contributed to Search ranking + YouTube recommendation systems inside Google's internal research infrastructure.

  • TensorFlow neural nets on Google's internal infra; Python + SQL data pipelines.
  • A/B testing + dashboards to measure model lift and inference speed.
  • Cross-functional debugging of ML pipeline components.
TensorFlowA/B TestingRanking
03 Skills

A constellation of what I ship with.

Generative AI

Llama IndexRAGGemini Prompt Eng.LLM Fine-tuning GitHub CopilotSourcegraph Cody

Agentic AI

LangChainLangGraphLangSmith AutoGenAssistants API MS Agent FrameworkMCP · A2A

Languages & Libraries

PythonJavaC#SAS PandasNumPyScikit-Learn KerasTensorFlowHugging Face FastAPI

Cloud

AWSAzureGCP AWS BedrockSageMaker LambdaStep Functions CloudFormationOpenSearch Vertex AIBigQuery DataflowAzure OpenAI

NLP

RNNLSTM TransformersBERT · GPT spaCyNLTK TextBlobStanford NLP GensimASR

ML Ops & Techniques

LoRA / QLoRAModel Quantization Neural NetworksTime Series A/B TestingHypothesis Testing EDAPredictive Analytics STT · TTS
Certifications
Oracle Cloud Infra 2025 — Generative AI Professional Model Context Protocol — Anthropic Academy Accreditation — Generative AI Data Science Essentials with Python Machine Learning SQL Advanced Building RAG Apps Building AI Agents with MongoDB Oracle Cloud Infra 2025 — Generative AI Professional Model Context Protocol — Anthropic Academy Accreditation — Generative AI Data Science Essentials with Python Machine Learning SQL Advanced Building RAG Apps Building AI Agents with MongoDB
04 Impact

Selected wins & recognition.

GDE

Google Developer Expert — ML

Recognized by Google for exemplary expertise in ML and open-source. Part of a global community of ~1,000 experts worldwide.

Jun 2024 — Present
95%

Fewer hallucinations

Cosmos DB vector search + RAG on Microsoft Agent Framework — platform triage dropped hallucinated outputs by 95%.

3×

Adoption lift

Standardized agent integration SDKs — internal teams onboarded AI features 3× faster.

4.0

Texas A&M GPA

MS in Business Analytics — Texas A&M University, Commerce, Dec 2023.

10%

Tumor detection accuracy

GDSC AI hackathon — CNN-based tumor detection; improved accuracy through iterative testing.

4

Enterprises shipped for

Microsoft · Toyota · KeyBank · Google — production AI across finance, auto and Big Tech.

05 Contact

Let's build
something intelligent.

Always interested in Gen AI / Agentic AI roles, research collaborations, and speaking.