About Services Experience Projects Blog Contact Buy Me a Coffee Discuss a System
Rugved Chandekar — AI Systems Engineer from Chhatrapati Sambhajinagar, India
Open to engineering roles & projects
AI Systems Engineer

Rugved Chandekar

RAG Pipeline Architect Production ML on AWS IEEE Co-author

I design and deploy production-grade AI systems — RAG pipelines, agentic orchestration engines, and ML infrastructure — that automate complex workflows at scale. Shipped at Idyllic Services with ~99% token reduction and ~10× throughput improvement.

Currently building 
~99% token reduction On live agentic AI pipeline
~10× throughput Async orchestration at Idyllic
90% effort automated On a real, live production system

const engineer = {

focus: "AI Systems",

shipped: ["RAG", "Agents", "AWS"],

tokenReduction: "~99%",

throughput: "~10×",

research: "IEEE 2026"

};

// systems that scale ↗

Rugved Chandekar profile photo
Rugved Chandekar
AI Systems Engineer
Chhatrapati Sambhajinagar, India
Hackathon Winner LLM-based AI Resume Parser
IEEE Author — 2026 Bayesian Pneumonia Detection
CCAT — AIR 308 National CS Aptitude Exam

Rugved Chandekar —
I design systems that think.

I'm an AI Systems Engineer specializing in production RAG pipelines, agentic orchestration, and ML infrastructure on AWS. At Idyllic Services Pvt. Ltd, I architected a supervisor-led agentic AI system for JD-driven candidate sourcing — reducing LLM token costs by ~99% and improving throughput ~10× through async orchestration and targeted retrieval.

My work sits at the intersection of backend engineering and applied ML. I choose tools based on constraints, not familiarity — OpenSearch over Pinecone for operational control at AWS scale; XGBoost over deep learning when interpretability outweighs marginal accuracy gains; Bedrock over self-hosted LLMs when operational overhead matters more than per-token cost.

I think in pipelines: input → orchestration → retrieval → synthesis → output. I design for failure cases — rate limits, embedding drift, latency spikes — and I optimize for the constraints that matter in production.

What I architect
RAG & LLM pipelines — production-grade, AWS-native
Agentic AI orchestration & intelligent automation systems
Full-stack AI applications — from ML model to deployed API
AI Systems Engineer AWS Production Stack IEEE Co-author Remote-ready Production ML
0%
Token cost reduced
on live agentic AI pipeline
Throughput improvement
via async orchestration
0%
Manual effort automated
on live production system
0+
Students mentored
in algorithms & problem-solving

Systems I Design & Deploy

End-to-end engineering — from architecture decisions to production deployment. I own the system, not just the code.

Production-Grade

RAG & LLM Pipeline Engineering

Design and deploy production RAG systems — vector indexing, retrieval orchestration, LLM routing, and response synthesis. Built on AWS with OpenSearch and Bedrock.

  • Chunking strategy & embedding pipeline
  • OpenSearch / vector store integration
  • LLM orchestration & prompt engineering
  • AWS ECS Fargate deployment

Agentic AI & Workflow Automation

Architect supervisor-worker agentic systems that handle multi-step reasoning tasks end-to-end — with token-efficient orchestration and async execution. Proven: 90% effort automation on live systems.

  • Supervisor-led agent architecture
  • Token-efficient LLM orchestration
  • n8n / custom pipeline automation
  • 24/7 production uptime

Full-Stack AI Application

From ML model training to REST API design to deployed web application — I own the entire stack. Designed for real throughput requirements, not toy demos.

  • ML model training & evaluation
  • Flask REST API with auth & rate limiting
  • Database design & query optimization
  • Containerized cloud deployment

ML Model Deployment & API

Take an ML model from notebook to production API with monitoring, error handling, and scalable inference. Chose XGBoost over deep learning when interpretability and latency matter more than marginal accuracy.

  • Model serialization & versioning
  • REST inference API with latency SLAs
  • SHAP explainability layer
  • Docker & AWS deployment

AI & Systems Infrastructure

The production toolkit behind every system I ship.

🐍 Python
🧠 TensorFlow
Flask
🐳 Docker
🤖 LLMs / RAG
☁️ AWS
📊 Pandas
🗄️ SQL
🔬 XGBoost
🟡 JavaScript
🔌 WebSockets
🐙 Git / GitHub
🐍 Python
🧠 TensorFlow
Flask
🐳 Docker
🤖 LLMs / RAG
☁️ AWS
📊 Pandas
🗄️ SQL
🔬 XGBoost
🟡 JavaScript
🔌 WebSockets
🐙 Git / GitHub

Experience & Impact

Production systems built, scaled, and shipped.

Current Jan 2026 – Present

Associate Developer — AI Systems

Idyllic Services Pvt. Ltd

  • Architected a supervisor-led agentic AI system for JD-driven candidate sourcing — multi-step LLM orchestration with structured output parsing and retry logic
  • Reduced LLM token consumption by ~99% through pipeline redesign: replaced brute-force calls with targeted retrieval + focused, context-bounded prompts
  • Achieved ~10× throughput improvement via async orchestration — replaced sequential API calls with parallel execution and intelligent caching
Jun 2024 – Jun 2025

CP & DSA Lead

Hackslash Community

  • Conducted 5+ workshops, coding competitions & hackathons
  • Mentored 300+ students in DSA & problem-solving
  • Built structured curriculum for competitive programming
2022 – 2026

B.E. Information Technology

Govt. College of Engineering, Chhatrapati Sambhajinagar

  • CGPA: 7.27
  • Active in competitive programming & hackathons

Production Systems That Solve Real Problems

Each system is live, documented with architecture decisions, and built around real constraints — not portfolio demos.

AWS ECS · OpenSearch · Bedrock RAG

ResuGenie — Production AI Resume Intelligence Platform

Problem: HR teams manually scan resumes — a semantic matching problem that keyword search fails to solve.
Architecture: Flask API on AWS ECS Fargate → resumes chunked at sentence-level with overlap → embedded and indexed in OpenSearch → JD query triggers vector retrieval → Amazon Bedrock synthesizes match explanation. Chose OpenSearch over Pinecone for AWS-native operational control and lower egress cost at scale.
Pipeline: Upload → chunk → embed → index → query → retrieve → synthesize → score.

AWS ECSOpenSearch BedrockRAGFlask
R² = 0.88 · SHAP Explainability

Explainable House Price Predictor

Problem: Black-box price predictions are useless in real estate — users need to know WHY a property is valued as it is.
System: XGBoost regression served via Flask REST API with SHAP value computation per prediction. Chose XGBoost over neural networks — better performance on tabular data, natively compatible with SHAP, sub-10ms inference latency at this scale. Traded marginal accuracy for full interpretability.
Impact: R² = 0.88 on held-out test data. Users see exact feature contributions per prediction.

XGBoostSHAP FlaskREST APIPython
Sub-100ms Sync · Multi-user Concurrent

Collaborative Code Editor

Problem: Shared coding sessions demand sub-100ms state sync across concurrent users without edit conflicts or data loss.
Architecture: Flask-SocketIO backend orchestrates room-based sessions. Persistent WebSocket connections eliminate polling overhead (100ms vs 3–5s latency). Socket.IO chosen over raw WebSockets for cross-browser reliability and automatic reconnection on network failure.
Challenge: Handling concurrent edit conflicts and cursor position sync — operational transformation ensures all clients converge to the same state.

Flask-SocketIOWebSockets PythonJavaScript

Proof That Backs It Up

Research, competition wins, and published tools — signals that compound.

IEEE Conference 2026 — Co-author Uncertainty-Aware Pneumonia Detection using Bayesian Deep Learning
College Hackathon — 1st Place Built an LLM-based AI Resume Parser. Won over 50+ teams
CCAT 2026 — AIR 308 All-India Rank 308 in national competitive CS aptitude exam
Published PyPI Package "integration-smoke-test" — open-source API health check library, live on PyPI
AI-Powered VS Code Extension Natural language terminal command execution inside VS Code — built from scratch
Hackslash Lead — 300+ Students Head of Hackslash: 5+ workshops, hackathons, and coding competitions

Production Infrastructure I Build On

Chosen for constraints, not trends. Each tool justified by the systems it enables.

Python
Flask
JavaScript
AWS
Docker
SQL
REST APIs
LLMs / RAG
ML / XGBoost
OpenSearch
WebSockets
Git / GitHub

Building something that needs
serious engineering?

I work on production AI systems, RAG pipelines, agentic automation, and ML infrastructure. If you have a real problem that needs a real system — let's scope it together. Fast response, clear communication, no noise.

Typically responds within 2 hours
I'll message you on WhatsApp — no calls unless you say so

No spam. Your contact stays private.

Buy me a coffee
If my work helped or inspired you