AI Infrastructure Architect & Enterprise Solution Architect

Category: AI Infrastructure

Why RAG Systems Sometimes Answer Questions Nobody Asked

A Production Lesson Every AI Engineer Eventually Learns One of the most surprising moments when deploying a Retrieval-Augmented Generation (RAG) system to production is watching users become frustrated by an AI that appears highly intelligent but somehow feels socially unaware. The user says: Thank You The AI responds: According to the my knowledge…

Amit Patriwala

June 24, 2026
Loops: The Quiet Skill Behind Every AI System That Actually Scales

Why the Future of AI Isn’t About Better Models—It’s About Better Loops Every week a new AI model arrives. A larger context window. A better benchmark score. A more impressive demo. The industry conversation usually follows the same pattern: Is GPT-5 better than Claude? Is Claude better than Gemini? Is Gemini better than…

Amit Patriwala

June 23, 2026
Learn AI for Free: 10 Platforms From OpenAI, Google, Microsoft, NVIDIA & More

A few years ago, learning Artificial Intelligence felt expensive. People spent thousands of dollars on bootcamps, certifications, and online programs hoping to gain AI skills that could improve their careers. Today, something remarkable has happened. The companies building the world’s most advanced AI systems are teaching people for free. Not…

Amit Patriwala

June 22, 2026
The Hidden Context Window Problem in RAG Systems: A Real Production Incident with vLLM and Qwen3

When Your 32K Context LLM Fails at 4K Tokens: A Production vLLM Troubleshooting Guide One of the most common misconceptions in Generative AI systems is: “The model supports 32K context, so my application automatically supports 32K context.” In production, that assumption can lead to unexpected failures. Recently, we encountered a production issue…

Amit Patriwala

June 22, 2026
Want to Learn AI Without Spending Thousands on Courses?

Microsoft has made its entire AI for Beginners curriculum available for FREE on GitHub. 📚 12 Weeks | 24 Lessons | Hands-On Labs | Open Source This isn’t a marketing tutorial or a collection of random videos. It’s a structured learning path created by Microsoft Cloud Advocates that covers the fundamentals of…

Amit Patriwala

June 19, 2026
Google Gemma 4 12B: The Model That Signals a Bigger Shift in AI Infrastructure

Over the past year, I’ve spent a considerable amount of time working with both local and production AI environments. On one side, I’ve been experimenting with local LLMs using Ollama, testing quantized models, and exploring how much intelligence can realistically run on developer laptops. On the other side, I’ve been deploying production workloads…

Amit Patriwala

June 19, 2026