Tag: Local-First AI
-
Google Gemma 4 12B: The Model That Signals a Bigger Shift in AI Infrastructure
Over the past year, I’ve spent a considerable amount of time working with both local and production AI environments. On one side, I’ve been experimenting with local LLMs using Ollama, testing quantized models, and exploring how much intelligence can realistically run on developer laptops. On the other side, I’ve been deploying production workloads…