Amey Lokare

Jan 03, 2025 • 2 min read • AI & Machine Learning

Building Transcription: Why Whisper Is Still the Best

I needed transcription for my app. I tried Google Speech-to-Text, AWS Transcribe, and Whisper. Whisper won. Here's why it's still the best choice.

AI Whisper

Dec 30, 2024 • 2 min read • AI & Machine Learning

Monitoring AI Models: The Metrics That Actually Matter

I monitor AI models in production. Most metrics are noise. Here are the metrics that actually matter, what I track, and what I ignore.

Dec 27, 2024 • 3 min read • AI & Machine Learning

Building RAG: Why My First Three Attempts Failed

I wanted to build a RAG system for my documentation. Three attempts, three failures. Here's what went wrong each time, why it failed, and what finally worked.

AI LLM RAG Vector Databases

Dec 25, 2024 • 2 min read • AI & Machine Learning

Running Multiple LLMs: My GPU Memory Management Nightmare

I wanted to run multiple LLMs simultaneously on my GPU. Simple goal, right? Wrong. GPU memory management became a nightmare. Here's what I learned the hard way.

AI GPU LLM Optimization

Dec 24, 2024 • 3 min read • AI & Machine Learning

Gemini 2.0 vs GPT-4o: I Tested Both for Real Work

I spent a week testing Gemini 2.0 and GPT-4o side-by-side on real work tasks. Not benchmarks or demos—actual coding, writing, and analysis. Here's what I found, when to use which, and the real costs.

AI LLM

Dec 22, 2024 • 4 min read • AI & Machine Learning

Building Voice Control That Actually Works (Without Cloud APIs)

I wanted voice control for my home automation, but I didn't want to send my voice data to Google or Amazon. So I built a local solution using Whisper. Here's why I chose local, the challenges I faced, and what actually works.

Whisper

Dec 20, 2024 • 3 min read • AI & Machine Learning

Fine-Tuning LLMs: The Expensive Truth Nobody Talks About

I spent $2,400 fine-tuning a language model, thinking it would solve my problem. Three months later, I realized I could have achieved 90% of the results with prompt engineering for $0. Here's the expensive truth about fine-tuning that nobody tells you.

AI Machine Learning LLM

Dec 18, 2024 • 4 min read • AI & Machine Learning

I Built a Local LLM Chat App—Here's What Actually Works

After trying three different approaches to build a local LLM chat interface, I finally found what works. Here's what failed, what succeeded, and the real performance numbers you won't find in tutorials.

Laravel AI LLM Ollama Experience

Dec 13, 2024 • 7 min read • AI & Machine Learning

Why 24GB VRAM Is the New Minimum for Serious AI Work

Modern AI models are breaking on 12GB cards. After running local LLMs, training models, and deploying AI systems, I've learned that 24GB VRAM is now the practical minimum for serious AI work. Here's why, and what it means for your hardware choices—comparing RTX 3090, 4090, and A6000.

Machine Learning GPU LLM CUDA Hardware ameylokare amey lokare lokare amey VRAM AI Hardware NVIDIA RTX 3090 RTX 4090 A6000 Deep Learning GPU Computing AI Training Model Inference

Dec 10, 2024 • 5 min read • AI & Machine Learning

Real-Time Speech-to-Text with Whisper and WebRTC: Building Voice Interfaces

Building real-time voice interfaces requires low-latency speech recognition and seamless audio streaming. I've integrated OpenAI Whisper with WebRTC to create production-ready voice transcription systems that work in browsers without plugins.

WebRTC AI Whisper Python Speech Recognition Real-time

Dec 07, 2024 • 4 min read • AI & Machine Learning

Building Production-Ready RAG Systems with Python and Vector Databases

Retrieval-Augmented Generation (RAG) has become the go-to approach for building AI applications that need accurate, contextual responses. I've built several RAG systems in production, and here's what I learned about making them reliable, fast, and maintainable.

AI Machine Learning Python RAG Vector Databases OpenAI

Dec 06, 2024 • 1 min read • AI & Machine Learning

My Home AI Lab Setup — GPU Computing for Local LLMs

From experimenting with speech-to-text to training lightweight predictive models, I've created a personal AI lab at home powered by high-end consumer hardware. The goal? Run local LLMs, real-time voice agents, VLMs, and GPU-accelerated automation workflows without relying on cloud costs.

AI Machine Learning GPU LLM Hardware Home Lab

AI & Machine Learning

Explore More AI & Machine Learning