Tag: llm

All the articles with the tag "llm".

n8n + LLM: Building Automations That Actually Think

16 Apr, 2026

Traditional automation is just very fast copy-paste. When your email filter breaks because someone wrote "URGENT" in lowercase, you realize rule-based logic has limits. Connecting n8n to a local LLM turns "if this then that" into "figure this out and do the right thing."

Qdrant vs Weaviate vs Chroma: Vector DB Showdown

15 Apr, 2026

Every RAG tutorial says 'just use Chroma.' Then you hit production. Here's what Qdrant, Weaviate, and ChromaDB actually offer and when each one earns its place.

CUDA vs ROCm vs CPU: Running AI on Whatever GPU You've Got

15 Apr, 2026

Want to run AI locally but not sure if your GPU will cooperate? Whether you're rocking an NVIDIA card, an AMD GPU, or just a CPU and sheer determination, here's the honest breakdown of what works, what technically works, and what will make you question your life choices.

LangGraph vs CrewAI vs AutoGen: AI Agent Frameworks for Mere Mortals

14 Apr, 2026

Everyone's talking about AI agents like they'll solve world hunger by Tuesday. But which framework do you actually use? We compare LangGraph, CrewAI, and AutoGen — with working Python examples, brutal honesty, and a healthy dose of skepticism about your robot assistant booking flights to Reykjavik.

Whisper & Faster-Whisper: Self-Hosted Speech-to-Text That Actually Works

14 Apr, 2026

OpenAI Whisper is genuinely impressive speech-to-text — and you can run it entirely on your own hardware. Add Faster-Whisper into the mix and suddenly you've got transcription that's 4x quicker, uses less VRAM, and doesn't phone home to anyone. Here's how to set it all up without losing your mind.

LiteLLM & vLLM: One API to Rule All Your Models

13 Apr, 2026

Your app calls OpenAI, your side project calls Anthropic, your homelab whispers to Ollama — and your codebase looks like a crime scene. LiteLLM and vLLM are the dynamic duo that puts a single sane API in front of every model you'll ever run, local or cloud.

RAG on a Budget: Building a Knowledge Base with Ollama & ChromaDB

12 Apr, 2026

Stop paying per-token to ask questions about your own documents. This guide walks you through building a fully local RAG pipeline with Ollama and ChromaDB — from Docker Compose to Python code — so your AI can actually know things without hallucinating them.