Skip to content

Tag: llm

All the articles with the tag "llm".

RAG Evaluation with Ragas

RAG Evaluation with Ragas

Most RAG demos look great until you ship them. Ragas measures faithfulness, context precision, answer relevancy — the metrics that actually predict user trust.

LLM Distillation Explained

LLM Distillation Explained

How tiny 7B and 8B models keep punching above their weight — knowledge distillation, the teacher-student trick that makes local AI actually usable on home hardware.

Self-Supervised Learning Explained

Self-Supervised Learning Explained

Self-supervised learning is the technique behind GPT, BERT, and modern LLMs. Learn how models teach themselves from unlabeled data.

Ollama Model Management: Beyond ollama run

Ollama Model Management: Beyond ollama run

You can pull and run a model. Now learn Modelfiles, GPU layer tuning, the REST API, running multiple models without OOM-killing your server, and useful system prompts.

n8n + LLM: Building Automations That Actually Think

n8n + LLM: Building Automations That Actually Think

Traditional automation is just very fast copy-paste. When your email filter breaks because someone wrote "URGENT" in lowercase, you realize rule-based logic has limits. Connecting n8n to a local LLM turns "if this then that" into "figure this out and do the right thing."

CUDA vs ROCm vs CPU: Running AI on Whatever GPU You've Got

CUDA vs ROCm vs CPU: Running AI on Whatever GPU You've Got

Want to run AI locally but not sure if your GPU will cooperate? Whether you're rocking an NVIDIA card, an AMD GPU, or just a CPU and sheer determination, here's the honest breakdown of what works, what technically works, and what will make you question your life choices.

LangGraph vs CrewAI vs AutoGen: AI Agent Frameworks for Mere Mortals

LangGraph vs CrewAI vs AutoGen: AI Agent Frameworks for Mere Mortals

Everyone's talking about AI agents like they'll solve world hunger by Tuesday. But which framework do you actually use? We compare LangGraph, CrewAI, and AutoGen — with working Python examples, brutal honesty, and a healthy dose of skepticism about your robot assistant booking flights to Reykjavik.

Whisper & Faster-Whisper: Self-Hosted Speech-to-Text That Actually Works

Whisper & Faster-Whisper: Self-Hosted Speech-to-Text That Actually Works

OpenAI Whisper is genuinely impressive speech-to-text — and you can run it entirely on your own hardware. Add Faster-Whisper into the mix and suddenly you've got transcription that's 4x quicker, uses less VRAM, and doesn't phone home to anyone. Here's how to set it all up without losing your mind.