Tag: llm

All the articles with the tag "llm".

OpenRouter vs LiteLLM

8 Jun, 2026

Two ways to route LLM traffic across providers — OpenRouter as a hosted gateway, LiteLLM as a self-hosted proxy. Which one fits your home lab in 2026?

Function Calling in Local LLMs

7 Jun, 2026

Local LLMs can call tools, query APIs, and run code if you set them up right. Function calling on Ollama and llama.cpp explained — patterns that actually work.

Gemma 4 vs Qwen3.6

6 Jun, 2026

Gemma 4 vs Qwen3.6: sizes, reasoning, coding benchmarks, and which model you should actually pull for your home lab rig.

AnythingLLM as Knowledge Base

6 Jun, 2026

AnythingLLM is the closest thing to a real private NotebookLM you can self-host. Workspaces, RAG, agents, document chat — running locally on Ollama in 20 minutes.

MCP Servers: Tools for LLMs

4 Jun, 2026

Model Context Protocol turns your LLM into a tool-using agent — file access, APIs, your home lab. Build your first MCP server in under 50 lines of Python.

RAG Evaluation with Ragas

3 Jun, 2026

Most RAG demos look great until you ship them. Ragas measures faithfulness, context precision, answer relevancy — the metrics that actually predict user trust.

LLM Distillation Explained

2 Jun, 2026

How tiny 7B and 8B models keep punching above their weight — knowledge distillation, the teacher-student trick that makes local AI actually usable on home hardware.

Open WebUI Tools, Functions & Pipelines: Extend Your Local LLM

11 May, 2026

Open WebUI Tools, Functions, and Pipelines do different things — and the names don't help. What each one actually does, when to use which, and working code for all three.

Self-Supervised Learning Explained

30 Apr, 2026

Self-supervised learning is the technique behind GPT, BERT, and modern LLMs. Learn how models teach themselves from unlabeled data.

Ollama Model Management: Beyond ollama run

26 Apr, 2026

You can pull and run a model. Now learn Modelfiles, GPU layer tuning, the REST API, running multiple models without OOM-killing your server, and useful system prompts.

Continue.dev vs Cody vs Tabby: AI Code Help Without the Cloud

24 Apr, 2026

GitHub Copilot is great until you read the ToS. Continue.dev, Cody, and Tabby bring AI code assistance to your editor with local or self-hosted models — no code leaves your machine.

LangGraph vs CrewAI vs AutoGen: AI Agents Without the Hype

19 Apr, 2026

LangGraph gives you graph-level control. CrewAI gives your agents job titles. AutoGen makes them have a conversation. Here's which one to reach for when building real AI workflows.