Skip to content

Tag: ai

All the articles with the tag "ai".

Self-Host a Local AI Coding Workhorse

Self-Host a Local AI Coding Workhorse

Run Gemma 4 or Qwen3-Coder locally via Ollama or llama.cpp in Docker, then delegate mechanical coding tasks to it while Claude handles the thinking. Free tokens, zero leakage.

Give Your AI Agent a Cheap Intern

Give Your AI Agent a Cheap Intern

Stop burning expensive AI tokens on boring grunt work. The overseer/workhorse pattern routes mechanical tasks to a cheap model and saves more than you'd think.

Dify: Visual Agent Workflows

Dify: Visual Agent Workflows

Dify is an open-source LLM-app builder you can self-host. Visual workflow editor, RAG, agents, tool use — without writing 500 lines of LangChain glue.

Exploring the Diverse World of LLM Models

Exploring the Diverse World of LLM Models

· Updated:

LLaMA, Mistral, Falcon, GPT — the LLM landscape is crowded. Compare model families, sizes, licensing, and what each is actually good for.

Key Parameters of Large Language Models

Key Parameters of Large Language Models

· Updated:

Temperature, top-p, top-k, context length — LLM inference parameters explained so you stop guessing why the model gives weird output.

Local Vision LLMs Worth Running in 2026

Local Vision LLMs Worth Running in 2026

· Updated:

Pixtral, Qwen3-VL, and Gemma 4 compared for local multimodal use in 2026. LLaVA is dead; here's what to run in Ollama for OCR, screenshots, and vision tasks.