Tag: ollama

All the articles with the tag "ollama".

n8n + LLM: Building Automations That Actually Think

n8n + LLM: Building Automations That Actually Think

16 Apr, 2026

Traditional automation is just very fast copy-paste. When your email filter breaks because someone wrote "URGENT" in lowercase, you realize rule-based logic has limits. Connecting n8n to a local LLM turns "if this then that" into "figure this out and do the right thing."

RAG on a Budget: Building a Knowledge Base with Ollama & ChromaDB

RAG on a Budget: Building a Knowledge Base with Ollama & ChromaDB

12 Apr, 2026

Stop paying per-token to ask questions about your own documents. This guide walks you through building a fully local RAG pipeline with Ollama and ChromaDB, from Docker Compose to Python code, so your AI can actually know things without hallucinating them.

The Embedding Model Choice Nobody Explains

The Embedding Model Choice Nobody Explains

28 Jan, 2026 · Updated: 5 Apr, 2026

Most people use OpenAI's embeddings because it's easy. But local embeddings exist. How to pick and when it actually matters.

$GPU Memory Math: Will This Model Actually Fit?$

GPU Memory Math: Will This Model Actually Fit?

17 Mar, 2026 · Updated: 5 Apr, 2026

Before you download a 70B model, calculate if it fits. The formulas, the gotchas, and a quick calculator you can actually use.

Running Gemma 4 Locally with Ollama

Running Gemma 4 Locally with Ollama

3 Apr, 2026

Google's Gemma 4 is the best open model they've shipped yet. Here's how to pull it, run it, and actually use it for real work with Ollama on your own hardware.

LLM Backends: vLLM vs llama.cpp vs Ollama

LLM Backends: vLLM vs llama.cpp vs Ollama

8 Mar, 2026

vLLM, llama.cpp, and Ollama all run local LLMs, compare throughput, memory use, GPU support, and which fits your hardware.

Running Multiple Ollama Models Without Running Out of RAM

Running Multiple Ollama Models Without Running Out of RAM

9 Feb, 2026

Ollama can load one model at a time on limited hardware. How to switch between models, use CPU offloading, and manage VRAM intelligently.

n8n + LLM: Building Automations That Actually Think

n8n + LLM: Building Automations That Actually Think

6 Jan, 2026

Connect n8n to Ollama or any local LLM to build smart automations that classify, summarize, and triage, not just shuffle data around blindly.

Ollama: Powerful Language Models on Your Own Machine

Ollama: Powerful Language Models on Your Own Machine

6 Apr, 2024

Ollama makes running local LLMs dead simple, pull a model, start the server, and get a private ChatGPT running on your own hardware.