DEV Community

# ollama

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Rio de Janeiro's "Own LLM" Looks Like a Merge: What to Read Between the Lines

Rio de Janeiro's "Own LLM" Looks Like a Merge: What to Read Between the Lines

Comments
8 min read
Hermes-Crew Hybrid: A Hybrid Architecture for Secure Multi-Agent AI Workflows

Hermes-Crew Hybrid: A Hybrid Architecture for Secure Multi-Agent AI Workflows

1
Comments
2 min read
How Much RAM Do You Really Need to Run LLMs Locally? 2026 Benchmarks

How Much RAM Do You Really Need to Run LLMs Locally? 2026 Benchmarks

Comments
6 min read
I Replaced My $20/mo AI Tools With Local Models: My Full Stack

I Replaced My $20/mo AI Tools With Local Models: My Full Stack

1
Comments 1
6 min read
Ollama Cloud Free vs Pro — Usage Limits, Pricing & What You Actually Get (2026)

Ollama Cloud Free vs Pro — Usage Limits, Pricing & What You Actually Get (2026)

Comments
3 min read
How I fixed silent Ollama failures in my local AI Assistant

How I fixed silent Ollama failures in my local AI Assistant

Comments
2 min read
Structured Output From Local LLMs: JSON That Never Breaks (Ollama + Zod)

Structured Output From Local LLMs: JSON That Never Breaks (Ollama + Zod)

1
Comments 1
6 min read
I Built a Private AI Brain on My Laptop for $0

I Built a Private AI Brain on My Laptop for $0

2
Comments
2 min read
Open Notebook Review: Self-Hosted NotebookLM Alternative

Open Notebook Review: Self-Hosted NotebookLM Alternative

Comments
10 min read
Open-LLM-VTuber Review: Offline AI Companion with Live2D

Open-LLM-VTuber Review: Offline AI Companion with Live2D

Comments
10 min read
I Benchmarked 3 Local LLMs on My Laptop — Here's What the Numbers Actually Show

I Benchmarked 3 Local LLMs on My Laptop — Here's What the Numbers Actually Show

1
Comments 1
4 min read
Doubling Qwen3.6-27B on One RTX 3090: ollama llama.cpp + MTP, Lever by Lever (35.7 ~75 tok/s)

Doubling Qwen3.6-27B on One RTX 3090: ollama llama.cpp + MTP, Lever by Lever (35.7 ~75 tok/s)

Comments
7 min read
Running Brand-New Gemma 4 12B on an 8-Year-Old GTX 1080 Ti: Speed, 3 Gotchas, and Why Q8 Beat Q4 on My Own Field

Running Brand-New Gemma 4 12B on an 8-Year-Old GTX 1080 Ti: Speed, 3 Gotchas, and Why Q8 Beat Q4 on My Own Field

Comments 1
5 min read
I built a 81-tool, fully local AI desktop assistant with PySide6 and Ollama (here is the architecture)

I built a 81-tool, fully local AI desktop assistant with PySide6 and Ollama (here is the architecture)

Comments 1
4 min read
Docker, Node, and Electron Walked Into My Terminal. So I Built a 3.5MB App to Kick Them All Out.

Docker, Node, and Electron Walked Into My Terminal. So I Built a 3.5MB App to Kick Them All Out.

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.