v1.0 LIVE today ⚡ v2.0 next

Your AI has no memory. We fix that.

UPtrim gives your local LLM permanent memory — your name, preferences, projects, everything. Then it teaches local and cloud models to work hand-in-hand: draft with Llama on your own machine, review with Claude when you need the firepower, both sharing the same context. Your memory never leaves your box.

Download free → See v2.0 features

Free developer key, forever Paid tiers from $5/mo 100% local

uptrim chat · monday LIVE

jordan / memories

memory · empty — watch this

v2.0 ships with

34+

new modules

Shipping Now

Live on GitHub

v1.0 is out today.

Open-source, free forever, runs fully offline. Don't wait for v2.0 — everything below is downloadable right now.

Download v1.0 on GitHub →

Open WebUI• SillyTavern• llama.cpp• Ollama• vLLM• LM Studio• Claude OAuth• OpenAI OAuth• Stable Diffusion• Any OpenAI-compatible app• Open WebUI• SillyTavern• llama.cpp• Ollama• vLLM• LM Studio• Claude OAuth• OpenAI OAuth• Stable Diffusion• Any OpenAI-compatible app•

Close the chat. It still remembers.

(your model, your box — no cloud account needed)

Chapter 01 — Memory

Close a chat.
Still remembered.

UPtrim watches every conversation and quietly extracts facts — your name, your preferences, the project you're working on. When you come back tomorrow, the model already knows.

NLP fact extraction (spaCy + regex + structured triples)
Local SQLite storage with FTS5 keyword search
Semantic search via bundled embeddings (v2.0)
Every fact editable & deletable from the dashboard

fact extraction · live

Hey, I'm Jordan. Mostly code in Python, and I prefer concise answers — no fluff.

↓ UPTRIM EXTRACTS ↓

Identityname = "Jordan"

Preferenceprimary_language = "Python"

Styleresponse_format = "concise"

route decisions · last 10 min LIVE

"what time is it in tokyo"

local 3B

"refactor this 800-line Python module"

claude-opus

"generate a cyberpunk hero image"

sd.cpp

"summarize last thursday's notes"

local + RAG

"review PR #1247 & suggest fixes"

claude-sonnet

"debug this stack trace"

gpt-4o

Chapter 02 — Routing

One chat.
Many brains.

Small talk hits your free local LLM. Hard problems get Claude Opus. Images route to your local Stable Diffusion. UPtrim picks the right model for each request — automatically — using a Capability Matrix built from your hardware.

OAuth sign-in for Claude & OpenAI, no raw keys
Per-request cost tracking & session budget
Capability Matrix benchmarks your local models
Keep everything local when you want to

Chapter 03 — Agents

Agents you can
actually watch.

Spawn autonomous subagents for multi-step work — code review, research, data analysis. Each runs with a scoped memory slice, a persistent task ID, and a shared scratchpad. You can resume, replay, or kill any task.

Pre-built modes: reviewer, researcher, analyst, coder
Full audit trail & live execution view
Shared scratchpad between collaborating agents
Every tool call logged & inspectable

subagents · 2 running LIVE

code-reviewerrunning

Reads PR diff, spots issues, drafts review comments, tests edge cases.

researcherrunning

Web search + URL fetching. Summarizes sources, cites inline, builds a memo.

analystidle

Loads CSVs and JSON, runs sandboxed analyses, emits charts and tables.

Chapter 04 — Dashboard

Everything, visible.

Memories, users, files, agents, routing, settings, tunnels, logs — one browser tab, running locally on your machine. Click the sidebar below. It actually works.

Overview

System health and recent activity

Local ✓ Hybrid ✓

247

Memories

5

Users

12

Files

1.2k

Messages

Active Users

Alice

42 memories

Bob

28 memories

Carol

67 memories

Recent Memories

Name is Jordan, prefers Pythonidentity

Working on Project Atlas rewritework

Prefers concise technical answerscomm

Uses VS Code with Vim keybindingstools

Staging env deploys every Thursdayops

Click the sidebar to explore · every panel runs at localhost:9099/dashboard

straight from the app ↓

uptrim · dashboardlocalhost:9099● live

Overviewsystem health · last 24h

247memories+12

5users+1

12files—

1.2kmsgs+96

requests · 24h

Name is Jordan — prefers Pythonidentity

Working on Project Atlas rewritework

Uses VS Code + Vim keybindingstools

the dashboard — live at :9099

brain viewrem: idle

brain view, mid-thought

uptrim tui17:42:08

status ✓ memory · 247 facts ✓ qwen-14b :11434 ✓ bge-base · warm

req / min

mem pressure

71% · 3.2k ctx

17:42:01 INJECT 3 facts → alice 17:42:03 EXTRACT "prefers dark mode" 17:42:07 ROUTE claude-opus · $0.04 17:42:08 RECALL project atlas ctx

the TUI, for terminal people

Chapter 05 — v2.0 arriving soon

Thirty-four
new modules.
One release.

Hybrid cloud routing. Autonomous subagents. Semantic memory and document RAG. Image generation. And a real-time view of the whole pipeline.

Pro tier · v2.0

Hybrid cloud + local routing

Small talk hits your local LLM. Hard problems go to Claude or GPT. OAuth means no raw API keys. Per-request cost tracking means no surprise bills. Your memory and files stay on your box regardless.

Local LLM Claude OAuth GPT OAuth Smart Router Cost Meter

v2.0

Arriving soon · early subscribers get it free

1 / 7 or just click the card

v2.0 · just landed

Six new features just landed.

v2.0 keeps shipping. Here's what came down recently — the meatiest additions since the last preview.

🎼 Premium

LLM Conductor

Decomposes every turn into typed subtasks, routes each to the best brain, runs them in parallel, assembles. Per-turn ledger logs cost + latency for every subtask.

🏛️ Premium

Internal Parliament

5–8 topical personas auto-clustered from your memory. Debate in the background on idle GPU. Their transcript is decision-support context when you ask hard questions.

🧑‍💻 Premium

Codex Offload

OpenAI Codex subprocess from inside UPtrim, full memory passed through. Pairs with Claude Code — pick your coding agent, keep your context.

📫 Pro

Ghost Inbox

Unified feed for every async result, dream memory, search hit, and background task UPtrim ran for you. Like email for your AI's background work.

🧪 Pro

Capability Matrix v2

Live per-model scoring on cost, latency, quality, context window. The Conductor reads it — you see exactly why every call went where it did.

🗜️ Pro

Context Compression

Long chats roll into running summaries instead of getting truncated. Message #287 still remembers what happened on #14 — nothing falls off the back of the context window.

Chapter 06 — Pricing

Start local.
Upgrade into hybrid.

Monthly billing. No annual lock-in. Free tier gives any local LLM a proper memory. Paid tiers unlock Ghost Agent, hybrid cloud routing, sub-agents, and the visual knowledge graph as the ceiling lifts with you.

LIVE · v1.0

Developer

free forever

👥4 seats

Make your local LLM remember you. Runs fully offline.

files/mo

10 MB

per file

50 MB

total

5,000 persistent memories
File upload & RAG
Multi-backend routing
Basic knowledge graph
Agent mode & local image gen
Secret Shield (key redaction)

Download free

Standard

per month

👥5 seats

Full local experience. Ghost Agent, memory lifecycle, brain dashboard.

files/mo

25 MB

per file

100 MB

total

Ghost Agent · background enrichment
Memory Lifecycle (active/fading/dormant)
Persona Engine + intent classification
Semantic recall + conversation branching
Brain dashboard + remote HTTPS
Parental controls · encrypted backup

Join the waitlist →

Pro

$15

per month

👥10 seats

Hybrid coding. Local drafts, cloud reviews, same memory. Most land here.

100

files/mo

500 MB

per file

uncapped

total

Hybrid cloud+local routing
Ghost Pre-Search + sub-agent swarm
Ghost Inbox · Capability Matrix v2 new
Context Compression · rolling summaries new
Prompt enhancement + history search
Cost tracker · aliases · 100K facts

Join the waitlist →

Premium

$30

per month

👥25 seats

Production stack. Visual KG, n8n tools, Ghost Mesh, Claude Code.

∞

files/mo

1 GB

per file

uncapped

total

LLM Conductor · Internal Parliament new
Visual Knowledge Graph explorer
Claude Code + Codex offload new
n8n + MCP · Ghost Mesh
Staleness UI + Ambient Task Tracker
Unlimited everything · SLA

Join the waitlist →

Business

$100

per month

👥40 seats

Premium feature set scaled for a small team. Centralized admin, SSO, audit log.

Everything in Premium · for the whole office
Central admin console · group permissions
SSO-ready (SAML / OIDC)
Hardened audit trail (SIEM export)
Encrypted backups to your S3 / R2
99.5% uptime SLA · 24h response

Join the waitlist →

Enterprise

Let's talk.

priced to fit

🔗contact for seats

⚡2+ instances required

On-prem licensing, regulated-industry deployment, HA-ready, direct line to engineering.

2+ instances minimum · HA + failover
On-prem & air-gapped deployments
Multi-proxy federation across regions
BYO KMS / HSM · data-residency rules
SOC 2 / HIPAA / GDPR mappings
99.9% uptime SLA · dedicated CSM

Contact sales →

90-second install

Install it.
Point your chat at it.
Done.

No accounts. No credit card. No rewrites. Drop UPtrim on your machine, change one URL in your chat app, and start building a proper memory.

Download v1.0 free → See all tiers

Your AI has no memory. We fix that.

v1.0 is out today.

Close the chat. It still remembers.

Close a chat.Still remembered.

One chat.Many brains.

Agents you canactually watch.

Everything, visible.

Thirty-fournew modules.One release.

Hybrid cloud + local routing

Six new features just landed.

LLM Conductor

Internal Parliament

Codex Offload

Ghost Inbox

Capability Matrix v2

Context Compression

Start local.Upgrade into hybrid.

Install it.Point your chat at it.Done.

Close a chat.
Still remembered.

One chat.
Many brains.

Agents you can
actually watch.

Thirty-four
new modules.
One release.

Start local.
Upgrade into hybrid.

Install it.
Point your chat at it.
Done.