v2.0 →
Download
00
v1.0 LIVE today ⚡ v2.0 next

Your AI has no memory. We fix that.

UPtrim gives your local LLM permanent memory — your name, preferences, projects, everything. Then it teaches local and cloud models to work hand-in-hand: draft with Llama on your own machine, review with Claude when you need the firepower, both sharing the same context. Your memory never leaves your box.

Free developer key, forever Paid tiers from $5/mo 100% local
brain / hybrid router LIVE
you
GPT-5
brainstorm
Claude Opus
review
Qwen · local
drafting
CLOUD $0.067 claude + gpt
LOCAL $0.00 saved $0.18
alice / memories
Prefers Python, VS Codepref
Works on Project Atlaswork
Likes concise code answersstyle
v2.0 ships with
34+
new modules
Shipping Now
Live on GitHub

v1.0 is out today.

Open-source, free forever, runs fully offline. Don't wait for v2.0 — everything below is downloadable right now.

Download v1.0 on GitHub
Open WebUI SillyTavern llama.cpp Ollama vLLM LM Studio Claude OAuth OpenAI OAuth Stable Diffusion Any OpenAI-compatible app Open WebUI SillyTavern llama.cpp Ollama vLLM LM Studio Claude OAuth OpenAI OAuth Stable Diffusion Any OpenAI-compatible app
01
Chapter 01 — Memory

Close a chat.
Still remembered.

UPtrim watches every conversation and quietly extracts facts — your name, your preferences, the project you're working on. When you come back tomorrow, the model already knows.

  • NLP fact extraction (spaCy + regex + structured triples)
  • Local SQLite storage with FTS5 keyword search
  • Semantic search via bundled embeddings (v2.0)
  • Every fact editable & deletable from the dashboard
fact extraction · live
Hey, I'm Jordan. Mostly code in Python, and I prefer concise answers — no fluff.
↓   UPTRIM EXTRACTS   ↓
Identityname = "Jordan"
Preferenceprimary_language = "Python"
Styleresponse_format = "concise"
02
route decisions · last 10 min LIVE
"what time is it in tokyo"
local 3B
"refactor this 800-line Python module"
claude-opus
"generate a cyberpunk hero image"
sd.cpp
"summarize last thursday's notes"
local + RAG
"review PR #1247 & suggest fixes"
claude-sonnet
"debug this stack trace"
gpt-4o
Chapter 02 — Routing

One chat.
Many brains.

Small talk hits your free local LLM. Hard problems get Claude Opus. Images route to your local Stable Diffusion. UPtrim picks the right model for each request — automatically — using a Capability Matrix built from your hardware.

  • OAuth sign-in for Claude & OpenAI, no raw keys
  • Per-request cost tracking & session budget
  • Capability Matrix benchmarks your local models
  • Keep everything local when you want to
03
Chapter 03 — Agents

Agents you can
actually watch.

Spawn autonomous subagents for multi-step work — code review, research, data analysis. Each runs with a scoped memory slice, a persistent task ID, and a shared scratchpad. You can resume, replay, or kill any task.

  • Pre-built modes: reviewer, researcher, analyst, coder
  • Full audit trail & live execution view
  • Shared scratchpad between collaborating agents
  • Every tool call logged & inspectable
subagents · 2 running LIVE
code-reviewerrunning
Reads PR diff, spots issues, drafts review comments, tests edge cases.
researcherrunning
Web search + URL fetching. Summarizes sources, cites inline, builds a memo.
analystidle
Loads CSVs and JSON, runs sandboxed analyses, emits charts and tables.
04
Chapter 04 — Dashboard

Everything, visible.

Memories, users, files, agents, routing, settings, tunnels, logs — one browser tab, running locally on your machine. Click the sidebar below. It actually works.

U
UPtrim
v1.0 · v2.0 preview
Overview
Memories
Users
Files
Agents
Brain
Conversations
Images
Plugins
Settings
Tunnels
Logs
Online
Overview
System health and recent activity
Local ✓ Hybrid ✓
247
Memories
5
Users
12
Files
1.2k
Messages
Active Users
A
Alice
42 memories
B
Bob
28 memories
C
Carol
67 memories
Recent Memories
Name is Jordan, prefers Pythonidentity
Working on Project Atlas rewritework
Prefers concise technical answerscomm
Uses VS Code with Vim keybindingstools
Staging env deploys every Thursdayops
Click the sidebar to explore · every panel runs at localhost:9099/dashboard
05
Chapter 05 — v2.0 arriving soon

Thirty-four
new modules.
One release.

Hybrid cloud routing. Autonomous subagents. Semantic memory and document RAG. Image generation. A real-time view of the whole pipeline. And a real-time view of the whole pipeline.

Pro tier · v2.0

Hybrid cloud + local routing

Small talk hits your local LLM. Hard problems go to Claude or GPT. OAuth means no raw API keys. Per-request cost tracking means no surprise bills. Your memory and files stay on your box regardless.

Local LLM Claude OAuth GPT OAuth Smart Router Cost Meter
v2.0
Arriving soon · early subscribers get it free
Pro

Real-time brain

Watch every memory injection, agent step, embedding score, and cloud token tick by in a live blueprint view.

Pro

SubAgents

Spawn autonomous agents for multi-step work with scoped memory, persistent task IDs, and shared scratchpad.

Pro

Semantic memory & RAG

Bundled local embeddings catch paraphrases, not just keywords. Uploaded docs auto-inject with source attribution.

Pro

Conversation history

Every past chat, searchable by date, topic, or content. Exportable. Time-grouped. That thing from three weeks ago? Findable.

Pro

Prompt enhancement

A local model expands your short prompts into detailed specs before they hit the cloud. Same question, better answer.

Pro

Image generation

Ask for an image. UPtrim routes to your local Stable Diffusion, drops it inline, keeps a gallery. No third parties.

Premium

Knowledge graph

Memories as connected nodes. The LLM sees that Sarah leads Project Atlas — not just that both exist.

Premium

TrimScript plugins

Write .trim scripts that extract, inject, filter, and react. Hot-reload. Visual blueprint builder.

v2.0 · just landed

Six new features just landed.

v2.0 keeps shipping. Here's what came down recently — the meatiest additions since the last preview.

🎼 Premium

LLM Conductor

Decomposes every turn into typed subtasks, routes each to the best brain, runs them in parallel, assembles. Per-turn ledger logs cost + latency for every subtask.

🏛️ Premium

Internal Parliament

5–8 topical personas auto-clustered from your memory. Debate in the background on idle GPU. Their transcript is decision-support context when you ask hard questions.

🧑‍💻 Premium

Codex Offload

OpenAI Codex subprocess from inside UPtrim, full memory passed through. Pairs with Claude Code — pick your coding agent, keep your context.

📫 Pro

Ghost Inbox

Unified feed for every async result, dream memory, search hit, and background task UPtrim ran for you. Like email for your AI's background work.

🧪 Pro

Capability Matrix v2

Live per-model scoring on cost, latency, quality, context window. The Conductor reads it — you see exactly why every call went where it did.

🗜️ Pro

Context Compression

Long chats roll into running summaries instead of getting truncated. Message #287 still remembers what happened on #14 — nothing falls off the back of the context window.

06
Chapter 06 — Pricing

Start local.
Upgrade into hybrid.

Monthly billing. No annual lock-in. Free tier gives any local LLM a proper memory. Paid tiers unlock Ghost Agent, hybrid cloud routing, sub-agents, and the visual knowledge graph as the ceiling lifts with you.

LIVE · v1.0
Developer
$0
free forever
👥4 seats
Make your local LLM remember you. Runs fully offline.
5
files/mo
10 MB
per file
50 MB
total
  • 5,000 persistent memories
  • File upload & RAG
  • Multi-backend routing
  • Basic knowledge graph
  • Agent mode & local image gen
  • Secret Shield (key redaction)
Download free
Standard
$5
per month
👥5 seats
Full local experience. Ghost Agent, memory lifecycle, brain dashboard.
20
files/mo
25 MB
per file
100 MB
total
  • Ghost Agent · background enrichment
  • Memory Lifecycle (active/fading/dormant)
  • Persona Engine + intent classification
  • Semantic recall + conversation branching
  • Brain dashboard + remote HTTPS
  • Parental controls · encrypted backup
Join the waitlist →
Pro
$15
per month
👥10 seats
Hybrid coding. Local drafts, cloud reviews, same memory. Most land here.
100
files/mo
500 MB
per file
uncapped
total
  • Hybrid cloud+local routing
  • Ghost Pre-Search + sub-agent swarm
  • Ghost Inbox · Capability Matrix v2 new
  • Context Compression · rolling summaries new
  • Prompt enhancement + history search
  • Cost tracker · aliases · 100K facts
Join the waitlist →
Premium
$30
per month
👥25 seats
Production stack. Visual KG, n8n tools, Ghost Mesh, Claude Code.
files/mo
1 GB
per file
uncapped
total
  • LLM Conductor · Internal Parliament new
  • Visual Knowledge Graph explorer
  • Claude Code + Codex offload new
  • n8n + MCP · Ghost Mesh
  • Staleness UI + Ambient Task Tracker
  • Unlimited everything · SLA
Join the waitlist →
Business
$100
per month
👥40 seats
Premium feature set scaled for a small team. Centralized admin, SSO, audit log.
  • Everything in Premium · for the whole office
  • Central admin console · group permissions
  • SSO-ready (SAML / OIDC)
  • Hardened audit trail (SIEM export)
  • Encrypted backups to your S3 / R2
  • 99.5% uptime SLA · 24h response
Join the waitlist →
Enterprise
Let's talk.
priced to fit
🔗contact for seats
2+ instances required
On-prem licensing, regulated-industry deployment, HA-ready, direct line to engineering.
  • 2+ instances minimum · HA + failover
  • On-prem & air-gapped deployments
  • Multi-proxy federation across regions
  • BYO KMS / HSM · data-residency rules
  • SOC 2 / HIPAA / GDPR mappings
  • 99.9% uptime SLA · dedicated CSM
Contact sales →
GO
90-second install

Install it.
Point your chat at it.
Done.

No accounts. No credit card. No rewrites. Drop UPtrim on your machine, change one URL in your chat app, and start building a proper memory.