What It Does
What changes when UPtrim sits between your chat app and your LLM.
Context Trimming
Long chats degrade as older messages fall out of the context window. UPtrim compresses earlier turns and keeps the important parts visible.
Click to learn moreMulti-User, Isolated
Multiple users, one proxy. Each gets isolated memory and files — nothing crosses over.
Click to learn morePersistent Memory
NLP extracts facts from your conversations — name, preferences, projects — and stores them in SQLite. They show up again when relevant.
Click to learn moreFile Uploads
Upload PDFs, code, spreadsheets, docs. The LLM pulls relevant chunks mid-conversation. 33+ formats supported.
Click to learn moreWeb Dashboard
View and edit memories, manage users, change settings. Real-time stats at localhost:9099.
Click to learn moreDrop-In Compatible
Change your API URL to localhost:9099. Works with Open WebUI, SillyTavern, and anything OpenAI-compatible.
Click to learn moreSetup Takes 2 Minutes
Download, run, change one URL.
Download
Grab the release for your OS from GitHub. Run it.
Redirect
Change your chat app's API URL to localhost:9099. UPtrim forwards everything to your LLM.
Done
Chat normally. Memory builds in the background. Open the dashboard to see what it knows.
The Dashboard
Memories, users, stats, settings. All at localhost:9099.
psst... type “snake” anywhere
Paid Tiers
The free key gets you started. Paid licenses add background agents and higher limits.