Trajbl pilot

Generate a key. Copy a prompt. Let your agent verify Trajbl.

Trajbl is a post-retrieval context compressor for RAG, agent memory, and knowledge-base chat. It keeps the evidence sentences that matter and sends a smaller packet to the LLM.

The free pilot flow is intentionally simple: create a key, copy a setup prompt into your coding agent, and let the agent run safe synthetic checks before enabling anything permanent.

1. Generate API key2. Copy setup prompt3. Agent checks safety4. Turn ON after approval

Try in your App View benchmark signals

Four simple Trajbl setup steps: select your app type, generate your free API key, copy the setup prompt, and let your coding agent verify everything

Trajbl cuts retrieved context before the LLM bill while preserving answer quality — Open the graphic for full-resolution detail.

+72%relative F1 lift vs LLMLingua2controlled long-context benchmark, similar token budget

~88%context token reductionmeasured against full-context RAG reference

68%RAG quality retainedwhile sending a much smaller context packet

0GPU/model dependencyTrajbl compression is CPU-first and model-free

Controlled benchmark signal

Higher answer quality than LLMLingua2 at a similar token budget.

On controlled public benchmark suites including LongBench-style multi-hop QA and WixQA-style support workloads, Trajbl showed stronger quality-per-token than LLMLingua2 while remaining CPU-first and model-free.

Trajbl vs LLMLingua2Higher quality, lower tokens is better

Full RAG referenceQuality

0.587 F1Tokens

10,593

TrajblQuality

0.400 F1Tokens

1,313

LLMLingua2Quality

0.233 F1Tokens

1,405

Plain English

Trajbl saves money by sending less text to the LLM.

Most AI apps charge, directly or indirectly, for tokens. Tokens are the pieces of text sent to and from the model. When Claw-like agents, coding assistants, Obsidian vault chat, Open WebUI, AnythingLLM, PrivateGPT, or enterprise RAG tools send huge context packets, you pay for that extra text. Trajbl reduces that packet before the expensive model call.

Claw, Claude Code, Codex-like agents

A coding agent often sends repo files, memory, tool notes, and conversation history to the LLM. Trajbl cuts that packet before the model call, so the same task can burn fewer paid input tokens.

Obsidian, vault chat, knowledge-base chat

These tools retrieve notes, PDFs, transcripts, and docs. Trajbl sits after retrieval and sends only the strongest evidence sentences instead of dumping every retrieved chunk.

Simple money example

If your app normally sends about 10,593 context tokens, the measured Trajbl path sent about 1,313. That is roughly 88% fewer input tokens before the model call.

Think of it like trimming a 20-page attachment down to the 4 pages that actually answer the question.

The LLM still gets the important evidence, but you stop paying it to reread irrelevant surrounding text on every request.

Why Trajbl

A compressor that fits the way real RAG systems already work.

Post-retrieval compressor

Trajbl plugs in after your existing RAG, agent memory, or vault retrieval step and packs the retrieved top-k context before it reaches the LLM.

Whole evidence sentences

It keeps readable source sentences instead of broken token fragments, which makes the final packet easier to inspect and audit.

CPU-first, no GPU

No model server, no GPU, no training loop. The benchmarked Trajbl path runs locally as a lightweight preprocessing layer.

Built for private workflows

Designed for knowledge-base chat, agent memory, Claw-like developer agents, and private RAG systems where context cost and evidence control matter.

Pilot and strategic collaboration

Bring Trajbl into a real RAG or agent workflow.

We are preparing plug-and-play pilots for AI agents, knowledge-base chat, and private memory systems. The public demo link can remain offline while benchmark validation continues.

Discuss a pilot Review the metrics