prompt-compression

Here are 61 public repositories matching this topic...

open-compress / claw-compactor

14-stage Fusion Pipeline for LLM token compression — reversible compression, AST-aware code analysis, intelligent content routing. Zero LLM inference cost. MIT licensed.

Updated Apr 1, 2026
Python

jia-gao / leanctx

Star

Drop-in prompt compression for production LLM apps. Cut your token bill 40-60% without changing your code. Python SDK, LLMLingua-2, MIT.

python gemini openai cost-optimization rag llm langchain anthropic llm-inference prompt-compression langgraph llmlingua

Updated Jun 8, 2026
Python

atjsh / llmlingua-2-js

Star

JavaScript/TypeScript implementation of LLMLingua-2 (Experimental)

nodejs javascript typescript web tensorflow transformers webgpu hf tensorflowjs prompt-engineering transformer-js prompt-compression llmlingua

Updated Sep 14, 2025
TypeScript

chappyasel / meta-kb

Star

A self-improving knowledge base about LLM agent infrastructure

markdown machine-learning ai artificial-intelligence multi-agent knowledge-graph knowledge-base self-learning ai-agents rag autonomous-research llm anthropic prompt-compression agent-skills agent-memory claude-code context-engineering openclaw

Updated Apr 9, 2026
TypeScript

centminmod / or-cli

Sponsor

Star

Python command-line tool for interacting with AI models through the OpenRouter API/Cloudflare AI Gateway, or local self-hosted Ollama. Optionally support Microsoft LLMLingua prompt token compression

openai linkup opik rag openai-api txtai llms llm-inference openrouter ollama cloudflare-ai ollama-api prompt-compression structured-outputs openai-api-client openrouter-api cloudflare-ai-gateway ai-rag llmlingua

Updated Dec 28, 2025

sriinnu / clipforge-PAKT

Sponsor

Star

Lossless-first prompt compression for JSON, YAML, CSV, and Markdown. Library, CLI, MCP server, desktop app, and browser extension.

markdown cli yaml json csv mcp developer-tools lossless-compression llm pakt prompt-compression token-compression coding-agent

Updated May 31, 2026
TypeScript

NodeNestor / claude-rolling-context

Star

Rolling context compression for Claude Code — never hit the context wall. Auto-compresses old messages while keeping recent context verbatim. Zero config, zero latency. Works as a Claude Code plugin.

claude ai-agent anthropic context-window context-management prompt-compression context-compression llm-context ai-coding claude-code claude-code-plugin claude-code-extension rolling-context

Updated Jun 2, 2026
Python

bladysh / exprompt

Star

Reverse T9 for LLMs. Free, open-source prompt compressor for your AI prompts and agents.

cli golang openai developer-tools agents codex text-compression claude llm prompt-engineering llms chatgpt anth prompt-compression

Updated May 17, 2026
Go

pleasedodisturb / awesome-llm-token-optimization

Star

A curated list of strategies, tools, papers, and resources for reducing LLM token costs and improving efficiency in production.

Updated Jun 7, 2026

napmany / cutia

Star

CUTIA: compress prompts while preserving quality

dspy prompt-engineering prompt-compression

Updated Feb 2, 2026
Python

g-akshay / ClaudeShrink

Sponsor

Star

A Claude Code skill that shrinks massive prompts and files using LLMLingua to save tokens.

skills developer-tools claude ai-tools context-window prompt-compression llmlingua claude-code token-optimization claude-skills

Updated Apr 25, 2026
Python

kaistAI / GenPI

Star

This repository is the official implementation of Generative Context Distillation.

agent distillation prompt-injection prompt-compression prompt-internalization context-distillation

Updated May 10, 2025
Python

therohanparmar / t3-toon

Star

TOON for TYPO3 — a compact, human-readable, and token-efficient data format for AI prompts & LLM contexts. Perfect for ChatGPT, Gemini, Claude, Mistral, and OpenAI integrations (JSON ⇄ TOON).

Updated Jun 6, 2026
PHP

gladehq / claude-shorthand

Star

LLMLingua-2 prompt compression hook for Claude Code — cut token usage by ~55%

macos linux cli developer-tools token claude prompt-tuning llm prompt-engineering prompt-compression llmlingua token-optimization claudecode claudecode-hooks claudecode-plugin

Updated Mar 16, 2026
Python

Kir93 / scrooge-mode

Star

Same answer, fewer tokens — KO-first LLM output-compression skill for Claude Code & Codex. A Korean-native caveman alternative, measured on real session output_tokens.

i18n skills korean codex ai-agents llm prompt-compression token-compression claude-code caveman-alternative llm-output-compression ko-first

Updated Jun 6, 2026
JavaScript

simanggu / llm-judgment-control-engine

Star

LLM judgment control layer for drift, memory loss, hallucination, and cost optimization.

ai-safety ai-agents llm long-context langchain llmops context-management prompt-compression llm-ai-agent-llmops-ai-control-orchestration runtime-controller

Updated Apr 24, 2026
Python

VDADev2022 / token-diet

Star

Advanced token reduction and prompt optimization framework for LLMs, featuring linguistic, algorithmic, and architectural patterns.

ai nlp-resources ai-development llm prompt-engineering generative-ai llm-tools token-reduction token-usage llm-optimization context-management prompt-compression agentic-ai llm-efficiency claude-skills claude-skill ai-cost-savings

Updated Apr 25, 2026

AybarsBarut / Nexus-APCP

Star

AI-assisted context management and prompt compression toolkit for developer productivity, ADR workflows, and LLM token optimization.

Updated May 24, 2026
Python

contextcrunch-ai / contextcrunch-python

Star

Compress LLM Prompts and save 80%+ on GPT-4 in Python

python api llm prompt-compression

Updated Jan 17, 2024
Python

PirateBao is a TypeScript/Bun agent-skill package for terse pirate-speak AI coding replies that preserve technical detail while cutting filler, with hooks, compressor CLI, OpenCode/Codex/Claude/Gemini cargo, .bao validation, npmjs gates, and token eval checks.

cli typescript ai opencode npm-package codex ai-agents bun bao prompt-compression gemini-cli agentic-ai ai-skills claude-code token-efficiency coding-agent

Updated Apr 13, 2026
TypeScript

Improve this page

Add a description, image, and links to the prompt-compression topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the prompt-compression topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

prompt-compression

Here are 61 public repositories matching this topic...

open-compress / claw-compactor

jia-gao / leanctx

atjsh / llmlingua-2-js

chappyasel / meta-kb

centminmod / or-cli

sriinnu / clipforge-PAKT

NodeNestor / claude-rolling-context

bladysh / exprompt

pleasedodisturb / awesome-llm-token-optimization

napmany / cutia

g-akshay / ClaudeShrink

kaistAI / GenPI

therohanparmar / t3-toon

gladehq / claude-shorthand

Kir93 / scrooge-mode

simanggu / llm-judgment-control-engine

VDADev2022 / token-diet

AybarsBarut / Nexus-APCP

contextcrunch-ai / contextcrunch-python

d4551 / piratebao

Improve this page

Add this topic to your repo