token-reduction

Here are 83 public repositories matching this topic...

ModelTC / LightCompress

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.

benchmark deployment tool evaluation pruning quantization wan awq large-language-models llm token-pruning vllm smoothquant token-reduction mixtral internlm2 token-merging deepseek-v3

Updated May 14, 2026
Python

manojmallick / sigmap

Star

97% token reduction for AI coding sessions — zero deps, 31 languages, MCP server

Updated Jun 9, 2026
JavaScript

edouard-claude / snip

Star

CLI proxy that reduces LLM token usage by 60-90%. Declarative YAML filters for Claude Code, Cursor, Copilot, Gemini. rtk alternative in Go.

Updated Jun 3, 2026
Go

CircleRadon / TokenPacker

Star

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM", IJCV2025

connector lmm mllm token-reduction visual-projector tokenpacker

Updated May 26, 2025
Python

jfrog / boost

Star

Less is more. Make your agents smarter and faster. It’s not just about saving time; it’s about the feeling of not wasting it.

acceleration ai cursor observability codex token-reduction context-management claude-code context-engineering token-savings

Updated Jun 10, 2026
Shell

fajarhide / omni

Sponsor

Star

A high-performance Semantic Signal Engine with Context OS for Agentic AI. Run your AI with zero noise, pure context, and 90% lower token costs.

rust cli homebrew hooks mcp ai-agents cost-reduction token-reduction efficiency-tools antigravity context-distillation claude-code token-optimization token-efficiency token-savings

Updated Jun 8, 2026
Rust

A discovery and compression tool for your Python codebase. Creates a knowledge graph for a LLM context window, efficiently outlining your project | Code structure visualization | LLM Context Window Efficiency | Static analysis for AI | Large Language Model tooling #LLM #AI #Python #CodeAnalysis #ContextWindow #DeveloperTools

Updated Dec 5, 2024
Python

Huzaifa785 / context-compressor

Star

AI-powered text compression library for RAG systems and API calls. Reduce token usage by up to 50-60% while preserving semantic meaning with advanced compression strategies.

Updated Aug 16, 2025
Python

brandondocusen / CntxtJS

Star

A lightweight tool to optimize your Javascript / Typescript project for LLM context windows by using a knowledge graph | AI code understanding | LLM context enhancement | Code structure visualization | Static analysis for AI | Large Language Model tooling #LLM #AI #JavaScript #TypeScript #CodeAnalysis #ContextWindow #DeveloperTools

Updated Dec 2, 2024
Python

orailix / PACT

Star

[CVPR 2025] PACT: Pruning and Clustering-Based Token Reduction for Faster Visual Language Models

token-pruning token-reduction vision-language-models token-merging visual-token-reduction token-clustering positional-bias-migitation-in-pruning

Updated Jan 30, 2026
Python

ZON-Format / zon-TS

Star

ZON → 35-70% cheaper LLM prompts than JSON/TOON. Zero overhead.

json data tokenizer toon claude llm chatgpt token-reduction zon gemini-pro

Updated Apr 27, 2026
TypeScript

xuyang-liu16 / GlobalCom2

Star

[AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models

multi-modal model-compression large-language-models llm token-reduction mllms

Updated Jan 27, 2026
Python

brandondocusen / CntxtCS

Star

A lightweight tool to optimize your C# project for LLM context windows by using a knowledge graph | Code structure visualization | Static analysis for AI | Large Language Model tooling | .NET ecosystem support #LLM #AI #CSharp #DotNet #CodeAnalysis #ContextWindow #DeveloperTools

Updated Dec 3, 2024
Python

devlensio / devlensOSS

Star

An Open Source Intelligent Codebase Visualizer for javascript, reactjs, nextjs and nodejs for easy PR review, fast Onboarding and deep architectural understanding

visualization architecture code-analysis dependency-graph developer-tools dev-tool code-visualization graph-visualizer pr-review token-reduction codebase-visualization code-visualization-tool ai-summaries codebase-visualizer

Updated Jun 5, 2026
TypeScript

oanhduong / token-ninja

Star

token-ninja routes deterministic shell commands locally — zero LLM calls, ~19µs latency. Works silently inside AI tools via MCP.

mcp developer-tools cursor copilot codex ai-tools token-reduction antigravity claude-code token-optimization token-optimizer

Updated Apr 22, 2026
TypeScript

Madhan230205 / token-reducer

Star

⚡ Cut Claude token usage by 90%+ — free, open-source, local-first context compression for Claude Code. Hybrid RAG (BM25 + ONNX vectors), AST chunking, reranking. No API needed.

Updated May 2, 2026
Python

brandondocusen / CntxtJV

Star

A discovery and compression tool for your Java codebase. Creates a knowledge graph for a LLM context window, efficiently outlining your project #LLM #AI #Java #CodeAnalysis #ContextWindow #DeveloperTools #StaticAnalysis #CodeVisualization

Updated Dec 4, 2024
Python

SuppieRK / ccp

Star

CLI proxy for coding agents that cuts noisy terminal output while preserving command behavior

go cli productivity open-source terminal opencode developer-tools command-line-tool codex llm cost-reduction token-reduction ai-coding claude-code agentic-coding

Updated May 30, 2026
Go

AMD-AGI / DUET-VLM

Star

DUET-VLM: Dual stage Unified Efficient Token reduction for VLM Training and Inference

python training inference pytorch vision-language-model token-reduction brain-genai

Updated May 21, 2026
Python

shubhamV123 / crisp

Star

A terse-output skill for AI agents. Shorter replies without stripping technical details.

productivity skills terse llm token-reduction agent-skills claude-code

Updated May 21, 2026
Shell

Improve this page

Add a description, image, and links to the token-reduction topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the token-reduction topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

token-reduction

Here are 83 public repositories matching this topic...

ModelTC / LightCompress

manojmallick / sigmap

edouard-claude / snip

CircleRadon / TokenPacker

jfrog / boost

fajarhide / omni

brandondocusen / CntxtPY

Huzaifa785 / context-compressor

brandondocusen / CntxtJS

orailix / PACT

ZON-Format / zon-TS

xuyang-liu16 / GlobalCom2

brandondocusen / CntxtCS

devlensio / devlensOSS

oanhduong / token-ninja

Madhan230205 / token-reducer

brandondocusen / CntxtJV

SuppieRK / ccp

AMD-AGI / DUET-VLM

shubhamV123 / crisp

Improve this page

Add this topic to your repo