🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
-
Updated
May 18, 2026 - TypeScript
🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
Fastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overhead at 5k RPS.
Unified AI Gateway for 30+ LLMs (OpenAI, Anthropic, Bedrock, Azure etc) with Caching, Guardrails, A/B test & cost controls. Go-native Fastest & Scalable AI Gateway LiteLLM & Kong AI Gateway alternative.
Nadir is a Python package designed to dynamically choose the best llm for your prompt by balancing complexity and cost and response time.
Real-time cost observability for Model Context Protocol (MCP) tool calls. Wraps any MCP server, attributes spend per tool/project/customer. Free tier 25K calls/mo. EU-hosted
Rails-native LLM cost ledger: track spend by provider, model, and feature with self-hosted storage and budget guardrails.
Stop overpaying to run your agents. Kalibr routes every request to lower-cost model and tool paths without degrading performance.
Open-source, FOCUS-aligned FinOps knowledge skill and mcp for AI coding assistants. 28 reference files spanning cloud cost (AWS/Azure/GCP/OCI), AI inference economics, Kubernetes, data platforms, allocation, chargeback, anomaly management, waste detection, GreenOps. Installs into 11 AI tools. Refreshed bi-monthly. Built by OptimNow.
Know what your AI agents cost. API gateway with budget enforcement, session tracking, and MCP tools.
Tools, libraries, papers, and patterns for reducing the cost of running large language models in production.
A curated list of strategies, tools, papers, and resources for reducing LLM token costs and improving efficiency in production.
Free local CLI that estimates, hard-caps, and losslessly compresses the cost of AI coding agents. Delta-encodes re-read files (37.9% proven on a real OpenAI call). MIT, 100% local.
Real-time LLM token and cost monitor with TLS-intercepting proxy or HTTP relay; cross-platform with macOS status bar app and browser dashboard
Self-hosted spend firewall and gateway for LLM ( OpenAI / Anthropic / Gemini ). Hard per-user & per-project budget caps that block runaway costs before the API call, plus cost-per-customer tracking, semantic caching, and failover. One line of code, single Go binary.
Open-source AI + data cost intelligence — 18 connectors (Claude, GPT, Gemini, dbt, warehouses, BI, cloud, CI/CD), cache-tier visibility, anomaly detection, MIT licensed
An LLM Cost Calculator for all the major services
Track, visualize, and optimize LLM API spending. Monitor OpenAI & Anthropic costs per feature, detect waste, suggest savings. Zero-config Python profiler.
Drop-in LLM cost-optimization proxy. Auto-route + cache + compress + batch. Flat monthly pricing by token volume, keep 100% of savings. Free 60M tokens/mo.
Local-first observability for Claude Code - drill into costs, prompts, and tool calls turn by turn. Zero instrumentation.
Add a description, image, and links to the llm-cost topic page so that developers can more easily learn about it.
To associate your repository with the llm-cost topic, visit your repo's landing page and select "manage topics."