🚀 Now serving MiniMax-M3 for efficient inference →
⚡ On-demand B200s now available on Together GPU Clusters →
📊 Delivering 31% more TPS than the next-fastest OSS engine for production coding agent workloads →
💬 How Together built the world's fastest speech-to-text stack →
🇫🇷 Join us at RAISE 2026 in Paris →
Inference
Serverless Inference
High-performance inference as APIs
Batch Inference
Inference for batch workloads
Dedicated Model Inference
Inference on custom hardware
Dedicated Container Inference
Inference for custom models
Model library
Explore the top open-source models
Compute
Accelerated Compute
GPU Clusters
Reliable GPU clusters at scale
AI Factory
Custom infrastructure at frontier scale
Developer Environments
Sandbox
Build development environments for AI
Storage
Managed Storage
Store model weights & data securely
GB300
GB200
B200
H200
H100
Model Shaping
Fine-Tuning
Shape models with your data
Evaluations
Measure model quality
Fine-tune top open-source models
Research
Systems research for production AI
Research blog
All our research publications
Featured publications
FlashAttention
ATLAS
Kernel Collection
ThunderKittens
DSGym
Developers
Documentation
Technical docs for Together AI
Demos
Our open-source demo apps
Cookbooks
Practical implementation guides
Voice Agents
Build voice agents for production
Model Library
Playground
Together Chat
Which LLM to use
Company
Resources
Customer stories
Testimonials from AI Natives
Startup accelerator
Build and scale your startup
Customer support
Find answers to your questions
Blog
Our latest news & blog posts
Events
Explore our events calendar
About
Get to know us
Careers
Join our mission
Pricing
Explore our collection of open-source example and demo apps by the Together AI team.
Open-source video dubbing — translate any video into 33 languages with native-sounding voice-over and subtitles
A simple Next.js chatbot that uses Together AI LLMs for inference
Explore and use Flux loras to generate images in different styles
A simple example app that shows how to use logprobs to get probabilities from LLMs
Generate a personal website from your LinkedIn/Resume
An easy way to split restaurant bills with OCR using vision models on Together AI
Summarize PDFs into beautiful sections with Llama 3.3 70B
Generate reports using our open source Deep Research implementation
Edit any images with a simple prompt using Flux Kontext