llamacpp-python

Star

Here are 18 public repositories matching this topic...

HuiResearch / FlashTTS

Star

基于SparkTTS、OrpheusTTS等模型，提供高质量中文语音合成与声音克隆服务。

vllm sglang llamacpp-python sparktts spark-tts orpheus-tts megatts3 flashtts

Updated May 18, 2025
Python

lef-fan / aria

Star

A local and uncensored AI entity.

python bot text-to-speech ai deep-learning speech pytorch tts assistant vad speech-to-text voice-assistant large-language-models llm xttsv2 localllama llamacpp-python kokoro-tts

Updated Aug 1, 2025
Python

Belluxx / Perplex

Star

Inspect LLM's logprobs and perplexity over a piece of text, or compare two LLMs (like a git diff)

text-analysis text-processing textanalysis llamacpp local-llm gguf llamacpp-python gguf-models

Updated Mar 23, 2026
Rust

Fortyseven / ircawp

Star

A Slack bot for LLM/ImageGen responses using an OpenAI API backend (LlamaCPP, Ollama, etc)

python slack bot ai llm llms llamacpp llamacpp-python

Updated Jun 15, 2026
Python

sitammeur / Dolphin-llamacpp

Star

Dolphin 3.0 🐬: Versatile AI for coding, math, and more

python gradio gradio-interface huggingface-spaces huggingface-hub llamacpp chatml gguf llama3 llamacpp-python

Updated Mar 12, 2025
Python

Run Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1, and other state-of-the-art language models locally with scorching-fast performance. Inferno provides an intuitive CLI and an OpenAI/Ollama-compatible API, putting the inferno of AI innovation directly in your hands.

text-generation llama llm ollama llamacpp-python gguf-models

Updated Jan 5, 2026
Python

zeeb0tt / runpod-llm

Star

Runpod-LLM provides ready-to-use container scripts for running large language models (LLMs) easily on RunPod.

llm runpod llms llmops llm-serving llamacpp llama-cpp llm-training llm-inference runpod-worker ollama llama-cpp-python ollama-api runpod-serverless runpods llamacpp-python runpod-endpoint

Updated May 20, 2025
Shell

Dhyanesh18 / rag-enterprise-search

Star

A genral RAG Search chatbot, with SoTA RAG techniques such as HyDE, Hybrid retrieval with BM25 + RRF and Cross encoder reranking. Evaluated on the BEIR scifact dataset and compared all the different pipelines i tried along the way

hyde rag cross-encoder biencoder beir chromadb rag-chatbot scifact hybrid-rag llamacpp-python

Updated Aug 23, 2025
Python

sashvat-bharat / model-accelerator

Star

The fastest, most efficient library for running GGUF models with maximum throughput and zero-config hardware optimization.

python chatbot cli-app llama gemma edge-ai model-accelerator ai-assistant gguf llamacpp-python gguf-models qwen3 sashvat sashvat-bharat

Updated May 15, 2026
Python

nithish-ctrl / Agents-From-Scratch-Using-Langgraph

Star

This repository contains concept and code for building AI Agents using langgraph and langchain from absolute basics. These are basic agents built to work with local LLMs.

ai local langchain localllm langgraph aiagents llamacpp-python langgraph-agents

Updated Jun 14, 2026
Python

nandan2003 / edge-cpu-inference

Star

High-performance Local LLM benchmarking and inference toolkit for Edge CPUs. Features automated profiling for GGUF models, RAM/KV-cache footprint analysis, and optimized llama.cpp execution.

python benchmark quantization systems-engineering edge-ai llamacpp llm-inference qwen llama-cpp-python cpu-optimization llamacpp-python

Updated Mar 4, 2026
Python

nithish-ctrl / Coding-Agent-with-Search-Engine-and-Knowledge-Base

Star

This repository contains concept and code for building Daily use AI Agents using langgraph and langchain from absolute basics. From Straight up browsing the internet to coding applications and debugging code, these AI Agents can be used locally with privacy.

search-engine ai knowledge-base ai-agents langchain local-llm langgraph llamacpp-python coding-agent

Updated Jun 15, 2026
Python

sitammeur / Gemma-llamacpp

Star

Gemma 3: Google's multimodal, multilingual, long context LLM.

python gradio gradio-interface huggingface-spaces huggingface-hub llamacpp gguf llamacpp-python gemma3 gemma3-1b-it

Updated May 1, 2025
Python

AubinGil / Scene-narration-with-MetaQuest-3

Star

Using Large Language Vision Assistant(Llava) for scene understanding on MetaQuest 3(VR)

unity vr aiassistant vision-transformer llm llava llamacpp-python metaquest3

Updated Sep 23, 2025
C#

sidmohan0 / codexify-api

Star

FastAPI semantic search + custom entity detection platform.

embeddings named-entity-recognition semantic-search entity-detection embedding-similarity llama3 llamacpp-python

Updated Nov 12, 2024
Python

mixelpixx / terminal-llm_pyqt6

Star

Simple LLM interface based on terminal + GUI option

pyqt6 llamacpp llamacpp-python

Updated Sep 28, 2024
Python

hammerdirt-analyst / integrate_cuda_llamacpp_and_conda

Star

Setting up a local inference environment with llama.cpp and pytorch, with CUDA support . Using huggingface transformers and outlines for structured generation.

integration-testing cuda pytorch outlines llamacpp langchain-python llamacpp-python

Updated Mar 24, 2025
Python

testli-ai / outlines-llama-cpp-python-streaming-output

Star

This repository demonstrates how to use outlines and llama-cpp-python for structured JSON generation with streaming output, integrating llama.cpp for local model inference and outlines for schema-based text generation.

outlines llamacpp llama-cpp llama-cpp-python gguf llamacpp-python gguf-models

Updated Mar 4, 2025
Python

Improve this page

Add a description, image, and links to the llamacpp-python topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llamacpp-python topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llamacpp-python

Here are 18 public repositories matching this topic...

HuiResearch / FlashTTS

lef-fan / aria

Belluxx / Perplex

Fortyseven / ircawp

sitammeur / Dolphin-llamacpp

HelpingAI / inferno

zeeb0tt / runpod-llm

Dhyanesh18 / rag-enterprise-search

sashvat-bharat / model-accelerator

nithish-ctrl / Agents-From-Scratch-Using-Langgraph

nandan2003 / edge-cpu-inference

nithish-ctrl / Coding-Agent-with-Search-Engine-and-Knowledge-Base

sitammeur / Gemma-llamacpp

AubinGil / Scene-narration-with-MetaQuest-3

sidmohan0 / codexify-api

mixelpixx / terminal-llm_pyqt6

hammerdirt-analyst / integrate_cuda_llamacpp_and_conda

testli-ai / outlines-llama-cpp-python-streaming-output

Improve this page

Add this topic to your repo