local-ai-llm

Here are 31 public repositories matching this topic...

AVADSA25 / codec

Open-Source Intelligent Command Layer

open-source self-hosted mac-os mlx voice-assistant python-automation opensource-projects voice-assistant-ai llm-agent local-ai qwen voice-assistant-free self-hosted-ai local-ai-development llm-agent-framework local-ai-agents local-ai-llm

Updated Jun 9, 2026
Python

pythongiant / KVBoost

Star

Make local LLM inference faster with chunk-level KV cache reuse

kv-cache llm open-llm local-llm llm-inference local-ai llm-optimization local-ai-llm kv-cache-lp

Updated Jun 5, 2026
Python

wajason / Nano_Cinema_AI_Video_Studio

Star

🎬 Nano Cinema: An all-in-one local AI video production studio. Automatically orchestrates Llama-3 (Script), SDXL-Turbo (Visuals), EdgeTTS (Audio), and LTX-Video (Motion) into a seamless Python workflow. Create cinematic short films with no API fees, full privacy, and professional-grade editing logic included!!! 🚀

python automation ffmpeg gradio short-film text-to-video gradio-interface ai-video-generator local-ai text-to-video-generation sdxl-turbo llama-3 edgetts local-ai-development ltx-video local-ai-llm ai-video-generator-tool

Updated Feb 5, 2026
Jupyter Notebook

tomaszwi66 / AetherMind

Star

Local-first Personal AI Memory OS - RAG over your entire life. Git, notes, calendar, location. 100% offline. No cloud.

python open-source privacy ai self-hosted streamlit vector-database second-brain llm personal-productivity qdrant-vector-database ollama rag-chatbot local-ai-llm

Updated Apr 15, 2026
Python

abendrothj / LAO

Star

Cross-platform desktop tool for chaining local AI models and plugins into powerful, agentic workflows. It supports prompt-driven orchestration, visual DAG editing, and full offline execution.

rust modular offline-first plugins rust-lang rustlang expandable workflow-orchestrator agentic-ai local-ai-llm

Updated Feb 23, 2026
Rust

fsm-cpp / hybrid-ai-agent

Star

An intelligent local AI agent powered by open-source LLMs, featuring free web search, hybrid memory, and context-aware query rewriting for real-time, grounded answers.

nlp agent open-source search-engine memory chatbot rag llm local-ai-agent local-ai-llm

Updated Jan 10, 2026
Python

yongmmin / web-hwp-editor

Star

HWP / HWPX files are a web-based editor that can be opened and edited directly in the browser. You can modify Hangul documents without installing any separate program, and even use local AI (OLLAMA) to get Korean synonym suggestions.

web-editor llama hwp edtior hwp-viewer hwpx local-ai local-ai-llm

Updated Apr 21, 2026
TypeScript

AnuragGupta93 / LocalEcho

Star

**LocalEcho** is a fully local, open-source Text-to-Speech engine powered by **Qwen3 TTS** models

open-source tts mlx qwen3 qwen3-tts local-ai-llm

Updated Feb 22, 2026
TypeScript

CosmonautCode / Tiny-Local-LLM-System

Star

A lightweight, self-contained Python project for running a local large language model (LLM) with minimal dependencies. This system uses TinyLlama-1.1B-Chat-v1.0.0 and llama-cpp-python for inference, and Rich for a user-friendly console chat interface

ai large-language-models llm llms large-language-model local-llm local-ai-llm

Updated Feb 6, 2026
Python

arshawnarbabi / LocalNotch

Star

Local AI assistant that lives in your MacBook's notch. Powered by Ollama — chat, vision, web search, and an autonomous file-system Agent Mode (beta), all on-device.

macos swift privacy macbook vision notch menu-bar swiftui menu-bar-app ai-assistant ai-agent llm local-llm ollama ollama-api local-ai-llm

Updated Jun 9, 2026
Swift

Mr-DS-ML-85 / chimera-ai-gateway

Star

One API. 20+ AI Providers. Smart Routing. Strong Security. Self-hosted OpenAI-compatible gateway with intelligent fallback & battle-tested defenses.

ai mistral ai-agents ai-api ai-tools prompt-injection local-ai ollama llm-proxy ai-gateway deepseek llm-orchestration ai-infrastructure ai-proxy self-hosted-ai openai-compatible model-routing local-ai-llm

Updated May 28, 2026
Python

barek2k2 / local_llm

Star

Lightweight Ruby gem for interacting with locally running Ollama LLMs with streaming, chat, and full offline privacy.

ruby security machine-learning privacy ai chatbot artificial-intelligence ruby-on-rails private-chat data-security data-privacy-compliance llm local-ai offline-ai local-ai-development local-ai-llm

Updated Dec 6, 2025
Ruby

YASSER-27 / RUN-GGUF

Star

Run model GGUF gui esay ,faster,localy 100%

github ai llama on-device-ai llm llms llamacpp llama-cpp llm-inference local-ai gguf llama3 local-ai-llm gemma4 gguf-gui

Updated Apr 14, 2026
JavaScript

lufermalgo / voxa

Star

Free macOS dictation with local AI profiles — Wispr Flow alternative powered by Whisper + llama.cpp. No cloud, no subscription.

Updated Jun 5, 2026
Rust

ceylonai / layerrun

Star

LayerRun is a Rust-based local LLM runtime for memory-aware model execution, layer-wise loading, model inspection, and flexible inference serving.

local-first local-ai llm-runtime local-ai-llm local-ai-models

Updated Jun 8, 2026
Rust

swdit / Setup-Guide-Headless-Ubuntu-AI-Mini-PC-for-LM-Studio

Star

Setup guide for AI-Mini PC. For hosting local LLM's via LM-Studio as RDP/headless-GUI Setup. In this example we'll use a Minisforum AI X1 Pro, AMD Ryzen AI 9 HX 370 / 64GB RAM

ubuntu xrdp local-ai lm-studio local-ai-llm

Updated Dec 27, 2025

BorisHrzenjak / OllamaBrah

Star

Local AI desktop app built for a single user. No accounts. No teams. No telemetry. Just you and your models.

desktop-app chat privacy ai ai-agents llm llamacpp local-ai ollama local-ai-agents local-ai-llm at-tools

Updated Apr 20, 2026
JavaScript

olesxg / flap

Star

machine-learning automation gpu machine-learning-algorithms pytorch fine-tuning machine-learning-projects llm llms llm-training local-ai fine-tuning-llm fine-tuning-model local-ai-llm

Updated Mar 14, 2026

ChaitanyaParate / Deskai

Star

Local-first desktop AI daemon that runs fully offline. Tracks active desktop context, exposes a CLI, streams responses from local LLMs via Ollama, and runs as a systemd user service. Built for systems-level learning: IPC, daemons, streaming inference, OS integration.

linux streaming daemon ipc x11 python3 unix-socket systemd-service command-line-interface desktop-assistant ocr-python llm ollama local-ai-llm

Updated Jun 13, 2026
Python

Happynood / llm-inference-benchmark

Star

Reproducible LLM inference optimization lab for comparing backends, quantization, latency, VRAM, and output sanity on local hardware.

python benchmark machine-learning perfomance cuda inference nvidia reproducibility quantization llm llama-cpp llm-inference llama-cpp-python gguf local-ai-llm

Updated Jun 14, 2026
Python

Improve this page

Add a description, image, and links to the local-ai-llm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the local-ai-llm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

local-ai-llm

Here are 31 public repositories matching this topic...

AVADSA25 / codec

pythongiant / KVBoost

wajason / Nano_Cinema_AI_Video_Studio

tomaszwi66 / AetherMind

abendrothj / LAO

fsm-cpp / hybrid-ai-agent

yongmmin / web-hwp-editor

AnuragGupta93 / LocalEcho

CosmonautCode / Tiny-Local-LLM-System

arshawnarbabi / LocalNotch

Mr-DS-ML-85 / chimera-ai-gateway

barek2k2 / local_llm

YASSER-27 / RUN-GGUF

lufermalgo / voxa

ceylonai / layerrun

swdit / Setup-Guide-Headless-Ubuntu-AI-Mini-PC-for-LM-Studio

BorisHrzenjak / OllamaBrah

olesxg / flap

ChaitanyaParate / Deskai

Happynood / llm-inference-benchmark

Improve this page

Add this topic to your repo