Releases: AtomicBot-ai/atomic-llama-cpp-turboquant
TurboQuant macOS ARM64 (c419fd5)
TurboQuant KV Cache — macOS ARM64 (Metal)
Built from feature/turboquant-kv-cache branch at commit c419fd5.
What's included
llama-serverwith--cache-type-k turbo3/turbo4supportllama-cli,llama-bench,llama-perplexity- Metal backend with BF16 + embedded shader library
Usage
# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz
./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3For Atomic Chat integration
Replace the binary at:
~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server
TurboQuant macOS ARM64 (9ca009a)
TurboQuant KV Cache — macOS ARM64 (Metal)
Built from feature/turboquant-kv-cache branch at commit 9ca009a.
What's included
llama-serverwith--cache-type-k turbo3/turbo4supportllama-cli,llama-bench,llama-perplexity- Metal backend with BF16 + embedded shader library
Usage
# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz
./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3For Atomic Chat integration
Replace the binary at:
~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server
TurboQuant macOS ARM64 (0dbf74d)
TurboQuant KV Cache — macOS ARM64 (Metal)
Built from feature/turboquant-kv-cache branch at commit 0dbf74d.
What's included
llama-serverwith--cache-type-k turbo3/turbo4supportllama-cli,llama-bench,llama-perplexity- Metal backend with BF16 + embedded shader library
Usage
# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz
./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3For Atomic Chat integration
Replace the binary at:
~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server
TurboQuant macOS ARM64 (0a635dc)
TurboQuant KV Cache — macOS ARM64 (Metal)
Built from feature/turboquant-kv-cache branch at commit 0a635dc.
What's included
llama-serverwith--cache-type-k turbo3/turbo4supportllama-cli,llama-bench,llama-perplexity- Metal backend with BF16 + embedded shader library
Usage
# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz
./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3For Atomic Chat integration
Replace the binary at:
~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server
TurboQuant macOS ARM64 (e381dc9)
TurboQuant KV Cache — macOS ARM64 (Metal)
Built from feature/turboquant-kv-cache branch at commit e381dc9.
What's included
llama-serverwith--cache-type-k turbo3/turbo4supportllama-cli,llama-bench,llama-perplexity- Metal backend with BF16 + embedded shader library
Usage
# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz
./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3For Atomic Chat integration
Replace the binary at:
~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server
TurboQuant macOS ARM64 (dcd8d77)
TurboQuant KV Cache — macOS ARM64 (Metal)
Built from feature/turboquant-kv-cache branch at commit dcd8d77.
What's included
llama-serverwith--cache-type-k turbo3/turbo4supportllama-cli,llama-bench,llama-perplexity- Metal backend with BF16 + embedded shader library
Usage
# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz
./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3For Atomic Chat integration
Replace the binary at:
~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server
TurboQuant macOS ARM64 (b1a7d71)
TurboQuant KV Cache — macOS ARM64 (Metal)
Built from feature/turboquant-kv-cache branch at commit b1a7d71.
What's included
llama-serverwith--cache-type-k turbo3/turbo4supportllama-cli,llama-bench,llama-perplexity- Metal backend with BF16 + embedded shader library
Usage
# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz
./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3For Atomic Chat integration
Replace the binary at:
~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server
TurboQuant macOS ARM64 (514e600)
TurboQuant KV Cache — macOS ARM64 (Metal)
Built from feature/turboquant-kv-cache branch at commit 514e600.
What's included
llama-serverwith--cache-type-k turbo3/turbo4supportllama-cli,llama-bench,llama-perplexity- Metal backend with BF16 + embedded shader library
Usage
# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz
./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3For Atomic Chat integration
Replace the binary at:
~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server
TurboQuant macOS ARM64 (98bbdfe)
TurboQuant KV Cache — macOS ARM64 (Metal)
Built from feature/turboquant-kv-cache branch at commit 98bbdfe.
What's included
llama-serverwith--cache-type-k turbo3/turbo4supportllama-cli,llama-bench,llama-perplexity- Metal backend with BF16 + embedded shader library
Usage
# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz
./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3For Atomic Chat integration
Replace the binary at:
~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server
TurboQuant macOS ARM64 (2e81dc5)
TurboQuant KV Cache — macOS ARM64 (Metal)
Built from feature/turboquant-kv-cache branch at commit 2e81dc5.
What's included
llama-serverwith--cache-type-k turbo3/turbo4supportllama-cli,llama-bench,llama-perplexity- Metal backend with BF16 + embedded shader library
Usage
# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz
./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3For Atomic Chat integration
Replace the binary at:
~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server