Skip to content

Releases: AtomicBot-ai/atomic-llama-cpp-turboquant

TurboQuant macOS ARM64 (c419fd5)

09 Jun 16:42

Choose a tag to compare

TurboQuant KV Cache — macOS ARM64 (Metal)

Built from feature/turboquant-kv-cache branch at commit c419fd5.

What's included

  • llama-server with --cache-type-k turbo3 / turbo4 support
  • llama-cli, llama-bench, llama-perplexity
  • Metal backend with BF16 + embedded shader library

Usage

# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz

./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3

For Atomic Chat integration

Replace the binary at:

~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server

TurboQuant macOS ARM64 (9ca009a)

09 Jun 16:43

Choose a tag to compare

Pre-release

TurboQuant KV Cache — macOS ARM64 (Metal)

Built from feature/turboquant-kv-cache branch at commit 9ca009a.

What's included

  • llama-server with --cache-type-k turbo3 / turbo4 support
  • llama-cli, llama-bench, llama-perplexity
  • Metal backend with BF16 + embedded shader library

Usage

# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz

./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3

For Atomic Chat integration

Replace the binary at:

~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server

TurboQuant macOS ARM64 (0dbf74d)

09 Jun 16:46
0dbf74d

Choose a tag to compare

Pre-release

TurboQuant KV Cache — macOS ARM64 (Metal)

Built from feature/turboquant-kv-cache branch at commit 0dbf74d.

What's included

  • llama-server with --cache-type-k turbo3 / turbo4 support
  • llama-cli, llama-bench, llama-perplexity
  • Metal backend with BF16 + embedded shader library

Usage

# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz

./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3

For Atomic Chat integration

Replace the binary at:

~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server

TurboQuant macOS ARM64 (0a635dc)

13 May 17:33
0a635dc

Choose a tag to compare

Pre-release

TurboQuant KV Cache — macOS ARM64 (Metal)

Built from feature/turboquant-kv-cache branch at commit 0a635dc.

What's included

  • llama-server with --cache-type-k turbo3 / turbo4 support
  • llama-cli, llama-bench, llama-perplexity
  • Metal backend with BF16 + embedded shader library

Usage

# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz

./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3

For Atomic Chat integration

Replace the binary at:

~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server

TurboQuant macOS ARM64 (e381dc9)

12 May 19:12
e381dc9

Choose a tag to compare

Pre-release

TurboQuant KV Cache — macOS ARM64 (Metal)

Built from feature/turboquant-kv-cache branch at commit e381dc9.

What's included

  • llama-server with --cache-type-k turbo3 / turbo4 support
  • llama-cli, llama-bench, llama-perplexity
  • Metal backend with BF16 + embedded shader library

Usage

# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz

./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3

For Atomic Chat integration

Replace the binary at:

~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server

TurboQuant macOS ARM64 (dcd8d77)

12 May 19:12
dcd8d77

Choose a tag to compare

Pre-release

TurboQuant KV Cache — macOS ARM64 (Metal)

Built from feature/turboquant-kv-cache branch at commit dcd8d77.

What's included

  • llama-server with --cache-type-k turbo3 / turbo4 support
  • llama-cli, llama-bench, llama-perplexity
  • Metal backend with BF16 + embedded shader library

Usage

# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz

./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3

For Atomic Chat integration

Replace the binary at:

~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server

TurboQuant macOS ARM64 (b1a7d71)

12 May 19:13
b1a7d71

Choose a tag to compare

Pre-release

TurboQuant KV Cache — macOS ARM64 (Metal)

Built from feature/turboquant-kv-cache branch at commit b1a7d71.

What's included

  • llama-server with --cache-type-k turbo3 / turbo4 support
  • llama-cli, llama-bench, llama-perplexity
  • Metal backend with BF16 + embedded shader library

Usage

# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz

./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3

For Atomic Chat integration

Replace the binary at:

~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server

TurboQuant macOS ARM64 (514e600)

12 May 20:40
514e600

Choose a tag to compare

Pre-release

TurboQuant KV Cache — macOS ARM64 (Metal)

Built from feature/turboquant-kv-cache branch at commit 514e600.

What's included

  • llama-server with --cache-type-k turbo3 / turbo4 support
  • llama-cli, llama-bench, llama-perplexity
  • Metal backend with BF16 + embedded shader library

Usage

# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz

./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3

For Atomic Chat integration

Replace the binary at:

~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server

TurboQuant macOS ARM64 (98bbdfe)

07 May 10:07
98bbdfe

Choose a tag to compare

Pre-release

TurboQuant KV Cache — macOS ARM64 (Metal)

Built from feature/turboquant-kv-cache branch at commit 98bbdfe.

What's included

  • llama-server with --cache-type-k turbo3 / turbo4 support
  • llama-cli, llama-bench, llama-perplexity
  • Metal backend with BF16 + embedded shader library

Usage

# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz

./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3

For Atomic Chat integration

Replace the binary at:

~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server

TurboQuant macOS ARM64 (2e81dc5)

07 May 17:12
2e81dc5

Choose a tag to compare

Pre-release

TurboQuant KV Cache — macOS ARM64 (Metal)

Built from feature/turboquant-kv-cache branch at commit 2e81dc5.

What's included

  • llama-server with --cache-type-k turbo3 / turbo4 support
  • llama-cli, llama-bench, llama-perplexity
  • Metal backend with BF16 + embedded shader library

Usage

# Option 1: zip (notarized + stapled)
unzip llama-turboquant-macos-arm64.zip
# Option 2: tar.gz
tar -xzf llama-turboquant-macos-arm64.tar.gz

./build/bin/llama-server -m model.gguf --cache-type-k turbo3 --cache-type-v turbo3

For Atomic Chat integration

Replace the binary at:

~/Library/Application Support/Atomic Chat/data/llamacpp/backends/<version>/macos-arm64/build/bin/llama-server