llama-cpp/llama at master

Files

Giancarmine Salucci e7e389c0e1 llama+compose: fix bigctx startup timing

- compose: increase start_period for bigctx services
  - gemma4-e4b-bigctx: 60s -> 150s (5 GiB model + warmup + 163840 ctx takes ~90-120s)
  - gemma4-e2b-bigctx: 60s -> 120s (large ctx 393216 allocation)
  - smollm3/qwen3-4b bigctx: 60s -> 90s
- llama: extend health poll from 30x2s=60s to 75x2s=150s
- llama: require 3 consecutive unhealthy before giving up (avoids
  false positives during Docker start_period window)

2026-05-06 19:03:31 +02:00

14 KiB

Executable File

Raw Permalink Blame History

View Raw

14 KiB Executable File Raw Permalink Blame History

14 KiB

Executable File

Raw Permalink Blame History