- ./llama (interactive menu) or ./llama <cmd> [args] - start <model> [--bigctx] [--webui]: verify model file, warn before stopping running server, health-wait after start - stop: stop all llama containers - status: running model + health + env vars - logs [--follow]: tail server logs - build: build TurboQuant images - bench <model>: run llama-bench via bench profile
13 KiB
Executable File
13 KiB
Executable File