Skip to content

hal0 docs

Welcome to the hal0 documentation. hal0 is a polished, reliable inference platform for running LLMs at home — it manages model slots, exposes an OpenAI-compatible API, and ships with a built-in dashboard and prewired chat UI.

Status: v1 pre-alpha. The docs below describe v1 as planned — some features (FLM NPU, ROCm/CUDA toolboxes, Hugging Face pulls) are still on the way. Pages marked “Coming soon” are stubs.