Self-hosting means you deploy and operate the model yourself — your servers or cloud, your GPUs. It keeps data in-house, removes per-call fees at scale, and avoids vendor lock-in, but you own the uptime, scaling, and upgrade burden.
Definition
Running open-weight models on infrastructure you control instead of calling a vendor API. Trades operational work for privacy, control, and lower marginal cost.
Self-hosting means you deploy and operate the model yourself — your servers or cloud, your GPUs. It keeps data in-house, removes per-call fees at scale, and avoids vendor lock-in, but you own the uptime, scaling, and upgrade burden.
Also known as
self-hosted, on-prem