What is dstack?¶
dstack
is a streamlined alternative to Kubernetes and Slurm, specifically designed for AI. It simplifies container orchestration
for AI workloads both in the cloud and on-prem, speeding up the development, training, and deployment of AI models.
dstack
is easy to use with any cloud providers as well as on-prem servers.
Accelerators¶
dstack
supports NVIDIA
, AMD
, Google TPU
, and Intel Gaudi
accelerators out of the box.
How does it work?¶
1. Set up the server¶
Before using
dstack
, ensure you've installed the server, or signed up for dstack Sky .
2. Define configurations¶
dstack
supports the following configurations:
- Dev environments — for interactive development using a desktop IDE
- Tasks — for scheduling jobs, incl. distributed ones (or running web apps)
- Services — for deploying models (or web apps)
- Fleets — for managing cloud and on-prem clusters
- Volumes — for managing network volumes (to persist data)
- Gateways — for publishing services with a custom domain and HTTPS
Configuration can be defined as YAML files within your repo.
3. Apply configurations¶
Apply the configuration either via the dstack apply
CLI command (or through a programmatic API.)
dstack
automatically manages infrastructure provisioning and job scheduling, while also handling auto-scaling,
port-forwarding, ingress, and more.
Why dstack?¶
dstack
's founder and CEO explains the challenges dstack
addresses for AI and Ops teams.
dstack
streamlines infrastructure management and container usage, enabling AI teams to work with any frameworks across
cloud platforms or on-premise servers.
Where do I start?
- Proceed to installation
- See quickstart
- Browse examples
- Join Discord