ClusterMAX 2.0Underperforming

Dstack Sky

Can rise to Bronze or Silver quickly if critical issues are fixed (security attestation, modern GPUs, etc.).

ByJordan NanosDaniel NishballDylan Patel
Published

Dstack Sky Quick Stats

ClusterMAX Tier
Underperforming (1 / 5)
Source Rating Cycle
ClusterMAX 2.0
GPUs Offered
H100
Slurm Support
Discussed in review
Kubernetes Support
Discussed in review
SOC 2 Mentioned
Yes
NCCL Benchmarks
Not in review
Last Updated
Nov 06, 2025

Want to model Dstack Sky cluster cost? Calculate H100, H200, B200 & GB200 NVL72 TCO with the ClusterMAX calculator.

Dstack the company has a really interesting orchestrator and scheduler that replaces the need for slurm or kubernetes. We love the idea of moving beyond slurm and kubernetes, and have been hearing great reviews from dstack users about their experience. On the flipside, the dstack sky marketplace offering is not at the same level. Dstack sky is a cloud broker that works similarly to their GPU orchestration product by focusing on a CLI-driven approach to provisioning GPU resources. The offering allows users to create three types of resources: a dev environment (a GPU instance accessible via an IDE), a task (a batch job), or a service (a deployed model or web app).

Under the hood, everything is powered by docker containers. As we have mentioned previously when reviewing other marketplaces and brokers, this creates an initial restriction for building developer environments that users must comply with to use the product. However, it is nice to see that dstack does not require users to build from their base image, instead allowing users to bring their own image of choice while dstack adds orchestration on top. We particularly enjoyed the convenience script that automatically edits a users local .ssh/config file to provide quick access to newly created systems.

However, the abstraction comes with a significant lack of transparency. It’s unclear how the underlying GPU provider, or “Backend,” is chosen, and there is no apparent way to view the full list of providers or filter by price when you get “Offers” from the CLI. When requesting an H100, we had no way to distinguish between PCIe and SXM models. During testing he happened to receive 2x SXM GPUs, but this seems to be a matter of chance.

Getting offers for 2x H100 from providers via dstack

Once connected, we found ourselves logged in as root, implying there is no RBAC or shared storage options on the cluster side. The machine we connected to provided a small 100GB root partition, and our connection speed was extremely slow, with seconds of lag for a carriage return to register on our CLI. This was likely due to our instance being provisioned from a provider in Thailand (“Internet Thailand Company Ltd.”). Storage performance, however, was good, taking only 6s to import torch.

This experience highlights the multiple layers of indirection in the Dstack model. We pay Dstack for credits; Dstack then pays a provider like Vast.ai for an instance; Vast.ai in turn pays the end provider to run the container (possibly the provider in Thailand uses a datacenter operator under the hood, too). It’s unclear how many layers exist and who is ultimately responsible for hardware maintenance and security, a significant concern for any serious workload.

An RL eval job on 1x H100, using the verifiers repo

With all this said, we are still able to use dstack to connect to a remote machine with 2x H100 inside VSCode in under 5 minutes, install required software in under 5 minutes, and run an RL rollout for a sample model eval. All in less than 30 minutes, paid for by the minute with existing credits. A nice experience for on-demand development that motivates us to reconsider the use of CLI’s to spin up machines. When it works, it works.

We look forward to testing dstack again in the future, and the company is planning on completing a basic security compliance attestation such as SOC 2 Type 1 soon.

Dstack Sky GPU Cloud FAQ

What tier is Dstack Sky in ClusterMAX?

Dstack Sky is rated Underperforming tier in the ClusterMAX 2.0 GPU cloud rating system by SemiAnalysis (with the ClusterMAX 2.1 Update applied April 2026). Underperforming is flagged by ClusterMAX as underperforming — capable of reaching Bronze or Silver if critical gaps are fixed. Can rise to Bronze or Silver quickly if critical issues are fixed (security attestation, modern GPUs, etc.).

Is Dstack Sky SOC 2 Type II certified?

Dstack Sky's review on ClusterMAX explicitly discusses SOC 2 posture. See the Security section of the Dstack Sky review for the current SOC 2 status, scope of the report, and any related attestations (ISO 27001, HIPAA) tracked by SemiAnalysis.

Does Dstack Sky support Slurm?

Yes. The Dstack Sky review on ClusterMAX covers their Slurm offering — including whether it is managed, self-managed, or runs as Slurm-on-Kubernetes (SUNK, Soperator, or Slinky). See the Orchestration section of the review for the specific Slurm flavor offered and SemiAnalysis' hands-on experience.

Does Dstack Sky support Kubernetes?

Yes. The Dstack Sky review on ClusterMAX covers their Kubernetes offering — whether managed Kubernetes is provided, what control plane is used, and how GPU operator, networking, and storage integrate. See the Orchestration and Storage sections of the review for details.

What GPUs does Dstack Sky offer?

Based on the SemiAnalysis hands-on review, Dstack Sky offers (or has been publicly tied to) the following NVIDIA / AMD GPU SKUs: H100. Specific inventory, region availability, and on-demand vs reserved access are detailed in the Dstack Sky ClusterMAX review.

What is the NCCL all-reduce performance on Dstack Sky?

Dstack Sky's ClusterMAX review does not yet publish hands-on NCCL all-reduce results. NCCL all-reduce bandwidth is the standard SemiAnalysis benchmark for InfiniBand / RoCE health on GPU clusters — see the ClusterMAX /health-checks page for the full benchmark methodology.

How does Dstack Sky compare to CoreWeave?

CoreWeave is the only ClusterMAX Platinum provider, while Dstack Sky is rated Underperforming. The Dstack Sky review documents the specific gaps versus CoreWeave across the 10 ClusterMAX criteria (Security, Lifecycle, Orchestration, Storage, Networking, Reliability, Monitoring, Pricing, Partnerships, Availability). See the Dstack Sky review body and the ClusterMAX /criteria page for the full comparison framework.

Is Dstack Sky recommended for LLM training?

Dstack Sky's current ClusterMAX rating (Underperforming) means SemiAnalysis does not directly recommend Dstack Sky for production LLM training without first addressing the specific gaps called out in the review. See the Dstack Sky review for the gating issues and see the ClusterMAX /cloudreview index for currently recommended alternatives in Platinum / Gold / Silver / Bronze.

All ClusterMAX™ 2.0 + 2.1 reviews