ClusterMAX 2.0Silver

Firmus / Sustainable Metal Cloud (SMC)

Adequate offering with noticeable gaps compared to Gold or Platinum. Room for improvement.

ByJordan NanosDaniel NishballDylan Patel
Published

Firmus / Sustainable Metal Cloud (SMC) Quick Stats

ClusterMAX Tier
Silver (3 / 5)
Source Rating Cycle
ClusterMAX 2.0
GPUs Offered
GB300 NVL72, GB300
Slurm Support
Discussed in review
Kubernetes Support
Discussed in review
SOC 2 Mentioned
Not flagged
NCCL Benchmarks
In review
Last Updated
Nov 06, 2025

Want to model Firmus / Sustainable Metal Cloud (SMC) cluster cost? Calculate H100, H200, B200 & GB200 NVL72 TCO with the ClusterMAX calculator.

Firmus is an Australian company that was recently backed by a strategic investment from Nvidia at a $1.9B valuation: https://www.afr.com/technology/nvidia-backs-australian-ai-factory-firmus-with-1-9b-valuation-20250915-p5mv0v. Their current ambition is to build a “Stargate for the southern hemisphere,” with a specific focus on next-generation rack-scale systems like the GB300 NVL72 and VR. Though we believe that the bulk of Firmus’s experience with immersion cooling is misguided, and now wasted, we also believe that this team is one of the few in the industry that has the engineering chops to monitor and maintain the physical layer of these DLC systems effectively. Our review of their current telemetry and failure prediction system for their immersion deployments demonstrates significant attention to detail, and a deep understanding of the physical stack, down to the signal quality and light levels in custom transceivers and optical cables. However, this experience at the lowest physical level can be undermined by a higher UX level that feels out-of-touch with customer requirements.

Our testing began with a difficult wrinkle: cluster access is gated behind a mandatory VPN. This is a significant operational bottleneck for teams accustomed to standard cloud workflows with public IPs or streamlined SSH wrappers. While some security-conscious customers (such as international federal agencies for defense, intelligence, and research) may find this acceptable and even prefer isolation at Layer 2,3, 5 or 7, the general public does not operate this way. The fact that Firmus had no alternative access method prepared was telling for us.

Once connected, our slurm environment also had some configuration issues. The standard topology.conf file was not set for topology-aware scheduling, and a simple “srun -N1 –gpus-per-node=8 –pty bash” command took over a minute to execute due to an exceptionally long prolog. It seems that the Firmus team took some of our previous feedback around health checks to an extreme, filling up the prolog with unnecessary dcgm level 3 checks when level 1, 2, or just an epilog with HealthCheckProgram configured would suffice. To their credit, a pre-staged nccl-test script was provided and ran at expected bandwidth.

As mentioned previously, the Firmus monitoring stack is unique, going beyond standard DCGM metrics and feeding ML models to predict component failures before they occur. A “link flap” is formally defined as five events in one hour, triggering automated diagnostics. Their internal validation suite is exhaustive, running regression tests on spare nodes that include P2P bandwidth tests, GDR copies, small-scale llama training runs, and NCCL tests to proactively identify GPUs, NVLink, or InfiniBand interconnects that are approaching failure.

Source: Firmus Custom Monitoring Dashboard for Immersion Tanks

Source: Firmus Customized Grafana Dashboard, showing relevant GPU Utilization Metrics during a training run

This level of investment in monitoring at the physical layer is how Firmus plans to back up an aggressive “99.94% SLA”, aiming to differentiate itself from competitors by ensuring maximum goodput – something that we have also heard from top-tier providers like CoreWeave and Nebius. Their business model mirrors other major Nvidia clouds, with attractive prospective pricing for their upcoming rack-scale deployments, much of which is made possible by a low power cost in their massive expansion into Tasmania. We encourage Firmus to double-down on their focus on operational excellence from the physical layer to the orchestration layer (i.e. properly configured slurm and kubernetes clusters) without getting distracted by fancy PaaS and SaaS applications that the vendor-du-jour is pitching.

Firmus / Sustainable Metal Cloud (SMC) GPU Cloud FAQ

What tier is Firmus / Sustainable Metal Cloud (SMC) in ClusterMAX?

Firmus / Sustainable Metal Cloud (SMC) is rated Silver tier in the ClusterMAX 2.0 GPU cloud rating system by SemiAnalysis (with the ClusterMAX 2.1 Update applied April 2026). Silver is a mid-tier rating in the ClusterMAX rating system. Adequate offering with noticeable gaps compared to Gold or Platinum. Room for improvement.

Is Firmus / Sustainable Metal Cloud (SMC) SOC 2 Type II certified?

Firmus / Sustainable Metal Cloud (SMC)'s ClusterMAX review does not flag a SOC 2 Type II attestation as confirmed. SemiAnalysis treats SOC 2 Type II as a baseline expectation for any GPU cloud serving enterprise or regulated AI workloads — see the ClusterMAX criteria page for the full security baseline.

Does Firmus / Sustainable Metal Cloud (SMC) support Slurm?

Yes. The Firmus / Sustainable Metal Cloud (SMC) review on ClusterMAX covers their Slurm offering — including whether it is managed, self-managed, or runs as Slurm-on-Kubernetes (SUNK, Soperator, or Slinky). See the Orchestration section of the review for the specific Slurm flavor offered and SemiAnalysis' hands-on experience.

Does Firmus / Sustainable Metal Cloud (SMC) support Kubernetes?

Yes. The Firmus / Sustainable Metal Cloud (SMC) review on ClusterMAX covers their Kubernetes offering — whether managed Kubernetes is provided, what control plane is used, and how GPU operator, networking, and storage integrate. See the Orchestration and Storage sections of the review for details.

What GPUs does Firmus / Sustainable Metal Cloud (SMC) offer?

Based on the SemiAnalysis hands-on review, Firmus / Sustainable Metal Cloud (SMC) offers (or has been publicly tied to) the following NVIDIA / AMD GPU SKUs: GB300 NVL72, GB300. Specific inventory, region availability, and on-demand vs reserved access are detailed in the Firmus / Sustainable Metal Cloud (SMC) ClusterMAX review.

What is the NCCL all-reduce performance on Firmus / Sustainable Metal Cloud (SMC)?

The Firmus / Sustainable Metal Cloud (SMC) review on ClusterMAX includes hands-on NCCL all-reduce results from SemiAnalysis testing. NCCL bandwidth (in GB/s) is one of the most important indicators of training cluster health — see the Networking section of the review for the specific numbers and how they compare to the ClusterMAX cohort.

How does Firmus / Sustainable Metal Cloud (SMC) compare to CoreWeave?

CoreWeave is the only ClusterMAX Platinum provider, while Firmus / Sustainable Metal Cloud (SMC) is rated Silver. The Firmus / Sustainable Metal Cloud (SMC) review documents the specific gaps versus CoreWeave across the 10 ClusterMAX criteria (Security, Lifecycle, Orchestration, Storage, Networking, Reliability, Monitoring, Pricing, Partnerships, Availability). See the Firmus / Sustainable Metal Cloud (SMC) review body and the ClusterMAX /criteria page for the full comparison framework.

Is Firmus / Sustainable Metal Cloud (SMC) recommended for LLM training?

Firmus / Sustainable Metal Cloud (SMC) is in a ClusterMAX tier that SemiAnalysis directly recommends for production GPU workloads (Platinum / Gold / Silver / Bronze). The Firmus / Sustainable Metal Cloud (SMC) review details which workload profiles fit best — large-scale pretraining, fine-tuning, on-demand experimentation, or inference — based on hands-on cluster testing.

All ClusterMAX™ 2.0 + 2.1 reviews