top of page

Your House Just Became a Data Center. Here's What That Actually Means.


Audio cover
Home GPUs

A California startup called SPAN just announced something that would have sounded absurd five years ago: small AI data center nodes — called XFRA units — mounted on the outside walls of homes across America. Backed by NVIDIA and partnered with homebuilder PulteGroup, SPAN is testing a distributed compute network that turns residential electrical capacity into enterprise-grade AI infrastructure.

The hardware is real. The partners are serious. And the timing is not a coincidence.


The Problem They're Solving

Let me reframe this so it lands correctly: SPAN isn't building a quirky consumer gadget. They're attacking the same structural bottleneck I've written about before — the fact that $710 billion in committed hyperscaler capital is sitting on the wrong side of a three-to-five year grid interconnection queue. Building a 100-megawatt traditional data center costs roughly $15 million per megawatt and takes three to five years from groundbreaking to first compute. Grid interconnection alone can add years to that timeline in constrained markets. The demand for AI inference — specifically inference, not training — is growing faster than that construction timeline can accommodate.

SPAN's answer: stop waiting for new grid infrastructure. Use the grid capacity that already exists and is sitting idle inside America's residential neighborhoods.

Their smart electrical panels — the original SPAN product — can identify unused headroom in a home's existing electrical service. That headroom powers the XFRA compute node. No new transmission lines. No substation upgrades. No interconnection queue. The power is already there.


What's Actually on Your Wall

The XFRA units are small white boxes — think HVAC unit form factor — mounted on the exterior of a home. Inside: liquid-cooled NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs. No fans. No noise. The liquid cooling is the key enabler — it's what makes enterprise-grade compute viable in a residential footprint. SPAN claims 8,000 XFRA units can be deployed six times faster and at five times lower cost than an equivalent 100-megawatt centralized data center. That's not a marketing claim — that's an infrastructure math problem they've actually solved by eliminating the construction and grid connection steps entirely.


The NVIDIA angle is significant. The RTX PRO 6000 Blackwell Server Edition is a first-to-market product. SPAN is one of the first deployments. This isn't NVIDIA bolting their logo onto someone's project — they have a strategic interest in proving that distributed inference infrastructure is viable. It diversifies their customer base beyond hyperscalers and opens a completely new deployment surface for their hardware.


The Homeowner Math

Homeowners who host an XFRA node get a SPAN smart panel, battery backup, optional solar, and a fixed discounted rate on electricity and internet. SPAN cited a $150/month electricity bill credit in early reporting. In some deployments, the deal gets more aggressive — free power and internet in exchange for hosting. From the homeowner's perspective, this is essentially a lease arrangement for exterior wall space and electrical headroom they weren't using anyway. The compute load is managed dynamically by SPAN's power orchestration software, so it doesn't compete with the home's own consumption.


PulteGroup's VP of Strategic Sourcing framed it cleanly: it lowers build costs, delivers homes with lower operating costs, and keeps local infrastructure from being overburdened. For a homebuilder competing in markets where utility grid capacity is constraining new construction, that last point is not a footnote — it's a competitive advantage.


What Hyperscalers Are Actually Buying

SPAN isn't selling compute to consumers. The customers on the other end of the XFRA network are hyperscalers, neoscalers, and AI cloud providers who need inference capacity now — not in 2029 when the next traditional data center comes online.

This matters because of what inference actually demands. Training workloads — building the models — requires massive, centralized clusters with ultra-low latency between GPUs. You cannot distribute a training run across residential neighborhoods. But inference — running the models to generate responses — is fundamentally different. It's embarrassingly parallel, latency-tolerant at moderate scales, and benefits from being physically close to end users. A distributed network of XFRA nodes is essentially a CDN for AI compute. Same architectural logic that Cloudflare used to distribute web content to edge nodes — now applied to GPU inference. The hyperscaler taps into the XFRA network the same way they'd tap a traditional data center, and the distributed architecture actually reduces latency for end users compared to compute sitting in a centralized campus in Northern Virginia.


The Counterarguments Worth Taking Seriously

This model has real questions attached to it.


Security and compliance are the obvious ones. Enterprise AI workloads often carry data sensitivity requirements that traditional data centers address with physical security, compliance certifications, and audited access controls. A residential wall box introduces a threat surface that centralized facilities don't have. SPAN will need a rigorous answer for how XFRA nodes handle workload isolation, physical tampering, and chain-of-custody requirements for regulated industries.


Grid resilience is the second question. Residential power is less reliable than utility-grade data center feeds. A neighborhood power outage takes out a cluster of XFRA nodes simultaneously. SPAN's battery backup partially addresses this, but workload orchestration across a distributed network with variable uptime is a materially harder problem than managing uptime in a controlled facility.


The homeowner incentive durability question is the third. A $150/month electricity credit is meaningful today. If SPAN needs to compress that incentive as the network scales and their economics tighten, homeowner participation could erode. The model only works if the supply side stays engaged.


Why This Is a Signal, Not Just a Story

The fact that NVIDIA is a launch partner here is the tell. NVIDIA doesn't put their brand on infrastructure experiments. They back deployment models they believe will scale. Their framing — "low-latency solutions proximal to end users that can scale rapidly" — is a precise description of what inference workloads need. They see XFRA as a new deployment surface for Blackwell GPUs, one that doesn't depend on hyperscaler capex timelines. The deeper implication: the data center industry is bifurcating. Centralized hyperscale campuses will continue to dominate training workloads. But inference — which will account for more than half of all AI compute by 2030 according to SPAN's own projections — is moving to the edge. XFRA is a bet on that bifurcation being real and being soon.


SPAN is aiming for gigawatt scale by 2027. Whether they get there depends on how fast PulteGroup and other builders can integrate XFRA into new construction pipelines, and whether the hyperscaler demand signal holds. But the architecture is sound, the partners are credible, and the problem they're solving is real.

The power constraint isn't going away. The people finding creative ways around it — not through it — are the ones who will own the next layer of AI infrastructure.



Comments


Animated coffee.gif
cup2 trans.fw.png

© 2018 Rich Washburn

bottom of page