Compute Shard
A live inference swarm running a 120B model across scattered RTX 4090 GPUs.
About
Compute Shard is an experimental distributed inference system that splits a 120B parameter model across multiple scattered consumer GPUs. It features a shared terminal where users can collectively watch the swarm stream output in real-time.
Timeline
Teaser
Video
Playable
Product
Loading…
Media & coverage
sourced from 1 post


