Elon Musk’s artificial intelligence research company, xAI, recently brought the supercomputer it calls “Colossus” online. Boasting the world’s most powerful AI training system, this newly launched Colossus was built to push the frontiers of large language models and consolidate xAI at the frontier of AI innovation.

This Image was created with the assistance of DALL·E

The machine is based on 100,000 Nvidia H100 GPUs, which are AI workload-specialized. These GPUs are optimized to run massive models, and such is the case with xAI’s very own GROK-3, the successor to their previous LLM, GROK-2. This powerhouse allows xAI to train more advanced AI systems, likely outpacing the competition currently posed by OpenAI’s GPT-4.

This Image was created with the assistance of DALL·E

The implications of this cannot be overemphasized. While most AI systems run on tens of thousands of GPUs, the sheer scale of the Colossus is unprecedented in any existing technology. For instance, OpenAI’s largest models run on 80,000 GPUs, making the Colossus almost three times more powerful. Musk said the supercomputer is just the beginning of the larger plan for xAI, with the team looking to further extend the technology.

This Image was created with the assistance of DALL·E

One post on the platform X described Colossus as a game-changer in AI, highlighting the system’s speed in development. The post further praised xAI for “obliterating boundaries,” suggesting that Colossus could redefine AI training in ways previously unimagined.

The construction of Colossus took 122 days, and this machine can be found in Memphis, Tennessee. But this project doesn’t come without its own problems. Colossus comes with a very big appetite for power—an estimated 150 megawatts of power—and would need vast water resources to keep cool. Musk himself did not contest that it has an environmental footprint, but he stressed that the benefits of increasing AI technology are far higher than the costs.

Image source: Envato

Nvidia, in collaboration with xAI on the project, commended the energy efficiency of the system and redefined the future of AI. Colossus will probably lead to exponential LLMs—thus promising us not only smarter systems but also more sustainable solutions with respect to the training and deployment of AI models.