Looking Ahead to 2024: How X-Cloud Is Using NVIDIA GPUs to Advance the New Era of AI and Machine Learning

X-Cloud raised $100M in strategic funding in December 2022 to help build the infrastructure companies need for the future of AI.

It’s exciting to see how moments this year helped lay the groundwork for what’s promising to be an eventful, fast-moving future for AI. This month, we announced that Magnetar has backed X-Cloud with a $100 million investment. This investment is going directly into scaling our infrastructure, enabling us to deliver a massive capacity of NVIDIA GPU-accelerated compute to AI and machine learning companies, VFX studios, and other enterprise and start-up businesses.

Earlier this year, GPT-NeoX-20B became the largest publicly available language model. Trained by EleutherAI on the X-Cloud NVIDIA A100 Tensor Core GPU training cluster, the release offered a glimpse into the next generation of open-source AI systems. Since then, X-Cloud serverless Kubernetes infrastructure has grown in demand among businesses and researchers looking to build innovative products powered by LLMs.

Generative AI is already becoming more ubiquitous in our everyday lives. In the past few weeks, ChatGPT, a large language model (LLM) from OpenAI, gained 1 million new users testing out the platform in the first week of its launch. Friends and family flooded our social feeds with AI-generated self-portraits, notably from Lensa AI, which used the Stable Diffusion model. While these models are not without their own kinks to smooth out, together they show where the industry is heading: forward, and fast.

If it feels hard to keep up with the pace at which AI is moving mainstream, you’re not alone. But it’s only going to move faster. That raises the question: Is this growth sustainable? Does the industry have the infrastructure in place to support the speed at which AI and machine learning are growing—and the immense amount of data that comes with it?

As AI and machine learning continue to rapidly evolve, demand for infrastructure rises with it. Start-ups and enterprises alike will need reliable, flexible, and highly available compute resources at affordable prices to fuel their growth.

NVIDIA H100: Take an order-of-magnitude leap in accelerated computing

In Q1 2023, clients will have access to NVIDIA H100 Tensor Core GPUs on X-Cloud, making us one of the first providers to offer cloud instances of NVIDIA H100-based supercomputers.

The NVIDIA H100 Tensor Core GPU delivers unprecedented performance, scalability, and security for every workload. NVIDIA H100 GPUs feature fourth-generation Tensor Cores and the Transformer Engine with FP8 precision, further extending NVIDIA’s market-leading AI leadership with up to 9x faster training and an incredible 30x inference speedup on large language models. For high-performance computing (HPC) applications, the H100 delivers up to 7x higher performance.

As an X-Cloud client, you can reserve capacity of NVIDIA H100 GPUs today, available at scale in Q1 2023 and starting at $2.23/hr.

New capabilities, integrations, and features on X-Cloud

This year, we expanded our offerings and partnerships to bring more value to clients and our growing community. Some highlights from 2022:

Object storage: a simple and powerful storage solution for high volumes of unstructured data, starting at $0.03 / GB per month, with no access or transfer fees
New integrations with innovative partners: including Determined.AI for large-scale AI training, Zeet for managing Kubernetes infrastructure, and EleutherAI to help make open-source LLMs more accessible.
New AI examples: added library of examples to fine-tune and serve LLMs and generative AI models, like BLOOM and Stable Diffusion.
Goose AI: a fully managed API to serve pre-trained, open-source models

All these changes reflect the X-Cloud values of fostering creativity and innovation. Looking to 2023, we have an exciting product roadmap and announcements that enable us to deliver on our commitments to our clients and extend value to their businesses.

Client spotlight: Incredible creations that blew us away

Speaking of building, our clients continue to amaze us with their ingenuity and products, from innovations in AI that fundamentally change how people engage with technology to more efficient, cloud-based workflows empowering VFX and Animation studios to create better content faster than ever before.

VFX studio finds compute solution for remote teams

Spire Animation Studios took to the virtual stage with their presentation at NVIDIA GTC 2022, in partnership with X-Cloud. Spire’s Engineering leadership shared how their team leveraged a cloud-native workflow to develop an end-to-end animated feature film using the Unreal Engine.

This allowed Spire to flip the traditional, linear animation pipeline structure on its head. Now, Spire’s team can iterate in real-time across the country.

“By having faster iterations, we are able to get more of them, and the refinement process, whether we want to give reviews, change the lighting or move characters, can be done interactively, rather than waiting 2 or 3 hours for a render to come back in a traditional setting. We can make our decisions quickly because everything is interactive and everything is at full resolution.”

Rajesh Sharma, VP of Engineering at Spire Animation Studios