NVIDIA Unveils Rubin CPX GPU to Tackle Long-Context AI Inference


NVIDIA Unveils Rubin CPX GPU to Tackle Long-Context AI Inference

📅 - NVIDIA is preparing to expand the boundaries of artificial intelligence infrastructure with the introduction of the Rubin CPX, a new GPU designed specifically for handling exceptionally long context windows. The processor, part of the company's upcoming Rubin series, is expected to become available by the end of 2026 and is tailored for workloads that demand processing power across sequences extending to more than one million tokens.

The Rubin CPX is built with a focus on disaggregated inference, an approach that separates the distinct stages of AI inference into compute-bound and memory bandwidth-bound operations. This separation allows specialized hardware to target each phase more [...][... Check source for end of article ...]
Tags: AI

Reads: 47 | Category: General | Source: Hosting Jurnalist : Hosting Jurnalist | Author:
URL source: https://hostingjournalist.com/news/nvidia-unveils-rubin-cpx-gpu-to-tackle-long-context-ai-inference
Want to add a website news or press release ? Just do it, it's free! Use add web hosting news!