Most Viewed Content:

Cygnus space cargo spacecraft arrives at International Space Station with only half of its solar array

NASA astronaut Nicole Mann, backed up by NASA astronaut...

Google to bring PWA application backup & restore function for Chrome/android

According to thespAndroid reports, GitHub's Chromium repository recently added...

OpenAI Launched Assistants API, Allowing Developers to Customize AI Assistants with One Click

At today's OpenAI's first developer conference, OpenAI launched the...

NVIDIA uses AI to design and develop GPUs – Latest Hopper already has 13,000 circuit instances

Over the past few years, NVIDIA has been deep into the AI space, and their GPUs have become the first choice not only for HPC but also for data centers, including AI and deep learning ecosystems. In a newly published developer blog post, NVIDIA announced that they are using AI to design and develop GPUs, and that their latest Hopper GPU has nearly 13,000 circuit instances that were created entirely by AI.

In a new blog posted on NVIDIA Develope, the company reiterates its strengths and how it itself used its AI capabilities to design its most powerful GPU to date, the Hopper H100. NVIDIA GPUs are primarily designed using state-of-the-art EDA (electronic design automation) tools, but with the help of AI that leverages the PrefixRL approach help, using deep reinforcement learning to optimize parallel prefix circuits, the company was able to design smaller, faster and more energy-efficient chips while delivering better performance.

Arithmetic circuits in computer chips are constructed using networks of logic gates (such as NAND, NOR and XOR) and wires. The ideal circuit should have the following characteristics.

● Small: Smaller area so that more circuits can be mounted on the chip.

● Fast: lower latency to improve chip performance.

● Consume less power: lower power consumption of the chip.

NVIDIA has designed nearly 13,000 AI-assisted circuits using this approach, reducing their area by 25% compared to equally fast and functionally identical EDA tools. But PrefixRL was mentioned as a very computationally demanding task, and for each GPU physically simulated, it required 256 CPUs and over 32,000 GPU hours. To remove this bottleneck, NVIDIA developed Raptor, an in-house distributed reinforcement learning platform that specifically leverages NVIDIA hardware for this industrial reinforcement learning.

Raptor has several features that improve scalability and training speed, such as job scheduling, custom networks, and GPU-aware data structures. In the context of PrefixRL, Raptor enables mixed job allocation across CPU, GPU, and Spot instances.

The networks in this reinforcement learning application are diverse and benefit from the following.

● Raptor’s ability to switch between NCCLs for peer-to-peer transfers to transfer model parameters directly from the learner GPU to the inference GPU.

● Redis for asynchronous and smaller messages, such as rewards or statistics.

● A JIT-compiled RPC for handling high-volume and low-latency requests, such as uploading experience data.

NVIDIA concluded that applying AI to real-world circuit design problems could lead to better GPU designs in the future. The full paper is here, and you can visit the developer blog here for more information.

Latest

Starting from 48,900, Geely Panda Karting officially starts pre-sale

Geely Panda Karting officially started pre-sale. The pre-sale price...

Ford: Expand charging network, fuel/ hybrid/ pure electric in parallel

Recently, Ford released the company's comprehensive annual report for...

Chery’s two new cars are exposed, targeting overseas markets

Recently, some media exposed the actual cars of two...

New Trumpchi Shadow Leopard to launch on May 1, upgraded performance rims

Recently, we learned from the official that the 2024...

Newsletter

Don't miss

Starting from 48,900, Geely Panda Karting officially starts pre-sale

Geely Panda Karting officially started pre-sale. The pre-sale price...

Ford: Expand charging network, fuel/ hybrid/ pure electric in parallel

Recently, Ford released the company's comprehensive annual report for...

Chery’s two new cars are exposed, targeting overseas markets

Recently, some media exposed the actual cars of two...

New Trumpchi Shadow Leopard to launch on May 1, upgraded performance rims

Recently, we learned from the official that the 2024...

Samsung Galaxy S25 Ultra expected to feature 5000mAh + 45W Combo

Technology media WccFtech recently reported that Samsung will not...
Threza Gabriel
Threza Gabrielhttps://www.techgoing.com
Threza Gabriel is a news writer at TechGoing. TechGoing is a global tech media to brings you the latest technology stories, including smartphones, electric vehicles, smart home devices, gaming, wearable gadgets, and all tech trending.

Chery Fulwin E06 announced, positioned as large five-seater

Chery Automobile "unannounced" several new cars during the Beijing Auto Show. Without any prior warm-up, it drove a new SUV model - Fulwin E06....

Chery’s two new cars are exposed, targeting overseas markets

Recently, some media exposed the actual cars of two Chery export models, namely the Omoda 7 and the Sterra ES export version. It is...

Zeekr Mix to be launched in the second half of this year

We learned from the official that Zeekr MIX, a medium-sized MPV under the Zeekr brand, will be launched in the second half of this...