Most Viewed Content:

Cygnus space cargo spacecraft arrives at International Space Station with only half of its solar array

NASA astronaut Nicole Mann, backed up by NASA astronaut...

Google to bring PWA application backup & restore function for Chrome/android

According to thespAndroid reports, GitHub's Chromium repository recently added...

Toyota responds to continued production cuts in the next 3 months: easing pressure on dealer earnings

In response to the news that "production will continue...

OpenAI Triton has begun merging AMD ROCm code

According to news on September 3, Triton is a Python-like Open-source programming language, which enables researchers without CUDA experience to write efficient GPU codes (can be understood as a simplified version of CUDA), and it is said that Xiaobai can also write codes comparable to professionals. Less effort to achieve maximum hardware performance, but Triton initially only supports Nvidia GPUs.

OpenAI claims that Triton can achieve comparable performance to cuBLAS on FP16 matrix multiplication with only 25 lines of code.

From Github, we can see that OpenAI has begun to merge AMD ROCm-related branch codes in the latest Triton version, which has exposed a lot of things. In other words, the latest Triton backend has been adapted to the AMD platform, which is of great significance.

Officially, they have passed most of the unit tests on “test_core.py”, but skipped some tests for various reasons.

OpenAI also announced that it will hold the Triton Developer Conference at the Microsoft Silicon Valley Park in Mountain View, California, from 10:00 am to 4:00 pm on September 20th, and the schedule includes “Introducing Triton to AMD GPU ” and “Triton’s Intel XPU”, it is expected that Triton will soon get rid of the history of NVIDIA CUDA monopoly.

It is worth mentioning that Triton is open-source. Compared with closed-source CUDA, other hardware accelerators can be directly integrated into Triton, which greatly reduces the time to build an AI compiler stack for new hardware.

In the previously released PyTorch 2.0 version, TorchInductor introduced OpenAI Triton support, which can automatically generate fast code for multiple accelerators and backends, and at the same time implement Python instead of CUDA programming to write code for the underlying hardware. In other words, Triton is already a key component of the PyTorch 2.0 backend compiler.

In fact, previously AMD ROCm mainly used the Hipify tool to achieve CUDA compatibility, and as AMD began to provide ROCm support for RDNA 3 consumer graphics cards, it is expected that more platforms will choose to adapt to AMD hardware in the future.

Latest

Starting from 48,900, Geely Panda Karting officially starts pre-sale

Geely Panda Karting officially started pre-sale. The pre-sale price...

Ford: Expand charging network, fuel/ hybrid/ pure electric in parallel

Recently, Ford released the company's comprehensive annual report for...

Chery’s two new cars are exposed, targeting overseas markets

Recently, some media exposed the actual cars of two...

New Trumpchi Shadow Leopard to launch on May 1, upgraded performance rims

Recently, we learned from the official that the 2024...

Newsletter

Don't miss

Starting from 48,900, Geely Panda Karting officially starts pre-sale

Geely Panda Karting officially started pre-sale. The pre-sale price...

Ford: Expand charging network, fuel/ hybrid/ pure electric in parallel

Recently, Ford released the company's comprehensive annual report for...

Chery’s two new cars are exposed, targeting overseas markets

Recently, some media exposed the actual cars of two...

New Trumpchi Shadow Leopard to launch on May 1, upgraded performance rims

Recently, we learned from the official that the 2024...

Samsung Galaxy S25 Ultra expected to feature 5000mAh + 45W Combo

Technology media WccFtech recently reported that Samsung will not...
James Lopez
James Lopezhttps://www.techgoing.com
James Lopez joined Techgoing as Senior News Editor in 2022. He's been a tech blogger since before the word was invented, and will never log off.

Meizu 21 Note to be equipped with black/white narrow-edge straight screen

A new Meizu phone model M468Q recently passed the national 3C certification. It was previously speculated to be Meizu 21X, but was later confirmed...

Nothing Phone (2a) Blue special edition phone to launch on April 29

According to source Technerd_9, Nothing will launch a special blue version of Nothing Phone (2a) on April 29. This version is specially designed for...

New Trumpchi Shadow Leopard to launch on May 1, upgraded performance rims

Recently, we learned from the official that the 2024 Shadow Leopard will be officially launched on May 1. As a new model, the new...