Most Viewed Content:

New Apple AirPods patent can monitor the wearer’s brainwaves and other biosignals

According to the latest list published by the United...

China Seeya Technology purchased 32.2 billion won OLEDoS production equipment

The high-resolution Micro OLED display panel (OLEDoS) currently used...

India’s JSW is negotiating with Leapmotor on licensing electric vehicle technology

In last Friday's first half of the 2023 Zero...

OpenAI Triton has begun merging AMD ROCm code

According to news on September 3, Triton is a Python-like Open-source programming language, which enables researchers without CUDA experience to write efficient GPU codes (can be understood as a simplified version of CUDA), and it is said that Xiaobai can also write codes comparable to professionals. Less effort to achieve maximum hardware performance, but Triton initially only supports Nvidia GPUs.

OpenAI claims that Triton can achieve comparable performance to cuBLAS on FP16 matrix multiplication with only 25 lines of code.

From Github, we can see that OpenAI has begun to merge AMD ROCm-related branch codes in the latest Triton version, which has exposed a lot of things. In other words, the latest Triton backend has been adapted to the AMD platform, which is of great significance.

Officially, they have passed most of the unit tests on “test_core.py”, but skipped some tests for various reasons.

OpenAI also announced that it will hold the Triton Developer Conference at the Microsoft Silicon Valley Park in Mountain View, California, from 10:00 am to 4:00 pm on September 20th, and the schedule includes “Introducing Triton to AMD GPU ” and “Triton’s Intel XPU”, it is expected that Triton will soon get rid of the history of NVIDIA CUDA monopoly.

It is worth mentioning that Triton is open-source. Compared with closed-source CUDA, other hardware accelerators can be directly integrated into Triton, which greatly reduces the time to build an AI compiler stack for new hardware.

In the previously released PyTorch 2.0 version, TorchInductor introduced OpenAI Triton support, which can automatically generate fast code for multiple accelerators and backends, and at the same time implement Python instead of CUDA programming to write code for the underlying hardware. In other words, Triton is already a key component of the PyTorch 2.0 backend compiler.

In fact, previously AMD ROCm mainly used the Hipify tool to achieve CUDA compatibility, and as AMD began to provide ROCm support for RDNA 3 consumer graphics cards, it is expected that more platforms will choose to adapt to AMD hardware in the future.

Latest

LG confirms to launch Quest 4 Pro headset with Meta

According to foreign media Pulse, LG employees have confirmed...

Gainward Launches All-in-One Water Cooling Cooler, Available in Black and White

GainWard launched its newest all-in-one water cooling cooler for...

Realme GT5 supports up to 240W fast charge, starting at 2999 RMB

Realme GT5, Realme’s fifth-anniversary product, is officially listed. The...

Fuji GFX100ⅡCamera exposed, expected to be released on September 12

According to Fujirumors, Fuji will release the successor to...

Newsletter

Don't miss

LG confirms to launch Quest 4 Pro headset with Meta

According to foreign media Pulse, LG employees have confirmed...

Gainward Launches All-in-One Water Cooling Cooler, Available in Black and White

GainWard launched its newest all-in-one water cooling cooler for...

Realme GT5 supports up to 240W fast charge, starting at 2999 RMB

Realme GT5, Realme’s fifth-anniversary product, is officially listed. The...

Fuji GFX100ⅡCamera exposed, expected to be released on September 12

According to Fujirumors, Fuji will release the successor to...

Mercedes-Benz’s E-Class All-Terrain Sedan Unveiled

Mercedes-Benz made its debut at the 2023 German International...
James Lopez
James Lopezhttps://www.techgoing.com
James Lopez joined Techgoing as Senior News Editor in 2022. He's been a tech blogger since before the word was invented, and will never log off.

Sennheiser launched the cheapest Ambeo Soundbar Mini, priced at $799

German audio brand Sennheiser launched its third and cheapest soundbar called Ambeo Soundbar Mini. This soundbar is priced at $799, which is still not...

Lexar NM790 PCIe 4.0 SSD released, up to 4TB, can be used for PS5 expansion

Lexar announced a Lexar NM790 M.2 2280 PCIe Gen4x4 NVMe solid-state drive, known as the “PS5 expansion artifact”. According to reports, the Lexar NM790 vest...

German Bionic Launches Exoskeleton Apogee+ for Healthcare Professionals

German bionics company German Bionics in early this year announced its new exoskeleton equipment Apogee is German Bionic's flag of the lightest exoskeleton equipment,...