28–29 May 2026
HUN-REN Centre
Europe/Budapest timezone

Manual AMDGCN Assembly Analysis & Optimization

28 May 2026, 10:50
30m
HUN-REN Centre

HUN-REN Centre

1054 Budapest Alkotmány utca 29.

Speaker

Nara Prasetya (StreamHPC)

Description

The performance of a GPU kernel is influenced by many factors, with some easier to change than others. In some cases, however, the resulting performance is beholden to the compiler. In this presentation we will go over a set of kernel optimization techniques that go beyond profiling and reducing memory bottlenecks, but instead focus on the analysis of AMDGCN assembly, reducing register pressure to improve occupancy, and manually recover performance due to losses from compiler changes.

Author

Nara Prasetya (StreamHPC)

Presentation materials

There are no materials yet.