Introduce `fold-unstructured-ptr` pass #210

nhat-nguyen · 2025-01-02T19:37:15Z

This PR introduces the fold-unstructured-ptr pass which is the first step towards allowing triton-shared to compile pointer sequences that cannot be analyzed by triton-to-structured (gather / scatter).

Intended lowering pipeline

triton-to-structured (no changes):
- analyzes structured addptr sequences
  - introduces tts.make_tptr %ptr_arg with offsets and strides
  - introduces tts.load and tts.store
- leaves unstructured addptr sequences and their corresponding tt.load and tt.store intact
fold-unstructured-ptr (this PR):
- converts all unstructured addptr sequences into sequences that compute pointer offsets
  - introduces tts.make_unstructured_tptr %ptr_arg %offsets
- removes all tt.addptr
structured-to-memref (to be updated in a different PR):
- currently converts everything to memref including scalar addptr and kernel arguments
- will change to just convert ops in the tts dialect to memref with the exception of tts.make_unstructured_tptr
unstructured-to-memref (to be introduced in a different PR):
- converts the remaining unstructured tt.load, tt.store, and tts.make_unstructured_tptr into memref
triton-ptr-to-memref (to be introduced in a different PR):
- converts kernel arguments with pointer type to memref

Pass design

This pass attempts to simplify the uses of triton pointers in the IR which
will help lowering triton to other mlir dialects easier.

A triton pointer has two pieces of info:

a base pointer which comes from the kernel arguments
an offset which could be either a tensor of offset or a single integer
offset

Triton pointers are created and manipulated through a sequence of tt.addptr,
tt.splat, or tt.broadcast ops. If a triton pointer is created through
tt.addptr %ptr %offset, the new pointer will contain the same base pointer
as the original pointer; its offset will also be accumulated. Triton pointers
created through tt.splat and tt.broadcast retain their base pointers and
offsets. Tensors of pointers cannot have different bases by design. In other
words, the base pointer is fixed throughout a chain of pointer manipulation
ops.

Leveraging these insights, we can simplify chains of tt.addptr,
tt.splat, and tt.broadcast which produce triton pointers to just a sequence
of offset manipulation ops and a base pointer.

In essence, this pass transforms all sequences of tt.addptr into sequences of
offset accumulation ops which are then fed into a single op
tts.make_unstructured_tptr that takes:

a base pointer from the kernel arguments
a tensor of offsets (or single offset) that indicates the offsets from
the base pointer

This simplification makes it easier for subsequent passes to lower these load
and store ops. The pass unstructured-to-memref will leverage this output to
lower the unstructured triton load / store ops into memref load / store ops
with the appropriate offsets.

See the comments in
lib/Conversion/FoldUnstructuredTritonAddPtr/FoldUnstructuredTritonAddPtrPass.cpp
for more detailed description on the approach.

nhat-nguyen added 3 commits January 2, 2025 14:25

FoldUnstructuredTritonPtr

7d25c03

Update

f8e5773

Update

b46785f

nhat-nguyen mentioned this pull request Jan 2, 2025

Introduce triton-ptr-to-memref pass #211

Open

nhat-nguyen added 2 commits January 2, 2025 15:16

Fix missing add_subdirectory

ba41383

Simplify pass name

072e903

nhat-nguyen changed the title ~~Introduce fold-unstructured-triton-ptr pass~~ Introduce fold-unstructured-ptr pass Jan 3, 2025

nhat-nguyen marked this pull request as ready for review January 6, 2025 19:21

nhat-nguyen requested review from manbearian, beicy and red1bluelost January 6, 2025 19:23

Update FoldUnstructuredPtrPass.cpp

999c080

This was referenced Jan 7, 2025

Introduce unstructured-to-memref pass #216

Draft

Update structured-to-memref pass to support the new pass pipeline #217

Draft

nhat-nguyen marked this pull request as draft January 8, 2025 18:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce `fold-unstructured-ptr` pass #210

Introduce `fold-unstructured-ptr` pass #210

nhat-nguyen commented Jan 2, 2025 •

edited

Loading

Introduce fold-unstructured-ptr pass #210

Are you sure you want to change the base?

Introduce fold-unstructured-ptr pass #210

Conversation

nhat-nguyen commented Jan 2, 2025 • edited Loading

Intended lowering pipeline

Pass design

Introduce `fold-unstructured-ptr` pass #210

Introduce `fold-unstructured-ptr` pass #210

nhat-nguyen commented Jan 2, 2025 •

edited

Loading