Connecting PyTorch sparse tensors with MLIR

aartbik · January 12, 2024, 7:18pm

Greetings! First time poster in this forum, after incorrectly starting the topic in the user forum.

I won’t repeat those postings here (since I hope you will read them there), but in a nutshell, the MLIR Sparsifier team is very interested in connecting the torch.sparse tensors with the MLIR sparse tensor types. A very first step towards this would be propagating sparsity information in the FX traced graph (without lowering the ops into sparse ops yet, or the arguments to their actual 1:N implementation arrays).

I made a very quick and dirty prototype for this that generates something like this for the example given in the original posting.

ExportedProgram:
    class GraphModule(torch.nn.Module):
        def forward(self, l_x_: "f32[64, 64]:torch.sparse_csr"):   # ADDED!
            # File: biknet.py:27, code: return x.sum()
            sum_1: "f32[]" = torch.ops.aten.sum.default(l_x_);  l_x_ = None
            return (sum_1,)
           
Graph signature: ExportGraphSignature(
  input_specs=[
      InputSpec(
           kind=<InputKind.USER_INPUT: 1>,
           arg=TensorArgument(name='l_x_'),
           target=None,
           layout=torch.sparse_csr)       # ADDED!
  ],
  output_specs=[
     OutputSpec(
         kind=<OutputKind.USER_OUTPUT: 1>,
         arg=TensorArgument(name='sum_1'),
         target=None)
 ])

This will hopefully enable me to prototype further in torch-mlir so I can report back here if this is a viable approach, and then further work on the actual feature request.

mlir_sparsifier_name

aartbik · January 23, 2024, 3:13am

I have posted a quick-and-dirty prototype “PR” that implements part of the requested feature (getting the type into the forward() parameter list in the FX Graph). This PR is of course not meant to be submitted. But hopefully some of the core developers can give some guidelines or assistance in getting the idea into a production-quality implementation.

aartbik · January 27, 2024, 1:27am

In the meantime, sparse support in torch-mlir part is making great progress. With a simple “wrapper” exporter (that builds the FX graph for dense and then annotates sparse arguments afterwards), something like

class MatMulNet(torch.nn.Module):

        def __init__(self):
            super(MatMulNet, self).__init__()

        def forward(self, x, y):
            return torch.matmul(x, y)

m = export_and_import(MatMulNet(), A_coo, B_dense)

actually goes through the PyTorch graph exporter all the way down to something that can be further processed by MLIR.


#sparse = #sparse_tensor.encoding<{ map = (d0, d1) -> (d0 : compressed(nonunique), d1 : singleton) }>
module {
  func.func @main(%arg0: !torch.vtensor<[64,64],f32,#sparse>, 
                  %arg1: !torch.vtensor<[64,64],f32>) -> !torch.vtensor<[64,64],f32> {
    %0 = torch.aten.mm %arg0, %arg1 : !torch.vtensor<[64,64],f32,#sparse>, !torch.vtensor<[64,64],f32> -> !torch.vtensor<[64,64],f32>
    return %0 : !torch.vtensor<[64,64],f32>
  }
}

aartbik · February 9, 2024, 2:39am

Minor update, we now have sufficient machinery in torch-mlir to run a simple PyTorch model “end-to-end” for sparse tensors as input. Take for example, the following code that uses MatMulNet. Then we get the same results when running with the normal PyTorch engine vs. torch-mlir execution (operating on the underlying numpy arrays):

    net = MatMulNet()
    a = torch.tensor([[1, 0, 0, 0, 0, 0, 0, 0],
                      [0, 0, 0, 0, 0, 0, 0, 0],
                      [0, 0, 2, 0, 0, 0, 0, 0],
                      [0, 0, 0, 0, 0, 0, 0, 0],
                      [0, 0, 0, 0, 0, 0, 0, 0],
                      [0, 0, 0, 0, 0, 0, 0, 3],
                      [0, 0, 0, 0, 0, 0, 0, 4],
                      [0, 0, 0, 0, 0, 0, 0, 5]],dtype=torch.float32)
    sparse_input = a.to_sparse_csr()
    res0 = net(a, a)
    res1 = net(sparse_input, a)
    res2 = sparse_jit(net, sparse_input, a)   # uses TORCH-MLIR +sparse

all yield the following numpy data

[[ 1.  0.  0.  0.  0.  0.  0.  0.]
 [ 0.  0.  0.  0.  0.  0.  0.  0.]
 [ 0.  0.  4.  0.  0.  0.  0.  0.]
 [ 0.  0.  0.  0.  0.  0.  0.  0.]
 [ 0.  0.  0.  0.  0.  0.  0.  0.]
 [ 0.  0.  0.  0.  0.  0.  0. 15.]
 [ 0.  0.  0.  0.  0.  0.  0. 20.]
 [ 0.  0.  0.  0.  0.  0.  0. 25.]]

aartbik · August 8, 2024, 6:01pm

To give an update on this work, over the past months we have added proper “export” support for sparse tensor types (coo and all the other compressed formats) as metadata in the FX graphs, which means that an external compiler (like MLIR) can act as “sparse compiler” backend for the torch sparse extension. Feature request 117188 has been closed, and a new test

test/export/test_sparse.py

has been added to ensure this feature remains functional going forward.

Note that I don’t think the work is fully done (is it ever? ;-). We will have some anecdotal corner cases that need to be fixed (I already found a few). Please feel free to file bug reports and/or fix them (and add new tests). But the overall idea of making sure sparsity is preserved in the exported graph is there!

This was a longer road than expected, but lots of fun nevertheless! I also would like to thank Pearu Peterson and Edward Yang for being exemplary mentors for this work!

You can find more information on the MLIR progress in this MPACT posting.

Topic		Replies	Views
torch-MLIR presentation from Google / nod.ai compiler	1	1142	October 8, 2021
Adding custom sparsity support	0	411	February 22, 2023
TorchFuser: A Plug-and-Play MLIR-Based Compiler and Optimized Runtime Integration compiler	4	327	June 3, 2025
Is there a plan for FX in a C++ IR? FX	2	573	February 28, 2024
Next Steps for PyTorch Compilers compiler	9	10443	October 21, 2021

Connecting PyTorch sparse tensors with MLIR

Related topics