[Codegen] Skinny mm/bmm/mv : use to vector distribute by newling · Pull Request #21679 · iree-org/iree

newling · 2025-08-14T05:37:58Z

Before this PR, use of vector distribute was gated on this ad hoc constraint:

isa<linalg::ReduceOp, linalg::GenericOp>(op) && llvm::any_of(op.getIteratorTypesArray(), linalg::isReductionIterator);

For this PR I wanted to relax that gate to just

llvm::any_of(op.getIteratorTypesArray(), linalg::isReductionIterator)

But that failed because there is a test pooling_dynamic in nvvm_pipeline.mlir
that, when it goes down vector distribution, hits an assertion failure in
configure-tensor-layouts about one of the operands not having a projection
permutation.

I assume that's because layout inference scheme doesn't work well unless the maps
are just permutations/projections?

This PR works around this by gating on not having any non-projecting maps.

kuhar

Seems like many tests are failing?

kuhar · 2025-08-14T13:14:52Z

+  const auto loopRanges = op.getStaticLoopRanges();
+  const auto loopTypes = op.getIteratorTypesArray();
+  const auto indexingMaps = op.getIndexingMapsArray();


Please spell these out: https://llvm.org/docs/CodingStandards.html#use-auto-type-deduction-to-make-code-more-readable

Also, we probably don't need const on most of the local variables in this file, especially Types/Values/etc.

kuhar · 2025-08-14T13:14:57Z

+  //
+  // false is returned. These maps appear in convolution/pooling ops, which are
+  // not currently supported by the vector distribute pipeline.
+  for (const auto m : indexingMaps) {


kuhar · 2025-08-14T13:15:15Z

+  }
+
+  // Check for non-unit reduction dim.
+  for (const auto [type, range] : llvm::zip(loopTypes, loopRanges)) {


Use zip_equal if these have the same length

newling · 2025-08-14T15:15:09Z

Seems like many tests are failing?

Yeah I thought I'd found a good solution but some e2e tests didn't compile. Pipeline selection is quite a slippery path! Curious, what is your approach for testing before pushing a PR @kuhar and others? I generally just do

ctest -R odegen -j60

And if that passes then I'll make a PR to check if there are other tests which fail in CI. Obvs if I am checking numerics I then compile an e2e test and run that locally on a GPU. But in general I assume that if lit tests pass, and numerics are fine.

Ideally an e2e numerical test will never be the first place we see a compilation failure, ideally it should be caught in a lit test. Wondering if there's a way to get all numerical e2e tests just compile locally.

Groverkss · 2025-08-14T15:18:23Z

Seems like many tests are failing?

Yeah I thought I'd found a good solution but some e2e tests didn't compile. Pipeline selection is quite a slippery path! Curious, what is your approach for testing before pushing a PR @kuhar and others? I generally just do
ctest -R odegen -j60
And if that passes then I'll make a PR to check if there are other tests which fail in CI. Obvs if I am checking numerics I then compile an e2e test and run that locally on a GPU. But in general I assume that if lit tests pass, and numerics are fine.

Ideally an e2e numerical test will never be the first place we see a compilation failure, ideally it should be caught in a lit test. Wondering if there's a way to get all numerical e2e tests just compile locally.

I think you want to run e2e lit tests: https://iree.dev/developers/general/testing-guide/#code-coverage

kuhar · 2025-08-14T15:29:53Z

Curious, what is your approach for testing before pushing a PR @kuhar and others? I generally just do

My go-to test command is: ninja all iree-test-deps && ctest all -j32 --output-on-failure -E 'cuda|metal'

newling · 2025-08-18T23:09:41Z

This isn't needed, the way to go is #21720

new pipeline spaghetti with test updates

c6c8f3f

newling requested review from Groverkss, MaheshRavishankar, kuhar and qedawkins as code owners August 14, 2025 05:37

kuhar reviewed Aug 14, 2025

View reviewed changes

newling closed this Aug 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Codegen] Skinny mm/bmm/mv : use to vector distribute#21679

[Codegen] Skinny mm/bmm/mv : use to vector distribute#21679
newling wants to merge 1 commit into
iree-org:mainfrom
newling:vector_distribute_for_matvec

newling commented Aug 14, 2025

Uh oh!

kuhar left a comment

Uh oh!

kuhar Aug 14, 2025

Uh oh!

kuhar Aug 14, 2025

Uh oh!

kuhar Aug 14, 2025

Uh oh!

kuhar Aug 14, 2025

Uh oh!

newling commented Aug 14, 2025

Uh oh!

Groverkss commented Aug 14, 2025

Uh oh!

kuhar commented Aug 14, 2025

Uh oh!

newling commented Aug 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

newling commented Aug 14, 2025

Uh oh!

kuhar left a comment

Choose a reason for hiding this comment

Uh oh!

kuhar Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

kuhar Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

kuhar Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

kuhar Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

newling commented Aug 14, 2025

Uh oh!

Groverkss commented Aug 14, 2025

Uh oh!

kuhar commented Aug 14, 2025

Uh oh!

newling commented Aug 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants