Skip to content

[X86][AVX512] Suboptimal codegen with llvm.masked.store.v192i8 #198154

@Andarwinux

Description

@Andarwinux

original C code: https://test.godbolt.org/z/x19oc35f5

reduced IR: https://test.godbolt.org/z/Yfcxcrbzv

define void @UpsampleBgrLinePair_C(<192 x i1> %interleaved.mask474) {
entry:
  br label %vector.body

vector.body:                                      ; preds = %vector.body, %entry
  tail call void @llvm.masked.store.v192i8.p0(<192 x i8> zeroinitializer, ptr null, <192 x i1> %interleaved.mask474)
  br label %vector.body
}

; Function Attrs: nocallback nofree nosync nounwind willreturn memory(argmem: write)
declare void @llvm.masked.store.v192i8.p0(<192 x i8>, ptr captures(none), <192 x i1>) #0

attributes #0 = { nocallback nofree nosync nounwind willreturn memory(argmem: write) }

Metadata

Metadata

Assignees

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions