[Convergence] allow non-convergent ops before entry and loop intrinsics #65939

ssahasra · 2023-09-11T09:23:55Z

The only real requirement is that entry and loop intrinsics should not be preceded by convergent operations in the same basic block. They do not need to be the first in the block.

Relaxing the constraint on the entry and loop intrinsics avoids having to make changes in the construction of LLVM IR, such as getFirstInsertionPt(). It also avoids added complexity in the lowering to Machine IR, where COPY instructions may be added to the start of the basic block.

The only real requirement is that entry and loop intrinsics should not be preceded by convergent operations in the same basic block. They do not need to be the first in the block. Relaxing the constraint on the entry and loop intrinsics avoids having to make changes in the construction of LLVM IR, such as getFirstInsertionPt(). It also avoids added complexity in the lowering to Machine IR, where COPY instructions may be added to the start of the basic block.

nhaehnle

Seems reasonable to me.

ruiling · 2023-09-11T13:11:51Z

llvm/include/llvm/ADT/GenericConvergenceVerifier.h

@@ -63,6 +63,8 @@ template <typename ContextT> class GenericConvergenceVerifier {
  // and not the token values.
  DenseMap<const InstructionT *, const InstructionT *> Tokens;

+  bool SeenFirstConvOp = false;


The value will not be reset across basic block, right?

It gets reset at the start of every block in the visit method.

I think not? The visit() method you pointed out is only called in Verifier::visitCallBase(). So it cannot reset the flag in a new block.

Ouch! You are right. I was paying too much attention on the MachineConvergenceVerifier (not checked in yet), and didn't see how the ConvergenceVerifier traverses the function. I have a fix in my branch, will push it out real soo now. Thanks for catching it!

arsenm

probably useful for alloca

…cs (llvm#65939) The only real requirement is that entry and loop intrinsics should not be preceded by convergent operations in the same basic block. They do not need to be the first in the block. Relaxing the constraint on the entry and loop intrinsics avoids having to make changes in the construction of LLVM IR, such as getFirstInsertionPt(). It also avoids added complexity in the lowering to Machine IR, where COPY instructions may be added to the start of the basic block.

ssahasra requested review from jayfoad, arsenm, nhaehnle and ruiling September 11, 2023 09:23

ssahasra requested a review from a team as a code owner September 11, 2023 09:23

llvmbot added the llvm:ir label Sep 11, 2023

nhaehnle approved these changes Sep 11, 2023

View reviewed changes

ssahasra merged commit 08da343 into llvm:main Sep 11, 2023

ssahasra deleted the ssahasra/token-not-first branch September 11, 2023 12:56

ruiling reviewed Sep 11, 2023

View reviewed changes

arsenm reviewed Sep 12, 2023

View reviewed changes

michaelrj-google mentioned this pull request Sep 12, 2023

[libc] Move long double table option to new config #66151

Merged

vzakhari mentioned this pull request Sep 12, 2023

internap proc trampolines #66156

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Convergence] allow non-convergent ops before entry and loop intrinsics #65939

[Convergence] allow non-convergent ops before entry and loop intrinsics #65939

ssahasra commented Sep 11, 2023

nhaehnle left a comment

ruiling Sep 11, 2023

ssahasra Sep 12, 2023

ruiling Sep 13, 2023

ssahasra Sep 13, 2023

arsenm left a comment

[Convergence] allow non-convergent ops before entry and loop intrinsics #65939

[Convergence] allow non-convergent ops before entry and loop intrinsics #65939

Conversation

ssahasra commented Sep 11, 2023

nhaehnle left a comment

Choose a reason for hiding this comment

ruiling Sep 11, 2023

Choose a reason for hiding this comment

ssahasra Sep 12, 2023

Choose a reason for hiding this comment

ruiling Sep 13, 2023

Choose a reason for hiding this comment

ssahasra Sep 13, 2023

Choose a reason for hiding this comment

arsenm left a comment

Choose a reason for hiding this comment