You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Deprecate _syncthreads (the CUDA name) in favor of new
_barrier_sync (NVPTX name barrier.sync).
* The: barrier.sync instruction is equivalent to barrier.sync.aligned
prior to sm_70, and will lead to errors/deadlock if passes (such as MIR
JumpThreading) lose the aligned property.
rust-lang/rust#137086https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#parallel-synchronization-and-communication-instructions-bar
* Since: MIR does not currently have a way to apply something like
LLVM's convergent attribute (and because convergent does not preserve
alignment, which can be broken by inlining), we cannot prevent loss of
alignment, and thus we require target feature sm_70.
https://llvm.org/docs/ConvergentOperations.html
WIP: compile-time dependency on ptx60
0 commit comments