AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming
communication distributed-computing ml async-programming gpgpu triton rdma hip shmem gemm rma rocm multigpu kernel-fusion fused-kernel workgroup-specialization symmetric-memory remote-memory-access
-
Updated
Oct 5, 2025 - Python