Cited By
View all- Zhou KSubramanian KLin PFey MYin BLi J(2024)FASTEN: Fast GPU-accelerated Segmented Matrix Multiplication for Heterogenous Graph Neural NetworksProceedings of the 38th ACM International Conference on Supercomputing10.1145/3650200.3656593(511-524)Online publication date: 30-May-2024
- Wang YHao MHe HZhang WTang QSun XWang Z(2024)DRLCAP: Runtime GPU Frequency Capping With Deep Reinforcement LearningIEEE Transactions on Sustainable Computing10.1109/TSUSC.2024.33626979:5(712-726)Online publication date: Sep-2024
- Li XLaguna IFang BSwirydowicz KLi AGopalakrishnan GButt AMi NChard K(2023)Design and Evaluation of GPU-FPX: A Low-Overhead tool for Floating-Point Exception Detection in NVIDIA GPUsProceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing10.1145/3588195.3592991(59-71)Online publication date: 7-Aug-2023
- Show More Cited By