Conferences
[
Google scholar]
[
SC'22]
EL-Rec: Efficient Large-scale Recommendation Model Training via Tensor-train Embedding
[
To appear]
Zheng Wang, Yuke Wang, Boyuan Feng, Dheevatsa Mudigere, Bharath Muthiah, Yufei Ding
[
SC'22]
ATL: Accelerated Training for Transformer-based Models on GPUs
[
To appear]
Xiaohui Wang, Yang Wei, Guyue Huang, Ying Xiong, Xian Qian, Yufei Ding, Lei Li, Mingxuan Wang
[
USENIX ATC'22]
Faith: An Efficient Framework for Transformer Verification on GPUs
[
To appear]
Boyuan Feng, Tianqi Tang, Yuke Wang, Zhaodong Chen, Zheng Wang, Shu Yang, Yuan Xie, Yufei Ding
[
DAC'22]
Shfl-BW: Accelerating Deep Neural Network Inference with Tensor-Core Aware Weight Pruning
[
To appear]
Guyue Huang, Haoran Li, Minghai Qin, Fei Sun, Yufei Ding and Yuan Xie.
[
DAC'22]
Heuristic Adaptability to Input Dynamics for SpMM on GPUs
[
To appear]
Guohao Dai, Guyue Huang, Shang Yang, Zhongming Yu, Hengrui Zhang, Yufei Ding, Yuan Xie, Huazhong Yang, Yu Wang.
[
MLSys’22]
Understanding GNN Computational Graph: A Coordinated Computation, IO, and Memory Perspective
[
To appear]
Hengrui Zhang, Zhongming Yu, Guohao Dai, Guyue Huang, Yufei Ding, Yuan Xie, Yu Wang.
[
PPoPP'22]
QGTC: Accelerating Quantized GNN via GPU Tensor Core GPUs
[
To appear]
Yuke Wang*, Boyuan Feng*, Yufei Ding. (* co-primary authors)
[
CIKM'21]
An Efficient Quantitative Approach for Optimizing Convolutional Neural Networks (Spotlight)
[
PDF]
Yuke Wang, Boyuan Feng, Xueqiao Peng, Yufei Ding.
[
SC'21]
APNN-TC: Accelerating Arbitrary-Precision Neural Networks on Tensor Cores
[
PDF]
Boyuan Feng*, Yuke Wang*, Tong Geng, Ang Li, Yufei Ding (* co-primary authors).
[
SC'21]
Efficient Tensor Core-based GPU Kernels for Structured Sparsity under Reduced Precision
[
PDF]
`
Zhaodong Chen*, Zheng Qu*, Liu Liu, Yufei Ding, Yuan Xie (* co-primary authors).
[
USENIX ATC'21]
Palleon: A Runtime System for Efficient Video Processing toward Dynamic Class Skew
[
PDF]
Boyuan Feng, Yuke Wang, Gushu Li, Yuan Xie, Yufei Ding.
[
OSDI'21]
GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs
[
PDF]
Yuke Wang, Boyuan Feng, Gushu Li, Shuangchen Li, Lei Deng, Yuan Xie, Yufei Ding.
[
IPDPS'21]
DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolution
[
PDF]
Yuke Wang, Boyuan Feng, Yufei Ding.
[
PPoPP'21]
EGEMM-TC: Accelerating Scientific Computing on Tensor Cores with Extended Precision
[
PDF]
Boyuan Feng, Yuke Wang, Guoyang Chen, Weifeng Zhang, Yuan Xie, Yufei Ding.
[
SysML'18]
TOP: A Compiler-Based Framework for Optimizing Machine Learning Algorithms through Generalized Triangle Inequality
[
PDF]
Yufei Ding, Lin Ning, Hui Guan, Xipeng Shen, Madanlal Musuvathi, Todd Mytkowicz.
SysML, Feb 16th, 2018. Paul Brest Hall · Stanford University.
[
ICDE'18]
Reuse-Centric K-Means Configuration
[
PDF]
Hui Guan, Yufei Ding, Xipeng Shen, Hamid Krim.
34th IEEE International Conference on Data Engineering. April 16th – 20th, 2018.
[
PLDI'17]
Generalizations of the Theory and Deployment of Triangular Inequality for Compiler-Based Strength Reduction
[
PDF]
Yufei Ding, Lin Ning, Hui Guan, Xipeng Shen.
The ACM SIGPLAN Conference on Programming Language Design and Implementation 2017. [Acceptance ratio: 15% (47/322).]
[
ICDE'17]
Sweet KNN: An Efficient KNN on GPU through Reconciliation of Redundancy and Regularity
[
PDF]
Guoyang Chen, Yufei Ding, Xipeng Shen.
2017 IEEE International Conference on Data Engineering, San Diego, California, April 19-22, 2017.
[
PLDI'15]
Autotuning algorithmic choice for input sensitivity
[
PDF]
Yufei Ding, Jason Ansel, Kalyan Veeramachaneni, Xipeng Shen, Una-May O'Reilly, Saman Amarasinghe.
ACM SIGPLAN conference on Programming Language Design and Implementation, Portland, Orgon, June 13-17, 2015. [Acceptance ratio: 19% (58/303).]
[
VLDB'15]
TOP: A Framework for Enabling Algorithmic Optimizations for Distance-Related Problems
[
PDF]
Yufei Ding, Xipeng Shen, Madan Musuvathi, Todd Mytkowicz.
The 41st International Conference on Very Large Data Bases, Kohala Coast, Hawaii, August, 2015.
Follow Me On