FlashFuser: Boosting Deep Learning Efficiency for Energy Innovations
Researchers from the National University of Defense Technology in China have developed a new compiler framework called FlashFuser, designed to optimize the performance of deep learning workloads on modern GPUs. The team, led by Ziyu Huang and Yangjie
FlashFuser: Boosting Deep Learning Efficiency for Energy Innovations Read More »










