Publications
Publications by categories in reversed chronological order.
2025
- ____ATC____Towards Optimal Rack-scale μs-level CPU Scheduling through In-Network Workload ShapingIn Proceedings of USENIX Annual Technical Conference (ATC 2025) , 2025
- __ASPLOS__Harmonia: A Unified Framework for Heterogeneous FPGA Acceleration in the CloudIn Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2025) , 2025
- __INFOCOM__A Generic and Efficient Communication Framework for Message-level In-Network ComputingIn Proceedings of the IEEE International Conference on Computer Communications (INFOCOM 2025) , 2025
- __ASPLOS__Design and Operation of Shared Machine Learning Clusters on CampusIn Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2025) , 2025
- __EuroSys__Achieving Fairness Generalizability for Learning-based Congestion Control with JuryIn Proceedings of the 20th ACM European Conference on Computer Systems (EuroSys 2025) , 2025
2024
- __SIGCOMM__Fast, Scalable, and Accurate Rate Limiter for RDMA NICsIn Proceedings of the ACM Special Interest Group on Data Communication (SIGCOMM 2024) , 2024
- __EuroSys__Astraea: Towards Fair and Efficient Learning-based Congestion ControlIn Proceedings of the 19th ACM European Conference on Computer Systems (EuroSys 2024) , 2024
- ___NSDI___Accelerating Neural Recommendation Training with Embedding SchedulingIn Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI 2024) , 2024
- ___NSDI___Towards Domain-Specific Network Transport for Distributed DNN TrainingIn Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI 2024) , 2024
2023
- __APNET__Accurate and Scalable Rate Limiter for RDMA NICsIn Proceedings of the 7th Asia-Pacific Workshop on Networking (APNet 2023) , 2023
- __SIGMOD__Scalable and Efficient Full-Graph GNN Training for Large GraphsIn Proceedings of the ACM on Management of Data (SIGMOD 2023) , 2023
- ___NSDI___SRNIC: A scalable architecture for RDMA NICsIn Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI 2023) , 2023
2022
- ___ICNP___DGS: Communication-Efficient Graph Sampling for Distributed GNN TrainingIn Proceedings of the 30th IEEE International Conference on Network Protocols (ICNP 2022) , 2022
2021
- ___ArXiv___Tacc: A full-stack cloud computing infrastructure for machine learning tasksarXiv preprint arXiv:2110.01556, 2021
2020
- ___ArXiv___Domain-specific communication optimization for distributed DNN trainingarXiv preprint arXiv:2008.08445, 2020
- __APNET__Rat-resilient allreduce tree for distributed machine learningIn Proceedings of the 4th Asia-Pacific Workshop on Networking (APNet 2020) , 2020