Summary
Tongping Liu is a Principal Engineer in the San Francisco Bay Area with 13+ years of experience in runtime systems, operating systems, and compilers, focusing on machine learning performance, profiling, failure diagnosis, and resource management. He currently drives ML training and inference performance and stability at XPENG. At ByteDance, he led a non-intrusive GPU memory management library that reduced GPU memory usage by 20–40% and developed a GPU memory profiler to pinpoint OOM causes, along with a high-throughput CPU/GPU training framework. He also serves as Adjunct Associate Professor at the University of Massachusetts Amherst, where he works on profilers for cache issues, NUMA memory management, memory-footprint reduction for ML, and deadlock detection. He earned a Ph.D. in Computer Science from UMass Amherst and an M.Eng. in Electronics and Information Engineering from Huazhong University of Science and Technology. Based in the SF Bay Area, he blends theoretical depth with hands-on engineering to deliver reliable, scalable systems that accelerate ML workloads.
13 years of coding experience
8 years of employment as a software developer
University of Massachusetts Amherst
BA, Automatic Measurement and Control, BA, Automatic Measurement and Control at Harbin Institute of Technology
Master of Engineering (M.Eng.), Electronics and Information Engineering, Master of Engineering (M.Eng.), Electronics and Information Engineering at Huazhong University of Science and Technology