Guanhua Wang is a research scientist on Meta's PyTorch team specializing in accelerating GenAI training at production scale. He holds a PhD in Computer Science from UC Berkeley's AMPLab/RISELab under Professor Ion Stoica, with research focused on distributed machine learning and networking. Previously he was a senior researcher and tech lead on Microsoft's DeepSpeed team, driving projects like Domino and ZeRO++ to reduce communication overhead and speed large-scale training for Microsoft and OpenAI models, and working on GPU communication optimizations within DGX clusters. With about a decade of experience across research labs, industry internships, and consulting on distributed ML infrastructure, he combines academic rigor with systems-first engineering to make large-model training practical.
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.