Yihe Dong

Research Engineer at Princeton University

New Jersey, United States
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

👤
Senior
🎓
Top School
Yihe Dong is a math-driven research engineer with a decade of experience translating theory into scalable ML systems. He specializes in geometric deep learning and large language models, applying rigorous mathematical principles to areas such as efficient optimal transport, graph representation learning, and robust statistics. Currently a Research Engineer at Princeton University, he bridges language-model research with practical ML engineering, drawing on prior leadership roles at Google and Microsoft that spanned search relevance, multi-modal grounding, private ML, and scalable nearest-neighbor and OT algorithms. An active open-source contributor, he has shaped Google Research projects—modifying run_exp.sh and KNF/evaluation.py and contributing AdaRank-related code—and maintains a strong scholarly/engineering portfolio at yihedong.me and twistedcubic on GitHub. Based in New Jersey, he blends academic rigor with production-ready engineering to push the boundaries of ML systems.
code11 years of coding experience
job7 years of employment as a software developer
bookMathematics, Mathematics at Princeton University
github-logo-circle

Github Skills (4)

machine-learning10
tensorflow10
python10
bash8

Programming languages (6)

JuliaC++CJavaScriptJupyter NotebookPython

Github contributions (5)

github-logo-circle
Google Research
Role in this project:
userML Engineer
Contributions:1 commit in 1 day
Contributions summary:Yihe contributed to the implementation and refinement of machine learning models within the Google Research repository. Their commits focused on modifying scripts, particularly `run_exp.sh` and `KNF/evaluation.py`, to replicate experimental results and add artifacts related to the paper. The changes include modifications to model parameters, dataset configurations, and evaluation metrics, indicating an active role in model training and result verification. Furthermore, the user committed source scripts, specifically for AdaRank, suggesting a focus on developing and evaluating ranking algorithms within the broader research context.
googlemachine-learningai
twistedcubic/learn-to-hash

Oct 2019 - Jan 2021

Contributions:9 commits, 9 pushes, 1 branch in 1 year 2 months
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial