Helen Gao

Data Scientist

United States
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
🎓
Top School
Tianyu Gao is a PhD candidate at Princeton University and a software engineer with 10 years of experience focused on NLP and neural relation extraction. As a back-end developer on the open-source OpenNRE project, he implemented core CNN-based components—CNNSentenceEncoder and CNNSoftmax—handling word and position embeddings, convolutional architectures, and training logic. He blends academic rigor with production-minded engineering, turning research models into maintainable, contributor-friendly code. Based in the United States, he is particularly skilled at translating nuanced model design choices (like position embedding strategies) into robust backend implementations.
code11 years of coding experience
job2 years of employment as a software developer
bookBachelor of Science - BS, Computer Science, Bachelor of Science - BS, Computer Science at Princeton University
github-logo-circle

Github Skills (11)

mask-rcnn10
faster-rcnn10
pytorch10
machine-learning10
word-embeddings10
fasterrcnn10
word-embedding10
nlp10
neural-networks10
relation-extraction10
python10

Programming languages (6)

RustCTeXHTMLJupyter NotebookPython

Github contributions (5)

github-logo-circle
thunlp/OpenNRE

Jul 2018 - Dec 2021

An Open-Source Package for Neural Relation Extraction (NRE)
Role in this project:
userBack-end Developer
Contributions:211 commits, 13 PRs, 146 pushes in 3 years 5 months
Contributions summary:Helen's commits primarily revolve around implementing and modifying the `CNNSentenceEncoder` and `CNNSoftmax` models within the `nrekit` package. These changes involve defining the architecture, including word embeddings, position embeddings, and convolutional layers. The user appears to be working on the core functionality of a neural relation extraction (NRE) model based on convolutional neural networks (CNNs), including the training process.
extractionrelation-extractionnreinformation-retrievalrelation
princeton-nlp/LM-BFF

Dec 2020 - Aug 2022

ACL'2021: LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723
Contributions:19 commits, 3 PRs, 16 pushes in 1 year 8 months
nlplanguage-modelarxivabsfine
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial