Daya Guo

Research Intern at 微软

Haidian District, Beijing, China
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
🎓
Top School
Daya Guo is an NLP-focused machine learning researcher and engineer with 7 years of experience, currently a research intern at Microsoft and enrolled in the SYSU–MSRA joint PhD program. She pairs top academic performance (ranked 1/88 in her CS bachelor's) with practical wins, including contributing to a first-place solution in the 2020 Tencent College Algorithm Contest. Her contributions span production ML pipelines, word‑embedding engineering, and model architecture tuning, and she has worked on CodeXGLUE tasks like clone and defect detection for code analysis. Active in Kaggle and Tianchi competitions, she bridges research and applied engineering to deliver high-performance NLP and code-intelligence systems.
code7 years of coding experience
book博士, 计算机科学与技术, 博士, 计算机科学与技术 at 中山大学
bookBachelor's degree, Computer Science, GPA: 4.3/5 Rank: 1/88, Bachelor's degree, Computer Science, GPA: 4.3/5 Rank: 1/88 at Sun Yat-Sen University
github-logo-circle

Github Skills (16)

pytorch10
machine-learning10
code-analysis10
tensorflow210
nlp10
tensorflow10
python10
word2vec10
natural-language-processing10
data-analysis9
pandas9
algorithms8
algorithm8
modeling8
trainings8

Programming languages (5)

C#MakefileJavaScriptHTMLPython

Github contributions (5)

github-logo-circle
guoday/Tencent2020_Rank1st

Jul 2020 - Jun 2022

The code for 2020 Tencent College Algorithm Contest, and the online result ranks 1st.
Role in this project:
userData Scientist
Contributions:18 commits, 1 PR, 8 pushes in 1 year 10 months
Contributions summary:Daya primarily focused on updating and refining machine learning model components. They modified preprocessing steps in `preprocess.py`, and adjusted model architectures in `model.py` to incorporate features and model layers, likely for improving performance. Further, the user updated training pipeline in `run.py` for training configuration, and made adjustments to the word embedding generation and usage in `w2v.py` with the `Word2Vec` model. These changes suggest an active role in optimizing the overall machine learning pipeline.
javascriptrankstencent
microsoft/CodeXGLUE

Sep 2020 - Nov 2021

CodeXGLUE
Role in this project:
userML Engineer
Contributions:8 commits, 2 PRs, 49 comments in 1 year 2 months
Contributions summary:Daya primarily worked on the `Clone-detection-POJ-104`, `Clone-detection-BigCloneBench`, and `Defect-detection` tasks. Their work involved updating and modifying the run scripts, model training, and evaluation. The changes suggest they were involved in tasks relating to clone detection and defect detection, which indicates that they were working on machine learning models for code analysis.
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial