Peter Wang

Machine Learning Engineer at Meta

San Jose, California, United States
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
🎓
Top School
Peter Wang is a Software Engineer based in Old Toronto with six years of experience focused on backend systems for data and metadata platforms. He is an active open-source contributor to DataHub, where he enhanced Superset ingestion by implementing dataset and column-level lineage and adding ownership metadata for charts, dashboards, and datasets. Peter also improved system reliability and performance by introducing timeout controls and threaded Superset API calls to prevent hanging queries and reduce resource usage, reflecting a pragmatic production-first mindset. An avid rock climber, he brings the same methodical problem-solving and composure under pressure to complex engineering challenges.
code7 years of coding experience
job16 years of employment as a software developer
bookBachelor's, Civil Engineering, Bachelor's, Civil Engineering at Tongji University
bookMaster's, Applied Mathematics, Master's, Applied Mathematics at University of Illinois Urbana-Champaign
languagesEnglish
github-logo-circle

Github Skills (16)

data-ingestion10
data-lineage10
discovery10
data-governance10
data-catalog10
metadata10
service-discovery10
auto-discovery10
python10
api9
api-doc9
multithreading8
github-ci4
github-actions-workflows4
github-actions-workflow4

Programming languages (8)

TypeScriptJavaShellC++JavaScriptGoJupyter NotebookPython

Github contributions (5)

github-logo-circle
datahub-project/datahub

Jan 2025 - Mar 2025

The Metadata Platform for your Data and AI Stack
Role in this project:
userBack-end Developer
Contributions:20 reviews, 8 PRs, 30 comments in 2 months
Contributions summary:Peter's contributions center around enhancing the Superset data ingestion capabilities within the DataHub project. They implemented features to integrate Superset's dataset lineage, and added ownership information for charts, dashboards and datasets, improving the data catalog. Furthermore, the user introduced column-level lineage for both datasets and charts, augmenting the data lineage capabilities. They also addressed potential issues with hanging queries and resource usage by introducing timeout values to the Superset API calls and leveraging threads for API calls, improving system performance.
data-managementdata-discoverydata-stackmodern-data-stackdata-catalog
PeteMango/PeteMango

Dec 2022 - Feb 2025

Contributions:2 PRs, 57 pushes, 1 branch in 2 years 2 months
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial