Andrew Montanez

Head Of Engineering at DataCebo

San Francisco, California, United States
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
🎓
Top School
Andrew Montanez is Head of Engineering at DataCebo with 10 years of experience building ML-driven backend systems and synthetic data tooling from San Francisco. He combines hands-on Python development and backend/devops expertise with research roots at MIT, where he worked on the Synthetic Data Vault and copula modeling. An active open-source contributor to SDV projects (CTGAN, Copulas, SDV), he balances model development with package and build health—managing dependency upgrades, tests, and versioning to keep libraries production-ready. Known for turning research-grade algorithms into reliable, well-tested pipelines, he leads teams that deliver scalable synthetic-data solutions for real-world privacy and feature-generation problems.
code10 years of coding experience
job7 years of employment as a software developer
bookMassachusetts Institute of Technology
github-logo-circle

Github Skills (31)

data-generation10
constraints10
pytorch10
python10
package-management10
pandas10
machine-learning10
generative-adversarial-network10
constraint10
ci-cd10
synthetic-data10
generative-adversarial-networks10
devops10
data-science9
testing9

Programming languages (2)

JavaScriptPython

Github contributions (5)

github-logo-circle
sdv-dev/SDV

Jul 2018 - Jan 2023

Synthetic data generation for tabular data
Role in this project:
userBack-end Developer & Data Scientist
Contributions:29 releases, 1066 reviews, 224 commits in 4 years 7 months
Contributions summary:Andrew primarily contributed to the codebase by implementing and refining constraints for synthetic data generation. Their work involved fixing issues related to handling lambda functions and functions returned from other functions within the constraint framework. They also addressed and resolved bugs related to duplicate IDs when utilizing reject-sampling, improving the functionality and robustness of the synthetic data generation process. These changes focused on core components of the synthetic data generation process using Python and potentially related frameworks.
relational-datasetssynthetictime-seriessynthetic-datamulti-table
sdv-dev/CTGAN

Jul 2021 - Jan 2023

Conditional GAN for generating synthetic tabular data.
Role in this project:
userML Engineer
Contributions:12 releases, 91 reviews, 20 commits in 1 year 6 months
Contributions summary:Andrew primarily worked on the CTGAN project, a conditional GAN for generating synthetic tabular data. Their commits involved updating the `rdt` dependency version, renaming and modifying tests related to synthesizers, and bumping the version of the library, indicating direct involvement in the model's core functionality and maintenance. They also fixed warnings and addressed package maintenance, which are important aspects of maintaining the project's health and stability.
pytorchdeep-learningconditionaltabular-datagenerative-adversarial-network
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial