Staff Software Engineer (Machine Learning Infrastructure) at Thumbtack
San Francisco, California, United States
Join Prog.AI to see contacts
Join Prog.AI to see contacts
Summary
👤
Senior
🎓
Top School
Oleksandr Pryimak is a Staff Software Engineer specializing in Machine Learning Infrastructure, currently leading ML platform efforts at Thumbtack in San Francisco. With 14 years of experience across Yandex, Criteo, and Thumbtack, he builds data-intensive pipelines and scalable ML systems that empower scientists and engineers to extract value from their data. He has driven reliability improvements—from big data processing pipelines at Criteo to extending ML infrastructure at Thumbtack for high-cardinality features, and leading internal tooling for ML workflows. An active open-source contributor, he has enhanced the XGBoost testing infrastructure and documentation and contributed to Priam’s backup/recovery tooling for Cassandra. He holds a Master's degree in Computer Science from Moscow Institute of Physics and Technology and studied data mining at Yandex School of Data Mining, bringing a rigorous, production-focused mindset to AI/ML infrastructure.
15 years of coding experience
11 years of employment as a software developer
Master's degree, Computer Science, Master's degree, Computer Science at Moscow Institute of Physics and Technology (State University) (MIPT)
Additional education, math statistics, data mining, algorithms, Additional education, math statistics, data mining, algorithms at Yandex School of Data Mining
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Role in this project:
QA Engineer / Test Automation Engineer
Contributions:8 commits, 11 PRs, 165 comments in 2 years 9 months
Contributions summary:Oleksandr primarily focused on improving the testing infrastructure and documentation within the XGBoost repository. Their contributions include adding instructions for running tests locally, modifying test scripts, and updating documentation related to testing procedures. They also added optional dependencies to setup.py, and addressed Pandas-related test failures. The user was also involved in updating the documentation for the xgboost-spark project and ensuring that JVM tests clean up after themselves.
Co-Process for backup/recovery, Token Management, and Centralized Configuration management for Cassandra.
Role in this project:
Backend Developer
Contributions:11 commits in 14 days
Contributions summary:Oleksandr primarily focused on enhancing the Priam codebase, which is related to backup and recovery of Cassandra. Their contributions included adding error checking and output logging to the Cassandra start/stop processes, improving reliability and troubleshooting capabilities. Furthermore, the user added a backuper and restorer tool, demonstrating their focus on core functionality. Further work included adding an incremental backup tool and refactoring code for better initialization and shutdown.
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.