Eric Zeng is a computer security and privacy researcher and software engineer with 13 years of experience who focuses on real-world harms people face on online platforms and builds prototypes and measurements to mitigate them. He holds a PhD from the University of Washington and has published at USENIX Security, CHI, IMC, and NDSS. Eric blends full‑stack engineering (React, TypeScript, Chromium), infrastructure (Kubernetes, Docker), ML/data science (LLMs, R, pandas) and human‑centered methods to design and evaluate usable security features—from HTTPS ecosystem fixes to smart‑home authentication. On the open‑source side he contributes to Microsoft’s autogen project (improving flaml test coverage and API clarity) and to ekzhu/datasketch where he implemented and optimized MinHash and HyperLogLog primitives for scalable similarity and cardinality estimation. Now a Postdoctoral Associate in Washington, D.C., he brings a pragmatic, measurement-driven approach that connects user research, statistical analysis, and production engineering.
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Role in this project:
Software Engineer & Test Automation
Contributions:19 releases, 2009 reviews, 859 PRs in 1 year 6 months
Contributions summary:Eric's commits primarily focused on enhancing the robustness and testing capabilities of the `flaml` submodule within the `autogen` repository. They implemented new tests for training logs, Python loggers, and the overall `flaml` module, thereby increasing test coverage. Furthermore, the user contributed to fixing code, clarifying API documentation and modifying core features such as structured output.
Contributions:28 releases, 31 reviews, 191 commits in 7 years 6 months
Contributions summary:Eric primarily contributed to the core data structure and related functionality of the project, evidenced by the addition and refinement of MinHash, HyperLogLog, and related similarity measures. Their work included the implementation and optimization of these probabilistic data structures, serialization capabilities, and benchmark creation for measuring performance and accuracy. The user also demonstrated expertise in applying these data sketches for cardinality estimation and Jaccard similarity calculations.
operationpythonensembledata-summarydatalog
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.