Xiangpeng Hao

Madison, Wisconsin, United States
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
Xiangpeng Hao is a software engineer and PhD candidate with 10 years of experience specializing in database storage engines, KV stores, and larger-than-memory indexes. He builds reliable systems in Rust with a strong emphasis on performance and correctness. His background blends academic research at the University of Wisconsin–Madison with industry internships at Google, Microsoft, and InfluxData, bringing research rigor to production problems. An active open-source contributor, he implemented BinaryView and Utf8View support and IPC handling in the official Apache Arrow Rust project and optimized Apache DataFusion to avoid unnecessary data copies, demonstrating deep knowledge of data structures, serialization, and query engine internals. Based in Madison, Wisconsin, he moves comfortably between low-level storage design and end-to-end data processing to turn complex research ideas into robust, deployable software.
code10 years of coding experience
github-logo-circle

Github Skills (19)

arrowkeys10
data-serialization10
data-structure10
serialization10
fusion10
apache-arrow10
sql10
arrowjs10
data-processing10
parquet10
data-structures10
query-engine10
rust10
arrows10
arrow-js10

Programming languages (9)

TypeScriptC++RustCVueJavaScriptHTMLPython

Github contributions (5)

github-logo-circle
apache/arrow-rs

Apr 2023 - Apr 2025

Official Rust implementation of Apache Arrow
Role in this project:
userBack-end Developer
Contributions:83 reviews, 44 PRs, 196 comments in 2 years
Contributions summary:Xiangpeng contributed to the official Rust implementation of Apache Arrow by adding new data types, specifically `BinaryView` and `Utf8View`, which involved modifying the schema and data structures. They also addressed code review suggestions and updated relevant files. Furthermore, the user worked on IPC format support for the new view types, including changes to the reader and writer. They demonstrated knowledge of data structures, serialization formats, and core Arrow data types.
rustolapapachearrowapache-arrow
apache/datafusion

Apr 2023 - Mar 2025

Apache DataFusion SQL Query Engine
Role in this project:
userBack-end Developer
Contributions:42 reviews, 29 PRs, 103 comments in 2 years
Contributions summary:Xiangpeng contributed primarily to the core data processing logic and query engine functionality. Their work involved implementing support for new data types like `Utf8View` and `BinaryView`, and addressing related issues, including bugs in aggregate functions. They also focused on optimizing performance by avoiding unnecessary data copies when reading arrow files and ensuring correct behavior in various data processing scenarios. Furthermore, the user worked on enhancing the project's functionality, such as fixing issues related to `COUNT(DISTINCT)` on string views and improving the handling of parquet settings.
querypythonquery-enginedataframerust
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Xiangpeng Hao