Heran Lin

Co-Founder CTO at LakeSail

Shenzhen, Guangdong Province, China
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
🎓
Top School
Heran Lin is a Co-Founder and CTO in Shenzhen with 12 years of experience building ML-driven search, recommendation and real-time data systems. He has led product classification and cold-start ranking efforts at Tencent and A9.com and built online query services and streaming pipelines as a freelance ML engineer. A hands-on backend engineer and open-source contributor, he has improved the Apache DataFusion SQL engine by adding UDTF access and refining internal expression and protobuf handling. Heran bridges research and product—his background includes research internships at Microsoft Research Asia and Bosch where he worked on visible light communication and speech-recognition error reduction—anchored by a CMU MS in Computational Data Science (4.07/4.33) and a Tsinghua BS. He focuses on shipping robust, scalable ML systems that translate academic techniques into production search and recommendation features.
code12 years of coding experience
job7 years of employment as a software developer
bookBachelor’s Degree, Computer Software, Bachelor’s Degree, Computer Software at Tsinghua University
bookMaster’s Degree, Computational Data Science, 4.07/4.33, Master’s Degree, Computational Data Science, 4.07/4.33 at Carnegie Mellon University
languagesChinese, English
github-logo-circle

Github Skills (15)

fusion10
query-engine10
arrowkeys10
rust10
sql10
back-end-development10
arrowjs10
arrows10
arrow-js10
protobuf9
protobuf39
protobufs9
protobuff9
data-structures8
data-structure8

Programming languages (6)

TypeScriptJavaRustJavaScriptGoPython

Github contributions (5)

github-logo-circle
apache/datafusion

Jun 2024 - Mar 2025

Apache DataFusion SQL Query Engine
Role in this project:
userBack-end Developer
Contributions:4 reviews, 5 PRs, 13 comments in 9 months
Contributions summary:Heran made several contributions focused on enhancing the Apache DataFusion SQL query engine. They modified the `Expr::Wildcard` handling, updating its type and associated protobuf definitions. They also added the ability to access UDTFs within the `SessionContext` and provided support for deregistering them. Additional changes involved code formatting and returning references. The user updated the internal data structures and code.
querypythonquery-enginedataframerust
lakehq/sail

Aug 2024 - Apr 2025

LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive (AI) workloads.
Contributions:164 reviews, 266 PRs, 400 pushes in 8 months
arrowbig-datadatadatafusionpyspark
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial