Maksim Levental is a Senior ML Infrastructure Engineer in Boston with 11 years of experience building compilers, ML runtimes, and accelerator backends. He specializes in DSLs, compilers, accelerator architectures, and applied math, and contributes to high-profile open-source projects like torch-mlir (eager-mode backend), Triton (AMD GPU codegen and FP8 matmul support), and LLVM/MLIR Python bindings. At Apple he works on CoreML + MLIR, and prior roles at AMD involved compiler research and Triton/IREE work for Instinct/CDNA and Xilinx AI Engines, while earlier positions at Meta/Facebook focused on PyTorch pipeline parallelism and JIT memory optimizations. Known for bridging research and production, he turns MLIR prototypes into eager GPU execution and performance-focused compiler integrations that improve real-world model throughput. He combines advanced computer science training and a BS in mathematics with an unusual background in communication—having served as a Peace Corps secondary school teacher early in his career.
12 years of coding experience
5 years of employment as a software developer
Bachelor of Science (BS), Mathematics, 3.73, Bachelor of Science (BS), Mathematics, 3.73 at Florida State University
Master of Science - MS, Computer Science, 4.0, Master of Science - MS, Computer Science, 4.0 at University of Chicago
Master’s Degree, Computer Science, 3.72, Master’s Degree, Computer Science, 3.72 at University of Florida
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
Role in this project:
Back-end Developer
Contributions:118 reviews, 19 commits, 87 PRs in 10 months
Contributions summary:Maksim primarily focuses on extending the `torch-mlir` framework to support eager mode execution for PyTorch. They implemented an eager mode backend, enabling the compilation of PyTorch operations on-the-fly. The user also addresses bugs, refactors code, and enhances the framework with features like the decomposition of certain operations. They also added examples and made updates for the use of the eager mode.
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
Role in this project:
Back-end Developer
Contributions:344 reviews, 213 PRs, 138 pushes in 2 years 7 months
Contributions summary:Maksim primarily contributed to the Python bindings for the LLVM MLIR project. Their work involved adding and modifying Python interfaces for various MLIR features, specifically the GPU dialect, LLVM dialect, and the `linalg` dialect. They implemented attribute and type casting for new classes and improved the behavior of existing functionalities such as the handling of `memref` objects. The user's changes also included adjustments to testing frameworks, test cases, and the inclusion of support for the `CLANG_CL` compiler.
compilerstechnologiesclangsubmittoolchain
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Maksim Levental - Senior ML Infrastructure Engineer at Apple