🤖
Makora (Formerly Mako)
Posted on Mar 10, 2026
AI Engineer
Table of Contents
Summary
Our Compiler R&D team is developing a next generation machine learning compiler that integrates the latest in code generating LLMs into our compilation flow.
We're looking for an AI engineer with experience in code generating LLMs, and a strong background in building and evaluating AI agents.
This job is based in New York City. Remote work will be considered for exceptional candidates.
About Makora
Makora is a venture-backed AI lab building building tools to automate algorithm discovery and GPU performance engineering. There are two core components:
MakoraGenerate writes GPU kernels in CUDA, HIP, and Triton using LLMs
MakoraOptimize automatically selects and swaps GPU kernels in combination with tuning inference engine (vLLM, SGlang, etc..) hyperparameters to optimize performance
Responsibilities
Create an LLM-based AI agent using proprietary frontier models, open-source models, fine-tuning, prompt engineering, RAG, or any other combination of technologies you can think of
Explore and analyze performance of various LLMs in terms of tool use and code generation
Develop tools and APIs for LLMs to use in the process of generating and augmenting GPU kernels
Implement programming solutions in C/C++ and Python.
Qualifications
Strong programming skills in C/C++ and Python.
Experience developing AI agents that leverage LLMs and external tools
General experience with the training, fine-tuning, and deployment of LLMs
Bonus Points
Proven experience with kernel optimizations on CUDA, ROCm, or other accelerators
Deep understanding and experience in GPU performance optimizations
Our Benefits
Competitive salary and equity package
Comprehensive health insurance coverage for you and your family
Remote work option for exceptional candidates
Generous vacation and paid time off policy
Modern and comfortable work environment with state-of-the-art equipment and facilities