Our Compiler R&D team is developing a next generation machine learning compiler that integrates the latest in code generating LLMs into our compilation flow.

We're looking for an AI engineer with experience in code generating LLMs, and a strong background in building and evaluating AI agents.

This job is based in New York City. Remote work will be considered for exceptional candidates.

About Makora

Makora is a venture-backed AI lab building building tools to automate algorithm discovery and GPU performance engineering. There are two core components:

MakoraGenerate writes GPU kernels in CUDA, HIP, and Triton using LLMs

MakoraOptimize automatically selects and swaps GPU kernels in combination with tuning inference engine (vLLM, SGlang, etc..) hyperparameters to optimize performance

Responsibilities

Create an LLM-based AI agent using proprietary frontier models, open-source models, fine-tuning, prompt engineering, RAG, or any other combination of technologies you can think of

Explore and analyze performance of various LLMs in terms of tool use and code generation

Develop tools and APIs for LLMs to use in the process of generating and augmenting GPU kernels

Implement programming solutions in C/C++ and Python.

Qualifications

Strong programming skills in C/C++ and Python.

Experience developing AI agents that leverage LLMs and external tools

General experience with the training, fine-tuning, and deployment of LLMs

Bonus Points

Proven experience with kernel optimizations on CUDA, ROCm, or other accelerators

Deep understanding and experience in GPU performance optimizations

Our Benefits

Competitive salary and equity package

Comprehensive health insurance coverage for you and your family

Remote work option for exceptional candidates

Generous vacation and paid time off policy

Modern and comfortable work environment with state-of-the-art equipment and facilities

To Apply

Fill out this form

See more open positions at Makora (Formerly Mako)

Join our constellation

🤖

AI Engineer

Summary