Contribute to our monokernel pipeline, the single persistent GPU program that covers the full decode pass from QKV projection to LM head sampling, across AMD and NVIDIA architectures.
Work on low-level GPU optimization, including impossibly-fast grid synchronizations and inter-GPU collectives, and optimized GEMM and attention kernels for specific batch sizes and context lengths.
Build profiling infrastructure inside a monokernel, including custom instrumentation, device-timestamp frameworks, and per-stage analysis to translate machine behavior into concrete engineering decisions.
Scale the stack to third-party MoE models such as DeepSeek v4 and Qwen 3 to push generation speed on the models that matter in production today.
Contribute to building AI agents that will perform GPU Engineering research and kernel optimization autonomously, calibrated to hardware target and workload, starting from the inference foundations we are building now.
Requirements
You have written GPU kernels where performance was the central constraint. Showing the code is a requirement to move forward in the process.
PyTorch custom ops are an acceptable starting point if the kernels show a genuine understanding of the hardware below the framework level.
Stronger signals include inline PTX or CDNA ISA in public repositories, experience with latency-sensitive execution paths, understanding of why MBU matters more than MFU at batch size 1, and a background in inference engine components.
A top engineering school or a PhD with concrete GPU work counts, even without industry experience.
You will spend at least 50% of your time in our Paris office.
Tech Stack
PyTorch
Benefits
Direct access to AMD and NVIDIA datacenter GPUs from day one
A team where creativity and technical judgment carry weight and where the people closest to the problem shape the key decisions
Problems that sit on the critical path of model execution speed and that directly influence what the system can become
Compensation aligned with top technical profiles in the Paris AI market, including equity