About the role
Optimize ML and HPC kernels for Cerebras' AI processor.
- •Develop and optimize high-performance ML and HPC kernels for Cerebras' wafer-scale AI processor, implementing low-level routines and scaling algorithms to maximize compute utilization.
- •Key Responsibilities Design ML and linear algebra kernels and map them to Cerebras hardware.
- •Develop and debug low-level assembly and C-like (CSL) kernel routines.
- •Use mathematical models to measure and inform performance decisions.
- •Integrate unit and system testing for kernel libraries.
- •Collaborate with chip and system architects to optimize microarchitecture and IO.
- •Requirements Bachelor's, Master's, or PhD in CS, CE, Mathematics, or related field.
- •Strong C++ and Python programming skills.
- •Understanding of hardware architecture concepts and debugging complex stacks.
- •Knowledge of library/API development best practices.
Tech stack
C++Python
Match insights
Tech:C++, Python
Level:Senior