Staff + Senior Software Engineer, Inference

AnthropicGenerative AI, company

San Francisco, United States$320,000 - $485,000 USDLead

Software Engineering

Bookmark Apply on site→

About the role

Design and build distributed systems for AI model inference at scale.

•The Inference team builds and maintains the critical systems that serve Claude to millions of users worldwide, handling compute-agnostic inference deployments across diverse AI accelerators.
•The team focuses on maximizing compute efficiency and enabling breakthrough research by providing high-performance inference infrastructure.
•Key Responsibilities Design, build, and maintain distributed systems for serving Claude.
•Develop intelligent request routing, load balancing, and traffic management systems.
•Maximize compute efficiency through autoscaling and orchestration.
•Build and operate production-grade deployment pipelines for new models.
•Provide high-performance inference infrastructure for researchers.
•Requirements Significant software engineering experience, particularly with distributed systems.
•Results-oriented with a bias towards flexibility and impact.
•Willingness to pick up slack and enjoy pair programming.

View original posting →

View original posting for full requirements →

Tech stack

PythonRustKubernetesAWSGoogle CloudAzureDockerCI/CDgRPCREST API

Match insights

Tech:Python, Rust, Kubernetes, AWS, Google Cloud

Level:Lead

More roles at Anthropic

View open roles at Anthropic