Staff + Senior Software Engineer, Inference
AnthropicGenerative AI, company
San Francisco, United States$320,000 - $485,000 USDLead
Software Engineering
About the role
Design and build distributed systems for AI model inference at scale.
- •The Inference team builds and maintains the critical systems that serve Claude to millions of users worldwide, handling compute-agnostic inference deployments across diverse AI accelerators.
- •The team focuses on maximizing compute efficiency and enabling breakthrough research by providing high-performance inference infrastructure.
- •Key Responsibilities Design, build, and maintain distributed systems for serving Claude.
- •Develop intelligent request routing, load balancing, and traffic management systems.
- •Maximize compute efficiency through autoscaling and orchestration.
- •Build and operate production-grade deployment pipelines for new models.
- •Provide high-performance inference infrastructure for researchers.
- •Requirements Significant software engineering experience, particularly with distributed systems.
- •Results-oriented with a bias towards flexibility and impact.
- •Willingness to pick up slack and enjoy pair programming.
Tech stack
PythonRustKubernetesAWSGoogle CloudAzureDockerCI/CDgRPCREST API
Match insights
Tech:Python, Rust, Kubernetes, AWS, Google Cloud
Level:Lead