About the role
Build and operate systems serving foundational models.
- •Join H Company's Inference team to build and operate systems serving foundational models.
- •Key Responsibilities Build and operate the inference stack for H's multimodal agentic models Improve latency, throughput, and cost of model serving Research and implement inference techniques tailored to agent workloads Collaborate with cross-functional teams to integrate inference into agentic AI products Stay current with advancements in inference, model serving, and accelerator technology Requirements Strong software engineering track record Proficient in Python and at least one systems language (Rust, C++, or Go) Hands-on experience with deep learning frameworks (PyTorch, JAX) Solid distributed systems fundamentals Excellent communication and presentation skills Strong collaboration and teamwork skills
Tech stack
PythonRustC++GoPyTorchJAX
Match insights
Tech:Python, Rust, C++, Go, PyTorch
Level:Mid