About the role
Join a small research group focused on fine-tuning, retrieval, and agent evaluation. You bridge papers to production.
What you'll do
- Run controlled experiments on agent quality and cost
- Build evaluation harnesses and offline benchmarks
- Publish internal research notes that drive product decisions
What we're looking for
- MS / PhD in CS, ML, or related — or equivalent experience
- Proven track record shipping evaluation or fine-tuning systems
- Fluent in PyTorch