Research Engineer (Machine Learning)

Research Engineer, Machine Learning (Horizons) London, UKAnthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.As a Research Engineer on the Reinforcement Learning Fundamentals team, you will collaborate with a diverse group of researchers and engineers to advance the capabilities and safety of large language models through fundamental research in reinforcement learning, improving reasoning abilities in areas such as code generation and mathematics, and exploring reinforcement learning for agentic / open-ended tasks.Representative projects:Develop and implement novel reinforcement learning techniques to improve the performance and safety of large language models.Design and run experiments to enhance models'' reasoning capabilities, particularly in code generation and mathematics.Are proficient in Python and have experience with deep learning frameworks such as PyTorch or Jax.Have a strong software engineering background and are interested in working closely with researchers and engineers.Enjoy pair programming.Care about code quality, testing, and performance.Are passionate about AI''s potential impact and committed to developing safe, beneficial systems.Have a background in machine learning, reinforcement learning, or high-performance computing.Contributed to open-source projects or published relevant research.Experience with LLMs or machine learning research prior.Bachelor’s degree in a related field or equivalent experience.Location-based hybrid policy:
Currently, all staff are expected to be in the office at least 25% of the time, with some roles requiring more.Visa sponsorship:
We sponsor visas! We will make every effort to assist with visa processes if we make an offer.Diversity and representation are important to us, and we value different perspectives in our team.We believe impactful AI research is big science, focusing on large-scale efforts with high impact, akin to empirical sciences like physics and biology. Our recent research includes GPT-3, interpretability, multimodal neurons, scaling laws, AI and compute, safety, and human preferences.Anthropic is headquartered in San Francisco, offering competitive compensation, benefits, equity donation matching, generous leave, flexible hours, and a collaborative office environment.indicates a required fieldPhoneWebsiteAre you open to working in-office 25% of the time? *AI Policy acknowledgment *Require visa sponsorship now or in future? *Open to relocation? *
Other jobs of interest...


Perform a fresh search...
-
Create your ideal job search criteria by
completing our quick and simple form and
receive daily job alerts tailored to you!