×
Register Here to Apply for Jobs or Post Jobs. X

Research Scientist Senior & levels

Remote / Online - Candidates ideally in
San Francisco, San Francisco County, California, 94199, USA
Listing for: Trades Workforce Solutions
Remote/Work from Home position
Listed on 2026-01-12
Job specializations:
  • Research/Development
    Data Scientist, Artificial Intelligence
  • IT/Tech
    Data Scientist, Artificial Intelligence
Salary/Wage Range or Industry Benchmark: 250000 - 400000 USD Yearly USD 250000.00 400000.00 YEAR
Job Description & How to Apply Below
Position: Research Scientist - (Senior & Staff levels)

Responsibilities

Want to push the boundaries of what reinforcement learning can achieve with frontier models?

In this role you will be advancing reinforcement learning methods for large-scale AI systems. You’ll be applying RL techniques to enhance reasoning, planning, and decision‑making in models that directly impact fields from biology to climate and materials science.

Your work will combine RL with large language models, experimenting with RLHF, PPO, and DPO, designing evaluation frameworks, and fine‑tuning models  aim is to go beyond benchmarks and deliver models that researchers can use to accelerate discovery.

You will be a driving force in a team that is building towards a broader superintelligence platform: models that don’t just generate text or data, but drive breakthroughs across multiple domains. As part of this, you’ll collaborate with domain experts to ensure your research translates into real‑world scientific progress.

Qualifications
  • Deep expertise in reinforcement learning (policy optimisation, value‑based, or model‑based methods).
  • Experience applying RL to large models (RLHF, PPO, DPO).
  • Hands‑on experience with model training and fine‑tuning at scale.
  • PhD in Computer Science, Machine Learning, Robotics, or related field, with contributions to top‑tier conferences (NeurIPS, ICML, ICLR, AAAI).
  • Experience with distributed computing platforms (cloud or HPC clusters).
  • Track record of running rigorous experiments and improving models based on results.

If you have experience with multi‑agent RL, hierarchical/offline RL, or domain‑specific work with scientific datasets you will be an ideal candidate for this position.

Package: $250k - $400k base + bonus + stock

Location:

SF Bay area or potential for remote with travel to office when needed.

If you want to see your RL research power the next generation of superintelligence, this is the role for you!

#J-18808-Ljbffr
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary