×
Register Here to Apply for Jobs or Post Jobs. X
More jobs:

Researcher, Agentic RL LLMs

Job in Markham, Ontario, Canada
Listing for: Huawei Canada
Contract position
Listed on 2026-03-01
Job specializations:
  • IT/Tech
    Data Scientist
Salary/Wage Range or Industry Benchmark: 30000 - 60000 CAD Yearly CAD 30000.00 60000.00 YEAR
Job Description & How to Apply Below
Position: Researcher, Agentic RL for LLMs (Contract)
A prominent technology company in York Region, Canada is seeking a Researcher for Reinforcement Learning. This role focuses on enhancing Large Language Models (LLMs) through innovative training techniques and evaluation methods. The ideal candidate will hold a PhD in Computer Science, with strong deep learning and reinforcement learning skills, along with proficiency in Python and experience with tools like PyTorch.

This entry-level position offers a contract duration of 12 months.
#J-18808-Ljbffr
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary