Resume and JobRESUME AND JOB
Tencent logo

Principal Researcher – Reinforcement Learning for Large Foundation Models

Tencent

Software and Technology Jobs

Principal Researcher – Reinforcement Learning for Large Foundation Models

full-timePosted: Oct 12, 2025

Job Description

Principal Researcher – Reinforcement Learning for Large Foundation Models

📋 Job Overview

Tencent AI Lab is seeking a Principal Researcher specializing in reinforcement learning for large foundation models to advance AI technologies. The role focuses on developing stable and efficient RL algorithms to enhance model capabilities in complex reasoning, autonomous exploration, and continuous learning. Responsibilities include leading research, conducting experiments, and publishing impactful papers.

📍 Location: Bellevue, Washington, United States

🏢 Business Unit: TEG

📄 Full Description

Business Unit

What the Role Entails
About the Position
Tencent AI Lab is dedicated to advancing cutting-edge AI technologies, with a particular focus on innovative breakthroughs in large foundation models. We are currently seeking expert-level researchers in the area of reinforcement learning (RL) for large foundation models, with an emphasis on developing stable and efficient RL algorithms. The goal is to empower large foundation models in complex reasoning and agent tasks and enhance their capabilities in autonomous exploration and continuous learning. The final position level will be determined based on the candidate's experience and accomplishments.Key Responsibilities
1.Lead in conducting cutting-edge algorithm research, with a focus on the design and optimization of RL algorithms for large foundation models. Research areas include but are not limited to Reinforcement Learning Algorithms, Reward Modeling, and World Models.
2.Conduct large-scale experiments of RL algorithms in scenarios such as complex reasoning and autonomous agents, deliver impactful algorithms for real world applications, and publish influential research papers.
3.Explore frontier technologies in large foundation models, and provide technical solutions aligned with future practical application scenarios.
 

Who We Look For
Qualifications
1.Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university.
2.Experience of working in leading global companies in the field of large foundation models.
3.Proficient in Python programming and experienced in developing with deep learning frameworks such as PyTorch or TensorFlow.
4.Strong academic background, with publications in top-tier conferences such as NeurIPS, ICLR, ICML, ACL, EMNLP; experience in deep learning research or engineering projects.
5.Excellent communication and teamwork skills, capable of collaborating with cross-functional teams to drive project success and innovation.
Location State(s)
US-Washington-Bellevue
The expected base pay range for this position in the location(s) listed above is $163,800.00 to $307,600.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience.
Employees hired for this position may be eligible for a sign on payment, relocation package, and restricted stock units, which will be evaluated on a case-by-case basis.
Subject to the terms and conditions of the plans in effect, hired applicants are also eligible for medical, dental, vision, life and disability benefits, and participation in the Company’s 401(k) plan. The Employee is also eligible for up to 15 to 25 days of vacation per year (depending on the employee’s tenure), up to 13 days of holidays throughout the calendar year, and up to 10 days of paid sick leave per year.
Your benefits may be adjusted to reflect your location, employment status, duration of employment with the company, and position level. Benefits may also be pro-rated for those who start working during the calendar year.

Equal Employment Opportunity at Tencent
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
Work Location: US-Washington-Bellevue

🎯 Key Responsibilities

  • Lead in conducting cutting-edge algorithm research, with a focus on the design and optimization of RL algorithms for large foundation models. Research areas include but are not limited to Reinforcement Learning Algorithms, Reward Modeling, and World Models.
  • Conduct large-scale experiments of RL algorithms in scenarios such as complex reasoning and autonomous agents, deliver impactful algorithms for real world applications, and publish influential research papers.
  • Explore frontier technologies in large foundation models, and provide technical solutions aligned with future practical application scenarios.

✅ Required Qualifications

  • Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university.
  • Experience of working in leading global companies in the field of large foundation models.
  • Proficient in Python programming and experienced in developing with deep learning frameworks such as PyTorch or TensorFlow.
  • Strong academic background, with publications in top-tier conferences such as NeurIPS, ICLR, ICML, ACL, EMNLP; experience in deep learning research or engineering projects.
  • Excellent communication and teamwork skills, capable of collaborating with cross-functional teams to drive project success and innovation.

🛠️ Required Skills

  • Proficiency in Python programming
  • Experience with deep learning frameworks such as PyTorch or TensorFlow
  • Expertise in Reinforcement Learning Algorithms, Reward Modeling, and World Models
  • Strong research skills in deep learning
  • Excellent communication and teamwork skills

🎁 Benefits

  • Medical, dental, vision, life and disability benefits
  • Participation in the Company’s 401(k) plan
  • Up to 15 to 25 days of vacation per year (depending on tenure)
  • Up to 13 days of holidays throughout the calendar year
  • Up to 10 days of paid sick leave per year
  • Eligibility for sign-on payment, relocation package, and restricted stock units (case-by-case)

Locations

  • Bellevue, Washington, United States

Salary

163,800 - 307,600 USD / yearly

Skills Required

  • Proficiency in Python programmingintermediate
  • Experience with deep learning frameworks such as PyTorch or TensorFlowintermediate
  • Expertise in Reinforcement Learning Algorithms, Reward Modeling, and World Modelsintermediate
  • Strong research skills in deep learningintermediate
  • Excellent communication and teamwork skillsintermediate

Required Qualifications

  • Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university. (experience)
  • Experience of working in leading global companies in the field of large foundation models. (experience)
  • Proficient in Python programming and experienced in developing with deep learning frameworks such as PyTorch or TensorFlow. (experience)
  • Strong academic background, with publications in top-tier conferences such as NeurIPS, ICLR, ICML, ACL, EMNLP; experience in deep learning research or engineering projects. (experience)
  • Excellent communication and teamwork skills, capable of collaborating with cross-functional teams to drive project success and innovation. (experience)

Responsibilities

  • Lead in conducting cutting-edge algorithm research, with a focus on the design and optimization of RL algorithms for large foundation models. Research areas include but are not limited to Reinforcement Learning Algorithms, Reward Modeling, and World Models.
  • Conduct large-scale experiments of RL algorithms in scenarios such as complex reasoning and autonomous agents, deliver impactful algorithms for real world applications, and publish influential research papers.
  • Explore frontier technologies in large foundation models, and provide technical solutions aligned with future practical application scenarios.

Benefits

  • general: Medical, dental, vision, life and disability benefits
  • general: Participation in the Company’s 401(k) plan
  • general: Up to 15 to 25 days of vacation per year (depending on tenure)
  • general: Up to 13 days of holidays throughout the calendar year
  • general: Up to 10 days of paid sick leave per year
  • general: Eligibility for sign-on payment, relocation package, and restricted stock units (case-by-case)

Target Your Resume for "Principal Researcher – Reinforcement Learning for Large Foundation Models" , Tencent

Get personalized recommendations to optimize your resume specifically for Principal Researcher – Reinforcement Learning for Large Foundation Models. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Principal Researcher – Reinforcement Learning for Large Foundation Models" , Tencent

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

TencentBellevueUnited StatesTEGTEG

Answer 10 quick questions to check your fit for Principal Researcher – Reinforcement Learning for Large Foundation Models @ Tencent.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

Tencent logo

Principal Researcher – Reinforcement Learning for Large Foundation Models

Tencent

Software and Technology Jobs

Principal Researcher – Reinforcement Learning for Large Foundation Models

full-timePosted: Oct 12, 2025

Job Description

Principal Researcher – Reinforcement Learning for Large Foundation Models

📋 Job Overview

Tencent AI Lab is seeking a Principal Researcher specializing in reinforcement learning for large foundation models to advance AI technologies. The role focuses on developing stable and efficient RL algorithms to enhance model capabilities in complex reasoning, autonomous exploration, and continuous learning. Responsibilities include leading research, conducting experiments, and publishing impactful papers.

📍 Location: Bellevue, Washington, United States

🏢 Business Unit: TEG

📄 Full Description

Business Unit

What the Role Entails
About the Position
Tencent AI Lab is dedicated to advancing cutting-edge AI technologies, with a particular focus on innovative breakthroughs in large foundation models. We are currently seeking expert-level researchers in the area of reinforcement learning (RL) for large foundation models, with an emphasis on developing stable and efficient RL algorithms. The goal is to empower large foundation models in complex reasoning and agent tasks and enhance their capabilities in autonomous exploration and continuous learning. The final position level will be determined based on the candidate's experience and accomplishments.Key Responsibilities
1.Lead in conducting cutting-edge algorithm research, with a focus on the design and optimization of RL algorithms for large foundation models. Research areas include but are not limited to Reinforcement Learning Algorithms, Reward Modeling, and World Models.
2.Conduct large-scale experiments of RL algorithms in scenarios such as complex reasoning and autonomous agents, deliver impactful algorithms for real world applications, and publish influential research papers.
3.Explore frontier technologies in large foundation models, and provide technical solutions aligned with future practical application scenarios.
 

Who We Look For
Qualifications
1.Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university.
2.Experience of working in leading global companies in the field of large foundation models.
3.Proficient in Python programming and experienced in developing with deep learning frameworks such as PyTorch or TensorFlow.
4.Strong academic background, with publications in top-tier conferences such as NeurIPS, ICLR, ICML, ACL, EMNLP; experience in deep learning research or engineering projects.
5.Excellent communication and teamwork skills, capable of collaborating with cross-functional teams to drive project success and innovation.
Location State(s)
US-Washington-Bellevue
The expected base pay range for this position in the location(s) listed above is $163,800.00 to $307,600.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience.
Employees hired for this position may be eligible for a sign on payment, relocation package, and restricted stock units, which will be evaluated on a case-by-case basis.
Subject to the terms and conditions of the plans in effect, hired applicants are also eligible for medical, dental, vision, life and disability benefits, and participation in the Company’s 401(k) plan. The Employee is also eligible for up to 15 to 25 days of vacation per year (depending on the employee’s tenure), up to 13 days of holidays throughout the calendar year, and up to 10 days of paid sick leave per year.
Your benefits may be adjusted to reflect your location, employment status, duration of employment with the company, and position level. Benefits may also be pro-rated for those who start working during the calendar year.

Equal Employment Opportunity at Tencent
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
Work Location: US-Washington-Bellevue

🎯 Key Responsibilities

  • Lead in conducting cutting-edge algorithm research, with a focus on the design and optimization of RL algorithms for large foundation models. Research areas include but are not limited to Reinforcement Learning Algorithms, Reward Modeling, and World Models.
  • Conduct large-scale experiments of RL algorithms in scenarios such as complex reasoning and autonomous agents, deliver impactful algorithms for real world applications, and publish influential research papers.
  • Explore frontier technologies in large foundation models, and provide technical solutions aligned with future practical application scenarios.

✅ Required Qualifications

  • Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university.
  • Experience of working in leading global companies in the field of large foundation models.
  • Proficient in Python programming and experienced in developing with deep learning frameworks such as PyTorch or TensorFlow.
  • Strong academic background, with publications in top-tier conferences such as NeurIPS, ICLR, ICML, ACL, EMNLP; experience in deep learning research or engineering projects.
  • Excellent communication and teamwork skills, capable of collaborating with cross-functional teams to drive project success and innovation.

🛠️ Required Skills

  • Proficiency in Python programming
  • Experience with deep learning frameworks such as PyTorch or TensorFlow
  • Expertise in Reinforcement Learning Algorithms, Reward Modeling, and World Models
  • Strong research skills in deep learning
  • Excellent communication and teamwork skills

🎁 Benefits

  • Medical, dental, vision, life and disability benefits
  • Participation in the Company’s 401(k) plan
  • Up to 15 to 25 days of vacation per year (depending on tenure)
  • Up to 13 days of holidays throughout the calendar year
  • Up to 10 days of paid sick leave per year
  • Eligibility for sign-on payment, relocation package, and restricted stock units (case-by-case)

Locations

  • Bellevue, Washington, United States

Salary

163,800 - 307,600 USD / yearly

Skills Required

  • Proficiency in Python programmingintermediate
  • Experience with deep learning frameworks such as PyTorch or TensorFlowintermediate
  • Expertise in Reinforcement Learning Algorithms, Reward Modeling, and World Modelsintermediate
  • Strong research skills in deep learningintermediate
  • Excellent communication and teamwork skillsintermediate

Required Qualifications

  • Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university. (experience)
  • Experience of working in leading global companies in the field of large foundation models. (experience)
  • Proficient in Python programming and experienced in developing with deep learning frameworks such as PyTorch or TensorFlow. (experience)
  • Strong academic background, with publications in top-tier conferences such as NeurIPS, ICLR, ICML, ACL, EMNLP; experience in deep learning research or engineering projects. (experience)
  • Excellent communication and teamwork skills, capable of collaborating with cross-functional teams to drive project success and innovation. (experience)

Responsibilities

  • Lead in conducting cutting-edge algorithm research, with a focus on the design and optimization of RL algorithms for large foundation models. Research areas include but are not limited to Reinforcement Learning Algorithms, Reward Modeling, and World Models.
  • Conduct large-scale experiments of RL algorithms in scenarios such as complex reasoning and autonomous agents, deliver impactful algorithms for real world applications, and publish influential research papers.
  • Explore frontier technologies in large foundation models, and provide technical solutions aligned with future practical application scenarios.

Benefits

  • general: Medical, dental, vision, life and disability benefits
  • general: Participation in the Company’s 401(k) plan
  • general: Up to 15 to 25 days of vacation per year (depending on tenure)
  • general: Up to 13 days of holidays throughout the calendar year
  • general: Up to 10 days of paid sick leave per year
  • general: Eligibility for sign-on payment, relocation package, and restricted stock units (case-by-case)

Target Your Resume for "Principal Researcher – Reinforcement Learning for Large Foundation Models" , Tencent

Get personalized recommendations to optimize your resume specifically for Principal Researcher – Reinforcement Learning for Large Foundation Models. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Principal Researcher – Reinforcement Learning for Large Foundation Models" , Tencent

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

TencentBellevueUnited StatesTEGTEG

Answer 10 quick questions to check your fit for Principal Researcher – Reinforcement Learning for Large Foundation Models @ Tencent.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.