RESUME AND JOB

混元大模型强化学习研究员

Tencent

混元大模型强化学习研究员

Tencent

full-timePosted: Nov 18, 2025

Job Description

混元大模型强化学习研究员

📋 Job Overview

Tencent is seeking a Reinforcement Learning Researcher for the Hunyuan Large Model, focusing on leading advanced algorithm research in RL for large models. The role involves designing and optimizing RL algorithms, conducting large-scale experiments in complex reasoning and autonomous learning scenarios, and driving practical applications in industry. Responsibilities include exploring cutting-edge technologies, providing innovative solutions, and collaborating with cross-functional teams to achieve technical breakthroughs.

📍 Location: Shenzhen, China

🏢 Business Unit: TEG

📄 Full Description

1.带领团队进行前沿算法研究，专注于大模型中强化学习算法的设计与优化，涵盖强化学习算法、奖励建模、世界模型等多个方向；
2.在大模型的复杂推理等自主探索与学习等场景中进行大规模实验验证，推动研究成果在行业内的实际应用，并发表具有影响力的学术论文；
3.探索大模型的前沿技术，结合未来实际应用场景，提供创新的技术解决方案；
4.与跨职能团队合作，确保项目进展顺利，并在技术突破方面发挥领导作用。

🎯 Key Responsibilities

Lead the team in cutting-edge algorithm research, focusing on the design and optimization of reinforcement learning algorithms in large models, covering directions such as RL algorithms, reward modeling, world models, etc.
Conduct large-scale experimental validation in scenarios like complex reasoning, autonomous exploration, and learning in large models, promote the practical application of research results in the industry, and publish influential academic papers.
Explore cutting-edge technologies in large models, combined with future practical application scenarios, to provide innovative technical solutions.
Collaborate with cross-functional teams to ensure smooth project progress and play a leading role in technical breakthroughs.

Locations

Shenzhen, China

Salary

Estimated Salary Rangemedium confidence

400,000 - 800,000 CNY / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Responsibilities

Lead the team in cutting-edge algorithm research, focusing on the design and optimization of reinforcement learning algorithms in large models, covering directions such as RL algorithms, reward modeling, world models, etc.
Conduct large-scale experimental validation in scenarios like complex reasoning, autonomous exploration, and learning in large models, promote the practical application of research results in the industry, and publish influential academic papers.
Explore cutting-edge technologies in large models, combined with future practical application scenarios, to provide innovative technical solutions.
Collaborate with cross-functional teams to ensure smooth project progress and play a leading role in technical breakthroughs.

Target Your Resume for "混元大模型强化学习研究员" , Tencent

Get personalized recommendations to optimize your resume specifically for 混元大模型强化学习研究员. Takes only 15 seconds!

AI-powered keyword optimization

Skills matching & gap analysis

Experience alignment suggestions

Check Your ATS Score for "混元大模型强化学习研究员" , Tencent

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check

Keyword optimization analysis

Skill matching & gap identification

Format & readability score

Tags & Categories

TencentShenzhenChinaTEGTEG

Answer 10 quick questions to check your fit for 混元大模型强化学习研究员 @ Tencent.

10 Questions

~2 Minutes

Instant Score

Related Books and Jobs

No related jobs found at the moment.

Privacy Terms & Conditions About Us Refund Policy Recruiter Login Sitemap

混元大模型强化学习研究员

Tencent

混元大模型强化学习研究员

Tencent

full-timePosted: Nov 18, 2025

Job Description

混元大模型强化学习研究员

📋 Job Overview

📍 Location: Shenzhen, China

🏢 Business Unit: TEG

📄 Full Description

🎯 Key Responsibilities

Lead the team in cutting-edge algorithm research, focusing on the design and optimization of reinforcement learning algorithms in large models, covering directions such as RL algorithms, reward modeling, world models, etc.
Conduct large-scale experimental validation in scenarios like complex reasoning, autonomous exploration, and learning in large models, promote the practical application of research results in the industry, and publish influential academic papers.
Explore cutting-edge technologies in large models, combined with future practical application scenarios, to provide innovative technical solutions.
Collaborate with cross-functional teams to ensure smooth project progress and play a leading role in technical breakthroughs.

Locations

Shenzhen, China

Salary

Estimated Salary Rangemedium confidence

400,000 - 800,000 CNY / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Responsibilities

Lead the team in cutting-edge algorithm research, focusing on the design and optimization of reinforcement learning algorithms in large models, covering directions such as RL algorithms, reward modeling, world models, etc.
Conduct large-scale experimental validation in scenarios like complex reasoning, autonomous exploration, and learning in large models, promote the practical application of research results in the industry, and publish influential academic papers.
Explore cutting-edge technologies in large models, combined with future practical application scenarios, to provide innovative technical solutions.
Collaborate with cross-functional teams to ensure smooth project progress and play a leading role in technical breakthroughs.

Target Your Resume for "混元大模型强化学习研究员" , Tencent

Get personalized recommendations to optimize your resume specifically for 混元大模型强化学习研究员. Takes only 15 seconds!

AI-powered keyword optimization

Skills matching & gap analysis

Experience alignment suggestions

Check Your ATS Score for "混元大模型强化学习研究员" , Tencent

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check

Keyword optimization analysis

Skill matching & gap identification

Format & readability score

Tags & Categories

TencentShenzhenChinaTEGTEG

Answer 10 quick questions to check your fit for 混元大模型强化学习研究员 @ Tencent.

10 Questions

~2 Minutes

Instant Score

Related Books and Jobs

No related jobs found at the moment.

Privacy Terms & Conditions About Us Refund Policy Recruiter Login Sitemap