Resume and JobRESUME AND JOB
Tencent logo

混元强化训练框架研发工程师(深圳/北京/上海/杭州)

Tencent

Software and Technology Jobs

混元强化训练框架研发工程师(深圳/北京/上海/杭州)

full-timePosted: Nov 17, 2025

Job Description

混元强化训练框架研发工程师(深圳/北京/上海/杭州)

📋 Job Overview

Tencent is seeking a Reinforcement Learning (RL) Training Framework R&D Engineer to develop and optimize large-scale RL training frameworks for big models. The role involves accelerating RL training for various business applications like text generation, multimodal understanding, and generation, while validating model performance in collaboration with business teams. Responsibilities include developing and optimizing modules for training, inference, parameter transfer, and advanced features such as train-inference separation and agent scenarios, supporting efficient and stable training at thousand or ten-thousand card scales.

📍 Location: Shenzhen, China

🏢 Business Unit: TEG

📄 Full Description

1.参与开发大模型RL训练框架,支持千卡或万卡规模高效稳定RL训练;
2.参与文生文、多模态理解、多模态生成等业务的RL训练加速,并联合业务进行模型效果验证;
3.参与训练、推理、参数传输等模块开发和优化;
4.参与训推分离、partial rollout、agent场景开发和优化。

🎯 Key Responsibilities

  • 参与开发大模型RL训练框架,支持千卡或万卡规模高效稳定RL训练
  • 参与文生文、多模态理解、多模态生成等业务的RL训练加速,并联合业务进行模型效果验证
  • 参与训练、推理、参数传输等模块开发和优化
  • 参与训推分离、partial rollout、agent场景开发和优化

Locations

  • Shenzhen, China

Salary

Estimated Salary Rangemedium confidence

300,000 - 800,000 CNY / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Responsibilities

  • 参与开发大模型RL训练框架,支持千卡或万卡规模高效稳定RL训练
  • 参与文生文、多模态理解、多模态生成等业务的RL训练加速,并联合业务进行模型效果验证
  • 参与训练、推理、参数传输等模块开发和优化
  • 参与训推分离、partial rollout、agent场景开发和优化

Target Your Resume for "混元强化训练框架研发工程师(深圳/北京/上海/杭州)" , Tencent

Get personalized recommendations to optimize your resume specifically for 混元强化训练框架研发工程师(深圳/北京/上海/杭州). Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "混元强化训练框架研发工程师(深圳/北京/上海/杭州)" , Tencent

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

TencentShenzhenChinaTEGTEG

Answer 10 quick questions to check your fit for 混元强化训练框架研发工程师(深圳/北京/上海/杭州) @ Tencent.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

Tencent logo

混元强化训练框架研发工程师(深圳/北京/上海/杭州)

Tencent

Software and Technology Jobs

混元强化训练框架研发工程师(深圳/北京/上海/杭州)

full-timePosted: Nov 17, 2025

Job Description

混元强化训练框架研发工程师(深圳/北京/上海/杭州)

📋 Job Overview

Tencent is seeking a Reinforcement Learning (RL) Training Framework R&D Engineer to develop and optimize large-scale RL training frameworks for big models. The role involves accelerating RL training for various business applications like text generation, multimodal understanding, and generation, while validating model performance in collaboration with business teams. Responsibilities include developing and optimizing modules for training, inference, parameter transfer, and advanced features such as train-inference separation and agent scenarios, supporting efficient and stable training at thousand or ten-thousand card scales.

📍 Location: Shenzhen, China

🏢 Business Unit: TEG

📄 Full Description

1.参与开发大模型RL训练框架,支持千卡或万卡规模高效稳定RL训练;
2.参与文生文、多模态理解、多模态生成等业务的RL训练加速,并联合业务进行模型效果验证;
3.参与训练、推理、参数传输等模块开发和优化;
4.参与训推分离、partial rollout、agent场景开发和优化。

🎯 Key Responsibilities

  • 参与开发大模型RL训练框架,支持千卡或万卡规模高效稳定RL训练
  • 参与文生文、多模态理解、多模态生成等业务的RL训练加速,并联合业务进行模型效果验证
  • 参与训练、推理、参数传输等模块开发和优化
  • 参与训推分离、partial rollout、agent场景开发和优化

Locations

  • Shenzhen, China

Salary

Estimated Salary Rangemedium confidence

300,000 - 800,000 CNY / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Responsibilities

  • 参与开发大模型RL训练框架,支持千卡或万卡规模高效稳定RL训练
  • 参与文生文、多模态理解、多模态生成等业务的RL训练加速,并联合业务进行模型效果验证
  • 参与训练、推理、参数传输等模块开发和优化
  • 参与训推分离、partial rollout、agent场景开发和优化

Target Your Resume for "混元强化训练框架研发工程师(深圳/北京/上海/杭州)" , Tencent

Get personalized recommendations to optimize your resume specifically for 混元强化训练框架研发工程师(深圳/北京/上海/杭州). Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "混元强化训练框架研发工程师(深圳/北京/上海/杭州)" , Tencent

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

TencentShenzhenChinaTEGTEG

Answer 10 quick questions to check your fit for 混元强化训练框架研发工程师(深圳/北京/上海/杭州) @ Tencent.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.