RESUME AND JOB
Tencent
Tencent is seeking a Reinforcement Learning (RL) Training Framework R&D Engineer to develop and optimize large-scale RL training frameworks for big models. The role involves accelerating RL training for various business applications like text generation, multimodal understanding, and generation, while validating model performance in collaboration with business teams. Responsibilities include developing and optimizing modules for training, inference, parameter transfer, and advanced features such as train-inference separation and agent scenarios, supporting efficient and stable training at thousand or ten-thousand card scales.
📍 Location: Shenzhen, China
🏢 Business Unit: TEG
300,000 - 800,000 CNY / yearly
Source: ai estimated
* This is an estimated range based on market data and may vary based on experience and qualifications.
Get personalized recommendations to optimize your resume specifically for 混元强化训练框架研发工程师(深圳/北京/上海/杭州). Takes only 15 seconds!
Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.
Answer 10 quick questions to check your fit for 混元强化训练框架研发工程师(深圳/北京/上海/杭州) @ Tencent.

No related jobs found at the moment.

© 2026 Pointers. All rights reserved.

Tencent
Tencent is seeking a Reinforcement Learning (RL) Training Framework R&D Engineer to develop and optimize large-scale RL training frameworks for big models. The role involves accelerating RL training for various business applications like text generation, multimodal understanding, and generation, while validating model performance in collaboration with business teams. Responsibilities include developing and optimizing modules for training, inference, parameter transfer, and advanced features such as train-inference separation and agent scenarios, supporting efficient and stable training at thousand or ten-thousand card scales.
📍 Location: Shenzhen, China
🏢 Business Unit: TEG
300,000 - 800,000 CNY / yearly
Source: ai estimated
* This is an estimated range based on market data and may vary based on experience and qualifications.
Get personalized recommendations to optimize your resume specifically for 混元强化训练框架研发工程师(深圳/北京/上海/杭州). Takes only 15 seconds!
Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.
Answer 10 quick questions to check your fit for 混元强化训练框架研发工程师(深圳/北京/上海/杭州) @ Tencent.

No related jobs found at the moment.

© 2026 Pointers. All rights reserved.