Resume and JobRESUME AND JOB
Tencent logo

混元大模型推理研发专家(深圳/北京/上海/杭州)

Tencent

Software and Technology Jobs

混元大模型推理研发专家(深圳/北京/上海/杭州)

full-timePosted: Nov 19, 2025

Job Description

混元大模型推理研发专家(深圳/北京/上海/杭州)

📋 Job Overview

The role involves leading the architecture design and implementation of end-to-end inference systems for deep learning algorithms in the Deep Synergy Algorithm Team, focusing on high throughput and low latency for large models. Responsibilities include deep performance analysis of the full inference chain, optimization through operator tuning, quantization, and resource scheduling to maximize throughput while controlling costs. Additionally, it entails optimizing the underlying inference framework, enhancing modules like dynamic batching and caching, and building engineering capabilities for usability and debuggability to support stable large-scale inference services.

📍 Location: Shenzhen, China

🏢 Business Unit: TEG

📄 Full Description

1.深度协同算法团队,主导深度学习算法端到端推理系统的架构设计与落地实践,聚焦高吞吐、低延时核心目标,攻克大模型推理工程化落地关键技术瓶颈;
2.针对大模型推理全链路进行性能瓶颈深度剖析,通过算子优化、量化策略、资源调度等手段实现推理吞吐最大化;建立性能 - 成本评估体系,制定资源利用率极致优化方案,实现推理成本可控化;
3.主导大模型推理框架底层架构优化,完善框架功能模块(如动态批处理、推理缓存、容错机制);构建工程化能力体系,提升框架易用性(API 设计、配置化能力)与可调试性(日志系统、性能埋点、调试工具链),支撑大规模推理服务稳定迭代。

🎯 Key Responsibilities

  • 主导深度学习算法端到端推理系统的架构设计与落地实践,聚焦高吞吐、低延时核心目标,攻克大模型推理工程化落地关键技术瓶颈
  • 针对大模型推理全链路进行性能瓶颈深度剖析,通过算子优化、量化策略、资源调度等手段实现推理吞吐最大化;建立性能 - 成本评估体系,制定资源利用率极致优化方案,实现推理成本可控化
  • 主导大模型推理框架底层架构优化,完善框架功能模块(如动态批处理、推理缓存、容错机制);构建工程化能力体系,提升框架易用性(API 设计、配置化能力)与可调试性(日志系统、性能埋点、调试工具链),支撑大规模推理服务稳定迭代

Locations

  • Shenzhen, China

Salary

Estimated Salary Rangemedium confidence

800,000 - 1,500,000 CNY / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Responsibilities

  • 主导深度学习算法端到端推理系统的架构设计与落地实践,聚焦高吞吐、低延时核心目标,攻克大模型推理工程化落地关键技术瓶颈
  • 针对大模型推理全链路进行性能瓶颈深度剖析,通过算子优化、量化策略、资源调度等手段实现推理吞吐最大化;建立性能 - 成本评估体系,制定资源利用率极致优化方案,实现推理成本可控化
  • 主导大模型推理框架底层架构优化,完善框架功能模块(如动态批处理、推理缓存、容错机制);构建工程化能力体系,提升框架易用性(API 设计、配置化能力)与可调试性(日志系统、性能埋点、调试工具链),支撑大规模推理服务稳定迭代

Target Your Resume for "混元大模型推理研发专家(深圳/北京/上海/杭州)" , Tencent

Get personalized recommendations to optimize your resume specifically for 混元大模型推理研发专家(深圳/北京/上海/杭州). Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "混元大模型推理研发专家(深圳/北京/上海/杭州)" , Tencent

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

TencentShenzhenChinaTEGTEG

Answer 10 quick questions to check your fit for 混元大模型推理研发专家(深圳/北京/上海/杭州) @ Tencent.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

Tencent logo

混元大模型推理研发专家(深圳/北京/上海/杭州)

Tencent

Software and Technology Jobs

混元大模型推理研发专家(深圳/北京/上海/杭州)

full-timePosted: Nov 19, 2025

Job Description

混元大模型推理研发专家(深圳/北京/上海/杭州)

📋 Job Overview

The role involves leading the architecture design and implementation of end-to-end inference systems for deep learning algorithms in the Deep Synergy Algorithm Team, focusing on high throughput and low latency for large models. Responsibilities include deep performance analysis of the full inference chain, optimization through operator tuning, quantization, and resource scheduling to maximize throughput while controlling costs. Additionally, it entails optimizing the underlying inference framework, enhancing modules like dynamic batching and caching, and building engineering capabilities for usability and debuggability to support stable large-scale inference services.

📍 Location: Shenzhen, China

🏢 Business Unit: TEG

📄 Full Description

1.深度协同算法团队,主导深度学习算法端到端推理系统的架构设计与落地实践,聚焦高吞吐、低延时核心目标,攻克大模型推理工程化落地关键技术瓶颈;
2.针对大模型推理全链路进行性能瓶颈深度剖析,通过算子优化、量化策略、资源调度等手段实现推理吞吐最大化;建立性能 - 成本评估体系,制定资源利用率极致优化方案,实现推理成本可控化;
3.主导大模型推理框架底层架构优化,完善框架功能模块(如动态批处理、推理缓存、容错机制);构建工程化能力体系,提升框架易用性(API 设计、配置化能力)与可调试性(日志系统、性能埋点、调试工具链),支撑大规模推理服务稳定迭代。

🎯 Key Responsibilities

  • 主导深度学习算法端到端推理系统的架构设计与落地实践,聚焦高吞吐、低延时核心目标,攻克大模型推理工程化落地关键技术瓶颈
  • 针对大模型推理全链路进行性能瓶颈深度剖析,通过算子优化、量化策略、资源调度等手段实现推理吞吐最大化;建立性能 - 成本评估体系,制定资源利用率极致优化方案,实现推理成本可控化
  • 主导大模型推理框架底层架构优化,完善框架功能模块(如动态批处理、推理缓存、容错机制);构建工程化能力体系,提升框架易用性(API 设计、配置化能力)与可调试性(日志系统、性能埋点、调试工具链),支撑大规模推理服务稳定迭代

Locations

  • Shenzhen, China

Salary

Estimated Salary Rangemedium confidence

800,000 - 1,500,000 CNY / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Responsibilities

  • 主导深度学习算法端到端推理系统的架构设计与落地实践,聚焦高吞吐、低延时核心目标,攻克大模型推理工程化落地关键技术瓶颈
  • 针对大模型推理全链路进行性能瓶颈深度剖析,通过算子优化、量化策略、资源调度等手段实现推理吞吐最大化;建立性能 - 成本评估体系,制定资源利用率极致优化方案,实现推理成本可控化
  • 主导大模型推理框架底层架构优化,完善框架功能模块(如动态批处理、推理缓存、容错机制);构建工程化能力体系,提升框架易用性(API 设计、配置化能力)与可调试性(日志系统、性能埋点、调试工具链),支撑大规模推理服务稳定迭代

Target Your Resume for "混元大模型推理研发专家(深圳/北京/上海/杭州)" , Tencent

Get personalized recommendations to optimize your resume specifically for 混元大模型推理研发专家(深圳/北京/上海/杭州). Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "混元大模型推理研发专家(深圳/北京/上海/杭州)" , Tencent

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

TencentShenzhenChinaTEGTEG

Answer 10 quick questions to check your fit for 混元大模型推理研发专家(深圳/北京/上海/杭州) @ Tencent.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.