Resume and JobRESUME AND JOB
Tencent logo

企业微信-机器学习平台调度工程师-(北京/成都)

Tencent

Software and Technology Jobs

企业微信-机器学习平台调度工程师-(北京/成都)

full-timePosted: Nov 18, 2025

Job Description

企业微信-机器学习平台调度工程师-(北京/成都)

📋 Job Overview

The Enterprise WeCom Machine Learning Platform Scheduling Engineer role involves leading the global resource scheduling for large-scale GPU clusters to optimize resource utilization and ensure efficient operation of offline and online tasks. The position focuses on enhancing the coordination of high-speed RDMA networks, distributed storage, and computing resources to address performance bottlenecks in large-scale training tasks. Additionally, it entails building high-availability scheduling frameworks using Kubernetes and Docker, supporting distributed training, and exploring advanced areas like hybrid cloud and heterogeneous computing for platform innovation.

📍 Location: Guangzhou, China

🏢 Business Unit: WXG

📄 Full Description

1.主导大规模GPU集群的全局资源调度,通过精细化管理和优化策略,显著提升资源利用率,确保离线和在线任务的高效稳定运行;
2.深入优化RDMA高速网络、分布式存储与计算资源的协同调度,有效解决大规模训练任务中的性能瓶颈,提升整体计算效率;
3.基于Kubernetes、Docker等云原生技术,构建高可用调度框架,全面支持分布式训练框架,实现任务编排、容灾与混部能力,并深入K8s调度器、CSI插件及CRD的开发,推动大规模训推技术的实际落地;
4.积极探索混合云、虚拟化等异构计算等前沿方向,不断推动技术与平台能力的升级和创新。

🎯 Key Responsibilities

  • 主导大规模GPU集群的全局资源调度,通过精细化管理和优化策略,显著提升资源利用率,确保离线和在线任务的高效稳定运行
  • 深入优化RDMA高速网络、分布式存储与计算资源的协同调度,有效解决大规模训练任务中的性能瓶颈,提升整体计算效率
  • 基于Kubernetes、Docker等云原生技术,构建高可用调度框架,全面支持分布式训练框架,实现任务编排、容灾与混部能力,并深入K8s调度器、CSI插件及CRD的开发,推动大规模训推技术的实际落地
  • 积极探索混合云、虚拟化等异构计算等前沿方向,不断推动技术与平台能力的升级和创新

🛠️ Required Skills

  • 大规模GPU集群资源调度
  • RDMA高速网络优化
  • 分布式存储与计算资源协同
  • Kubernetes调度器开发
  • Docker云原生技术
  • CSI插件及CRD开发
  • 分布式训练框架支持
  • 混合云与虚拟化异构计算

Locations

  • Guangzhou, China

Salary

Estimated Salary Rangemedium confidence

300,000 - 600,000 CNY / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • 大规模GPU集群资源调度intermediate
  • RDMA高速网络优化intermediate
  • 分布式存储与计算资源协同intermediate
  • Kubernetes调度器开发intermediate
  • Docker云原生技术intermediate
  • CSI插件及CRD开发intermediate
  • 分布式训练框架支持intermediate
  • 混合云与虚拟化异构计算intermediate

Responsibilities

  • 主导大规模GPU集群的全局资源调度,通过精细化管理和优化策略,显著提升资源利用率,确保离线和在线任务的高效稳定运行
  • 深入优化RDMA高速网络、分布式存储与计算资源的协同调度,有效解决大规模训练任务中的性能瓶颈,提升整体计算效率
  • 基于Kubernetes、Docker等云原生技术,构建高可用调度框架,全面支持分布式训练框架,实现任务编排、容灾与混部能力,并深入K8s调度器、CSI插件及CRD的开发,推动大规模训推技术的实际落地
  • 积极探索混合云、虚拟化等异构计算等前沿方向,不断推动技术与平台能力的升级和创新

Target Your Resume for "企业微信-机器学习平台调度工程师-(北京/成都)" , Tencent

Get personalized recommendations to optimize your resume specifically for 企业微信-机器学习平台调度工程师-(北京/成都). Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "企业微信-机器学习平台调度工程师-(北京/成都)" , Tencent

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

TencentGuangzhouChinaWXGWXG

Answer 10 quick questions to check your fit for 企业微信-机器学习平台调度工程师-(北京/成都) @ Tencent.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

Tencent logo

企业微信-机器学习平台调度工程师-(北京/成都)

Tencent

Software and Technology Jobs

企业微信-机器学习平台调度工程师-(北京/成都)

full-timePosted: Nov 18, 2025

Job Description

企业微信-机器学习平台调度工程师-(北京/成都)

📋 Job Overview

The Enterprise WeCom Machine Learning Platform Scheduling Engineer role involves leading the global resource scheduling for large-scale GPU clusters to optimize resource utilization and ensure efficient operation of offline and online tasks. The position focuses on enhancing the coordination of high-speed RDMA networks, distributed storage, and computing resources to address performance bottlenecks in large-scale training tasks. Additionally, it entails building high-availability scheduling frameworks using Kubernetes and Docker, supporting distributed training, and exploring advanced areas like hybrid cloud and heterogeneous computing for platform innovation.

📍 Location: Guangzhou, China

🏢 Business Unit: WXG

📄 Full Description

1.主导大规模GPU集群的全局资源调度,通过精细化管理和优化策略,显著提升资源利用率,确保离线和在线任务的高效稳定运行;
2.深入优化RDMA高速网络、分布式存储与计算资源的协同调度,有效解决大规模训练任务中的性能瓶颈,提升整体计算效率;
3.基于Kubernetes、Docker等云原生技术,构建高可用调度框架,全面支持分布式训练框架,实现任务编排、容灾与混部能力,并深入K8s调度器、CSI插件及CRD的开发,推动大规模训推技术的实际落地;
4.积极探索混合云、虚拟化等异构计算等前沿方向,不断推动技术与平台能力的升级和创新。

🎯 Key Responsibilities

  • 主导大规模GPU集群的全局资源调度,通过精细化管理和优化策略,显著提升资源利用率,确保离线和在线任务的高效稳定运行
  • 深入优化RDMA高速网络、分布式存储与计算资源的协同调度,有效解决大规模训练任务中的性能瓶颈,提升整体计算效率
  • 基于Kubernetes、Docker等云原生技术,构建高可用调度框架,全面支持分布式训练框架,实现任务编排、容灾与混部能力,并深入K8s调度器、CSI插件及CRD的开发,推动大规模训推技术的实际落地
  • 积极探索混合云、虚拟化等异构计算等前沿方向,不断推动技术与平台能力的升级和创新

🛠️ Required Skills

  • 大规模GPU集群资源调度
  • RDMA高速网络优化
  • 分布式存储与计算资源协同
  • Kubernetes调度器开发
  • Docker云原生技术
  • CSI插件及CRD开发
  • 分布式训练框架支持
  • 混合云与虚拟化异构计算

Locations

  • Guangzhou, China

Salary

Estimated Salary Rangemedium confidence

300,000 - 600,000 CNY / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • 大规模GPU集群资源调度intermediate
  • RDMA高速网络优化intermediate
  • 分布式存储与计算资源协同intermediate
  • Kubernetes调度器开发intermediate
  • Docker云原生技术intermediate
  • CSI插件及CRD开发intermediate
  • 分布式训练框架支持intermediate
  • 混合云与虚拟化异构计算intermediate

Responsibilities

  • 主导大规模GPU集群的全局资源调度,通过精细化管理和优化策略,显著提升资源利用率,确保离线和在线任务的高效稳定运行
  • 深入优化RDMA高速网络、分布式存储与计算资源的协同调度,有效解决大规模训练任务中的性能瓶颈,提升整体计算效率
  • 基于Kubernetes、Docker等云原生技术,构建高可用调度框架,全面支持分布式训练框架,实现任务编排、容灾与混部能力,并深入K8s调度器、CSI插件及CRD的开发,推动大规模训推技术的实际落地
  • 积极探索混合云、虚拟化等异构计算等前沿方向,不断推动技术与平台能力的升级和创新

Target Your Resume for "企业微信-机器学习平台调度工程师-(北京/成都)" , Tencent

Get personalized recommendations to optimize your resume specifically for 企业微信-机器学习平台调度工程师-(北京/成都). Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "企业微信-机器学习平台调度工程师-(北京/成都)" , Tencent

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

TencentGuangzhouChinaWXGWXG

Answer 10 quick questions to check your fit for 企业微信-机器学习平台调度工程师-(北京/成都) @ Tencent.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.