Resume and JobRESUME AND JOB
Tencent logo

混元语音与音频理解方向研究员(语音理解方向)(北京/深圳/上海)

Tencent

Software and Technology Jobs

混元语音与音频理解方向研究员(语音理解方向)(北京/深圳/上海)

full-timePosted: Oct 26, 2025

Job Description

混元语音与音频理解方向研究员(语音理解方向)(北京/深圳/上海)

📋 Job Overview

We are building a large-scale multimodal model system natively supporting vision, audio, and text to enable comprehensive perception and understanding of the physical world by AI systems. You will join the core research team in the speech and audio direction, focusing on key research tasks such as developing end-to-end speech large models with universal capabilities. This includes advancing speech representation learning, exploring alignment and fusion mechanisms in multimodal models, and building high-quality datasets.

📍 Location: Shenzhen, China

🏢 Business Unit: TEG

📄 Full Description

1.我们正在构建原生支持视觉、音频与文本的大规模多模态模型体系,以推动人工智能系统实现对物理世界的全面感知与理解。你将加入语音与音频方向的核心研究团队,围绕以下关键研究任务开展工作:;
2.研发具备通用能力的端到端语音大模型,包括多语言语音识别、语音翻译、副语言信息理解,音频理解 等;
3.推进 语音表征学习 与 语音编码/解码 架构研究,构建适用于多任务、多模态的统一声学表征;
4.探索音频和语音在多模态大模型中的表征对齐与融合机制,与图像、文本联合建模;
5.构建并维护高质量的语音多模态数据集、自动标注与数据合成技术。

🎯 Key Responsibilities

  • 研发具备通用能力的端到端语音大模型,包括多语言语音识别、语音翻译、副语言信息理解,音频理解 等
  • 推进 语音表征学习 与 语音编码/解码 架构研究,构建适用于多任务、多模态的统一声学表征
  • 探索音频和语音在多模态大模型中的表征对齐与融合机制,与图像、文本联合建模
  • 构建并维护高质量的语音多模态数据集、自动标注与数据合成技术

Locations

  • Shenzhen, China

Salary

Estimated Salary Rangemedium confidence

500,000 - 1,200,000 CNY / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Responsibilities

  • 研发具备通用能力的端到端语音大模型,包括多语言语音识别、语音翻译、副语言信息理解,音频理解 等
  • 推进 语音表征学习 与 语音编码/解码 架构研究,构建适用于多任务、多模态的统一声学表征
  • 探索音频和语音在多模态大模型中的表征对齐与融合机制,与图像、文本联合建模
  • 构建并维护高质量的语音多模态数据集、自动标注与数据合成技术

Target Your Resume for "混元语音与音频理解方向研究员(语音理解方向)(北京/深圳/上海)" , Tencent

Get personalized recommendations to optimize your resume specifically for 混元语音与音频理解方向研究员(语音理解方向)(北京/深圳/上海). Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "混元语音与音频理解方向研究员(语音理解方向)(北京/深圳/上海)" , Tencent

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

TencentShenzhenChinaTEGTEG

Answer 10 quick questions to check your fit for 混元语音与音频理解方向研究员(语音理解方向)(北京/深圳/上海) @ Tencent.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

Tencent logo

混元语音与音频理解方向研究员(语音理解方向)(北京/深圳/上海)

Tencent

Software and Technology Jobs

混元语音与音频理解方向研究员(语音理解方向)(北京/深圳/上海)

full-timePosted: Oct 26, 2025

Job Description

混元语音与音频理解方向研究员(语音理解方向)(北京/深圳/上海)

📋 Job Overview

We are building a large-scale multimodal model system natively supporting vision, audio, and text to enable comprehensive perception and understanding of the physical world by AI systems. You will join the core research team in the speech and audio direction, focusing on key research tasks such as developing end-to-end speech large models with universal capabilities. This includes advancing speech representation learning, exploring alignment and fusion mechanisms in multimodal models, and building high-quality datasets.

📍 Location: Shenzhen, China

🏢 Business Unit: TEG

📄 Full Description

1.我们正在构建原生支持视觉、音频与文本的大规模多模态模型体系,以推动人工智能系统实现对物理世界的全面感知与理解。你将加入语音与音频方向的核心研究团队,围绕以下关键研究任务开展工作:;
2.研发具备通用能力的端到端语音大模型,包括多语言语音识别、语音翻译、副语言信息理解,音频理解 等;
3.推进 语音表征学习 与 语音编码/解码 架构研究,构建适用于多任务、多模态的统一声学表征;
4.探索音频和语音在多模态大模型中的表征对齐与融合机制,与图像、文本联合建模;
5.构建并维护高质量的语音多模态数据集、自动标注与数据合成技术。

🎯 Key Responsibilities

  • 研发具备通用能力的端到端语音大模型,包括多语言语音识别、语音翻译、副语言信息理解,音频理解 等
  • 推进 语音表征学习 与 语音编码/解码 架构研究,构建适用于多任务、多模态的统一声学表征
  • 探索音频和语音在多模态大模型中的表征对齐与融合机制,与图像、文本联合建模
  • 构建并维护高质量的语音多模态数据集、自动标注与数据合成技术

Locations

  • Shenzhen, China

Salary

Estimated Salary Rangemedium confidence

500,000 - 1,200,000 CNY / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Responsibilities

  • 研发具备通用能力的端到端语音大模型,包括多语言语音识别、语音翻译、副语言信息理解,音频理解 等
  • 推进 语音表征学习 与 语音编码/解码 架构研究,构建适用于多任务、多模态的统一声学表征
  • 探索音频和语音在多模态大模型中的表征对齐与融合机制,与图像、文本联合建模
  • 构建并维护高质量的语音多模态数据集、自动标注与数据合成技术

Target Your Resume for "混元语音与音频理解方向研究员(语音理解方向)(北京/深圳/上海)" , Tencent

Get personalized recommendations to optimize your resume specifically for 混元语音与音频理解方向研究员(语音理解方向)(北京/深圳/上海). Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "混元语音与音频理解方向研究员(语音理解方向)(北京/深圳/上海)" , Tencent

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

TencentShenzhenChinaTEGTEG

Answer 10 quick questions to check your fit for 混元语音与音频理解方向研究员(语音理解方向)(北京/深圳/上海) @ Tencent.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.