Resume and JobRESUME AND JOB
Tencent logo

混元数据算法工程师(北京)

Tencent

Software and Technology Jobs

混元数据算法工程师(北京)

full-timePosted: Nov 23, 2025

Job Description

混元数据算法工程师(北京)

📋 Job Overview

The Data Algorithm Engineer position at Tencent focuses on developing algorithms for understanding and processing massive multimodal data including text, images, videos, audio, and 3D content. Responsibilities include building data pipelines for collection, cleaning, annotation, and quality assessment, while collaborating with model teams to automate processes supporting continuous model iteration. The role also involves detailed data analysis to identify and resolve issues like sample deficiencies and imbalances, driving improvements in data quality, diversity, and overall large model performance.

📍 Location: Shenzhen, China

🏢 Business Unit: TEG

📄 Full Description

1.数据特征算法:负责海量文本&多模态数据(图像,视频,音频,3D)的内容理解(如分类标签体系、embedding表征、Caption生成等),质量检测(低质识别检测、优质美学评价等),去重/聚类分析,数据合成等算法;
2.数据pipeline建设:负责数据采集、筛选清洗、标注与质量评估pipeline的建设。与模型业务团队紧密配合,充分分析挖掘数据资源,建立自动化数据处理流程与机制,支持模型持续迭代;
3.数据实验分析:对模型训练数据进行详细分析,建立科学数据实验机制,识别样本不足、质量问题、配比不均衡等潜在问题,驱动数据优化提升数据覆盖、质量、多样性需求,最终带来大模型生成效果的持续提升。

🎯 Key Responsibilities

  • Develop algorithms for content understanding of massive text and multimodal data (images, videos, audio, 3D), including classification labeling systems, embedding representations, caption generation, quality detection (low-quality identification, aesthetic evaluation), deduplication/clustering analysis, and data synthesis.
  • Build data pipelines for data collection, screening and cleaning, annotation, and quality assessment. Collaborate closely with model business teams to analyze and mine data resources, establish automated data processing workflows and mechanisms to support continuous model iteration.
  • Conduct detailed analysis of model training data, establish scientific data experimentation mechanisms, identify potential issues such as sample shortages, quality problems, and imbalanced ratios, drive data optimization to enhance coverage, quality, and diversity, ultimately improving large model generation effects.

Locations

  • Shenzhen, China

Salary

Estimated Salary Rangemedium confidence

300,000 - 600,000 CNY / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Responsibilities

  • Develop algorithms for content understanding of massive text and multimodal data (images, videos, audio, 3D), including classification labeling systems, embedding representations, caption generation, quality detection (low-quality identification, aesthetic evaluation), deduplication/clustering analysis, and data synthesis.
  • Build data pipelines for data collection, screening and cleaning, annotation, and quality assessment. Collaborate closely with model business teams to analyze and mine data resources, establish automated data processing workflows and mechanisms to support continuous model iteration.
  • Conduct detailed analysis of model training data, establish scientific data experimentation mechanisms, identify potential issues such as sample shortages, quality problems, and imbalanced ratios, drive data optimization to enhance coverage, quality, and diversity, ultimately improving large model generation effects.

Target Your Resume for "混元数据算法工程师(北京)" , Tencent

Get personalized recommendations to optimize your resume specifically for 混元数据算法工程师(北京). Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "混元数据算法工程师(北京)" , Tencent

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

TencentShenzhenChinaTEGTEG

Answer 10 quick questions to check your fit for 混元数据算法工程师(北京) @ Tencent.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

Tencent logo

混元数据算法工程师(北京)

Tencent

Software and Technology Jobs

混元数据算法工程师(北京)

full-timePosted: Nov 23, 2025

Job Description

混元数据算法工程师(北京)

📋 Job Overview

The Data Algorithm Engineer position at Tencent focuses on developing algorithms for understanding and processing massive multimodal data including text, images, videos, audio, and 3D content. Responsibilities include building data pipelines for collection, cleaning, annotation, and quality assessment, while collaborating with model teams to automate processes supporting continuous model iteration. The role also involves detailed data analysis to identify and resolve issues like sample deficiencies and imbalances, driving improvements in data quality, diversity, and overall large model performance.

📍 Location: Shenzhen, China

🏢 Business Unit: TEG

📄 Full Description

1.数据特征算法:负责海量文本&多模态数据(图像,视频,音频,3D)的内容理解(如分类标签体系、embedding表征、Caption生成等),质量检测(低质识别检测、优质美学评价等),去重/聚类分析,数据合成等算法;
2.数据pipeline建设:负责数据采集、筛选清洗、标注与质量评估pipeline的建设。与模型业务团队紧密配合,充分分析挖掘数据资源,建立自动化数据处理流程与机制,支持模型持续迭代;
3.数据实验分析:对模型训练数据进行详细分析,建立科学数据实验机制,识别样本不足、质量问题、配比不均衡等潜在问题,驱动数据优化提升数据覆盖、质量、多样性需求,最终带来大模型生成效果的持续提升。

🎯 Key Responsibilities

  • Develop algorithms for content understanding of massive text and multimodal data (images, videos, audio, 3D), including classification labeling systems, embedding representations, caption generation, quality detection (low-quality identification, aesthetic evaluation), deduplication/clustering analysis, and data synthesis.
  • Build data pipelines for data collection, screening and cleaning, annotation, and quality assessment. Collaborate closely with model business teams to analyze and mine data resources, establish automated data processing workflows and mechanisms to support continuous model iteration.
  • Conduct detailed analysis of model training data, establish scientific data experimentation mechanisms, identify potential issues such as sample shortages, quality problems, and imbalanced ratios, drive data optimization to enhance coverage, quality, and diversity, ultimately improving large model generation effects.

Locations

  • Shenzhen, China

Salary

Estimated Salary Rangemedium confidence

300,000 - 600,000 CNY / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Responsibilities

  • Develop algorithms for content understanding of massive text and multimodal data (images, videos, audio, 3D), including classification labeling systems, embedding representations, caption generation, quality detection (low-quality identification, aesthetic evaluation), deduplication/clustering analysis, and data synthesis.
  • Build data pipelines for data collection, screening and cleaning, annotation, and quality assessment. Collaborate closely with model business teams to analyze and mine data resources, establish automated data processing workflows and mechanisms to support continuous model iteration.
  • Conduct detailed analysis of model training data, establish scientific data experimentation mechanisms, identify potential issues such as sample shortages, quality problems, and imbalanced ratios, drive data optimization to enhance coverage, quality, and diversity, ultimately improving large model generation effects.

Target Your Resume for "混元数据算法工程师(北京)" , Tencent

Get personalized recommendations to optimize your resume specifically for 混元数据算法工程师(北京). Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "混元数据算法工程师(北京)" , Tencent

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

TencentShenzhenChinaTEGTEG

Answer 10 quick questions to check your fit for 混元数据算法工程师(北京) @ Tencent.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.