RESUME AND JOB

Multimodal Large Model Algorithm Engineer 106501

Tencent

Multimodal Large Model Algorithm Engineer 106501

Tencent

internshipPosted: Nov 18, 2025

Job Description

Multimodal Large Model Algorithm Engineer 106501

📋 Job Overview

The Multimodal Large Model Algorithm Engineer role at Tencent's Technology Engineering Group involves researching and developing advanced multimodal large model technologies, focusing on cross-modal alignment and understanding to create industry-leading models. The position requires tracking state-of-the-art algorithms, participating in model design, training, optimization, and evaluation, and applying these innovations to business scenarios. This role is based in Singapore and supports Tencent's infrastructure R&D through collaborative efforts.

📍 Location: CapitaSky, Singapore

🏢 Business Unit: TEG

📄 Full Description

Business Unit
Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers, TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia,TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.

What the Role Entails
Conduct research and development of multimodal large model technologies, including cross-modal alignment and multimodal understanding tasks, to build industry-leading multimodal large models.
Continuously track state-of-the-art algorithms in multimodal large models, participate in the design, training, optimization, and evaluation of these models, and promote their application in business scenarios.

Who We Look For
Master’s degree or higher in Computer Science, Machine Learning, Artificial Intelligence, Applied Mathematics, or related fields.
Solid research background in multimodal understanding (e.g., natural language processing, computer vision, speech understanding/generation), with familiarity in mainstream models and algorithms such as CLIP, LLaVA, VALL-E, etc..
Proficiency in deep learning frameworks like TensorFlow or PyTorch; knowledge of distributed training frameworks (e.g., DeepSpeed, Megatron-LM) and practical experience in multi-node/multi-GPU distributed training.
Strong engineering skills with proficiency in at least one programming language: C/C++, Java, or Python.
Publication record in top-tier conferences (e.g., ICLR, NeurIPS, CVPR, ICCV, ECCV, ACL, EMNLP) is preferred.
Excellent learning ability, technical curiosity, and strong teamwork and communication skills.

Equal Employment Opportunity at Tencent
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
Work Location: Singapore-CapitaSky

🎯 Key Responsibilities

Conduct research and development of multimodal large model technologies, including cross-modal alignment and multimodal understanding tasks, to build industry-leading multimodal large models
Continuously track state-of-the-art algorithms in multimodal large models
Participate in the design, training, optimization, and evaluation of these models
Promote their application in business scenarios

✅ Required Qualifications

Master’s degree or higher in Computer Science, Machine Learning, Artificial Intelligence, Applied Mathematics, or related fields
Solid research background in multimodal understanding (e.g., natural language processing, computer vision, speech understanding/generation), with familiarity in mainstream models and algorithms such as CLIP, LLaVA, VALL-E, etc.
Proficiency in deep learning frameworks like TensorFlow or PyTorch; knowledge of distributed training frameworks (e.g., DeepSpeed, Megatron-LM) and practical experience in multi-node/multi-GPU distributed training
Strong engineering skills with proficiency in at least one programming language: C/C++, Java, or Python

⭐ Preferred Qualifications

Publication record in top-tier conferences (e.g., ICLR, NeurIPS, CVPR, ICCV, ECCV, ACL, EMNLP) is preferred

🛠️ Required Skills

Solid research background in multimodal understanding
Familiarity with mainstream models and algorithms such as CLIP, LLaVA, VALL-E
Proficiency in deep learning frameworks like TensorFlow or PyTorch
Knowledge of distributed training frameworks (e.g., DeepSpeed, Megatron-LM)
Practical experience in multi-node/multi-GPU distributed training
Strong engineering skills
Proficiency in at least one programming language: C/C++, Java, or Python
Excellent learning ability
Technical curiosity
Strong teamwork and communication skills

🎁 Benefits

Equal Employment Opportunity: Diverse voices fuel innovation and better serve users and the community
Supportive environment where every employee feels inspired to achieve individual and common goals
Work Location: Singapore-CapitaSky

Locations

CapitaSky, Singapore

Salary

Estimated Salary Rangemedium confidence

120,000 - 180,000 SGD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

Solid research background in multimodal understandingintermediate
Familiarity with mainstream models and algorithms such as CLIP, LLaVA, VALL-Eintermediate
Proficiency in deep learning frameworks like TensorFlow or PyTorchintermediate
Knowledge of distributed training frameworks (e.g., DeepSpeed, Megatron-LM)intermediate
Practical experience in multi-node/multi-GPU distributed trainingintermediate
Strong engineering skillsintermediate
Proficiency in at least one programming language: C/C++, Java, or Pythonintermediate
Excellent learning abilityintermediate
Technical curiosityintermediate
Strong teamwork and communication skillsintermediate

Required Qualifications

Master’s degree or higher in Computer Science, Machine Learning, Artificial Intelligence, Applied Mathematics, or related fields (experience)
Solid research background in multimodal understanding (e.g., natural language processing, computer vision, speech understanding/generation), with familiarity in mainstream models and algorithms such as CLIP, LLaVA, VALL-E, etc. (experience)
Proficiency in deep learning frameworks like TensorFlow or PyTorch; knowledge of distributed training frameworks (e.g., DeepSpeed, Megatron-LM) and practical experience in multi-node/multi-GPU distributed training (experience)
Strong engineering skills with proficiency in at least one programming language: C/C++, Java, or Python (experience)

Preferred Qualifications

Publication record in top-tier conferences (e.g., ICLR, NeurIPS, CVPR, ICCV, ECCV, ACL, EMNLP) is preferred (experience)

Responsibilities

Conduct research and development of multimodal large model technologies, including cross-modal alignment and multimodal understanding tasks, to build industry-leading multimodal large models
Continuously track state-of-the-art algorithms in multimodal large models
Participate in the design, training, optimization, and evaluation of these models
Promote their application in business scenarios

Benefits

general: Equal Employment Opportunity: Diverse voices fuel innovation and better serve users and the community
general: Supportive environment where every employee feels inspired to achieve individual and common goals
general: Work Location: Singapore-CapitaSky

Target Your Resume for "Multimodal Large Model Algorithm Engineer 106501" , Tencent

Get personalized recommendations to optimize your resume specifically for Multimodal Large Model Algorithm Engineer 106501. Takes only 15 seconds!

AI-powered keyword optimization

Skills matching & gap analysis

Experience alignment suggestions

Check Your ATS Score for "Multimodal Large Model Algorithm Engineer 106501" , Tencent

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check

Keyword optimization analysis

Skill matching & gap identification

Format & readability score

Tags & Categories

TencentCapitaSkySingaporeTEGTEG

Answer 10 quick questions to check your fit for Multimodal Large Model Algorithm Engineer 106501 @ Tencent.

10 Questions

~2 Minutes

Instant Score

Related Books and Jobs

No related jobs found at the moment.

Privacy Terms & Conditions About Us Refund Policy Recruiter Login Sitemap

Multimodal Large Model Algorithm Engineer 106501

Tencent

Multimodal Large Model Algorithm Engineer 106501

Tencent

internshipPosted: Nov 18, 2025

Job Description

Multimodal Large Model Algorithm Engineer 106501

📋 Job Overview

📍 Location: CapitaSky, Singapore

🏢 Business Unit: TEG

📄 Full Description

🎯 Key Responsibilities

Conduct research and development of multimodal large model technologies, including cross-modal alignment and multimodal understanding tasks, to build industry-leading multimodal large models
Continuously track state-of-the-art algorithms in multimodal large models
Participate in the design, training, optimization, and evaluation of these models
Promote their application in business scenarios

✅ Required Qualifications

Master’s degree or higher in Computer Science, Machine Learning, Artificial Intelligence, Applied Mathematics, or related fields
Solid research background in multimodal understanding (e.g., natural language processing, computer vision, speech understanding/generation), with familiarity in mainstream models and algorithms such as CLIP, LLaVA, VALL-E, etc.
Proficiency in deep learning frameworks like TensorFlow or PyTorch; knowledge of distributed training frameworks (e.g., DeepSpeed, Megatron-LM) and practical experience in multi-node/multi-GPU distributed training
Strong engineering skills with proficiency in at least one programming language: C/C++, Java, or Python

⭐ Preferred Qualifications

Publication record in top-tier conferences (e.g., ICLR, NeurIPS, CVPR, ICCV, ECCV, ACL, EMNLP) is preferred

🛠️ Required Skills

Solid research background in multimodal understanding
Familiarity with mainstream models and algorithms such as CLIP, LLaVA, VALL-E
Proficiency in deep learning frameworks like TensorFlow or PyTorch
Knowledge of distributed training frameworks (e.g., DeepSpeed, Megatron-LM)
Practical experience in multi-node/multi-GPU distributed training
Strong engineering skills
Proficiency in at least one programming language: C/C++, Java, or Python
Excellent learning ability
Technical curiosity
Strong teamwork and communication skills

🎁 Benefits

Equal Employment Opportunity: Diverse voices fuel innovation and better serve users and the community
Supportive environment where every employee feels inspired to achieve individual and common goals
Work Location: Singapore-CapitaSky

Locations

CapitaSky, Singapore

Salary

Estimated Salary Rangemedium confidence

120,000 - 180,000 SGD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

Solid research background in multimodal understandingintermediate
Familiarity with mainstream models and algorithms such as CLIP, LLaVA, VALL-Eintermediate
Proficiency in deep learning frameworks like TensorFlow or PyTorchintermediate
Knowledge of distributed training frameworks (e.g., DeepSpeed, Megatron-LM)intermediate
Practical experience in multi-node/multi-GPU distributed trainingintermediate
Strong engineering skillsintermediate
Proficiency in at least one programming language: C/C++, Java, or Pythonintermediate
Excellent learning abilityintermediate
Technical curiosityintermediate
Strong teamwork and communication skillsintermediate

Required Qualifications

Master’s degree or higher in Computer Science, Machine Learning, Artificial Intelligence, Applied Mathematics, or related fields (experience)
Solid research background in multimodal understanding (e.g., natural language processing, computer vision, speech understanding/generation), with familiarity in mainstream models and algorithms such as CLIP, LLaVA, VALL-E, etc. (experience)
Proficiency in deep learning frameworks like TensorFlow or PyTorch; knowledge of distributed training frameworks (e.g., DeepSpeed, Megatron-LM) and practical experience in multi-node/multi-GPU distributed training (experience)
Strong engineering skills with proficiency in at least one programming language: C/C++, Java, or Python (experience)

Preferred Qualifications

Publication record in top-tier conferences (e.g., ICLR, NeurIPS, CVPR, ICCV, ECCV, ACL, EMNLP) is preferred (experience)

Responsibilities

Conduct research and development of multimodal large model technologies, including cross-modal alignment and multimodal understanding tasks, to build industry-leading multimodal large models
Continuously track state-of-the-art algorithms in multimodal large models
Participate in the design, training, optimization, and evaluation of these models
Promote their application in business scenarios

Benefits

general: Equal Employment Opportunity: Diverse voices fuel innovation and better serve users and the community
general: Supportive environment where every employee feels inspired to achieve individual and common goals
general: Work Location: Singapore-CapitaSky

Target Your Resume for "Multimodal Large Model Algorithm Engineer 106501" , Tencent

Get personalized recommendations to optimize your resume specifically for Multimodal Large Model Algorithm Engineer 106501. Takes only 15 seconds!

AI-powered keyword optimization

Skills matching & gap analysis

Experience alignment suggestions

Check Your ATS Score for "Multimodal Large Model Algorithm Engineer 106501" , Tencent

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check

Keyword optimization analysis

Skill matching & gap identification

Format & readability score

Tags & Categories

TencentCapitaSkySingaporeTEGTEG

Answer 10 quick questions to check your fit for Multimodal Large Model Algorithm Engineer 106501 @ Tencent.

10 Questions

~2 Minutes

Instant Score

Related Books and Jobs

No related jobs found at the moment.

Privacy Terms & Conditions About Us Refund Policy Recruiter Login Sitemap