Resume and JobRESUME AND JOB
Sony logo

Research Intern - Multimodal Foundation Model for Vision

Sony

Engineering Jobs

Research Intern - Multimodal Foundation Model for Vision

full-timePosted: Dec 12, 2025

Job Description

Research Intern - Multimodal Foundation Model for Vision

πŸ“‹ Job Overview

The Research Intern position at Sony focuses on developing multimodal foundation models for vision applications, integrating computer vision, natural language processing, and other modalities to advance AI research. Interns will collaborate with a team of researchers to design, implement, and evaluate innovative models that push the boundaries of multimodal AI. This role offers hands-on experience in cutting-edge research within Sony's AI division, contributing to real-world applications in imaging and entertainment technologies.

πŸ“ Location: Schlieren, Switzerland

🏒 Company: Sony Europe Limited, Switzerland Branch

πŸ“… Posted: Posted 30+ Days Ago

🎯 Key Responsibilities

  • Assist in designing and developing multimodal foundation models for vision tasks
  • Conduct experiments to evaluate model performance across various modalities
  • Collaborate with research team to integrate models into Sony's product pipelines
  • Analyze and interpret results to improve model efficiency and accuracy
  • Contribute to research papers and internal documentation

βœ… Required Qualifications

  • Currently pursuing a PhD or Master's degree in Computer Science, Electrical Engineering, or a related field
  • Strong foundation in machine learning and deep learning principles
  • Experience with Python programming and machine learning frameworks
  • Familiarity with computer vision and multimodal data processing

⭐ Preferred Qualifications

  • Prior research experience in foundation models or large-scale AI systems
  • Publications in top conferences such as CVPR, NeurIPS, or ICML
  • Knowledge of generative AI techniques and transformer architectures
  • Experience with distributed training and large dataset handling

πŸ› οΈ Required Skills

  • Python programming
  • Machine learning frameworks (e.g., PyTorch, TensorFlow)
  • Deep learning techniques
  • Computer vision algorithms
  • Multimodal data integration
  • Research methodology and experimentation
  • Strong analytical and problem-solving skills
  • Effective communication and teamwork

🎁 Benefits & Perks

  • Competitive hourly pay for interns
  • Flexible work hours and remote options
  • Access to Sony's state-of-the-art research facilities and computing resources
  • Mentorship from leading AI researchers
  • Networking opportunities within Sony's innovation ecosystem
  • Potential for full-time offers upon successful completion

Locations

  • Schlieren, Switzerland

Salary

Estimated Salary Rangemedium confidence

60,000 - 85,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • Python programmingintermediate
  • Machine learning frameworks (e.g., PyTorch, TensorFlow)intermediate
  • Deep learning techniquesintermediate
  • Computer vision algorithmsintermediate
  • Multimodal data integrationintermediate
  • Research methodology and experimentationintermediate
  • Strong analytical and problem-solving skillsintermediate
  • Effective communication and teamworkintermediate

Required Qualifications

  • Currently pursuing a PhD or Master's degree in Computer Science, Electrical Engineering, or a related field (experience)
  • Strong foundation in machine learning and deep learning principles (experience)
  • Experience with Python programming and machine learning frameworks (experience)
  • Familiarity with computer vision and multimodal data processing (experience)

Preferred Qualifications

  • Prior research experience in foundation models or large-scale AI systems (experience)
  • Publications in top conferences such as CVPR, NeurIPS, or ICML (experience)
  • Knowledge of generative AI techniques and transformer architectures (experience)
  • Experience with distributed training and large dataset handling (experience)

Responsibilities

  • Assist in designing and developing multimodal foundation models for vision tasks
  • Conduct experiments to evaluate model performance across various modalities
  • Collaborate with research team to integrate models into Sony's product pipelines
  • Analyze and interpret results to improve model efficiency and accuracy
  • Contribute to research papers and internal documentation

Benefits

  • general: Competitive hourly pay for interns
  • general: Flexible work hours and remote options
  • general: Access to Sony's state-of-the-art research facilities and computing resources
  • general: Mentorship from leading AI researchers
  • general: Networking opportunities within Sony's innovation ecosystem
  • general: Potential for full-time offers upon successful completion

Target Your Resume for "Research Intern - Multimodal Foundation Model for Vision" , Sony

Get personalized recommendations to optimize your resume specifically for Research Intern - Multimodal Foundation Model for Vision. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Research Intern - Multimodal Foundation Model for Vision" , Sony

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

SwitzerlandSchlierenTechnologyElectronics

Answer 10 quick questions to check your fit for Research Intern - Multimodal Foundation Model for Vision @ Sony.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

Sony logo

Research Intern - Multimodal Foundation Model for Vision

Sony

Engineering Jobs

Research Intern - Multimodal Foundation Model for Vision

full-timePosted: Dec 12, 2025

Job Description

Research Intern - Multimodal Foundation Model for Vision

πŸ“‹ Job Overview

The Research Intern position at Sony focuses on developing multimodal foundation models for vision applications, integrating computer vision, natural language processing, and other modalities to advance AI research. Interns will collaborate with a team of researchers to design, implement, and evaluate innovative models that push the boundaries of multimodal AI. This role offers hands-on experience in cutting-edge research within Sony's AI division, contributing to real-world applications in imaging and entertainment technologies.

πŸ“ Location: Schlieren, Switzerland

🏒 Company: Sony Europe Limited, Switzerland Branch

πŸ“… Posted: Posted 30+ Days Ago

🎯 Key Responsibilities

  • Assist in designing and developing multimodal foundation models for vision tasks
  • Conduct experiments to evaluate model performance across various modalities
  • Collaborate with research team to integrate models into Sony's product pipelines
  • Analyze and interpret results to improve model efficiency and accuracy
  • Contribute to research papers and internal documentation

βœ… Required Qualifications

  • Currently pursuing a PhD or Master's degree in Computer Science, Electrical Engineering, or a related field
  • Strong foundation in machine learning and deep learning principles
  • Experience with Python programming and machine learning frameworks
  • Familiarity with computer vision and multimodal data processing

⭐ Preferred Qualifications

  • Prior research experience in foundation models or large-scale AI systems
  • Publications in top conferences such as CVPR, NeurIPS, or ICML
  • Knowledge of generative AI techniques and transformer architectures
  • Experience with distributed training and large dataset handling

πŸ› οΈ Required Skills

  • Python programming
  • Machine learning frameworks (e.g., PyTorch, TensorFlow)
  • Deep learning techniques
  • Computer vision algorithms
  • Multimodal data integration
  • Research methodology and experimentation
  • Strong analytical and problem-solving skills
  • Effective communication and teamwork

🎁 Benefits & Perks

  • Competitive hourly pay for interns
  • Flexible work hours and remote options
  • Access to Sony's state-of-the-art research facilities and computing resources
  • Mentorship from leading AI researchers
  • Networking opportunities within Sony's innovation ecosystem
  • Potential for full-time offers upon successful completion

Locations

  • Schlieren, Switzerland

Salary

Estimated Salary Rangemedium confidence

60,000 - 85,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • Python programmingintermediate
  • Machine learning frameworks (e.g., PyTorch, TensorFlow)intermediate
  • Deep learning techniquesintermediate
  • Computer vision algorithmsintermediate
  • Multimodal data integrationintermediate
  • Research methodology and experimentationintermediate
  • Strong analytical and problem-solving skillsintermediate
  • Effective communication and teamworkintermediate

Required Qualifications

  • Currently pursuing a PhD or Master's degree in Computer Science, Electrical Engineering, or a related field (experience)
  • Strong foundation in machine learning and deep learning principles (experience)
  • Experience with Python programming and machine learning frameworks (experience)
  • Familiarity with computer vision and multimodal data processing (experience)

Preferred Qualifications

  • Prior research experience in foundation models or large-scale AI systems (experience)
  • Publications in top conferences such as CVPR, NeurIPS, or ICML (experience)
  • Knowledge of generative AI techniques and transformer architectures (experience)
  • Experience with distributed training and large dataset handling (experience)

Responsibilities

  • Assist in designing and developing multimodal foundation models for vision tasks
  • Conduct experiments to evaluate model performance across various modalities
  • Collaborate with research team to integrate models into Sony's product pipelines
  • Analyze and interpret results to improve model efficiency and accuracy
  • Contribute to research papers and internal documentation

Benefits

  • general: Competitive hourly pay for interns
  • general: Flexible work hours and remote options
  • general: Access to Sony's state-of-the-art research facilities and computing resources
  • general: Mentorship from leading AI researchers
  • general: Networking opportunities within Sony's innovation ecosystem
  • general: Potential for full-time offers upon successful completion

Target Your Resume for "Research Intern - Multimodal Foundation Model for Vision" , Sony

Get personalized recommendations to optimize your resume specifically for Research Intern - Multimodal Foundation Model for Vision. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Research Intern - Multimodal Foundation Model for Vision" , Sony

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

SwitzerlandSchlierenTechnologyElectronics

Answer 10 quick questions to check your fit for Research Intern - Multimodal Foundation Model for Vision @ Sony.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.