Resume and JobRESUME AND JOB
NVIDIA logo

Software Engineer, LLM Inference

NVIDIA

Software and Technology Jobs

Software Engineer, LLM Inference

full-timePosted: Sep 29, 2025

Job Description

NVIDIA has continuously reinvented itself over two decades. NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world.This is our life’s work — to amplify human imagination and intelligence. AI becomes more and more important in Auto Driving and AI City. NVIDIA is at the forefront of the Auto Driving and AI City revolution and providing powerful solutions for them. All these solutions are based on GPU-accelerated libraries, such as CUDA, TensorRT and V/LLM inference framework etc. Now, we are now looking for an LLM inference framework developer engineer based in Shanghai.What you’ll be doing:Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performancePerformance analysis, optimization and tuningClosely follow academic developments in the field of artificial intelligence and feature updateCollaborate across the company to guide the direction of machine learning inferencing, working with software, research and product teamsWhat we need to see:Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience)3+ years of relevant software development experience.Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.Strong curiosity about artificial intelligence, awareness of the latest developments in deep learning like LLMs, generative modelsExperience working with deep learning frameworks like PyTorchProactive and able to work without supervisionExcellent written and oral communication skills in EnglishStrong customer communication skills, powerfully motivated to provide highly responsive support as needed#deeplearning

Locations

  • Shanghai, China

Salary

Estimated Salary Rangemedium confidence

3,000,000 - 8,000,000 INR / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • C/C++ programmingintermediate
  • software designintermediate
  • debuggingintermediate
  • performance analysisintermediate
  • test designintermediate
  • artificial intelligenceintermediate
  • deep learningintermediate
  • LLMsintermediate
  • generative modelsintermediate
  • PyTorchintermediate
  • CUDAintermediate
  • TensorRTintermediate
  • V/LLM inference frameworkintermediate
  • performance optimizationintermediate
  • performance tuningintermediate
  • written communicationintermediate
  • oral communicationintermediate
  • customer communicationintermediate

Target Your Resume for "Software Engineer, LLM Inference" , NVIDIA

Get personalized recommendations to optimize your resume specifically for Software Engineer, LLM Inference. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Software Engineer, LLM Inference" , NVIDIA

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

China

Answer 10 quick questions to check your fit for Software Engineer, LLM Inference @ NVIDIA.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

NVIDIA logo

Software Engineer, LLM Inference

NVIDIA

Software and Technology Jobs

Software Engineer, LLM Inference

full-timePosted: Sep 29, 2025

Job Description

NVIDIA has continuously reinvented itself over two decades. NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world.This is our life’s work — to amplify human imagination and intelligence. AI becomes more and more important in Auto Driving and AI City. NVIDIA is at the forefront of the Auto Driving and AI City revolution and providing powerful solutions for them. All these solutions are based on GPU-accelerated libraries, such as CUDA, TensorRT and V/LLM inference framework etc. Now, we are now looking for an LLM inference framework developer engineer based in Shanghai.What you’ll be doing:Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performancePerformance analysis, optimization and tuningClosely follow academic developments in the field of artificial intelligence and feature updateCollaborate across the company to guide the direction of machine learning inferencing, working with software, research and product teamsWhat we need to see:Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience)3+ years of relevant software development experience.Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.Strong curiosity about artificial intelligence, awareness of the latest developments in deep learning like LLMs, generative modelsExperience working with deep learning frameworks like PyTorchProactive and able to work without supervisionExcellent written and oral communication skills in EnglishStrong customer communication skills, powerfully motivated to provide highly responsive support as needed#deeplearning

Locations

  • Shanghai, China

Salary

Estimated Salary Rangemedium confidence

3,000,000 - 8,000,000 INR / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • C/C++ programmingintermediate
  • software designintermediate
  • debuggingintermediate
  • performance analysisintermediate
  • test designintermediate
  • artificial intelligenceintermediate
  • deep learningintermediate
  • LLMsintermediate
  • generative modelsintermediate
  • PyTorchintermediate
  • CUDAintermediate
  • TensorRTintermediate
  • V/LLM inference frameworkintermediate
  • performance optimizationintermediate
  • performance tuningintermediate
  • written communicationintermediate
  • oral communicationintermediate
  • customer communicationintermediate

Target Your Resume for "Software Engineer, LLM Inference" , NVIDIA

Get personalized recommendations to optimize your resume specifically for Software Engineer, LLM Inference. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Software Engineer, LLM Inference" , NVIDIA

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

China

Answer 10 quick questions to check your fit for Software Engineer, LLM Inference @ NVIDIA.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.