Resume and JobRESUME AND JOB
NVIDIA logo

Senior Software Engineer, Deep Learning Inference

NVIDIA

Software and Technology Jobs

Senior Software Engineer, Deep Learning Inference

full-timePosted: Jul 6, 2025

Job Description

NVIDIA has been at the forefront of the deep learning revolution, pioneering innovations that have transformed the entire field. As the leading provider of GPUs and AI computing platforms, NVIDIA has empowered researchers and engineers worldwide to accelerate breakthroughs in artificial intelligence.We seek a versatile Senior Software Engineer who is passionate about performance optimization and generative AI. Our team builds software solutions that enable efficient inference on the latest and greatest generative AI models. We tackle problems on all levels of the stack—from server-level request batching to GPU kernel fusion—and collaborate with teams across diverse disciplines to push Nvidia's hardware to its full potential.What you’ll be doing:Cooperate with research teams to onboard new LLMs and VLMs into Nvidia's opensource AI runtimesOptimize inference workloads using sophisticated profiling and simulation toolsBuild SOLID, extendable inference software systems, and refine robust APIsImplement and debug low-level GPU code to harness the latest HW featuresOwn end-to-end inference acceleration features and work with teams around the world to deliver production-grade productsWhat we need to see:B.Sc., M.Sc. or equivalent experience in Computer Science or Computer Engineering5+ years of relevant hands-on software engineering experienceProfound knowledge of software design principlesStrong proficiency in at least one system and one scripting languageStrong grasp of machine learning conceptsPeople person with excellent communication skills that enjoys collaboration and teamwork.Ways to stand out from the crowd:Familiarity with Nvidia's DL software stack, e.g. Triton Inference Server, TensorRT-LLM, and Model OptimizerProven track record of performance modeling, profiling, debugging, and development in a performance-critical setting with Nvidia's accelerators.Familiarity with LLM quantization, fine-tunning, and caching algorithmsProficiency in GPU kernel programming (CUDA or OpenCL)Prior experience working on a large software project with 50+ contributorsNVIDIA is widely considered one of the world’s most desirable employers in the technology field. We have some of the most forward-thinking and hardworking people working for us. If you're creative and autonomous, we want to hear from you! We are committed to fostering a diverse work environment and are proud to be an equal-opportunity employer. We highly value diversity in our current and future employees. We do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.

Locations

  • Tel Aviv, Israel

Salary

Estimated Salary Rangemedium confidence

12,000,000 - 24,000,000 INR / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • performance optimizationintermediate
  • generative AIintermediate
  • deep learningintermediate
  • GPUsintermediate
  • AI computing platformsintermediate
  • artificial intelligenceintermediate
  • LLMsintermediate
  • VLMsintermediate
  • opensource AI runtimesintermediate
  • inference workloadsintermediate
  • profilingintermediate
  • simulation toolsintermediate
  • SOLID software designintermediate
  • inference software systemsintermediate
  • robust APIsintermediate
  • low-level GPU codeintermediate
  • GPU kernel fusionintermediate
  • server-level request batchingintermediate
  • end-to-end inference accelerationintermediate
  • software design principlesintermediate
  • system programming languageintermediate
  • scripting languageintermediate
  • machine learning conceptsintermediate
  • communication skillsintermediate
  • collaborationintermediate
  • teamworkintermediate
  • Nvidia's DL software stackintermediate
  • Triton Inference Serverintermediate
  • TensorRT-LLMintermediate
  • Model Optimizerintermediate
  • performance modelingintermediate
  • debuggingintermediate
  • Nvidia's acceleratorsintermediate
  • LLM quantizationintermediate
  • fine-tuningintermediate
  • caching algorithmsintermediate
  • GPU kernel programmingintermediate
  • CUintermediate

Target Your Resume for "Senior Software Engineer, Deep Learning Inference" , NVIDIA

Get personalized recommendations to optimize your resume specifically for Senior Software Engineer, Deep Learning Inference. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Senior Software Engineer, Deep Learning Inference" , NVIDIA

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

Israel

Answer 10 quick questions to check your fit for Senior Software Engineer, Deep Learning Inference @ NVIDIA.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

NVIDIA logo

Senior Software Engineer, Deep Learning Inference

NVIDIA

Software and Technology Jobs

Senior Software Engineer, Deep Learning Inference

full-timePosted: Jul 6, 2025

Job Description

NVIDIA has been at the forefront of the deep learning revolution, pioneering innovations that have transformed the entire field. As the leading provider of GPUs and AI computing platforms, NVIDIA has empowered researchers and engineers worldwide to accelerate breakthroughs in artificial intelligence.We seek a versatile Senior Software Engineer who is passionate about performance optimization and generative AI. Our team builds software solutions that enable efficient inference on the latest and greatest generative AI models. We tackle problems on all levels of the stack—from server-level request batching to GPU kernel fusion—and collaborate with teams across diverse disciplines to push Nvidia's hardware to its full potential.What you’ll be doing:Cooperate with research teams to onboard new LLMs and VLMs into Nvidia's opensource AI runtimesOptimize inference workloads using sophisticated profiling and simulation toolsBuild SOLID, extendable inference software systems, and refine robust APIsImplement and debug low-level GPU code to harness the latest HW featuresOwn end-to-end inference acceleration features and work with teams around the world to deliver production-grade productsWhat we need to see:B.Sc., M.Sc. or equivalent experience in Computer Science or Computer Engineering5+ years of relevant hands-on software engineering experienceProfound knowledge of software design principlesStrong proficiency in at least one system and one scripting languageStrong grasp of machine learning conceptsPeople person with excellent communication skills that enjoys collaboration and teamwork.Ways to stand out from the crowd:Familiarity with Nvidia's DL software stack, e.g. Triton Inference Server, TensorRT-LLM, and Model OptimizerProven track record of performance modeling, profiling, debugging, and development in a performance-critical setting with Nvidia's accelerators.Familiarity with LLM quantization, fine-tunning, and caching algorithmsProficiency in GPU kernel programming (CUDA or OpenCL)Prior experience working on a large software project with 50+ contributorsNVIDIA is widely considered one of the world’s most desirable employers in the technology field. We have some of the most forward-thinking and hardworking people working for us. If you're creative and autonomous, we want to hear from you! We are committed to fostering a diverse work environment and are proud to be an equal-opportunity employer. We highly value diversity in our current and future employees. We do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.

Locations

  • Tel Aviv, Israel

Salary

Estimated Salary Rangemedium confidence

12,000,000 - 24,000,000 INR / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • performance optimizationintermediate
  • generative AIintermediate
  • deep learningintermediate
  • GPUsintermediate
  • AI computing platformsintermediate
  • artificial intelligenceintermediate
  • LLMsintermediate
  • VLMsintermediate
  • opensource AI runtimesintermediate
  • inference workloadsintermediate
  • profilingintermediate
  • simulation toolsintermediate
  • SOLID software designintermediate
  • inference software systemsintermediate
  • robust APIsintermediate
  • low-level GPU codeintermediate
  • GPU kernel fusionintermediate
  • server-level request batchingintermediate
  • end-to-end inference accelerationintermediate
  • software design principlesintermediate
  • system programming languageintermediate
  • scripting languageintermediate
  • machine learning conceptsintermediate
  • communication skillsintermediate
  • collaborationintermediate
  • teamworkintermediate
  • Nvidia's DL software stackintermediate
  • Triton Inference Serverintermediate
  • TensorRT-LLMintermediate
  • Model Optimizerintermediate
  • performance modelingintermediate
  • debuggingintermediate
  • Nvidia's acceleratorsintermediate
  • LLM quantizationintermediate
  • fine-tuningintermediate
  • caching algorithmsintermediate
  • GPU kernel programmingintermediate
  • CUintermediate

Target Your Resume for "Senior Software Engineer, Deep Learning Inference" , NVIDIA

Get personalized recommendations to optimize your resume specifically for Senior Software Engineer, Deep Learning Inference. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Senior Software Engineer, Deep Learning Inference" , NVIDIA

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

Israel

Answer 10 quick questions to check your fit for Senior Software Engineer, Deep Learning Inference @ NVIDIA.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.