Resume and JobRESUME AND JOB
Amazon logo

Software Engineer-AI/ML, AWS Neuron Inference

Amazon

Software and Technology Jobs

Software Engineer-AI/ML, AWS Neuron Inference

full-timePosted: Aug 26, 2025

Job Description

AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machinelearning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc.The team works side by side with chip architects, compiler engineers and runtime engineers to deliver performance and accuracy on Neuron devices across a range of models such as Llama 3.3 70B, 3.1 405B, DBRX, Mixtral, and so on.Key job responsibilitiesResponsibilities of this role include adapting latest research in LLM optimization to Neuron chips to extract best performance from both open source as well as internally developed models. Working across teams and organizations is key.About the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.

Locations

  • United States, WA, Seattle, Seattle, WA, United States

Salary

Estimated Salary Rangehigh confidence

180,000 - 300,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • - 3+ years of non-internship professional software development experienceintermediate
  • - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experienceintermediate
  • - Experience programming with at least one software programming languageintermediate
  • - Fundamentals of Machine learning models, their architecture, training and inference lifecycles along with work experience on some optimizations for improving the model performance.intermediate

Required Qualifications

  • - 3+ years of non-internship professional software development experience (experience, 3 years)
  • - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience (experience, 2 years)
  • - Experience programming with at least one software programming language (experience)
  • - Fundamentals of Machine learning models, their architecture, training and inference lifecycles along with work experience on some optimizations for improving the model performance. (experience)

Preferred Qualifications

  • - 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience (experience, 3 years)
  • - Bachelor's degree in computer science or equivalent (degree in computer science or equivalent)
  • - Hands-on experience with PyTorch or Jax - preferably involving developing and deploying LLMs in production on GPUs, Neuron, TPU or other AI acceleration hardware. (experience)
  • Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site. (experience)

Responsibilities

  • Responsibilities of this role include adapting latest research in LLM optimization to Neuron chips to extract best performance from both open source as well as internally developed models. Working across teams and organizations is key.
  • About the team
  • Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.

Target Your Resume for "Software Engineer-AI/ML, AWS Neuron Inference" , Amazon

Get personalized recommendations to optimize your resume specifically for Software Engineer-AI/ML, AWS Neuron Inference. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Software Engineer-AI/ML, AWS Neuron Inference" , Amazon

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

aws.team-annapurna-labsaws.team-utility-computingSoftware Development

Answer 10 quick questions to check your fit for Software Engineer-AI/ML, AWS Neuron Inference @ Amazon.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

Amazon logo

Software Engineer-AI/ML, AWS Neuron Inference

Amazon

Software and Technology Jobs

Software Engineer-AI/ML, AWS Neuron Inference

full-timePosted: Aug 26, 2025

Job Description

AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machinelearning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc.The team works side by side with chip architects, compiler engineers and runtime engineers to deliver performance and accuracy on Neuron devices across a range of models such as Llama 3.3 70B, 3.1 405B, DBRX, Mixtral, and so on.Key job responsibilitiesResponsibilities of this role include adapting latest research in LLM optimization to Neuron chips to extract best performance from both open source as well as internally developed models. Working across teams and organizations is key.About the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.

Locations

  • United States, WA, Seattle, Seattle, WA, United States

Salary

Estimated Salary Rangehigh confidence

180,000 - 300,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • - 3+ years of non-internship professional software development experienceintermediate
  • - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experienceintermediate
  • - Experience programming with at least one software programming languageintermediate
  • - Fundamentals of Machine learning models, their architecture, training and inference lifecycles along with work experience on some optimizations for improving the model performance.intermediate

Required Qualifications

  • - 3+ years of non-internship professional software development experience (experience, 3 years)
  • - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience (experience, 2 years)
  • - Experience programming with at least one software programming language (experience)
  • - Fundamentals of Machine learning models, their architecture, training and inference lifecycles along with work experience on some optimizations for improving the model performance. (experience)

Preferred Qualifications

  • - 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience (experience, 3 years)
  • - Bachelor's degree in computer science or equivalent (degree in computer science or equivalent)
  • - Hands-on experience with PyTorch or Jax - preferably involving developing and deploying LLMs in production on GPUs, Neuron, TPU or other AI acceleration hardware. (experience)
  • Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site. (experience)

Responsibilities

  • Responsibilities of this role include adapting latest research in LLM optimization to Neuron chips to extract best performance from both open source as well as internally developed models. Working across teams and organizations is key.
  • About the team
  • Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.

Target Your Resume for "Software Engineer-AI/ML, AWS Neuron Inference" , Amazon

Get personalized recommendations to optimize your resume specifically for Software Engineer-AI/ML, AWS Neuron Inference. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Software Engineer-AI/ML, AWS Neuron Inference" , Amazon

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

aws.team-annapurna-labsaws.team-utility-computingSoftware Development

Answer 10 quick questions to check your fit for Software Engineer-AI/ML, AWS Neuron Inference @ Amazon.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.