Software Development Engineer - Generative AI, AGIF | Inference Engine

Amazon logo

Amazon

full-time

Posted: July 29, 2025

Number of Vacancies: 1

Job Description

Are you interested in advancing Amazon's Generative AI capabilities? Come work with a talented team of engineers and scientists in a highly collaborative and friendly team. We are building state-of-the-art Generative AI technology that will benefit all Amazon businesses and customers.Key job responsibilitiesAs a Software Development Engineer, you will be responsible for designing, developing, testing, and deploying high performance model inference capabilities, including but not limited to multi-modality, SOTA model architectures, latency, throughput, and cost. You will collaborate closely with a team of engineers and scientists to influence our overall strategy, and define the team’s roadmap. You will drive system architecture, spearhead best practices, and mentor junior engineers.A day in the lifeYou will consult with scientists to get inspiration of emerging techniques, and blend those into our roadmap; You will design and experiment with new algorithms from public and internal papers, benchmark the latency and accuracy of your implementations; Most importantly you will implement production grade solutions, and see them through the deployments swiftly; You may need to collaborate with other science and engineering teams to get things done properly; You will hold highest bar in operational excellence and support production systems, and constantly create solutions to minimize the ops load.About the teamOur mission is to build best-in-class, fast, accurate, and cost-efficient frontier model inference solutions and infrastructure that will enable Amazon businesses to deliver more value to their customers.

Locations

  • United States, MA, Boston, Boston, MA, United States
  • United States, NY, New York, New York, NY, United States

Salary

Salary not disclosed

Estimated Salary Rangehigh confidence

180,000 - 280,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • - 3+ years of non-internship professional software development experienceintermediate
  • - Must have one of the following two: 1) Prior experience with software performance optimization Or 2) Knowledge of Deep Learning and Transformer architecturesintermediate

Required Qualifications

  • - 3+ years of non-internship professional software development experience (experience, 3 years)
  • - Must have one of the following two: 1) Prior experience with software performance optimization Or 2) Knowledge of Deep Learning and Transformer architectures (experience)

Preferred Qualifications

  • - 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience (experience, 3 years)
  • - Bachelor's degree in computer science or equivalent (degree in computer science or equivalent)
  • - Experience with Large Language Model Inference (experience)
  • - Experience with GPU programming (TensorRT-LLM) (experience)
  • - Experience with Python, PyTorch, and C++ programming and performance optimization (experience)
  • - Experience with Trainium and Inferentia Development (experience)
  • Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site. (experience)

Responsibilities

  • As a Software Development Engineer, you will be responsible for designing, developing, testing, and deploying high performance model inference capabilities, including but not limited to multi-modality, SOTA model architectures, latency, throughput, and cost. You will collaborate closely with a team of engineers and scientists to influence our overall strategy, and define the team’s roadmap. You will drive system architecture, spearhead best practices, and mentor junior engineers.

Target Your Resume for "Software Development Engineer - Generative AI, AGIF | Inference Engine"

Get personalized recommendations to optimize your resume specifically for Software Development Engineer - Generative AI, AGIF | Inference Engine. Our AI analyzes job requirements and tailors your resume to maximize your chances.

Keyword optimization
Skills matching
Experience alignment

Check Your ATS Score for "Software Development Engineer - Generative AI, AGIF | Inference Engine"

Find out how well your resume matches this job's requirements. Our Applicant Tracking System (ATS) analyzer scores your resume based on keywords, skills, and format compatibility.

Instant analysis
Detailed feedback
Improvement tips

Documents

Tags & Categories

amazon.artificial-intelligenceSoftware Development

Software Development Engineer - Generative AI, AGIF | Inference Engine

Amazon logo

Amazon

full-time

Posted: July 29, 2025

Number of Vacancies: 1

Job Description

Are you interested in advancing Amazon's Generative AI capabilities? Come work with a talented team of engineers and scientists in a highly collaborative and friendly team. We are building state-of-the-art Generative AI technology that will benefit all Amazon businesses and customers.Key job responsibilitiesAs a Software Development Engineer, you will be responsible for designing, developing, testing, and deploying high performance model inference capabilities, including but not limited to multi-modality, SOTA model architectures, latency, throughput, and cost. You will collaborate closely with a team of engineers and scientists to influence our overall strategy, and define the team’s roadmap. You will drive system architecture, spearhead best practices, and mentor junior engineers.A day in the lifeYou will consult with scientists to get inspiration of emerging techniques, and blend those into our roadmap; You will design and experiment with new algorithms from public and internal papers, benchmark the latency and accuracy of your implementations; Most importantly you will implement production grade solutions, and see them through the deployments swiftly; You may need to collaborate with other science and engineering teams to get things done properly; You will hold highest bar in operational excellence and support production systems, and constantly create solutions to minimize the ops load.About the teamOur mission is to build best-in-class, fast, accurate, and cost-efficient frontier model inference solutions and infrastructure that will enable Amazon businesses to deliver more value to their customers.

Locations

  • United States, MA, Boston, Boston, MA, United States
  • United States, NY, New York, New York, NY, United States

Salary

Salary not disclosed

Estimated Salary Rangehigh confidence

180,000 - 280,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • - 3+ years of non-internship professional software development experienceintermediate
  • - Must have one of the following two: 1) Prior experience with software performance optimization Or 2) Knowledge of Deep Learning and Transformer architecturesintermediate

Required Qualifications

  • - 3+ years of non-internship professional software development experience (experience, 3 years)
  • - Must have one of the following two: 1) Prior experience with software performance optimization Or 2) Knowledge of Deep Learning and Transformer architectures (experience)

Preferred Qualifications

  • - 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience (experience, 3 years)
  • - Bachelor's degree in computer science or equivalent (degree in computer science or equivalent)
  • - Experience with Large Language Model Inference (experience)
  • - Experience with GPU programming (TensorRT-LLM) (experience)
  • - Experience with Python, PyTorch, and C++ programming and performance optimization (experience)
  • - Experience with Trainium and Inferentia Development (experience)
  • Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site. (experience)

Responsibilities

  • As a Software Development Engineer, you will be responsible for designing, developing, testing, and deploying high performance model inference capabilities, including but not limited to multi-modality, SOTA model architectures, latency, throughput, and cost. You will collaborate closely with a team of engineers and scientists to influence our overall strategy, and define the team’s roadmap. You will drive system architecture, spearhead best practices, and mentor junior engineers.

Target Your Resume for "Software Development Engineer - Generative AI, AGIF | Inference Engine"

Get personalized recommendations to optimize your resume specifically for Software Development Engineer - Generative AI, AGIF | Inference Engine. Our AI analyzes job requirements and tailors your resume to maximize your chances.

Keyword optimization
Skills matching
Experience alignment

Check Your ATS Score for "Software Development Engineer - Generative AI, AGIF | Inference Engine"

Find out how well your resume matches this job's requirements. Our Applicant Tracking System (ATS) analyzer scores your resume based on keywords, skills, and format compatibility.

Instant analysis
Detailed feedback
Improvement tips

Documents

Tags & Categories

amazon.artificial-intelligenceSoftware Development