Resume and JobRESUME AND JOB
Apple logo

On-Device ML Infrastructure Engineer (ML User Experience APIs & Integration)

Apple

Software and Technology Jobs

On-Device ML Infrastructure Engineer (ML User Experience APIs & Integration)

full-timePosted: Aug 29, 2025

Job Description

The On-Device Machine Learning team at Apple is responsible for enabling the Research to Production lifecycle of cutting edge machine learning models that power magical user experiences on Apple’s hardware and software platforms. Apple is the best place to do on-device machine learning, and this team sits at the heart of that discipline, interfacing with research, SW engineering, HW engineering, and products. The team builds critical infrastructure that begins with onboarding the latest machine learning architectures to embedded devices, optimization toolkits to optimize these models to better suit the target devices, machine learning compilers and runtimes to execute these models as efficiently as possible, and the benchmarking, analysis and debugging toolchain needed to improve on new model iterations. This infrastructure underpins most of Apple’s critical machine learning workflows across Camera, Siri, Health, Vision, etc., and as such is an integral part of Apple Intelligence. We are seeking an ML Infrastructure Engineer to focus on ML user experience APIs and Integration. In this role, which offers terrific exposure to developing new ML model conversion & authoring APIs that will be a part of coremltools (CoreML’s authoring/conversion toolkit), you will also be central to integrating the APIs into internal and external systems (e.g., HuggingFace) to demonstrate the most efficient way of ingesting models into CoreML from these systems. This integration could involve a gamut of optimizations ranging from authored program optimizations (e.g., in PyTorch), to custom optimizations on CoreML’s model representation. As an engineer in this critical role, you will develop and use APIs in coremltools to enable ML engineers to efficiently author/convert ML models to CoreML. You will be integrating coremltools into internal and external ML model repositories to evaluate and demonstrate how ML models can ingested into CoreML. You will ideate, design, and stress test the gamut of optimizations required to ingest these models, ranging from source level optimizations (e.g., in the PyTorch program), to custom optimizations after converting to CoreML’s model representation. The role requires a good understanding of ML modeling (architectures, training vs inference trade-offs, etc.), ML deployment optimizations (e.g., quantization), and a good understanding of designing Python APIs We are building the first end-to-end developer experience for ML development that, by taking advantage of Apple’s vertical integration, allows developers to iterate on model authoring, optimization, transformation, execution, debugging, profiling and analysis. The coremltools authoring and conversion APIs are the entrypoint to the rest of the infrastructure stack.

Locations

  • Cupertino, California, United States 95014

Salary

Estimated Salary Rangemedium confidence

30,000,000 - 60,000,000 INR / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • on-device machine learningintermediate
  • machine learning architecturesintermediate
  • optimization toolkitsintermediate
  • machine learning compilersintermediate
  • machine learning runtimesintermediate
  • benchmarkingintermediate
  • analysisintermediate
  • debuggingintermediate
  • ML model conversionintermediate
  • ML model authoringintermediate
  • developing APIsintermediate
  • integrating APIsintermediate
  • PyTorchintermediate
  • CoreMLintermediate
  • HuggingFaceintermediate
  • authored program optimizationsintermediate
  • custom optimizationsintermediate
  • quantizationintermediate
  • ML modelingintermediate
  • architecturesintermediate
  • training vs inference trade-offsintermediate
  • ML deployment optimizationsintermediate
  • designing Python APIsintermediate
  • model authoringintermediate
  • model optimizationintermediate
  • model transformationintermediate
  • model executionintermediate
  • profilingintermediate
  • stress testingintermediate
  • SW engineeringintermediate
  • HW engineeringintermediate

Required Qualifications

  • Bachelors in Computer Sciences, Engineering, or related discipline. (degree in computer sciences)
  • Highly proficient in Python programming, familiarity with C++ is required. (experience)
  • Proficiency in at least one ML authoring framework, such as PyTorch, TensorFlow, JAX, MLX. (experience)
  • Strong understanding of ML fundamentals, including common architectures such as Transformers. (experience)
  • Understanding of common ML inference optimizations, such as quantization, pruning, KV caching, etc. (experience)

Preferred Qualifications

  • Experience with any on-device ML stack, such as TFLite, ONNX, etc. (experience)
  • Experience with designing Python APIs and production deployment of python packages is a strong plus. (experience)
  • Experience with HuggingFace or any other model repository is a strong plus. (experience)
  • Experience with MLIR/LLVM or any compiler toolchains is a strong plus. (experience)
  • Good communication skills, including ability to communicate with cross-functional audiences. (experience)

Responsibilities

  • As an engineer in this critical role, you will develop and use APIs in coremltools to enable ML engineers to efficiently author/convert ML models to CoreML. You will be integrating coremltools into internal and external ML model repositories to evaluate and demonstrate how ML models can ingested into CoreML. You will ideate, design, and stress test the gamut of optimizations required to ingest these models, ranging from source level optimizations (e.g., in the PyTorch program), to custom optimizations after converting to CoreML’s model representation. The role requires a good understanding of ML modeling (architectures, training vs inference trade-offs, etc.), ML deployment optimizations (e.g., quantization), and a good understanding of designing Python APIs
  • We are building the first end-to-end developer experience for ML development that, by taking advantage of Apple’s vertical integration, allows developers to iterate on model authoring, optimization, transformation, execution, debugging, profiling and analysis. The coremltools authoring and conversion APIs are the entrypoint to the rest of the infrastructure stack.

Target Your Resume for "On-Device ML Infrastructure Engineer (ML User Experience APIs & Integration)" , Apple

Get personalized recommendations to optimize your resume specifically for On-Device ML Infrastructure Engineer (ML User Experience APIs & Integration). Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "On-Device ML Infrastructure Engineer (ML User Experience APIs & Integration)" , Apple

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

Hardware

Answer 10 quick questions to check your fit for On-Device ML Infrastructure Engineer (ML User Experience APIs & Integration) @ Apple.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

Apple logo

On-Device ML Infrastructure Engineer (ML User Experience APIs & Integration)

Apple

Software and Technology Jobs

On-Device ML Infrastructure Engineer (ML User Experience APIs & Integration)

full-timePosted: Aug 29, 2025

Job Description

The On-Device Machine Learning team at Apple is responsible for enabling the Research to Production lifecycle of cutting edge machine learning models that power magical user experiences on Apple’s hardware and software platforms. Apple is the best place to do on-device machine learning, and this team sits at the heart of that discipline, interfacing with research, SW engineering, HW engineering, and products. The team builds critical infrastructure that begins with onboarding the latest machine learning architectures to embedded devices, optimization toolkits to optimize these models to better suit the target devices, machine learning compilers and runtimes to execute these models as efficiently as possible, and the benchmarking, analysis and debugging toolchain needed to improve on new model iterations. This infrastructure underpins most of Apple’s critical machine learning workflows across Camera, Siri, Health, Vision, etc., and as such is an integral part of Apple Intelligence. We are seeking an ML Infrastructure Engineer to focus on ML user experience APIs and Integration. In this role, which offers terrific exposure to developing new ML model conversion & authoring APIs that will be a part of coremltools (CoreML’s authoring/conversion toolkit), you will also be central to integrating the APIs into internal and external systems (e.g., HuggingFace) to demonstrate the most efficient way of ingesting models into CoreML from these systems. This integration could involve a gamut of optimizations ranging from authored program optimizations (e.g., in PyTorch), to custom optimizations on CoreML’s model representation. As an engineer in this critical role, you will develop and use APIs in coremltools to enable ML engineers to efficiently author/convert ML models to CoreML. You will be integrating coremltools into internal and external ML model repositories to evaluate and demonstrate how ML models can ingested into CoreML. You will ideate, design, and stress test the gamut of optimizations required to ingest these models, ranging from source level optimizations (e.g., in the PyTorch program), to custom optimizations after converting to CoreML’s model representation. The role requires a good understanding of ML modeling (architectures, training vs inference trade-offs, etc.), ML deployment optimizations (e.g., quantization), and a good understanding of designing Python APIs We are building the first end-to-end developer experience for ML development that, by taking advantage of Apple’s vertical integration, allows developers to iterate on model authoring, optimization, transformation, execution, debugging, profiling and analysis. The coremltools authoring and conversion APIs are the entrypoint to the rest of the infrastructure stack.

Locations

  • Cupertino, California, United States 95014

Salary

Estimated Salary Rangemedium confidence

30,000,000 - 60,000,000 INR / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • on-device machine learningintermediate
  • machine learning architecturesintermediate
  • optimization toolkitsintermediate
  • machine learning compilersintermediate
  • machine learning runtimesintermediate
  • benchmarkingintermediate
  • analysisintermediate
  • debuggingintermediate
  • ML model conversionintermediate
  • ML model authoringintermediate
  • developing APIsintermediate
  • integrating APIsintermediate
  • PyTorchintermediate
  • CoreMLintermediate
  • HuggingFaceintermediate
  • authored program optimizationsintermediate
  • custom optimizationsintermediate
  • quantizationintermediate
  • ML modelingintermediate
  • architecturesintermediate
  • training vs inference trade-offsintermediate
  • ML deployment optimizationsintermediate
  • designing Python APIsintermediate
  • model authoringintermediate
  • model optimizationintermediate
  • model transformationintermediate
  • model executionintermediate
  • profilingintermediate
  • stress testingintermediate
  • SW engineeringintermediate
  • HW engineeringintermediate

Required Qualifications

  • Bachelors in Computer Sciences, Engineering, or related discipline. (degree in computer sciences)
  • Highly proficient in Python programming, familiarity with C++ is required. (experience)
  • Proficiency in at least one ML authoring framework, such as PyTorch, TensorFlow, JAX, MLX. (experience)
  • Strong understanding of ML fundamentals, including common architectures such as Transformers. (experience)
  • Understanding of common ML inference optimizations, such as quantization, pruning, KV caching, etc. (experience)

Preferred Qualifications

  • Experience with any on-device ML stack, such as TFLite, ONNX, etc. (experience)
  • Experience with designing Python APIs and production deployment of python packages is a strong plus. (experience)
  • Experience with HuggingFace or any other model repository is a strong plus. (experience)
  • Experience with MLIR/LLVM or any compiler toolchains is a strong plus. (experience)
  • Good communication skills, including ability to communicate with cross-functional audiences. (experience)

Responsibilities

  • As an engineer in this critical role, you will develop and use APIs in coremltools to enable ML engineers to efficiently author/convert ML models to CoreML. You will be integrating coremltools into internal and external ML model repositories to evaluate and demonstrate how ML models can ingested into CoreML. You will ideate, design, and stress test the gamut of optimizations required to ingest these models, ranging from source level optimizations (e.g., in the PyTorch program), to custom optimizations after converting to CoreML’s model representation. The role requires a good understanding of ML modeling (architectures, training vs inference trade-offs, etc.), ML deployment optimizations (e.g., quantization), and a good understanding of designing Python APIs
  • We are building the first end-to-end developer experience for ML development that, by taking advantage of Apple’s vertical integration, allows developers to iterate on model authoring, optimization, transformation, execution, debugging, profiling and analysis. The coremltools authoring and conversion APIs are the entrypoint to the rest of the infrastructure stack.

Target Your Resume for "On-Device ML Infrastructure Engineer (ML User Experience APIs & Integration)" , Apple

Get personalized recommendations to optimize your resume specifically for On-Device ML Infrastructure Engineer (ML User Experience APIs & Integration). Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "On-Device ML Infrastructure Engineer (ML User Experience APIs & Integration)" , Apple

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

Hardware

Answer 10 quick questions to check your fit for On-Device ML Infrastructure Engineer (ML User Experience APIs & Integration) @ Apple.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.