Netflix is one of the world's leading entertainment services, with over 300 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.As Netflix grows, we keep advancing innovations in personalization and discovery, experimentation and decision-making, understanding our members and our titles, and backend infrastructure. These developments constantly create new opportunities for research to drive meaningful impact. By exploring the frontiers of AI/ML and intersecting fields, the Machine Learning and Inference Research team turns these opportunities into tangible benefits for our members and our business.The Machine Learning and Inference Research team is a dedicated research team building up Netflix’s technical capabilities by tackling fundamental research questions tied to our most important challenges and partnering closely with teams across the business to translate research into impact at scale. As a member of the team, you will leverage your technical expertise to shape roadmaps, collaborate across functions, and bring new ideas from exploration to impact. You will also engage actively with the broader research community by publishing at top venues, presenting at conferences, mentoring interns, and fostering academic collaborations.We are seeking an early-career researcher who can grow to define and execute a strong research agenda with both internal and external visibility, disseminate knowledge effectively and inspire others, collaborate with colleagues to deliver tangible impact, and help foster an open environment of innovation, intellectual rigor, and curiosity.What you bringPh.D. in Computer Science or a related field with a specialization in post-training LLMs for downstream tasks, especially using RL (e.g., RLVR, RLHF, offline or online, policy- or value-based), and possibly also including reasoning, alignment, distillation/compression, tool use, memory, calibration, or related.A track record of top-tier publications demonstrating deep expertise in the specialization.Passion for collaboration and for building strong relationships to tackle big, cross-functional problems.Strong technical communication skills, with the ability to adapt to different audiences.Self-motivated with an ability to thrive and to lead with minimal oversight and process.Curiosity and judgment in identifying and framing ambiguous research and business problems, and connecting the two.Eagerness to elevate the broader organization through sharing knowledge and guiding the adoption of new methods.Our compensation structure consists solely of an annual salary; we do not have bonuses. You choose each year how much of your compensation you want in salary versus stock options. To determine your personal top of market compensation, we rely on market indicators and consider your specific job family, background, skills, and experience to determine your compensation in the market range. The range for this role is $170,000 - $720,000.Netflix provides comprehensive benefits including Health Plans, Mental Health support, a 401(k) Retirement Plan with employer match, Stock Option Program, Disability Programs, Health Savings and Flexible Spending Accounts, Family-forming benefits, and Life and Serious Injury Benefits. We also offer paid leave of absence programs. Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off. Full-time salaried employees are immediately entitled to flexible time off. See more detail about our Benefits here.Netflix has a unique culture and environment. Learn more here.Inclusion is a Netflix value and we strive to host a meaningful interview experience for all candidates. If you want an accommodation/adjustment for a disability or any other reason during the hiring process, please send a request to your recruiting partner.We are an equal-opportunity employer and celebrate diversity, recognizing that diversity builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.
Locations
Los Gatos, California, United States of America
New York, New York, United States of America
Salary
170,000 - 720,000 USD / yearly
Estimated Salary Rangehigh confidence
250,000 - 400,000 USD / yearly
Source: ai estimated
* This is an estimated range based on market data and may vary based on experience and qualifications.
Skills Required
specialization in post-training LLMs for downstream tasksintermediate
using RL (e.g., RLVR, RLHF, offline or online, policy- or value-based)intermediate
reasoningintermediate
alignmentintermediate
distillation/compressionintermediate
tool useintermediate
memoryintermediate
calibrationintermediate
top-tier publications demonstrating deep expertiseintermediate
Strong technical communication skillsintermediate
Required Qualifications
Ph.D. in Computer Science or a related field with a specialization in post-training LLMs for downstream tasks, especially using RL (e.g., RLVR, RLHF, offline or online, policy- or value-based), and possibly also including reasoning, alignment, distillation/compression, tool use, memory, calibration, or related. (experience)
A track record of top-tier publications demonstrating deep expertise in the specialization. (experience)
Passion for collaboration and for building strong relationships to tackle big, cross-functional problems. (experience)
Strong technical communication skills, with the ability to adapt to different audiences. (experience)
Self-motivated with an ability to thrive and to lead with minimal oversight and process. (experience)
Curiosity and judgment in identifying and framing ambiguous research and business problems, and connecting the two. (experience)
Eagerness to elevate the broader organization through sharing knowledge and guiding the adoption of new methods. (experience)
Responsibilities
leverage your technical expertise to shape roadmaps, collaborate across functions, and bring new ideas from exploration to impact.
engage actively with the broader research community by publishing at top venues, presenting at conferences, mentoring interns, and fostering academic collaborations.
grow to define and execute a strong research agenda with both internal and external visibility, disseminate knowledge effectively and inspire others, collaborate with colleagues to deliver tangible impact, and help foster an open environment of innovation, intellectual rigor, and curiosity.
Benefits
general: Health Plans
general: Mental Health support
general: a 401(k) Retirement Plan with employer match
general: Stock Option Program
general: Disability Programs
general: Health Savings and Flexible Spending Accounts
general: Family-forming benefits
general: Life and Serious Injury Benefits
general: paid leave of absence programs
general: Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off.
general: Full-time salaried employees are immediately entitled to flexible time off.
Target Your Resume for "Research Scientist (L4) - Machine Learning and Inference Research, LLM Post-Training"
Get personalized recommendations to optimize your resume specifically for Research Scientist (L4) - Machine Learning and Inference Research, LLM Post-Training. Our AI analyzes job requirements and tailors your resume to maximize your chances.
Keyword optimization
Skills matching
Experience alignment
Check Your ATS Score for "Research Scientist (L4) - Machine Learning and Inference Research, LLM Post-Training"
Find out how well your resume matches this job's requirements. Our Applicant Tracking System (ATS) analyzer scores your resume based on keywords, skills, and format compatibility.