Machine Learning Research Engineer (Human Sensing), SIML - ISE

Apple

| Seattle, Washington, United States of America (On Site) | Full Time | 1 months ago

Apply Now

Job Summary

The System Intelligence Machine Learning (SIML) organization seeks Research Engineers with strong ML and Computer Vision skills to develop next-gen multi-modal Human Sensing technologies. This role involves building foundation models for facial and full-body perception, contributing to Apple Intelligence, Camera, Photos, and Visual Intelligence features. You will design, implement, and deploy cutting-edge AI/ML models for robust cross-domain identity recognition systems, driving innovation in human-device interactions and personalized intelligent experiences.

Must Have

Design, implement, and deploy state-of-the-art visual recognition systems
Build foundation models for facial and full-body perception
Drive data quality excellence through strategic dataset curation, validation, and generation
Build tools and frameworks for systematic failure analysis and continuous model improvement
Interact with cross-functional stakeholders to gather product requirements
Translate requirements into actionable plans for ML research and development
Effectively communicate results and insights to partners and senior leaders
Stay current with latest trends in machine learning, multi-modal foundation models, computer vision, and natural language understanding
Actively contribute to Apple's ML community, disseminate research, enhance shared infrastructure, and mentor practitioners
Master's or Ph.D. in Computer Science, Computer Engineering, or related fields; or equivalent professional experience in ML research and development
Proficient in Python, PyTorch or equivalent deep learning frameworks
Proven track record of designing and implementing solutions using modern ML architectures
Background in research and innovation (publications, patents, impactful software developments)

Good to Have

Expert-level knowledge of state-of-the-art methods in face recognition or other facial analysis and biometric systems
Hands-on experience training multi-modal large language models
Experience with on-device ML, model optimization, or production ML systems

Perks & Benefits

Opportunity to become an Apple shareholder through discretionary employee stock programs
Eligible for discretionary restricted stock unit awards
Can purchase Apple stock at a discount through Employee Stock Purchase Plan
Comprehensive medical and dental coverage
Retirement benefits
Range of discounted products and free services
Reimbursement for certain educational expenses (including tuition) for formal education related to advancing your career at Apple
Might be eligible for discretionary bonuses or commission payments
Might be eligible for relocation

Job Description

The System Intelligence Machine Learning (SIML) organization is looking for Research Engineers with a strong foundation in Machine Learning and Computer Vision to develop the next generation of multi-modal Human Sensing technologies. You will be part of a fast-paced, impact-driven Applied Research organization building foundation models for facial and full-body perception, and working on cutting-edge machine learning that is at the heart of the most loved features on Apple platforms including Apple Intelligence, Camera, Photos, Visual Intelligence, etc. These innovations form the foundation of the seamless, intelligent experiences our users enjoy every day!

As a Machine Learning Research Engineer, you will be responsible for designing and developing cutting-edge AI/ML models for Human Sensing, with a focus on building robust cross-domain identity recognition systems. Multi-modal Human Sensing is a foundational capability that powers intelligent experiences based on key human traits such as identity, expression, clothing, action, gesture, gaze and human-object interaction. Major Apple Intelligence experiences such as personalized Natural Language Search, Memories Creation, as well as personalized Image Generation are powered by our ability to learn robust representations of visual human traits. Efficient real-time visual human sensing powers flagship Photography experiences such as Cinematic mode and Photographic Styles, communication experiences such as Center Stage, and paves the way for more natural human-device interactions, e.g., with the DockKit framework. YOUR PRIMARY RESPONSIBILITIES WILL INCLUDE: Designing, implementing, and deploying state-of-the-art visual recognition systems. Building foundation models for facial and full-body perception. Driving data quality excellence through strategic dataset curation, validation, and generation to support world-class model development. Building tools and frameworks for systematic failure analysis, identifying edge cases, and driving continuous model improvement. Directly interacting with all cross-functional stakeholders to gather product requirements and translating these into actionable plans for ML research and development. Effectively communicating results and insights to partners and senior leaders, providing clear and actionable recommendations. Staying current with the latest trends, technologies, and standard methodologies in machine learning, multi-modal foundation models, computer vision and natural language understanding. Actively contributing to Apple's ML community by disseminating research ideas and results, enhancing shared infrastructure, and mentoring fellow practitioners.

Master's or Ph.D. in Computer Science, Computer Engineering, or related fields; or equivalent professional experience in ML research and development.
Proficient in Python, PyTorch or equivalent deep learning frameworks.
Proven track record of designing and implementing solutions using modern ML architectures.
Background in research and innovation, demonstrated through publications in top-tier journals or conferences, patents, or impactful software developments.

Expert-level knowledge of state-of-the-art methods in face recognition or other facial analysis and biometric systems.
Hands-on experience training multi-modal large language models.
Experience with on-device ML, model optimization, or production ML systems.

At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $171,600 and $258,100, and your base pay will depend on your skills, qualifications, experience, and location.

Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.

Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant

Apple accepts applications to this posting on an ongoing basis.

8 Skills Required For This Role

Cross Functional Game Texts Html Pytorch Deep Learning Computer Vision Python Machine Learning

Similar Jobs

Research Development

Machine Learning Engineer Intern

Match Group • Palo Alto, California, United States (Hybrid)