AIML - Machine Learning Engineer, Model Evaluations

2 Months ago • All levels • Research Development • $175,800 PA - $312,200 PA

Job Summary

Job Description

This role focuses on evaluating and mitigating safety risks in generative AI features within Apple Intelligence. The responsibilities include developing metrics for safety and fairness evaluations, designing datasets, collaborating with cross-functional teams, translating regional safety requirements into evaluation criteria, building expertise in machine translation and data synthesis, and developing ML-based enhancements to improve product quality. The role involves working with highly sensitive and potentially offensive content. The candidate will work at the intersection of applied data science, empirical analysis, cultural and linguistic expertise, and stakeholder communication.
Must have:
  • Develop metrics for evaluation of safety and fairness risks.
  • Design datasets, identify data needs and creative solutions.
  • Collaborate with cross-functional partners.
  • Translate regional safety and inclusivity requirements.

Job Details

Apple Intelligence is driven by intentional data design—spanning careful sampling, creation, and curation of high-quality datasets, enriched with precise annotations. Our data powers our ability to evaluate and mitigate safety risks in new generative AI features. This role sits at the intersection of applied data science, empirical analysis, cultural and linguistic expertise, and stakeholder communication. It requires strong scientific judgment, cross-functional collaboration, and the ability to translate evaluation findings into actionable insights. - Develop metrics for evaluation of safety and fairness risks inherent to generative models and Gen-AI features - Design datasets, identify data needs, and work on creative solutions, scaling and expanding data coverage through human and synthetic generation methods - Collaborate with cross-functional partners—including engineering, product, and research teams—to ensure evaluations align with feature goals and deployment plans - Partner with policy teams to translate regional safety and inclusivity requirements into measurable evaluation criteria - Build expertise in machine translation and data synthesis techniques to generate localized and culturally aligned evaluation datasets at scale - Develop ML-based enhancements to red teaming, model evaluation, and other processes to improve the quality of Apple Intelligence’s user-facing products - Work with highly-sensitive content with exposure to offensive and controversial content

Similar Jobs

Penumbrainc - Therapy Development Specialist

Penumbrainc

Oklahoma City, Oklahoma, United States (Remote)
3 Months ago
Tencent - Tencent Cloud - Senior Cloud Architect (R&D & Solution Design)

Tencent

Singapore (On-Site)
8 Months ago
Apple - Marketing Operations Email Specialist

Apple

Cupertino, California, United States (On-Site)
1 Week ago
Zinnia - Head of Enterprise Quality and Controls

Zinnia

Greenwich, Connecticut, United States (Hybrid)
1 Month ago
Yahoo - Data Governance Lead (Senior Principal Engineer)

Yahoo

United States (Hybrid)
1 Month ago
bytedance - NLP Researcher - 2025 Start

bytedance

Singapore (On-Site)
8 Months ago
Apple - Machine Learning Applied Research Scientist

Apple

Cupertino, California, United States (On-Site)
2 Months ago
Intangible - Applied AI Engineer (Image/Video Diffusion)

Intangible

United States (Remote)
2 Months ago
Welltech - Senior Machine Learning Engineer

Welltech

Poland (Remote)
2 Months ago
Apple - Machine Learning Engineer – Ads Signals Intelligence & Information Retrieval

Apple

Cupertino, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Marsh McLennan - Senior .NET Engineer with Angular

Marsh McLennan

Cluj-Napoca, Cluj County, Romania (Hybrid)
1 Month ago
Mozilla - Senior Growth Product Manager

Mozilla

United States (Remote)
2 Weeks ago
Tap nation  - Senior Marketing Artist

Tap nation

France (Remote)
1 Month ago
Solace - Product Manager

Solace

United States (Remote)
1 Week ago
Crunchyroll - Staff DevOps Engineer

Crunchyroll

Los Angeles, California, United States (Hybrid)
2 Months ago
YouGov - Senior Software Engineer, Python

YouGov

Barcelona, Catalonia, Spain (Remote)
2 Weeks ago
Guardian - Senior QA Engineer - IT

Guardian

Chennai, Tamil Nadu, India (On-Site)
2 Months ago
Salesforce - Director-Strategic Partnerships (COE- NextGen Platform)

Salesforce

London, England, United Kingdom (On-Site)
1 Month ago
Single Store - Director of Customer Marketing and Analyst Relations

Single Store

Raleigh, North Carolina, United States (Remote)
2 Weeks ago
Fortra - Cloud Security Operations Lead

Fortra

United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in Cupertino, California, United States

Dungarvin - Direct Support Professional / Caregiver

Dungarvin

Baraboo, Wisconsin, United States (On-Site)
1 Week ago
bounteous - Lead PIM Technical Analyst

bounteous

United States (Remote)
3 Weeks ago
Evolution  - Online Game Presenter 11pm-7am Shift/Full Time Benefits/(Restaurant Alternative) $20-$25/hr.

Evolution

Atlantic City, New Jersey, United States (On-Site)
9 Months ago
CyberArk - Principal Global Solutions Program Manager, Regulated Markets

CyberArk

United States (On-Site)
5 Days ago
Next Level Business Services - Cognos Administrator

Next Level Business Services

Beaverton, Oregon, United States (On-Site)
8 Years ago
Valeo - Production Operator

Valeo

Reno, Nevada, United States (On-Site)
2 Months ago
Scopely - Full-Stack Engineer

Scopely

United States (Remote)
3 Weeks ago
Yahoo - Sr. Fullstack Engineer - Messaging Platform

Yahoo

United States (Hybrid)
1 Month ago
Zscaler - Principal Software Engineer, IAM

Zscaler

San Jose, California, United States (Hybrid)
1 Month ago
Qualcomm - DV CAD Engineer

Qualcomm

Santa Clara, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

Penrose studios - Machine Learning Engineer

Penrose studios

San Francisco, California, United States (On-Site)
2 Months ago
PwC - Senior AI Developer - Roma [DIG]

PwC

Rome, Lazio, Italy (On-Site)
9 Months ago
bytedance - Student Researcher (Doubao (Seed) - LLM Post-training) - 2025 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
8 Months ago
bytedance - Research Scientist Graduate (High-Performance Computing (Inference Optimization) - Vision AI Platform)

bytedance

San Jose, California, United States (On-Site)
5 Months ago
Prophecy - AI Community Manager

Prophecy

San Francisco, California, United States (Remote)
2 Weeks ago
Snorkel AI - Research Scientist

Snorkel AI

Redwood City, California, United States (Hybrid)
1 Month ago
DevRev - Architect - Applied AI Engineer

DevRev

(Remote)
2 Months ago
Ubisoft - Lead R&D Scientist

Ubisoft

Shanghai, Shanghai, China (On-Site)
6 Months ago
bytedance - High-Performance Computing Research Scientist (Algorithm Acceleration)

bytedance

San Jose, California, United States (On-Site)
3 Months ago
Illumina - Sr Scientist

Illumina

Foster City, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Cupertino, California, United States (On-Site)

Cupertino, California, United States (On-Site)

Seattle, Washington, United States (On-Site)

Culver City, California, United States (On-Site)

Cupertino, California, United States (On-Site)

Cupertino, California, United States (On-Site)

Sunnyvale, California, United States (On-Site)

Cupertino, California, United States (On-Site)

Cupertino, California, United States (On-Site)

Seattle, Washington, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Apple

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug