Software Engineer - GenAI Evaluations, AiDP

8 Minutes ago • 2 Years + • Research Development • $181,100 PA - $318,400 PA

Job Summary

Job Description

Join Apple’s Generative AI Evaluations team as a Software Engineer to define how AI systems are measured, monitored, and improved for next-generation user experiences. You will design robust evaluation frameworks, translate research into practical tooling, and collaborate with cross-functional teams to ensure trustworthy, efficient, and high-quality GenAI solutions. This role influences Apple’s AI platforms and sets standards for evaluating generative applications at scale.
Must have:
  • Design and develop platform features for solution developers to experiment and identify optimal configurations for high quality GenAI applications.
  • Evaluate and analyze the performance of GenAI applications, and actively collaborate in driving performance improvements.
  • Translate the latest research into reliable and scalable evaluations that can deliver high quality experiences for users.
  • Actively engage in all aspects of feature development, from ideation and experimentation to deployment and maintenance.
  • Communicate complex technical topics effectively to a diverse audience.
  • Programming skills in Python.
  • Experience developing scalable and robust services with FastAPI or similar frameworks.
  • Experience in Machine Learning, with a particular emphasis on Large Language Models (LLMs), Retrieval Augmented Generation (RAG) or GenAI Agents.
  • Experience with evaluating and optimizing Generative AI platforms or applications.
Good to have:
  • Experience with GenAI RAG and Agent evaluation frameworks like RAGAS, DeepEvals, OpenEvals, AgentEvals or OpenAI Evals.
  • Familiarity with LLM Observability techniques and best practices.
  • Proven ability to comprehend, interpret, and apply cutting-edge research into tangible applications.
  • Proven problem-solving and leadership abilities, with the capacity to steer the team's research and build practical applications in a collaborative and fast-paced environment.
  • Customer-focused with strong business acumen, capable of translating business needs into impactful technical solutions and a proven history of successfully shipping products that drive significant outcomes.
  • Experience with cloud platforms like AWS, GCP, or Azure.
  • Knowledge of containerization and orchestration tools like Docker and Kubernetes.
  • Creative, collaborative and project focused with an ability to work hands-on in multi-functional teams.
  • Excellent communication skills with the ability to communicate with all stakeholders effectively, including senior leadership.
  • Master’s in Computer Science, Artificial Intelligence, Machine Learning, or a related field.
Perks:
  • Comprehensive medical and dental coverage
  • Retirement benefits
  • Range of discounted products and free services
  • Reimbursement for certain educational expenses (tuition) for formal education related to career advancement at Apple
  • Opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs
  • Eligibility for discretionary restricted stock unit awards
  • Option to purchase Apple stock at a discount through Employee Stock Purchase Plan
  • Eligibility for discretionary bonuses or commission payments
  • Relocation assistance

Job Details

We are seeking a driven and analytical Software Engineer to join Apple’s Generative AI Evaluations team. In this role, you will help define how we measure, monitor, and improve the performance of AI systems that power next-generation user experiences. You will design robust evaluation frameworks, translate cutting-edge research into practical tooling, and collaborate closely with cross-functional teams to ensure our GenAI solutions are trustworthy, efficient, and high-quality. This is a unique opportunity to influence both the inner workings of Apple’s AI platforms and the broader standard for evaluating generative applications at scale.

As a Software Development Engineer on the Evaluations Team, you’ll join a phenomenal team of hardworking engineers and will be entrusted with a range of responsibilities. Your tasks will include: Designing and developing platform features for helping solution developers to experiment and identify optimal configurations for delivering high quality GenAI applications. Evaluating and analyzing the performance of GenAI applications, and actively collaborating with the team in driving performance improvements Translating the latest research into reliable and scalable evaluations that can deliver high quality experiences for our users. Actively engaging in all aspects of feature development, from ideation and experimentation to deployment and maintenance. Communicating complex technical topics effectively to a diverse audience.

  • Bachelor’s in Computer Science, Artificial Intelligence, Machine Learning, or a related field or experience
  • 2+ years of software engineering experience
  • Programming skills in Python
  • Experience developing scalable and robust services with FastAPI or similar frameworks.
  • Experience in Machine Learning, with a particular emphasis on Large Language Models (LLMs), Retrieval Augmented Generation (RAG) or GenAI Agents
  • Experience with evaluating and optimizing Generative AI platforms or applications
  • Experience with GenAI RAG and Agent evaluation frameworks like RAGAS, DeepEvals, OpenEvals, AgentEvals or OpenAI Evals
  • Familiarity with LLM Observability techniques and best practices
  • Proven ability to comprehend, interpret, and apply cutting-edge research into tangible applications
  • Proven problem-solving and leadership abilities, with the capacity to steer the team's research and build practical applications in a collaborative and fast-paced environment
  • Customer-focused with strong business acumen, capable of translating business needs into impactful technical solutions and a proven history of successfully shipping products that drive significant outcomes
  • Experience with cloud platforms like AWS, GCP, or Azure
  • Knowledge of containerization and orchestration tools like Docker and Kubernetes
  • Creative, collaborative and project focused with an ability to work hands-on in multi-functional teams
  • Excellent communication skills with the ability to communicate with all stakeholders effectively, including senior leadership
  • Master’s in Computer Science, Artificial Intelligence, Machine Learning, or a related field

At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $181,100 and $318,400, and your base pay will depend on your skills, qualifications, experience, and location.

Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.

Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant

.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in San Francisco, California, United States

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Research Development Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Seattle, Washington, United States (On-Site)

Cupertino, California, United States (On-Site)

Cupertino, California, United States (On-Site)

Cupertino, California, United States (On-Site)

Seattle, Washington, United States (On-Site)

San Francisco, California, United States (On-Site)

Seattle, Washington, United States (On-Site)

Cupertino, California, United States (On-Site)

San Diego, California, United States (On-Site)

Cupertino, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Apple

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug