Senior ML Infrastructure Engineer - Training Algorithms, SIML
Apple
Job Summary
The Intelligence System Experience (ISE) team is seeking a Senior ML Infrastructure Engineer to work on groundbreaking generative modeling technologies. This role focuses on building infrastructure for training, adapting, and deploying large-scale generative models. You will collaborate with algorithm design and infrastructure engineers to benchmark, prototype, and steer algorithmic choices for training and deployment. The team operates at the intersection of multimodal machine learning and system experiences, contributing to seamless, intelligent user experiences.
Must Have
- Training optimizations & profiling targeting vision/language pre-training
- Researching training recipes for effective scheduling of multimodal training workloads
- Experimentation & tooling for post-training ablations including reward modeling, distillation and prompt optimization
- Coordinating with post-training algorithm owners for analyzing quality / performance tradeoffs of downstream capabilities
- Ablations involving optimization aware fine-tuning
- Experienced in training / adapting LLM and Diffusion models
- Advanced Fluency in PyTorch
- Excellent programming skills and experience contributing software to large projects
- Experience with distributed training of large models
Good to Have
- Strong ML Fundamentals
- Experience working with large cross-functional and diverse teams
Perks & Benefits
- Comprehensive medical and dental coverage
- Retirement benefits
- Range of discounted products and free services
- Reimbursement for certain educational expenses (including tuition) related to advancing your career
- Opportunity to become an Apple shareholder through discretionary employee stock programs
- Ability to purchase Apple stock at a discount through Employee Stock Purchase Plan
- Discretionary bonuses or commission payments (if eligible)
- Relocation (if eligible)
Job Description
Are you passionate about Generative AI? Are you interested in working on groundbreaking generative modeling technologies to enrich billions of people? We are the Intelligence System Experience (ISE) team within Apple’s software organization. The team operates at the intersection of multimodal machine learning and system experiences. Our multidisciplinary ML teams focus on a broad spectrum of areas, including Visual Generative Foundation Models, Multimodal Understanding, Visual Understanding of People, Text, Handwriting, and Scenes, Personalization, Knowledge Extraction, Conversation Analysis, Behavioral Modeling for Proactive Suggestions, and Privacy-Preserving Learning. These innovations form the foundation of the seamless, intelligent experiences our users enjoy every day. We are seeking engineers experienced in building infrastructure for training, adapting and deploying large-scale generative models. In this role, you will be working with closely with a cross functional team of algorithm design and infrastructure engineers to benchmark, prototype and steer algorithmic choices to best fit our training & deployment infrastructure.
In this role you will be technically hands on, with deep subject matter expertise in ML infrastructure. Responsibilities Include:
- Training optimizations & profiling targeting vision/language pre-training
- Researching training recipes for effective scheduling of multimodal training workloads
- Experimentation & tooling for post-training ablations including reward modeling, distillation and prompt optimization
- Coordinating with post-training algorithm owners for analyzing quality / performance tradeoffs of downstream capabilities
- Ablations involving optimization aware fine-tuning
- Bachelors, Masters, or PhD in Electrical Engineering/Computer Science or a related field (mathematics, physics or computer engineering), with a focus on machine learning; or comparable professional experience
- Experienced in training / adapting LLM and Diffusion models
- Advanced Fluency in PyTorch
- Excellent programming skills and experience contributing software to large projects
- Experience with distributed training of large models
- Strong ML Fundamentals
- Experience working with large cross-functional and diverse teams.
At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $181,100 and $318,400, and your base pay will depend on your skills, qualifications, experience, and location.
Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant
.
Apple accepts applications to this posting on an ongoing basis.