AIML - Staff Machine Learning Engineer - Reinforcement Learning
- 10+ years of ML experiences in search, natural language processing/understanding. Conversational AI.
- Proven experience for LLM post training, including but not limited to SFT, RLHF, RLAIF, Reward Modeling, Chain-of-thought, agentic LLM.
- Hands-on experience building RL pipelines and training agents in simulation or real-world environments.
- Growth mindset and ability to learn new technologies
- MS or Ph.D. in Computer Science, Machine Learning with a specialty in reinforcement learning, or a related field
- Deep expertise in reinforcement learning-based post-training on LLM models, reward modeling, RLHF, RLAIF, Chain-of-thought, and agentic AI R&D.
- Deep understanding of cutting edge RL algorithms and large language model.
- Deep understanding in LLM pre-training, post-training.
- Strong product intuition and ownership
- Excellent communication skills
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.