Member of Technical Staff - Macrohard / Computer Control - Post Training (RL)

23 Minutes ago • All levels • $180,000 PA - $440,000 PA

Research Development

Job Description

The Computer Use Agents team at xAI is developing LLM agents to autonomously use computers, aiming to build Macrohard, an AI software company. This role involves building best-in-class Computer Use Agents, focusing on automating repetitive processes, software development, and long-horizon tasks. Responsibilities include designing RL algorithms and efficient tools for agent learning, collaborating on data acquisition, and building evaluation suites to enhance product experience.

Good To Have:

Extensive experience developing and implementing reinforcement learning (RL) algorithms for intelligent agents
Focus on agent-based systems
Proven expertise in designing and training purely vision-based RL agents or algorithms
Leveraging visual inputs for decision-making
Demonstrated ability to translate research ideas into deployable models or agents
Successfully delivering models/agents to end users

Must Have:

Build best-in-class Computer Use Agents
Automate repetitive processes
Build and test software
Perform long-horizon tasks (e.g., research, plan, execute complex tasks)
Design RL algorithms
Design reliable and efficient tools for agent learning
Acquire data
Build relevant evaluation suites
Ensure good product experience

Add these skills to join the top 1% applicants for this job

team-management

communication

game-texts

reinforcement-learning

algorithms

About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All engineers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

About the Team

The Computer Use Agents team aims to develop LLM agents that can use computers the way (or better) human use today. The team will build the base towards the goal of Macrohard – an autonomous AI software company.

About the Role

In this role you will:

Build the best in class Computer Use Agents focussing on automating repetitive processes, building and testing software, performing long-horizon tasks (e.g. research, plan and execute on a complex task).
Design RL algorithms, reliable and efficient tools for agent to learn by interacting w/ realistic environments.
Work closely with the team to acquire data, build relevant evaluation suites and good product experience.

Exceptional candidates may have:

Extensive experience developing and implementing reinforcement learning (RL) algorithms for intelligent agents, with a focus on agent-based systems.
Proven expertise in designing and training purely vision-based RL agents or algorithms, leveraging visual inputs for decision-making.
Demonstrated ability to translate research ideas inspired by real-world use cases into deployable models or agents, successfully delivering them to end users.

Interview Process

After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15 minute interview (“phone interview”) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews:

1. Coding assessment in a language of your choice.

2. Researcher technical sessions (2): These sessions will be testing your ability to formulate, design and solve concrete problems in real world with LLM. It can be research or engineering, depending on background/experience.

3. Meet the Team: Present your past exceptional work and your vision with xAI to a small audience.

Our goal is to finish the main process within one week. All interviews will be conducted via Google Meet.

Set alerts for more jobs like Member of Technical Staff - Macrohard / Computer Control - Post Training (RL)

Set alerts for new jobs by xAI

Set alerts for new Research Development jobs in United States

Set alerts for new jobs in United States

Set alerts for Research Development (Remote) jobs