We develop software that enables TPU, Google’s custom-built AI computation chip, to run large-scale AI hypercomputation in Google’s data centers, and thus empowering all the cutting edge AI innovations for Google (Deepmind, Search, Ads, everything) and other Cloud customers.
Our team covers a broad range of software in the TPU software stack, including system software that enable a single TPU machine, superpod software that connects thousands of TPU chips into a AI hypercomputer, and health monitoring software that ensures the TPUs and their interconnection and networking are healthy, etc.
We play a key role in the introduction of each new TPU chip, from design, system bringup, to productionization of individual machines and large-scale AI hypercomputers including thousands of machines. We are involved in all stages of the project from concept, planning, development, deployment, and end of life in the data centers.
Software Engineering Managers have not only the technical expertise to take on and provide technical leadership to major projects, but also manage a team of Engineers. You not only optimize your own code but make sure Engineers are able to optimize theirs. As a Software Engineering Manager you manage your project goals, contribute to product strategy and help develop your team. Teams work all across the company, in areas such as information retrieval, artificial intelligence, natural language processing, distributed computing, large-scale system design, networking, security, data compression, user interface design; the list goes on and is growing every day. Operating with scale and speed, our exceptional software engineers are just getting started -- and as a manager, you guide the way.
The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.
We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.
Get notifed when new similar jobs are uploaded
A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.
Get notified when new jobs are added by Google