Site Reliability Engineer I

3 Months ago • 1 Years + • Devops

Job Summary

Job Description

The Site Reliability Engineer I (SRE-I) role at Take-Two involves maintaining the health, availability, and reliability of games and services. The SRE team acts as the first line of defense for production issues, monitoring infrastructure and providing on-call support. Responsibilities include Windows and Linux administration, managing virtual environments with VMWare, cloud administration across AWS, Azure, Splunk, and GCP, risk assessment, issue identification, service troubleshooting, and incident management. The role requires effective communication and collaboration with game studios, ensuring timely information relay during incidents. Proactive problem-solving and continuous improvement are also essential.
Must have:
  • Manage and maintain Windows servers.
  • Utilize CheckMK for monitoring and alerting.
  • Diagnose and resolve issues on Linux systems.
  • Manage virtual environments using VMWare.
  • Administer cloud services across AWS, Azure, Splunk, and GCP.
  • Assess potential risks on game services and revenue.
  • Identify issues using provided dashboards.
  • Troubleshoot basic issues across various services.
  • Relay accurate and timely information to game studios.
Good to have:
  • Utilize broad understanding of IT principles and concepts.
  • Solve complex problems effectively.
  • Network and collaborate with senior personnel.
  • Innovate solutions to tactical business issues.
  • Lead the team with advanced knowledge.
  • Exercise decision-making autonomy.
Perks:
  • Great Company Culture.
  • Growth opportunities.
  • Gym reimbursement up to INR1150 per month.
  • Charitable giving program.
  • Access to learning platforms.
  • Gaming events.

Job Details

Job Title: Site Reliability Engineer I (SRE-I)

 

Who We Are:

Headquartered in New York City, Take-Two Interactive Software, Inc. is a leading developer, publisher, and marketer of interactive entertainment for consumers around the globe. The Company develops and publishes products principally through Rockstar Games, 2K, Private Division, and Zynga. Our products are currently designed for console gaming systems, PC, and Mobile, including smartphones and tablets, and are delivered through physical retail, digital download, online platforms, and cloud streaming services. The Company’s common stock is publicly traded on NASDAQ under the symbol TTWO.

While our offices (physical and virtual) are casual and inviting, we are deeply committed to our core tenets of creativity, innovation and efficiency, and individual and team development opportunities. Our industry and business are continually evolving and fast-paced, providing numerous opportunities to learn and hone your skills. We work hard, but we also like to have fun, and believe that we provide a great place to come to work each day to pursue your passions. 

 

The Challenge

SRE team serves as a centralised operations unit under the Technical Operation Centre (TOC), tasked with maintaining the health, availability, and reliability of our games and services. From a broader perspective, our primary mission is to ensure high uptime. As the first line of defence for all production issues, SREs take the lead in monitoring infrastructure and providing primary on-call support, ensuring a quick response to any incidents. We also play a critical role in emergency response, managing communication and coordination to resolve issues as efficiently as possible.In addition to these primary responsibilities, the SREs take a proactive approaches along with the NOC team to improving latency, performance, and efficiency across all services. Our work extends to capacity planning and optimization of systems at both the system and cloud levels, ensuring that services scale efficiently to meet the demands of our games. We don’t just respond to incidents; we continuously look for ways to enhance the performance and reliability of the infrastructure.Ultimately, SRE strives to achieve world-class uptime for all Take-Two products, working to reduce the frequency and impact of downtime while resolving issues promptly and comprehensively. With a focus on the entire production stack, we take a holistic approach to reliability engineering, ensuring that every layer—from the infrastructure to the application level—contributes to the best possible user experience.

 

What You’ll Take On

  • Windows Administration

    • Manage and maintain Windows servers, ensuring their stability, security, and performance.

  • CheckMK

    • Utilize CheckMK for comprehensive monitoring and alerting, ensuring all systems are functioning optimally.

  • Linux Administration

    • Diagnose and resolve issues on Linux systems, ensuring minimal downtime and maximum efficiency.

  • VMWare

    • Manage virtual environments using VMWare, ensuring resources are optimized and available.

  • vSan Understanding

    • Demonstrate a solid understanding of vSan for effective storage management and troubleshooting.

  • Cloud Administration

    • Administer and manage cloud services across AWS, Azure, Splunk, and GCP, ensuring seamless integration and operation.

  • Risk Assessment

    • Assess potential risks and impacts on game services and revenue, taking proactive measures to mitigate them.

  • Issue Identification

    • Identify issues, alerts, and critical service incidents using provided dashboards and monitoring tools.

  • Service Troubleshooting

    • Utilize studio playbooks to troubleshoot and diagnose basic issues across various services.

  • Communication

    • Relay accurate and timely information regarding service impacts to game studios, ensuring effective communication during incidents.

  • Incident Management

    • Spearhead outage management, including communication, triage, and escalation.

  • Daily On Call

    • Responsible for triaging and troubleshooting critical alerts form critical systems

What You Bring 

  • Experience:

    • Live Services Knowledge: Understanding of live services and their operational requirements.

    • Change/Crisis Management: Experience in managing change and crisis situations, ensuring minimal disruption to services.

    • Effective Communicator: Able to relay information accurately and timely to the game studio and other stakeholders.

    • Team Player: Works well in a collaborative environment, sharing knowledge and supporting team members.

  • Proactive Problem-Solving:

    • A commitment to continuous improvement and proactive issue resolution.

    • Proven experience in troubleshooting production problems affecting live services.

    • Able to identify potential issues before they become critical and manage details effectively.

  • Background:

    • At least 1 year of experience in a similar role and/or 3 years experience in a relevant role. 

Great to Have: 

  • Apply Advanced Knowledge: 

    • Utilize your broad understanding of principles, theories, and concepts in IT, integrating advanced knowledge from related fields.

    • Solve Complex Problems: Address diverse and moderately complex problems, using sound judgment to select the best methods and techniques.

    • Network and Collaborate: Engage with senior internal and external personnel to maximize the application of functional expertise.

  • Problem Solving:

    • Innovate Solutions: Develop and recommend solutions to tactical business issues, proactively identifying and addressing potential problems.

    • Lead with Expertise: Use your advanced knowledge to guide your team and drive effective solutions.

  • Decision Making:

    • Exercise Autonomy: Make decisions with considerable latitude, consulting with senior engineers or managers on complex issues and recommending solutions as necessary.

 

What We Offer You:

  • Great Company Culture. We pride ourselves as being one of the most creative and innovative places to work, creativity, innovation, efficiency, diversity and philanthropy are among the core tenets of our organization and are integral drivers of our continued success.
  • Growth: As a global entertainment company, we pride ourselves on creating environments where employees are encouraged to be themselves, inquisitive, collaborative and to grow within and around the company.
  • Work Hard, Enjoy Life. Our employees’ bond, blow-off steam, and flex some creative muscles – through our Office gaming spaces, company parties, game release events, monthly socials, and team challenges.
  • Benefits. Benefits include, but are not limited to; Discretionary bonus, Provident fund contributions, 1+5 medical insurance + top up options and access to Practo online Doctor consultation App, Employee assistance program, 3X CTC Life Assurance, 3X CTC Personal accident insurance, childcare services, 20 days holiday + statutory holidays,
  • Perks. Gym reimbursement up to INR1150 per month, charitable giving program, access to learning platforms, gaming events. 
 
Please be aware that Take-Two does not conduct job interviews or make job offers over third-party messaging apps such as Telegram, WhatsApp, or others. Take-Two also does not engage in any financial exchanges during the recruitment or onboarding process, and the Company will never ask a candidate for their personal or financial information over an app or other unofficial chat channel. Any attempt to do so may be the result of a scam or phishing exercise. Take-Two’s in-house recruitment team will only contact individuals through their official Company email addresses (i.e., via a take2games.com email domain). If you need to report an issue or otherwise have questions, please contact Careers@take2games.com.*
 
As an equal opportunity employer, Take-Two Interactive Software, Inc. (“Take-Two”) is committed to fostering and celebrating the diverse thoughts, cultures, and backgrounds of its talent, partners, and communities throughout its organization. Consistent with this commitment, Take-Two does not discriminate or retaliate against any employee or job applicant because of their race, color, religion, sex (including pregnancy, sexual orientation, and gender identity), national origin, age, disability, and genetic information (including family medical history), or on the basis of any other trait protected by applicable law. If you need to report a concern or have questions regarding Take-Two’s equal opportunity commitment, please contact Careers@take2games.com.
 
#LI-Hybrid

Similar Jobs

ISG - Instructional Designer / Training Developer- Analyst / Sr Analyst

ISG

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Crunchyroll - Vice President, COO Initiatives

Crunchyroll

Culver City, California, United States (Hybrid)
6 Months ago
Crunchyroll - Senior Product Manager - Access and Identity Management

Crunchyroll

Los Angeles, California, United States (On-Site)
5 Months ago
White board games - 3D Environment Artist (SSR)

White board games

(Remote)
3 Months ago
Vendavo - Global Payroll Manager

Vendavo

Denver, Colorado, United States (Remote)
1 Month ago
Thales - Senior Technical Lead - DevOps

Thales

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Apple - Senior Site Reliability Engineer

Apple

Austin, Texas, United States (On-Site)
1 Month ago
playrix  - Senior C++ Software Engineer (Build System)

playrix

Ireland (Remote)
9 Months ago
ARHS - DevOps Engineer

ARHS

Luxembourg (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Zelis  - Data Science Engineer / Healthcare Data Analyst

Zelis

Hyderabad, Telangana, India (On-Site)
3 Months ago
Scopely - Lead Marketing Artist

Scopely

Barcelona, Catalonia, Spain (Hybrid)
7 Months ago
endava - Senior .NET Developer

endava

Iași, Iași County, Romania (On-Site)
2 Months ago
Bandai Namco - Controller

Bandai Namco

Irvine, California, United States (Hybrid)
3 Months ago
Ubisoft - Team Lead Programming

Ubisoft

Montreal, Quebec, Canada (Hybrid)
1 Month ago
Activate Games - Digital Marketing Leader

Activate Games

Toronto, Ontario, Canada (Hybrid)
1 Month ago
PwC - IN- Manager_ Employee Central_Enterprise Apps SAP_Advisory_Noida

PwC

Noida, Uttar Pradesh, India (On-Site)
9 Months ago
Informa Group - Outside Sales Representative

Informa Group

Boca Raton, Florida, United States (Remote)
1 Month ago
Joyride Games - VP Marketing

Joyride Games

Palo Alto, California, United States (Remote)
1 Year ago
Dream Games - Brand Marketing Specialist

Dream Games

London, England, United Kingdom (On-Site)
2 Years ago

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

zeta - Lead UX Designer

zeta

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Neolytix - Healthcare Data Analytics

Neolytix

Gurugram, Haryana, India (Hybrid)
1 Month ago
Qualcomm - Chipset Architect

Qualcomm

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Enphase Energy - Staff Front-end Design (Drupal)

Enphase Energy

Bengaluru, Karnataka, India (On-Site)
8 Months ago
Zeeco, Inc. - C&I Engineer (C&I Global Burner Support Group)

Zeeco, Inc.

Mumbai, Maharashtra, India (On-Site)
9 Months ago
Rippling - Senior Software Engineer - Tax Platform

Rippling

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Digicore studios - Content Writer

Digicore studios

Pune, Maharashtra, India (On-Site)
5 Months ago
Accenture - Procurement Practice New Associate

Accenture

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Zeeco, Inc. - QA-QC Engineer (Global Support TO)

Zeeco, Inc.

Mumbai, Maharashtra, India (On-Site)
1 Month ago
Highspot - Sr. Data Analyst

Highspot

Hyderabad, Telangana, India (Hybrid)
6 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Argus - Software Engineer (Infrastructure/Backend)

Argus

(Remote)
4 Months ago
Rackspace Technology - Machine Learning Operations (MLOps) Architect - GCP

Rackspace Technology

United States (Remote)
2 Months ago
Aptive - Cloud Engineer (Python, Kubernetes, AWS)

Aptive

Chennai, Tamil Nadu, India (On-Site)
4 Weeks ago
Axi - Senior Software Architect

Axi

Singapore (On-Site)
1 Month ago
Ubisoft - Build Engineer

Ubisoft

Paris, Île-de-France, France (Hybrid)
1 Month ago
Google - Software Engineer III, Infrastructure, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
9 Months ago
Flowable - Senior Java / Spring Engineer – Cloud Applications

Flowable

Zürich, Zurich, Switzerland (Hybrid)
1 Year ago
Easybrain - Build Engineer

Easybrain

(Remote)
3 Months ago
Cadence - Sr Solutions Engineer (Analog Mixed Signal Circuit Design)

Cadence

San Jose, California, United States (On-Site)
3 Months ago
Thousand Eyes - Lead Software Engineer, Account Management Platform

Thousand Eyes

San Jose, California, United States (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Take-Two Interactive Software, Inc. is a leading developer, publisher, and marketer of interactive entertainment for consumers around the globe. We develop and publish products principally through Rockstar Games, 2K, and Zynga. Our products are designed for console gaming systems, PC, and mobile, including smartphones and tablets. We deliver our products through physical retail, digital download, online platforms, and cloud streaming services. For more information, visit

California, United States (Hybrid)

New York, United States (Remote)

New York, United States (Hybrid)

New York, United States (Hybrid)

Vancouver, British Columbia, Canada (Hybrid)

New York, United States (Hybrid)

New York, United States (Hybrid)

New York, United States (Hybrid)

Brighton And Hove, England, United Kingdom (Hybrid)

London, England, United Kingdom (Hybrid)

View All Jobs

Get notified when new jobs are added by Take-Two Interactive

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug