Site Reliability Engineer II

1 Month ago • 5-9 Years • DevOps

About the job

Job Description

As a Site Reliability Engineer II on the Commerce Reliability Engineering team, you'll ensure the high availability and resilience of PlayStation's monetization platform. You'll work closely with service development teams to automate and improve operational excellence, proactively identifying and driving process and technology advancements. Responsibilities include managing 90+ commerce and payment services within an AWS environment, ensuring availability, scalability, and performance. You'll integrate AWS managed services, automate operational processes, enhance platform observability, and collaborate with other SRE teams for optimal back-end performance. You'll also conduct performance and capacity analysis, review service architecture for resilience and scalability, and provide on-call support for production incidents.
Must have:
  • 5+ years hands-on AWS experience
  • 5+ years of relevant work experience in a high-volume production environment
  • 5+ years of software engineering or supporting/maintaining software systems experience (Java and/or c++ services)
  • 3+ years of experience with building automation
  • Experience with container technologies and orchestration (Docker, Kubernetes, EKS)
  • Experience with AWS managed data services (RDS, DynamoDB, Elasticache)
  • Experience with monitoring and log management tools (DataDog, CloudWatch, Splunk)
  • Hands-on experience in triaging and tuning Java cloud applications
  • Solid understanding of AWS networking systems and protocols
  • Experience with CI/CD pipelines
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.

Why PlayStation?

PlayStation isn’t just the Best Place to Play — it’s also the Best Place to Work. Today, we’re recognized as a global leader in entertainment producing The PlayStation family of products and services including PlayStation®5, PlayStation®4, PlayStation®VR, PlayStation®Plus, acclaimed PlayStation software titles from PlayStation Studios, and more.

PlayStation also strives to create an inclusive environment that empowers employees and embraces diversity. We welcome and encourage everyone who has a passion and curiosity for innovation, technology, and play to explore our open positions and join our growing global team.

The PlayStation brand falls under Sony Interactive Entertainment, a wholly-owned subsidiary of Sony Corporation.

As a member of the Commerce Reliability Engineering team, you will carry the responsibility of keeping our monetization platform highly available and resilient, while continually enabling our service teams to deliver new and exciting product and technical features. Our team strives to iteratively learn, improve and automate our processes every single day, which continually improves operational excellence within our organization. You will be empowered to be a technical leader on our team, helping identify and proactivity drive improvements in both process and technology.

Responsibilities:

Your responsibilities will include hands-on application management of over 90+ commerce and payment related services within an AWS cloud environment, ensuring availability, resiliency, scalability and performance. You will work side by side with our service development teams to develop, automate and ensure the production readiness of all new services and features introduced.

Other responsibilities include:

  • Apply, integrate and automate the configuration and ongoing operations of AWS managed services.
  • Identify areas for operational process improvement and automation. Drive hands on development efforts to automate these processes within our environment.
  • Increase observability on our platform by implementing robust monitoring and alerting patterns across our services. Develop rich, informative dashboards / reports on our services that provide valuable insight, and develop meaningful alerting patterns to drive down the MTTD and MTTR on platform incidents.
  • Collaborate and partner with other SRE teams that specialize in areas such as platform hosting, Kubernetes, CICD, and data services to inspire changes and ensure optimal application performance and resiliency across all back-end services within PlayStation.
  • Iteratively lead performance and capacity validation analysis for our commerce platform services. Use AWS patterns and technologies such as spot instances, dynamic auto-scaling and EKS to efficiently make the most of our AWS spend.
  • Review service flows and architecture to influence resiliency, availability scalability and consistency for all services within our platform
  • Provide rotational on-call support where you’ll respond, detect, triage and resolve production incidents on the commerce and payments platform.
  • Conduct, document and present root cause analysis documents to share incident insights and findings with our broader engineering organization.

Qualifications:

  • A "BS degree or equivalent experience" in Computer Science, Engineering, or a related technical subject area is preferred.
  • 5+ years hands-on AWS experience – integrating, developing and managing applications
  • 5+ years of relevant work experience in a high-volume and/or critical production, software environment
  • 5+ years of hands on software engineering or supporting/maintaining software systems experience (Java and/or c++ services)
  • 3+ years of experience with building automation into daily operational processes through one or more programming languages
  • Experience with container technologies and orchestration (ie: Docker, Kubernetes, EKS)
  • Experience in configuring, tuning and automating operational responsibilities for AWS managed data services including RDS, DynamoDB and Elasticache
  • Experience with monitoring and log management tools (ie: DataDog, CloudWatch, Splunk)
  • Hands-on experience in triaging and tuning Java cloud applications with integration into AWS managed services
  • Solid understanding of AWS networking systems and protocols (ie: ALB, R53, API-Gateway, TCP/IP, HTTP/HTTPS, DNS)
  • Experience with developing or supporting Continuous Integration and Continuous Delivery/Deployment pipelines (CI/CD)

 

#LI-GM1

Equal Opportunity Statement:

Sony is an Equal Opportunity Employer. All persons will receive consideration for employment without regard to gender (including gender identity, gender expression and gender reassignment), race (including colour, nationality, ethnic or national origin), religion or belief, marital or civil partnership status, disability, age, sexual orientation, pregnancy, maternity or parental status, trade union membership or membership in any other legally protected category.

We strive to create an inclusive environment, empower employees and embrace diversity. We encourage everyone to respond. 

PlayStation is a Fair Chance employer and qualified applicants with arrest and conviction records will be considered for employment.

View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Want to take your career to the next level? Search open job vacancies at any of the Sony Interactive sites by visiting playstation.com/careers/


Sony Interactive Entertainment pushes the boundaries of entertainment and innovation, starting from the launch of the original PlayStation in Japan in 1994. Today, we continue to deliver innovative and thrilling experiences to a global audience through our PlayStation line of products and services that include generation-defining hardware, pioneering network services, and award-winning games. Headquartered in San Mateo, California, with global functions in California, London, and Tokyo, and game development studios around the world as part of PlayStation Studios, we believe that the power of play is borderless. Sony Interactive Entertainment is a wholly owned subsidiary of Sony Group Corporation.  


For more information about our company, please visit SonyInteractive.com. For more information about PlayStation products, please visit PlayStation.com.

United States (Remote)

United States (Remote)

Helsinki, Uusimaa, Finland (On-Site)

Guildford, England, United Kingdom (On-Site)

Los Angeles, California, United States (Hybrid)

Aliso Viejo, California, United States (On-Site)

London, England, United Kingdom (Hybrid)

Aliso Viejo, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by PlayStation Global

Similar Jobs

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Meta - Software Engineer, Android

Meta, United States (On-Site)

Luxoft - Java Team Lead

Luxoft, Canada (On-Site)

MatchGroup - Android Software Engineer

MatchGroup, South Korea (Hybrid)

The Walt Disney Company - Lead Software Engineer, Ad Platforms

The Walt Disney Company, United States (On-Site)

Sporty Group - LatAM Site Reliability Engineer

Sporty Group, (On-Site)

Aptiv - Android Audio - Technical Lead

Aptiv, India (On-Site)

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

Take-Two Interactive - Paralegal

Take-Two Interactive, United Kingdom (On-Site)

ION - Sales Executive

ION, United Kingdom (On-Site)

Salesforce - Account Executive - Nonprofit, Public Sector UK

Salesforce, United Kingdom (On-Site)

Rare - Release Manager

Rare, United Kingdom (Hybrid)

ElevenLabs - Backend Engineer

ElevenLabs, United Kingdom (Remote)

The Walt Disney Company - Business and Sales Internship

The Walt Disney Company, United Kingdom (On-Site)

Red Rover Interactive - Senior Concept Artist

Red Rover Interactive, United Kingdom (Hybrid)

Scopely - Junior Level Designer

Scopely, United Kingdom (Hybrid)

Granicus - Associate Product Manager

Granicus, United Kingdom (Remote)

Get notifed when new similar jobs are uploaded