Systems Development Engineer, Operations Support

1 Month ago • 5-9 Years • DevOps

About the job

Job Description

The Systems Development Engineer, Operations Support at Google's Distributed Cloud Edge (GDCE) provides crucial support for GDCE installations at customer sites. Responsibilities include troubleshooting installation challenges (cabling errors, network issues, hardware failures, etc.), managing workload migration issues, and designing automated remediation solutions. This role involves rotating on-call schedules, interfacing with hardware vendors, and collaborating with internal labs. The ideal candidate possesses extensive experience in data centers, networking operations, scripting (Shell, Python, or Go), and Linux system administration. The position emphasizes automation, observability, and improving operational efficiency of Google's systems.
Must have:
  • 5+ years data center/hardware experience
  • 5+ years networking operations experience
  • 2+ years scripting (Shell, Python, Go)
  • 2+ years Linux system administration
  • Troubleshooting and automation skills
Good to have:
  • Experience with hardware vendors
  • CI/CD tool experience (Git, Piper, etc.)
  • Cloud experience (Google Cloud)
  • IT infrastructure and security knowledge

Minimum qualifications:

  • 5 years of experience in data centers and hardware operations or design.
  • 5 years of experience in Networking Operations.
  • 2 years of experience in Scripting/Practical Coding in Shell, Python, or Go.
  • 2 years of experience with Linux system administration.

Preferred qualifications:

  • Experience working with hardware vendors and supply chain.
  • Experience with CI/CD tools (e.g., Git, Piper, etc.).
  • Experience in Cloud (e.g., Google Cloud, etc.).
  • Knowledge of IT infrastructure and security standards, with the ability to troubleshoot issues.

About the job

Systems Development Engineering (SDE) at Google is a role where you manage services and systems at scale. SDEs creatively put their engineering discipline to use automating the mundane and reducing toil. We don’t just write code to fix bugs, but emphasize the development of tools and solutions that fix classes of problems. We know it’s hard to control what you can’t measure – so we focus on observability: instrumenting first, then turning data into knowledge, and finally knowledge into action. We know that the operational efficiency of Google systems, services, virtual compute environments and the operating systems that power them impact the environment, not just the bottom line. We know that working together we can do more, and that community matters.

Google brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

Together we engineer and build the infrastructure, tools, access and telemetry for systems that enable orchestration of Google-scale services. Come build things that matter.

Google Distributed Cloud Edge (GDCE) is a portfolio of managed hardware and software solutions which extends Google Cloud’s Infrastructure to the edge and to customer’s data centers. The Operations support team will be the primary point of contact for field deployers encountering challenges during GDCE installation at customer sites.

Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.

Responsibilities

  • Provide support for installation challenges (e.g., GDCE zone setup issues (eg. cabling error, network issue, hardware DOA), edgeOS bootup problems, networking complications, cluster creation hurdles, etc.). 
  • Participate in rotating on-call schedules including during weekends and holidays for production operations. 
  • Interface with external hardware vendors to ensure the requirements and specifications of GDCE servers are implemented, and with the internal labs to ensure the lab deployment reflects the product design.
  • Manage troubleshooting customer workload migration issues, including script/playbook errors and store-specific configuration discrepancies. 
  • Identify issues and design automated remediation solutions in order to reduce toiling.
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

View All Jobs

Get notified when new jobs are added by Google

Similar Jobs

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Cadence Design Systems - IT-Staff Systems Engineer

Cadence Design Systems, India (On-Site)

Luxoft - Java Technical Support L2 Engineer

Luxoft, India (On-Site)

Luxoft - Junior Embedded C Developer

Luxoft, Romania (On-Site)

Next Level Business Services - Support Engineer - Full Time

Next Level Business Services, United States (On-Site)

IGT - Data Operations Analyst

IGT, United States (On-Site)

HID - Senior Software Engineer

HID, India (Hybrid)

TOPPAN Edge  Inc  - Quality Assurance Analyst

TOPPAN Edge Inc , India (On-Site)

Get notifed when new similar jobs are uploaded

DevOps Jobs

Dynamics - Cloud Architect (SEVIS)

Dynamics, (Remote)

Homa games - SRE / Devops

Homa games, France (On-Site)

Smarsh - Senior Platform Engineer

Smarsh, India (Hybrid)

Glean - SRE Manager (India)

Glean, India (On-Site)

N-iX - Senior Python Engineer (#2435)

N-iX, Ukraine (Remote)

Immutable - Senior Site Reliability Engineer

Immutable, Singapore (Hybrid)

Luxoft - Senior Software Support Engineer

Luxoft, India (Remote)

Rackspace Technology - Senior Big Data Hadoop ML Engineer (GCP)

Rackspace Technology, United States (Remote)

Get notifed when new similar jobs are uploaded