Software Engineer, Site Reliability Engineering

3 Months ago • 2 Years + • Devops

Job Summary

Job Description

The Site Reliability Engineer (SRE) role combines software and systems engineering to build and maintain large-scale, fault-tolerant systems. The SRE will focus on ensuring services have the required reliability, uptime, and improvement rate. Responsibilities include engaging in the entire service lifecycle, supporting services before they go live, maintaining services by monitoring system health, scaling systems through automation, and participating in incident response. The SRE will have the opportunity to manage complex challenges and use their expertise in coding, algorithms, and system design.
Must have:
  • 2+ years of software development experience in one or more programming languages.
  • 2+ years of Linux system administration experience.
  • 1+ years of experience in distributed systems and leading projects.
  • Hands-on experience in planning and deploying services on production.
Good to have:
  • Experience in cloud solutions in virtualized environments.
  • Experience in deployment and orchestration technologies.
  • Experience in building and deploying automation systems.
  • Experience in big data systems related to data access.
  • Experience in operating online web services.
  • Experience in operating services on AWS and GCP.
  • Experience in Database management like MongoDB, Cassandra, MySQL.
  • Security Knowledge such as Firewall setup.
  • Working knowledge of virtualization and cloud infrastructures.

Job Details

About Appier 

Appier is a software-as-a-service (SaaS) company that uses artificial intelligence (AI) to power business decision-making. Founded in 2012 with a vision of democratizing AI, Appier’s mission is turning AI into ROI by making software intelligent. Appier now has 17 offices across APAC, Europe and U.S., and is listed on the Tokyo Stock Exchange (Ticker number: 4180). Visit www.appier.com for more information.

 

About the role

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Appier's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Appier, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. This includes source code management, continuous integration, artifact packaging, continuous deployment, service traffic management, service registration and discovery, as well as holistic observability and the underlying compute runtime and container orchestration. A collection of platforms and capabilities which accelerate development velocity while protecting Appier’s production availability. We are looking for all levels of seniority in the space. This is a local hire position. 

 

Responsibilities 

  • Engage in and improve the whole lifecycle of services—from inception and design, through to deployment, operation and refinement.
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
  • Practice sustainable incident response and blameless postmortems.
  • Participate in on-call rotation.(remote on-call)

About you

[Minimum qualifications]

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 2+ years of experience with software development in one or more programming languages.
  • 2+ years of experience with Linux system administration.
  • 1+ years of experience in designing, analyzing, and troubleshooting large-scale distributed systems, and 1+ years of experience leading projects and providing technical leadership.
  • Hands-on experience in planning and deploying services on production.
  • Proficiency in Chinese

[Preferred qualifications]

  • Experience in architecting, developing, or maintaining production-grade cloud solutions in virtualized environments
  • Experience in deployment and orchestration technologies (such as Docker, Puppet, Kubernetes, Chef, Salt, Ansible)
  • Experience in building and deploying automation and continuous integration systems
  • Experience in operating a big data systems related to data access, collection, processing and storage
  • Experience in operating and deploying online web services
  • Experience in operating services on IaaS such as AWS and GCP.
  • Experience in Database management (e.g.Database System Setup, Backup & Restore, System Tuning), MongoDB, Cassandra, MySQL, and PostgreSQL will be plus.
  • Security Knowledge such as setting up Firewall, proper security policy design, network attack defense.
  • Working knowledge of virtualization, hosted services, multi-tenant cloud infrastructures, storage systems and content delivery networks.

 

#LI-BD1 #LI-Hybrid

Similar Jobs

Sword Health - Deal Desk Strategist

Sword Health

Ireland (Remote)
3 Weeks ago
Harvey - Enterprise Customer Success Manager, APAC

Harvey

Sydney, New South Wales, Australia (Hybrid)
3 Weeks ago
Glean - Business Development Representative (EMEA shift hours)

Glean

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Lead Venture - Accounts Receivable Specialist

Lead Venture

Canada (Remote)
3 Weeks ago
Interface AI - Senior Customer Support & Escalations Manager

Interface AI

San Jose, California, United States (On-Site)
1 Month ago
Shield AI - Software Engineer, API's & Infrastructure (R2609)

Shield AI

San Diego, California, United States (On-Site)
3 Weeks ago
The Walt Disney Company - Automation Engineer

The Walt Disney Company

(On-Site)
6 Months ago
Loft Orbital - Senior SRE / DevOps

Loft Orbital

Toulouse, Occitanie, France (Hybrid)
10 Months ago
Synechron - API Automation Engineer (Java/Python)

Synechron

Bengaluru, Karnataka, India (On-Site)
1 Year ago
Google - Software Engineer III, Site Reliability Engineering

Google

Sunnyvale, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Mercury - Deputy BSA Officer

Mercury

San Francisco, California, United States (Remote)
1 Month ago
CyberArk - Human Resources BI Analyst

CyberArk

Israel (Hybrid)
3 Weeks ago
EvenUp - Senior Accounting Manager

EvenUp

San Francisco, California, United States (Hybrid)
2 Months ago
Agara labs - Senior Enterprise Account Executive

Agara labs

California City, California, United States (Remote)
3 Months ago
Rippling - Senior Security Engineer, Offensive Security

Rippling

United States (Remote)
1 Month ago
Nice - Specialist Software Engineer (Dot Net)

Nice

Pune, Maharashtra, India (Hybrid)
1 Month ago
Zscaler - Account Executive, Commercial

Zscaler

London, England, United Kingdom (Hybrid)
1 Month ago
Toast - Principal Software Engineer - Backend

Toast

Chennai, Tamil Nadu, India (Hybrid)
6 Months ago
Global Business Travel - Software Development Engineer I

Global Business Travel

Gurugram, Haryana, India (On-Site)
1 Year ago
Crowd Strick - Regional Alliances Manager, North

Crowd Strick

Delhi, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Tokyo, Japan

playphony games - Pipeline Tool Programmer

playphony games

Tokyo, Japan (On-Site)
3 Months ago
unseen tokyo - Senior Gameplay Engineer (AI)

unseen tokyo

Japan (On-Site)
1 Month ago
Mixpanel - Solutions Engineer

Mixpanel

Tokyo, Japan (Remote)
1 Month ago
Miro - Scaled Customer Success Manager, Japan

Miro

Tokyo, Japan (On-Site)
1 Month ago
Marvelous games - Game Department: Global Marketing Department, Domestic & Asian, Global Sales Promotion

Marvelous games

Shinagawa City, Tokyo, Japan (On-Site)
6 Months ago
q games - Marketing Coordinator

q games

Kyoto, Kyoto, Japan (On-Site)
3 Months ago
bytedance - Time Sensitive Content Operation - Trust & Safety

bytedance

Tokyo, Japan (On-Site)
4 Months ago
Gree - Financial Strategy Staff (Corporate Planning)

Gree

Tokyo, Japan (Hybrid)
1 Month ago
Cygames - 3DCG / Video / Layout Artist / Tokyo

Cygames

Shibuya, Tokyo, Japan (On-Site)
3 Months ago
Tencent - Technical Art Expert

Tencent

Tokyo, Japan (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Ion - Cloud Engineer Kubernetes

Ion

Italy (Hybrid)
10 Months ago
Sailpoint - Advisory Solutions Engineer

Sailpoint

State Of São Paulo, Brazil (On-Site)
1 Month ago
Thousand Eyes - Site Reliability Engineering Technical Leader

Thousand Eyes

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Tesla - Distributed Systems Engineer, Autobidder Platform (Energy Software)

Tesla

North Holland, Netherlands (On-Site)
6 Months ago
Next Level Business Services - Senior Java, Cloud Foundry Developer (Full Time)

Next Level Business Services

Herndon, Virginia, United States (On-Site)
10 Months ago
Cognite - Solution Architect

Cognite

Tokyo, Japan (On-Site)
10 Months ago
endava - Senior Cloud Operations Engineer - AWS

endava

Iași, Iași County, Romania (On-Site)
2 Months ago
mad finger games - Build Engineer

mad finger games

Brno, South Moravian Region, Czechia (On-Site)
9 Months ago
deel. - Senior Backend Engineer, Node.js + AWS

deel.

Georgia (Remote)
2 Weeks ago
Ansys - Lead DevOps Engineer

Ansys

Canonsburg, Pennsylvania, United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded