Software Engineer, Site Reliability Engineering

5 Months ago • 2 Years + • Full Stack Development

Job Summary

Job Description

Appier's Site Reliability Engineering (SRE) team combines software and systems engineering to build and run large-scale, distributed systems. SRE ensures high reliability and uptime for both internal and external systems. Responsibilities include managing the lifecycle of services (design, deployment, operation, refinement), supporting services before launch, maintaining live services, scaling systems sustainably through automation, practicing incident response and postmortems, and participating in on-call rotation. The ideal candidate will have experience with software development, Linux system administration, and large-scale distributed systems. Experience with cloud solutions, automation, and big data systems is preferred.
Must have:
  • Software development experience (2+ years)
  • Linux system administration (2+ years)
  • Large-scale distributed systems experience (1+ years)
  • Project leadership and technical leadership (1+ years)
  • Production service planning and deployment
Good to have:
  • Cloud solutions (AWS, GCP)
  • Docker, Puppet, Chef, Salt, Ansible
  • CI/CD systems
  • Big data systems experience
  • Web services operation
  • Database management (MongoDB, Cassandra, MySQL, PostgreSQL)
  • Security knowledge (Firewall, security policy)

Job Details

About Appier 

Appier is a software-as-a-service (SaaS) company that uses artificial intelligence (AI) to power business decision-making. Founded in 2012 with a vision of democratizing AI, Appier’s mission is turning AI into ROI by making software intelligent. Appier now has 17 offices across APAC, Europe and U.S., and is listed on the Tokyo Stock Exchange (Ticker number: 4180). Visit www.appier.com for more information.

 

About the role

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Appier's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

 

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Appier, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. This includes source code management, continuous integration, artifact packaging, continuous deployment, service traffic management, service registration and discovery, as well as holistic observability and the underlying compute runtime and container orchestration. A collection of platforms and capabilities which accelerate development velocity while protecting Appier’s production availability. We are looking for all levels of seniority in the space. This is a local hire position. 

 

Responsibilities 

  • Engage in and improve the whole lifecycle of services—from inception and design, through to deployment, operation and refinement.
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
  • Practice sustainable incident response and blameless postmortems.
  • Participate in on-call rotation.(remote on-call)

 

About you

[Minimum qualifications]

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 2+ years of experience with software development in one or more programming languages.
  • 2+ years of experience with Linux system administration.
  • 1+ years of experience in designing, analyzing, and troubleshooting large-scale distributed systems, and 1+ years of experience leading projects and providing technical leadership.
  • Hands-on experience in planning and deploying services on production.

 

[Preferred qualifications]

  • Experience in architecting, developing, or maintaining production-grade cloud solutions in virtualized environments
  • Experience in deployment and orchestration technologies (such as Docker, Puppet, Chef, Salt, Ansible)
  • Experience in building and deploying automation and continuous integration systems
  • Experience in operating a big data systems related to data access, collection, processing and storage
  • Experience in operating and deploying online web services
  • Experience in operating services on IaaS such as AWS and GCP.
  • Experience in Database management (e.g.Database System Setup, Backup & Restore, System Tuning), MongoDB, Cassandra, MySQL, and PostgreSQL will be plus.
  • Security Knowledge such as setting up Firewall, proper security policy design, network attack defense.
  • Working knowledge of virtualization, hosted services, multi-tenant cloud infrastructures, storage systems and content delivery networks.

Similar Jobs

Google - Senior Software Engineer, Visual Language and Multimodal Modeling

Google

Sydney, New South Wales, Australia (On-Site)
1 Week ago
OKX - Graduate Hire 2024/25 - Software Engineer

OKX

Hong Kong (On-Site)
6 Months ago
ByteDance - Researcher - Large Language Models, Applied Machine Learning

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
Microsoft - Member of Technical Staff, AI - Reinforcement Learning Systems

Microsoft

Mountain View, California, United States (Hybrid)
1 Week ago
ByteDance - Research Scientist, Foundation Model, Speech Understanding

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Google - Software Engineer III, Server Frameworks

Google

Mexico City, Mexico City, Mexico (On-Site)
1 Week ago
Google - Staff Software Engineer, Google Cloud Dataproc, Open Source

Google

Sunnyvale, California, United States (On-Site)
1 Week ago
Google - Software Engineer III, Google Cloud

Google

Dublin, County Dublin, Ireland (On-Site)
1 Week ago
Google - Software Engineer, Mobile, iOS, Photos

Google

Sydney, New South Wales, Australia (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Rackspace Technology - Principal Java Engineer (GCP)

Rackspace Technology

United States (Remote)
1 Month ago
Google - Software Engineer II, Site Reliability Engineering, Cloud Logs

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Week ago
Google - Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
6 Days ago
Google - Software Engineer III, AI/ML, Recommendations, Rankings, Predictions

Google

Mountain View, California, United States (On-Site)
4 Days ago
ByteDance - Student Researcher (Doubao (Seed) - Machine Learning System) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Techland - Junior Rendering Programmer

Techland

Wrocław, Lower Silesian Voivodeship, Poland (On-Site)
2 Weeks ago
Google - Software Engineer III, Google Cloud Platforms

Google

Kirkland, Washington, United States (On-Site)
5 Months ago
Genies - Lead Applied ML Engineer, Real-time 3D Asset Optimization

Genies

San Mateo, California, United States (On-Site)
4 Weeks ago
Google - Software Engineer II, Health Platform Nova

Google

Bucharest, Bucharest, Romania (On-Site)
1 Week ago
Inkittt - Fullstack Martech Engineer

Inkittt

San Francisco, California, United States (Hybrid)
2 Days ago

Get notifed when new similar jobs are uploaded

Jobs in Taipei City, Taiwan

Google - Software Engineer III, Mainline Engineering Productivity

Google

New Taipei, New Taipei City, Taiwan (On-Site)
4 Days ago
Google - Senior System Engineer, Product Software

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Week ago
Google - Software Engineering Manager II, Pixel Performance

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Week ago
Google - Senior Software Engineer, Generative AI and LLMs

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Week ago
Trend Micro - (Sr.) Data Engineer/AI Trainer

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago
Google - Electrical Engineering Manager, Google Cloud

Google

Taipei City, Taiwan (On-Site)
4 Days ago
Google - Staff Software Engineer, Large Language Model and GenAI

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Week ago
Google - Machine Learning Engineer, Pixel AI

Google

New Taipei, New Taipei City, Taiwan (On-Site)
6 Days ago
Google - Manufacturing Test Engineering, Rack Integration

Google

Taipei City, Taiwan (On-Site)
6 Days ago
Google - Firmware Engineer, AS Layer 3, Modem Reliability Engineering

Google

New Taipei City, Taiwan (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Full Stack Development Jobs

N-iX - Middle .NET Engineer

N-iX

Poland (Remote)
1 Month ago
The Walt Disney Company - Sr Software Engineer

The Walt Disney Company

Seattle, Washington, United States (On-Site)
1 Month ago
Luxoft - Senior Angular JS Developer

Luxoft

New York, New York, United States (On-Site)
4 Months ago
Technorizen Software Solutions - React Native | Node Js Developer

Technorizen Software Solutions

Indore, Madhya Pradesh, India (On-Site)
9 Months ago
CloudLinux - Java Developer

CloudLinux

Tbilisi, Tbilisi, Georgia (Remote)
2 Weeks ago
PwC - IN-Manager _Technical Delivery Manager_ Emerging Technologies_ Advisory_ Bengaluru

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Warner Bros Games - Senior Software Engineer - Roku (Adtech Team)

Warner Bros Games

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Lakshya Digital - Dot Net Developer

Lakshya Digital

Haryana, India (On-Site)
1 Month ago
Next Level Business Services - Apigee API Developer

Next Level Business Services

San Francisco, California, United States (On-Site)
5 Months ago
Nagarro - Principal Engineer, Java Fullstack

Nagarro

India (Remote)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

San Francisco, California, United States (Remote)

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

Seoul, South Korea (On-Site)

Taipei City, Taiwan (On-Site)

San Francisco, California, United States (On-Site)

Taipei City, Taiwan (On-Site)

View All Jobs

Get notified when new jobs are added by Appier

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug