Software Engineer, Site Reliability Engineering

undefined ago • 1 Years + • Devops

Job Summary

Job Description

Appier is an AI-powered SaaS company focused on business decision-making. The Site Reliability Engineering (SRE) role involves combining software and systems engineering to build and run large-scale, distributed, fault-tolerant systems. SREs ensure reliability, uptime, and continuous improvement of Appier's internal and external services, while also monitoring capacity and performance. The role focuses on optimizing existing systems, building infrastructure, and automating tasks, managing challenges of scale using expertise in coding, algorithms, complexity analysis, and large-scale system design.
Must have:
  • Engage in and improve the whole lifecycle of services—from inception and design, through to deployment, operation and refinement.
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
  • Practice sustainable incident response and blameless postmortems.
  • Participate in on-call rotation.
Good to have:
  • Experience in architecting, developing, or maintaining production-grade cloud solutions in virtualized environments
  • Experience in deployment and orchestration technologies (such as Docker, Puppet, Chef, Salt, Ansible)
  • Experience in building and deploying automation and continuous integration systems
  • Experience in operating a big data systems related to data access, collection, processing and storage
  • Experience in operating and deploying online web services
  • Experience in operating services on IaaS such as AWS and GCP.
  • Experience in Database management (e.g.Database System Setup, Backup & Restore, System Tuning), MongoDB, Cassandra, MySQL, and PostgreSQL
  • Security Knowledge such as setting up Firewall, proper security policy design, network attack defense.
  • Working knowledge of virtualization, hosted services, multi-tenant cloud infrastructures, storage systems and content delivery networks.

Job Details

About Appier

Appier is a software-as-a-service (SaaS) company that uses artificial intelligence (AI) to power business decision-making. Founded in 2012 with a vision of democratizing AI, Appier’s mission is turning AI into ROI by making software intelligent. Appier now has 17 offices across APAC, Europe and U.S., and is listed on the Tokyo Stock Exchange (Ticker number: 4180). Visit www.appier.com for more information.

About the role

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Appier's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Appier, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. This includes source code management, continuous integration, artifact packaging, continuous deployment, service traffic management, service registration and discovery, as well as holistic observability and the underlying compute runtime and container orchestration. A collection of platforms and capabilities which accelerate development velocity while protecting Appier’s production availability. We are looking for all levels of seniority in the space. This is a local hire position.

Responsibilities

  • Engage in and improve the whole lifecycle of services—from inception and design, through to deployment, operation and refinement.
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
  • Practice sustainable incident response and blameless postmortems.
  • Participate in on-call rotation.(remote on-call)

About you

Minimum qualifications

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 2+ years of experience with software development in one or more programming languages.
  • 2+ years of experience with Linux system administration.
  • 1+ years of experience in designing, analyzing, and troubleshooting large-scale distributed systems, and 1+ years of experience leading projects and providing technical leadership.
  • Hands-on experience in planning and deploying services on production.

Preferred qualifications

  • Experience in architecting, developing, or maintaining production-grade cloud solutions in virtualized environments
  • Experience in deployment and orchestration technologies (such as Docker, Puppet, Chef, Salt, Ansible)
  • Experience in building and deploying automation and continuous integration systems
  • Experience in operating a big data systems related to data access, collection, processing and storage
  • Experience in operating and deploying online web services
  • Experience in operating services on IaaS such as AWS and GCP.
  • Experience in Database management (e.g.Database System Setup, Backup & Restore, System Tuning), MongoDB, Cassandra, MySQL, and PostgreSQL will be plus.
  • Security Knowledge such as setting up Firewall, proper security policy design, network attack defense.
  • Working knowledge of virtualization, hosted services, multi-tenant cloud infrastructures, storage systems and content delivery networks.

Similar Jobs

Rippling - Global Payroll Account Executive

Rippling

Sydney, New South Wales, Australia (Hybrid)
3 Months ago
gitlab - Principal Software Engineer

gitlab

(Remote)
2 Months ago
Granicus - SLED Enterprise Account Executive - State Team

Granicus

United States (Remote)
2 Months ago
Single Store - Senior Manager GTM

Single Store

Sunnyvale, California, United States (Hybrid)
3 Months ago
e2 open - Enterprise Service Engineer - Java

e2 open

Hyderabad, Telangana, India (On-Site)
2 Months ago
Adyen - Solutions Engineer

Adyen

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Months ago
Argus - Site Reliability Engineer

Argus

Calgary, Alberta, Canada (Remote)
4 Months ago
Ansys - Lead SPDM Application Engineer - Customer Solutions Engineer

Ansys

Canonsburg, Pennsylvania, United States (Remote)
1 Month ago
bytedance - DevOps Engineer - Applied Machine Learning Engine (Singapore)

bytedance

Singapore (On-Site)
8 Months ago
Tide - Principal Cloud Engineer

Tide

Lithuania (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Notion - Global Head of Customer Success

Notion

San Francisco, California, United States (On-Site)
2 Months ago
Sprinkler - Large Enterprise Account Executive

Sprinkler

Singapore (On-Site)
2 Months ago
Veeam Software - Territory Manager

Veeam Software

Seoul, South Korea (On-Site)
2 Months ago
Zuora - Solution Consultant - Fraud

Zuora

United States (Remote)
1 Month ago
USE Insider - Inside Sales Specialist - Mexico

USE Insider

Mexico City, Mexico (Hybrid)
2 Months ago
Zinnia - Business, Solutions Architect

Zinnia

Bridgewater, New Jersey, United States (Hybrid)
2 Months ago
Vimeo - Renewals Manager II

Vimeo

Sydney, New South Wales, Australia (On-Site)
4 Weeks ago
Cognite - Vice President Global Academy

Cognite

Amsterdam, North Holland, Netherlands (Remote)
1 Month ago
CyberArk - Automation Engineer

CyberArk

India (On-Site)
3 Months ago
Reveal - GTM Systems Manager

Reveal

Chicago, Illinois, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Taipei City, Taiwan

Qualcomm - Component Supplier Quality Engineer, Staff

Qualcomm

Hsinchu City, Taiwan (On-Site)
2 Months ago
PwC - L. Legal - Intern Lawyer

PwC

Taipei City, Taiwan (On-Site)
1 Month ago
Dentsu - Data & Analytics Strategy Specialist

Dentsu

Taipei City, Taiwan (On-Site)
1 Month ago
Qualcomm - Camera Software Engineer – Senior

Qualcomm

Taipei City, Taiwan (On-Site)
1 Month ago
Lilt - Traditional Chinese Linguists

Lilt

Taiwan (Remote)
1 Year ago
binance - Senior Staff Engineer - Java

binance

Taipei City, Taiwan (Remote)
9 Months ago
NVIDIA - Senior Mixed Signal Design Verification Engineer

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
3 Months ago
binance - Central PMO

binance

Taipei City, Taiwan (Remote)
5 Months ago
binance - Spot Trading Operations

binance

Taipei City, Taiwan (Remote)
1 Month ago
binance - Senior Product Manager, Trading Systems (Backend)

binance

Taipei City, Taiwan (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Cargo studio - Lead DevOps Engineer

Cargo studio

(On-Site)
5 Months ago
Reltio - Senior DevOps Engineer

Reltio

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Xplor Technologies - Salesforce Engineer - Service Cloud Voice

Xplor Technologies

Newcastle, Northern Ireland, United Kingdom (Remote)
2 Months ago
Scopely - Senior Server Engineer (Platform)

Scopely

Barcelona, Catalonia, Spain (Hybrid)
5 Months ago
Behaviour Interactive - Senior Build and Pipeline Programmer

Behaviour Interactive

Montreal, Quebec, Canada (Hybrid)
3 Months ago
Apple - Senior SRE Manager, iCloud

Apple

Seattle, Washington, United States (On-Site)
1 Month ago
Qualcomm - Senior Devops Engineer

Qualcomm

Hyderabad, Telangana, India (On-Site)
2 Months ago
conga - Software Architect

conga

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Crowd Strick - Sr. Software Engineer Cloud - Flight Control

Crowd Strick

Canada (Remote)
2 Months ago
Veeam Software - Platform Engineer, SaaS

Veeam Software

Warsaw, Masovian Voivodeship, Poland (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

View All Jobs

Get notified when new jobs are added by appier

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug