Staff Site Reliability Engineer

2 Months ago • 7 Years + • Devops • $156,000 PA - $240,000 PA

Job Summary

Job Description

Attentive is seeking a Staff Site Reliability Engineer to join their Platform Infrastructure team. This role is responsible for designing and implementing solutions that enhance the reliability and scalability of their systems. The team owns compute, persistence, networking, observability, and deployments, handling billions of events daily. The engineer will lead strategic initiatives, collaborate with various engineering teams (AI/ML, Data, Platform, Product), establish standards, champion reliability goals, mentor team members, and drive continuous improvement. Key responsibilities include designing and delivering high-impact solutions for reliability, observability, and incident management, and ensuring platform scalability.
Must have:
  • 7+ years of experience in Production Engineering, SRE, or similar
  • Proficient in Golang, Python, Java, or Typescript
  • Experience delivering medium to large-scale projects
  • Deep understanding of reliability concepts (SLIs, SLOs, incident management)
  • Excellent verbal and written communication skills
Good to have:
  • Familiarity with dynamic, reliability-focused production environments
Perks:
  • Competitive perks and benefits
  • Health & wellness
  • Equity

Job Details

Attentive® is the AI-powered mobile marketing platform transforming the way brands personalize consumer engagement. Attentive enables marketers to craft tailored journeys for every subscriber, driving higher recurring revenue and maximizing campaign performance. Activating real-time data from multiple channels and advanced AI, the platform personalizes content, tone, and timing to deliver 1:1 messages that truly resonate.

With a top-rated customer success team recognized on G2, Attentive partners with marketers to provide strategic guidance and optimize SMS and email campaigns. Trusted by leading global brands like Neiman Marcus, Samsung, Wayfair, and Dyson, Attentive ensures enterprise-grade compliance and deliverability, supporting trillions of interactions across more than 70 industries. To learn more or request a demo, visit www.attentive.com or follow us on LinkedIn, X (formerly Twitter), or Instagram.

Attentive’s growth has been recognized by Deloitte’s Fast 500, Linkedin’s Top Startups and Forbes Cloud 100 all thanks to the hard work from our global employees!

About the Role
Our Platform Infrastructure team is the backbone of everything we do at Attentive, providing a resilient and cost-effective platform that seamlessly handles billions of events from over 100 million customers daily. We own everything from compute, persistence, and networking to observability and deployments. Joining our team offers a high-growth career opportunity to collaborate with some of the world’s most talented engineers in a high-performance, high-impact culture.

As part of the Infrastructure and Platform organization, the Production Engineering Team is focused on delivering a fast and reliable platform that empowers Attentive engineers to deliver solutions quickly and safely. We build scalable systems that automate routine tasks so we can focus on other impactful efforts. Reliability, scalability, and security are our areas of expertise. We focus on release, observability, and cost optimization. Our mission is to create robust platforms and tools that allow stakeholders to concentrate on delivering exceptional products.

As a Staff Engineer, you will take a strategic role in designing and implementing solutions that enhance the reliability and scalability of our systems, while mentoring others and influencing technical roadmaps across the organization.

What You'll Accomplish
  • Design and Deliver High-Impact Solutions: Design and implement systems that enhance reliability, observability, traceability, and incident management, ensuring the platform scales effectively
  • Lead Strategic Initiatives: Take ownership of cross-team collaborations and drive impactful projects by providing technical leadership and guidance
  • Partner Across Teams: Collaborate with engineers from AI/ML, Data, Platform, and Product teams to develop best-in-class services
  • Partner with engineers from AI/ML, Data, Platform, Product, and other groups to deliver best-in-class services
  • Establish Standards and Best Practices: Define and enforce production standards, processes, and tools to ensure operational excellence
  • Champion Reliability Goals: Advocate for and implement SLIs, SLOs, and other reliability-focused metrics across the engineering organization
  • Mentorship and Knowledge Sharing: Guide and mentor team members, fostering technical growth and helping to develop the next generation of engineering leaders
  • Innovate and Inspire: Drive continuous improvement by bringing creative ideas and challenging the status quo

Your Expertise
  • 7+ years of experience in Production Engineering, Backend Engineering, SRE, DevOps or similar role
  • Proficient Problem-Solver: Strong coding ability in at least one language (e.g., Golang, Python, Java, Typescript) with the capability to solve complex issues through code
  • Track Record of Success: Demonstrated experience delivering medium to large-scale projects that drive meaningful improvements in platform reliability and scalability
  • Reliability Expertise: Deep understanding of production reliability concepts, including SLIs, SLOs, and incident management
  • Strong Communicator: Excellent verbal and written communication skills with the ability to influence and collaborate across technical and non-technical teams
  • Fast-Paced Experience: Familiarity with working in dynamic, reliability-focused production environments (preferred)

What We Use
  • Our infrastructure runs primarily in Kubernetes hosted in AWS’s EKS
  • Infrastructure tooling includes Istio, Datadog, Terraform, CloudFlare, and Helm
  • Our backend is Java / Spring Boot microservices, built with Gradle, coupled with things like DynamoDB, Kinesis, AirFlow, Postgres, Planetscale, and Redis, hosted via AWS
  • Our frontend is built with React and TypeScript, and uses best practices like GraphQL, Storybook, Radix UI, Vite, esbuild, and Playwright
  • Our automation is driven by custom and open source machine learning models, lots of data and built with Python, Metaflow, HuggingFace 🤗, PyTorch, TensorFlow, and Pandas

You'll get competitive perks and benefits, from health & wellness to equity, to help you bring your best self to work.

For US based applicants:
- The US base salary range for this full-time position is $156,000 - $240,000 annually + equity + benefits
- Equity is a substantial part of the total compensation package
- Our salary ranges are determined by role, level and location

#LI-EF1

Attentive Company Values
Default to Action - Move swiftly and with purpose
Be One Unstoppable Team - Rally as each other’s champions
Champion the Customer - Our success is defined by our customers' success
Act Like an Owner - Take responsibility for Attentive’s success

Learn more about AWAKE, Attentive’s collective of employee resource groups.

If you do not meet all the requirements listed here, we still encourage you to apply! No job description is perfect, and we may also have another opportunity that closely matches your skills and experience.

At Attentive, we know that our Company's strength lies in the diversity of our employees. Attentive is an Equal Opportunity Employer and we welcome applicants from all backgrounds. Our policy is to provide equal employment opportunities for all employees, applicants and covered individuals regardless of protected characteristics. We prioritize and maintain a fair, inclusive and equitable workplace free from discrimination, harassment, and retaliation. Attentive is also committed to providing reasonable accommodations for candidates with disabilities. If you need any assistance or reasonable accommodations, please let your recruiter know. 

Similar Jobs

flix interactive - VFX Artist

flix interactive

Birmingham, England, United Kingdom (Remote)
3 Months ago
PwC - Associate - IFS - Secretary and Admin

PwC

Jakarta, Jakarta, Indonesia (On-Site)
10 Months ago
D-market - Customer Support Specialist

D-market

Ukraine (On-Site)
1 Month ago
PwC - Deals | Senior Associate Financial Due Diligence Barcelona

PwC

Barcelona, Catalonia, Spain (On-Site)
10 Months ago
Penumbrainc - Manufacturing Engineer I - Development

Penumbrainc

Alameda, California, United States (On-Site)
3 Months ago
Unity - Automation Infrastructure Engineer

Unity

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
4 Months ago
ARHS - DevOps - AWS Cloud Engineer

ARHS

Brussels, Brussels, Belgium (On-Site)
1 Month ago
dbt Labs - Solutions Architect, Enterprise

dbt Labs

New York, New York, United States (Hybrid)
1 Month ago
Canonical - Cloud Field Engineer

Canonical

(Remote)
3 Months ago
Tesla - Site Reliability Engineer, Energy Software

Tesla

North Holland, Netherlands (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Evolution  - Compliance Officer (Costa Rica)

Evolution

San José, San José Province, Costa Rica (On-Site)
1 Year ago
Visa - Software Engineer - Backend

Visa

Warsaw, Masovian Voivodeship, Poland (Hybrid)
9 Months ago
GoMotive - Senior Marketing Analytics Specialist

GoMotive

Islamabad, Islamabad Capital Territory, Pakistan (Hybrid)
1 Month ago
InMobiInMobi - Assistant Manager - Public Policy and Partnerships

InMobiInMobi

New Delhi, Delhi, India (On-Site)
3 Months ago
Atari - Senior Creative Designer, Digital Marketing Campaigns

Atari

United States (Remote)
1 Month ago
Reveal - Senior Software Development Engineer in Test

Reveal

Hyderabad, Telangana, India (On-Site)
2 Months ago
Riot Games - Project Coordinator, Wild Rift

Riot Games

Shanghai, China (On-Site)
2 Months ago
kaizen gaming  - Key Account Manager

kaizen gaming

Prague, Prague, Czechia (Hybrid)
2 Months ago
gismart - UA Manager (Paid Social)

gismart

(Remote)
3 Months ago
MiQ - Account Manager

MiQ

Manila, Metro Manila, Philippines (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in United States

Interface AI - Account Executive

Interface AI

San Francisco, California, United States (Remote)
2 Months ago
Alten Technology - Software Development Engineer

Alten Technology

Greensboro, North Carolina, United States (On-Site)
4 Weeks ago
zoox - Senior/Staff Technical Program Manager - Simulation

zoox

Foster City, California, United States (Hybrid)
5 Months ago
Apple - Pre-silicon Metal Framework Engineer

Apple

Cupertino, California, United States (On-Site)
3 Months ago
bounteous - Senior Product Manager

bounteous

United States (Remote)
4 Weeks ago
Loft Orbital - Director of Sales Engineering

Loft Orbital

Golden, Colorado, United States (Hybrid)
1 Month ago
bytedance - Talent Acquisition Partner - Data

bytedance

Seattle, Washington, United States (On-Site)
4 Months ago
C3 IoT - Solution Engineer

C3 IoT

Redwood City, California, United States (On-Site)
1 Month ago
Trek - Future Store Manager - Portland Area

Trek

Portland, Oregon, United States (On-Site)
6 Months ago
Anavation - Cyber Security Operations Analyst

Anavation

Bethesda, Maryland, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Open Systems Technologies - DevOps Engineer

Open Systems Technologies

Guadalajara, Jalisco, Mexico (Hybrid)
1 Month ago
Kwalee - DevSecOps Engineer

Kwalee

Royal Leamington Spa, England, United Kingdom (On-Site)
3 Months ago
smarsh - Cloud Engineer III-Observability

smarsh

India (Hybrid)
7 Months ago
Capgemini - SAP End to End Solution Architect

Capgemini

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Saviynt - Manager Cloud Security, Infosec

Saviynt

Bengaluru, Karnataka, India (Hybrid)
8 Months ago
Lead Venture - Salesforce Administrator - Service Cloud

Lead Venture

United States (Remote)
1 Month ago
Illumina - Intelligent Automation Engineer

Illumina

Bengaluru, Karnataka, India (On-Site)
1 Year ago
Deepgram - Senior Pre-Sales Solutions Engineer

Deepgram

California, United States (Remote)
2 Months ago
Capgemini - AZURE SOLUTION ARCHITECT

Capgemini

Mumbai, Maharashtra, India (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

United States (Remote)

London, England, United Kingdom (Remote)

New York, United States (Remote)

United States (Remote)

United States (Remote)

London, England, United Kingdom (Hybrid)

San Francisco, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by attentive