Senior Production Support Engineer

2 Months ago • 3-3 Years

Job Summary

Job Description

As a Production Support Engineer, you will support, monitor, and maintain the high availability of Tala's platform across all markets. This role involves close collaboration with engineering, CX, product, and program teams globally for production incident response and post-mortem processes. You'll also work closely with CX and Collections teams to identify product improvements based on customer feedback and communication. Responsibilities include coordinating incident responses, debugging issues, improving monitoring systems, identifying patterns in customer issues, sharing knowledge with global teams, and tracking production performance metrics. You'll be responsible for the risk event process for the Philippines and East Asia timezone. Excellent debugging and documentation skills are a must.
Must have:
  • 3+ years in a technology environment with microservices experience
  • 3+ years in incident response or similar role
  • Experience working with a global, remote team
  • Knowledge of monitoring platforms (CloudWatch, SumoLogic, APM)
  • Knowledge of relational databases and BI querying languages
  • Experience with tools like Postman or scripting API queries
  • Excellent debugging and documentation skills
  • Ability to coordinate incident response and communicate effectively
  • Ability to remain calm under pressure

Job Details

About Tala


Tala is on a mission to unleash the economic power of the Global Majority – the 4 billion people overlooked by existing financial systems. With nearly half a billion dollars raised from equity and debt, we are serving millions of customers across three continents. Tala has been named by the Fortune Impact 20 list, CNBC’s Disruptor 50 five years in a row, CNBC’s World's Top Fintech Company, Forbes’ Fintech 50 list for eight years running, and Chief's The New Era of Leadership Award. We are expanding across product offerings, countries and crypto and are looking for people who have an entrepreneurial spirit and are passionate about our mission.


By creating a unique platform that enables lending and other financial services around the globe, people in emerging markets are able to start and expand small businesses, manage day-to-day needs, and pursue their financial goals with confidence. Currently, over nine million people across Kenya, the Philippines, Mexico, and India have used Tala products. Due to our global team, we have a remote-first approach, and also have offices in Santa Monica, CA (HQ); Nairobi, Kenya; Mexico City, Mexico; Manila, the Philippines; and Bangalore, India.


Most Talazens join us because they connect with our mission. If you are energized by the impact you can make at Tala, we’d love to hear from you!


About the Role


As a Production Support Engineer you will support, monitor and maintain the high availability of our platform across all of our markets. You’ll work closely with engineering, CX, product and program teams globally for production incident response and post-mortem processes. You’ll work closely with the CX and Collections teams to discover areas of improvement for our product based on their feedback and the customer communication. You’ll continuously review and improve our existing monitoring and alerting systems.

What You'll Do

    • Ownership of risk event process for the PH & East Asia timezone: you’ll help coordinate teams responding to an incident, communicate effectively, oversee post-mortem and monitor that the follow-up action items are completed
    • Ownership of escalations from the in-country CXCL guild: debugging and identifying problems, resolving when possible and escalating to appropriate teams when necessary.
    • Tracking and reporting on pending issues, and regular updates on open items
    • Continuous improvement of our monitoring dashboards and alerts
    • In collaboration with the CX team, identify patterns in customer and product issues and propose improvements
    • In collaboration with the Production Support Engineers globally, share product learning, knowledge and exchange ideas
    • Identify and communicate repeating themes around risk events and propose improvements to prevent recurrence of the same issues
    • Keep track of metrics related to production performance and identify areas of improvement
    • Continuous improvements of our documentation library to allow faster onboarding of new team members and more efficient response times 

What You'll Need

    • 3+ years of experience working in technology environment with experience in microservices architecture
    • 3+ years of experience in incident response or similar role
    • Experience working with a remote team in a global environment
    • Knowledge of various monitoring platforms such as AWS CloudWatch, SumoLogic, APM monitoring (NewRelic, Instana), mobile (Crashlytics data), BI (Looker, Snowflake)
    • Knowledge of relational databases, BI querying languages  to be able to construct queries during investigations
    • Experience working with tools like Postman, or scripting API queries 
    • Excellent debugging and documentation skills
    • Ability to coordinate incident response and communicate effectively with stakeholders from variety of teams across different timezones
    • Ability to remain calm under pressure during a production incident resolution
    • Candidates with SRE background are welcome 
Our vision is to build a new financial ecosystem where everyone can participate on equal footing and access the tools they need to be financially healthy. We strongly believe that inclusion fosters innovation and we’re proud to have a diverse global team that represents a multitude of backgrounds, cultures, and experience. We hire talented people regardless of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status.

Similar Jobs

Riot Games - Senior User Researcher

Riot Games

Shanghai, Shanghai, China (On-Site)
10 Months ago
bytedance - Network Engineer, Optical Long-Haul and Submarine

bytedance

Seattle, Washington, United States (On-Site)
3 Months ago
Reddit - Senior Client Partner, Mid-Market (App Dev)

Reddit

New York, United States (On-Site)
2 Weeks ago
outer dawn - Senior Game Designer

outer dawn

Auckland, Auckland, New Zealand (On-Site)
1 Week ago
Playtouch.net - Game Developer Fullstack

Playtouch.net

Grand Baie, Rivière Du Rempart District, Mauritius (On-Site)
1 Year ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

PlayStation Global - Director, People Technology Services

PlayStation Global

Aliso Viejo, California, United States (On-Site)
1 Month ago
Haleon - Enterprise Architect - Finance Strategy & Technology

Haleon

London, England, United Kingdom (On-Site)
1 Week ago
Token Metrics - Crypto Senior Backend Engineer (Remote)

Token Metrics

Medellín, Antioquia, Colombia (Remote)
7 Months ago
Accenture - Solution Enablement Senior Manager

Accenture

Bengaluru, Karnataka, India (On-Site)
1 Month ago
lucas films - Groom Artist (All Levels)

lucas films

London, England, United Kingdom (Hybrid)
6 Days ago
Ziff Davis - Marketing Manager

Ziff Davis

United States (Remote)
2 Weeks ago
The Walt Disney Company - Entertainment Technician - Full Time

The Walt Disney Company

Hong Kong (On-Site)
6 Months ago
Take-Two Interactive - Senior Producer

Take-Two Interactive

Massachusetts, United States (Remote)
1 Month ago
bytedance - Software Engineer, Multi Cloud CDN - San Jose / Seattle / Boston

bytedance

Seattle, Washington, United States (On-Site)
5 Months ago
Google - Analytical Lead, Large Customer Sales

Google

New York, New York, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Manila, Metro Manila, Philippines

Workato - Associate Customer Success Manager, Scale

Workato

Muntinlupa, Metro Manila, Philippines (On-Site)
2 Weeks ago
gitlab - Senior GL Accountant (APAC)

gitlab

Philippines (Remote)
2 Weeks ago
London stock Exchange - Analyst, Operational Support – DSM

London stock Exchange

Taguig, Metro Manila, Philippines (On-Site)
4 Days ago
PwC - AML/KYC Associate

PwC

Makati, Metro Manila, Philippines (On-Site)
8 Months ago
WebFX - Technical Sourcing Specialist/Recruiter

WebFX

Philippines (Remote)
3 Months ago
Simcorp - Lead DevOps Engineer

Simcorp

Manila, Metro Manila, Philippines (Hybrid)
3 Weeks ago
Sovrun - Senior Rust Engineer

Sovrun

Makati, Metro Manila, Philippines (Remote)
2 Months ago
PwC - SAP Basis Manager

PwC

Makati, Metro Manila, Philippines (On-Site)
8 Months ago
Nagarro - Staff Engineer, PHP Drupal

Nagarro

Philippines (Remote)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Mexico City, Mexico (Remote)

Mexico City, Mexico (Hybrid)

Mexico City, Mexico (Remote)

Manila, Metro Manila, Philippines (Hybrid)

Mexico City, Mexico (Hybrid)

Mexico City, Mexico (Hybrid)

Manila, Metro Manila, Philippines (Hybrid)

Mexico City, Mexico (Hybrid)

United States (Remote)

Mexico City, Mexico (Hybrid)

View All Jobs

Get notified when new jobs are added by TALA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug