Senior Infrastructure/Application Support Engineer

1 Month ago • 5 Years + • DevOps

Job Summary

Job Description

As a Senior Infrastructure/Application Support Engineer at TMS, you'll be part of a dedicated EU-based team supporting a global promotions platform for a major fast-food chain. Responsibilities include providing advanced application support, collaborating with developers to troubleshoot issues, understanding the platform's capabilities for internal/external team interactions, monitoring system performance, leading incident response, creating detailed reports, maintaining support documentation, assisting with deployments, and mentoring junior engineers. You'll also configure and maintain cloud environments using automated scripts and participate in on-call support. The initial 6 months require working within the US Eastern/Central time zone for onboarding.
Must have:
  • 5+ years application support experience
  • API testing/scripting
  • Basic SQL
  • AWS services and security
  • Centralized logging systems
  • Kubernetes
  • Problem-solving skills
  • Communication skills
Good to have:
  • System profiling
  • Athena (S3 queries)
  • Grafana/Prometheus
  • JVM memory management
  • NoSQL databases
  • Elasticache, DynamoDB, APIGateway, Lambda, Cloudwatch, Cognito
  • Load testing
Perks:
  • Health insurance
  • Psychotherapy assistance
  • Gym allowance
  • Educational budget
  • Paid vacations
  • Paid sick leaves
  • Paid public holidays

Job Details

GT was founded in 2019 by a former Apple, Nest, and Google executive. GT’s mission is to connect the world’s best talent with product careers offered by high-growth companies in the UK, USA, Canada, Germany, and the Netherlands.

On behalf of TMS, GT is looking for a Senior Infrastructure/Application Support Engineer, interested in joining a team of dedicated engineers.

About the Client

The Marketing Store (TMS) is a privately held global Agency that innovates, optimizes, and drives marketing promotions and supply chains of many of the best-known brands in the world. With 1200+ employees across 26 countries, they offer an impressive range of solutions — from inspiration and innovation to category management and delivery.

Headquartered in Chicago with 10 offices worldwide, they are responsible for some of the world’s most successful and iconic long-term marketing platforms, including McDonald’s Happy Meal and MONOPOLY programs. Operating as a creative agency, a strategic consultancy, and a technology provider, they engage with over 110 million customers every single day for clients including McDonald’s, adidas, T-Mobile, Starbucks, Vue, and O2.

Why we think you will love this role

You will be joining a team of dedicated engineers who value their community of knowledge sharing, communication, and growth. We encourage open dialogue between our people and you’ll have the opportunity to interface with your peers, leadership, and product teams daily. In this role, which is both technical and collaborative, you will make significant contributions to the success of our internationally popular promotional games – bringing smiles to faces around the world every day.

Project Details

You will be a part of the EU-based team of application support & DevOps engineers who will support & help ensure the smooth delivery of our API-driven global promotions platform for one of the largest fast-food restaurant chains in the world 😉

  • Essentially, the team is responsible for configuring, deploying & supporting the backend of the games and will be expected to support multiple games at the same time.

Team: Senior Infrastructure/Application Support Engineer, 2x Middle Application Support Engineer, DevOps Engineer, Project Manager.

Working Schedule:

  • Due to the complexity of TMS software, it is essential to work via the eastern/central US time zone for the first 6 months to ensure an effective onboarding process.

  • After a 6-month period it's possible to reduce the overlap to 3-4hrs with the eastern/central US time zone.

What you will bring

The experience and energy to roll up your sleeves and commit your excellence to a product, your team, and our customers. You will gain a deep understanding of our API-driven global promotions gaming platform so that you and the Applications team are confident in ensuring the best quality, accuracy, and availability of the high-visibility games the system supports. This role also interfaces on technical subjects with internal stakeholders, external vendors, and clients for troubleshooting and general support. 

Responsibilities:

  • Provide advanced application support for our promotion platform, addressing and resolving issues promptly to ensure optimal system performance. 

  • Collaborate with the development team to diagnose and troubleshoot application-related issues, identifying root causes and implementing effective solutions.

  • Gain a strong understanding of our platform capabilities so you may confidently interface with internal, vendor, and client technical teams for smooth integrations and troubleshooting.

  • Monitor system performance and conduct regular health checks to proactively identify potential issues and areas for improvement.

  • Lead incident response to ensure swift resolution of anomalies detected in internal platforms, downstream client environments, as well as partner systems.

  • Ensure continuous up-to-date communication on incident response to key stakeholders

  • Author official incident response reports providing an executive summary, a timeline of key moments, root cause analysis, and recommendations for system and process improvements to prevent a recurrence. Provide a walk-through of the report for the larger team and incorporate their feedback.

  • Develop and maintain support documentation, including troubleshooting guides, FAQs, and system configuration details.

  • Assist in the deployment of new releases and updates, ensuring smooth transitions and minimal disruption to services.

  • Contribute to the continuous improvement of support processes and tools, leveraging feedback and data to enhance efficiency and effectiveness.

  • Mentor and train junior support engineers, sharing knowledge and best practices to build a strong, capable support team.

  • Responsible for configuring and maintaining cloud environments for application runtimes utilizing automated scripts to streamline deployment, scaling, and management processes, ensuring optimal performance and reliability.

  • Participate in on-call support rotations to provide 24/7 assistance as needed.

Essential knowledge, skills & experience:

  • 5+ years of professional experience as an Application Support Specialist, Software Engineer, DevOps Engineer, or equivalent.

  • Basic understanding of reading metrics and HTTP status codes.

  • Skill with system scripting languages such as Python, Ruby, or Bash/Zsh.

  • Strong understanding of API testing/scripting, with knowledge of how to test using tools such as Postman or equivalent.

  • Proficient in basic SQL, with the ability to execute standard SQL commands from the command line to satisfy ad-hoc support requests.

  • Kicking off tests using a testing tool (Gatling or similar such as BlazeMeter, Jmeter, etc), interpreting metrics, anomalies, results, etc.

  • Strong understanding of AWS services and security best practices.

  • Experienced with centralized logging systems such as Graylog or Loki, with strong skills in writing sophisticated queries to extract, analyze, and monitor log data for operational insights and troubleshooting.

  • Solid understanding of security and encryption practices.

  • Knowledge and understanding of Kubernetes concepts.

  • Experience with proxy, debugging, and profiling tools.

  • Proficiency in reading and writing XML/XSD.

  • Comfortable with version control such as GIT.

  • Excellent problem-solving and analytical skills, with the ability to diagnose and resolve complex technical issues.

  • Strong communication and interpersonal skills, with the ability to work effectively with technical and non-technical stakeholders.

  • Proactive, detail-oriented, and able to work independently and as part of a team.

Desired

  • Experience with system profiling and performance tuning.

  • Ability to use Athena to perform ad-hoc support queries against data in S3 buckets.

  • Proficient in monitoring and diagnostic tools such as Grafana and Prometheus, with strong capabilities in setting up dashboards, interpreting real-time data, and implementing alerting systems for proactive incident management.

  • Understanding of JVM memory management and garbage collection tuning.

  • Experience with NoSQL databases.

  • Familiarity with Elasticache, DynamoDB, APIGateway, Lambda, Cloudwatch, Cognito.

  • Experience with load testing high-performance distributed systems.

Interview Steps

  1. GT interview with Recruiter

  2. Introductory interview with the TMS team

  3. Technical interview

  4. Final interview

  5. Reference Check

We go beyond usual perks… By working with us, you will get:

  • Health insurance

  • Psychotherapy assistance allowance

  • Gym allowance

  • Individual educational budget

  • Paid vacations.

  • Paid sick leaves.

  • All public holidays are paid days off.

GT working model:

You will work directly with a client through our Extended Team model. We try to do things differently and try to integrate you as deeply as possible into the client’s team. You work with the same tools and technologies as they do and are managed directly by the client without any intermediary. We help you build relationships and create an environment where you genuinely feel like a member of the client’s team. We also encourage trips to a client and join teambuilding and after-work activities. Our Extended Team model is focused on long-term projects that last over several years.

Similar Jobs

Larian Studios - VFX DIRECTOR

Larian Studios

Quebec, Canada (On-Site)
5 Months ago
Nexon - Manager, CRM

Nexon

El Segundo, California, United States (Hybrid)
1 Week ago
reality.co - Mid-Level Technical QA Tester

reality.co

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
6 Days ago
Naughty Dog - Senior Gameplay Melee Animator

Naughty Dog

Los Angeles, California, United States (Hybrid)
1 Month ago
Moloco - Senior New Business Manager

Moloco

Tokyo, Japan (On-Site)
2 Weeks ago
UXBERT Labs - Technology Director

UXBERT Labs

Riyadh, Riyadh Province, Saudi Arabia (On-Site)
1 Month ago
Epic Games - Senior DevOps Programmer

Epic Games

Montreal, Quebec, Canada (On-Site)
2 Months ago
ECI - Cloud Services Engineer

ECI

Indore, Madhya Pradesh, India (On-Site)
7 Months ago
Ion - Lead Python Engineer, New York

Ion

New York, New York, United States (Hybrid)
7 Months ago
Ion - Senior DevSecOps Engineer, Italy

Ion

Pisa, Tuscany, Italy (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Wind River Jobs - Member of Technical Staff - Services

Wind River Jobs

Bengaluru, Karnataka, India (Hybrid)
2 Weeks ago
Amber - Graphic Designer (Project Based)

Amber

Bucharest, Bucharest, Romania (Hybrid)
2 Months ago
Black Bery - QNX - Software Developer Student

Black Bery

Ottawa, Ontario, Canada (On-Site)
1 Week ago
Tide - Lead People Partner, Member Operations

Tide

Hyderabad, Telangana, India (Hybrid)
1 Month ago
Google - Software Engineer III, Core

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
spauldingridge - Anaplan Engagement Lead, Retail and CPG

spauldingridge

Chicago, Illinois, United States (On-Site)
1 Month ago
hogarth - Assistant Editor

hogarth

Detroit, Michigan, United States (Hybrid)
1 Week ago
Lumeto - Clinical Educational AI Author

Lumeto

Toronto, Ontario, Canada (Remote)
1 Month ago
CD PROJEKT RED - Engineering Director

CD PROJEKT RED

Boston, Massachusetts, United States (On-Site)
6 Days ago
Reddit - Senior Client Account Manager

Reddit

Los Angeles, California, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in undefined

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

DevOps Jobs

Knack studios - Power Automate Developers

Knack studios

Chennai, Tamil Nadu, India (On-Site)
11 Months ago
Tencent - SRE Intern

Tencent

(On-Site)
3 Months ago
Sonar Source - Support Engineer

Sonar Source

Geneva, Geneva, Switzerland (On-Site)
7 Months ago
Ion - Cloud Engineer Kubernetes

Ion

Italy (Hybrid)
7 Months ago
Zazz - Cloud Engineer (AWS)

Zazz

(Remote)
3 Months ago
Canva - Senior Software Engineer -Cloud Platform- - Remote across ANZ

Canva

Sydney, New South Wales, Australia (Remote)
6 Months ago
Axinous - Principal Software Development Engineer

Axinous

(Remote)
3 Months ago
Playtech - DevOps Engineer

Playtech

Vienna, Vienna, Austria (On-Site)
1 Month ago
Kaedim - DevOps Engineer

Kaedim

London, England, United Kingdom (On-Site)
9 Months ago
CloudLinux - Senior Site Reliability Engineer

CloudLinux

(Remote)
2 Months ago

Get notifed when new similar jobs are uploaded