Senior Infrastructure/Application Support Engineer | TMS, US

2 Days ago • 5 Years + • DevOps

About the job

Job Description

The Senior Infrastructure/Application Support Engineer will provide advanced application support for TMS's API-driven global promotions platform, ensuring optimal performance. Responsibilities include troubleshooting application issues, collaborating with the development team, monitoring system performance, leading incident response, developing support documentation, assisting with deployments, and mentoring junior engineers. The role requires strong understanding of AWS services, API testing, SQL, scripting languages (Python, Ruby, or Bash/Zsh), Kubernetes, and centralized logging systems. The engineer will also be responsible for configuring and maintaining cloud environments and participating in on-call support rotations. This position involves working closely with internal stakeholders, external vendors, and clients for troubleshooting and general support, ensuring the smooth delivery of the platform for a major fast-food client. The initial six months require working in the Eastern/Central US time zone, with potential for flexibility afterward.
Must have:
  • 5+ years application support experience
  • API testing/scripting & SQL proficiency
  • AWS & Kubernetes knowledge
  • Scripting (Python, Ruby, Bash)
  • Incident response & problem-solving
  • Strong communication skills
Good to have:
  • System profiling & performance tuning
  • Athena & S3 experience
  • Grafana & Prometheus proficiency
  • JVM memory management
  • NoSQL database experience
  • Load testing experience
Perks:
  • Health insurance
  • Psychotherapy assistance
  • Gym allowance
  • Educational budget
  • Paid vacations
  • Paid sick leaves
  • Paid public holidays
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.

GT was founded in 2019 by a former Apple, Nest, and Google executive. GT’s mission is to connect the world’s best talent with product careers offered by high-growth companies in the UK, USA, Canada, Germany, and the Netherlands.

On behalf of TMS, GT is looking for a Senior Infrastructure/Application Support Engineer, interested in joining a team of dedicated engineers.

About the Client

The Marketing Store (TMS) is a privately held global Agency that innovates, optimizes, and drives marketing promotions and supply chains of many of the best-known brands in the world. With 1200+ employees across 26 countries, they offer an impressive range of solutions — from inspiration and innovation to category management and delivery.

Headquartered in Chicago with 10 offices worldwide, they are responsible for some of the world’s most successful and iconic long-term marketing platforms, including McDonald’s Happy Meal and MONOPOLY programs. Operating as a creative agency, a strategic consultancy, and a technology provider, they engage with over 110 million customers every single day for clients including McDonald’s, adidas, T-Mobile, Starbucks, Vue, and O2.

Why we think you will love this role

You will be joining a team of dedicated engineers who value their community of knowledge sharing, communication, and growth. We encourage open dialogue between our people and you’ll have the opportunity to interface with your peers, leadership, and product teams daily. In this role, which is both technical and collaborative, you will make significant contributions to the success of our internationally popular promotional games – bringing smiles to faces around the world every day.

Project Details

You will be a part of the EU-based team of application support & DevOps engineers who will support & help ensure the smooth delivery of our API-driven global promotions platform for one of the largest fast-food restaurant chains in the world 😉

  • Essentially, the team is responsible for configuring, deploying & supporting the backend of the games and will be expected to support multiple games at the same time.

Team: Senior Infrastructure/Application Support Engineer, 2x Middle Application Support Engineer, DevOps Engineer, Project Manager.

Working Schedule:

  • Due to the complexity of TMS software, it is essential to work via the eastern/central US time zone for the first 6 months to ensure an effective onboarding process.

  • After a 6-month period it's possible to reduce the overlap to 3-4hrs with the eastern/central US time zone.

What you will bring

The experience and energy to roll up your sleeves and commit your excellence to a product, your team, and our customers. You will gain a deep understanding of our API-driven global promotions gaming platform so that you and the Applications team are confident in ensuring the best quality, accuracy, and availability of the high-visibility games the system supports. This role also interfaces on technical subjects with internal stakeholders, external vendors, and clients for troubleshooting and general support. 

Responsibilities:

  • Provide advanced application support for our promotion platform, addressing and resolving issues promptly to ensure optimal system performance. 

  • Collaborate with the development team to diagnose and troubleshoot application-related issues, identifying root causes and implementing effective solutions.

  • Gain a strong understanding of our platform capabilities so you may confidently interface with internal, vendor, and client technical teams for smooth integrations and troubleshooting.

  • Monitor system performance and conduct regular health checks to proactively identify potential issues and areas for improvement.

  • Lead incident response to ensure swift resolution of anomalies detected in internal platforms, downstream client environments, as well as partner systems.

  • Ensure continuous up-to-date communication on incident response to key stakeholders

  • Author official incident response reports providing an executive summary, a timeline of key moments, root cause analysis, and recommendations for system and process improvements to prevent a recurrence. Provide a walk-through of the report for the larger team and incorporate their feedback.

  • Develop and maintain support documentation, including troubleshooting guides, FAQs, and system configuration details.

  • Assist in the deployment of new releases and updates, ensuring smooth transitions and minimal disruption to services.

  • Contribute to the continuous improvement of support processes and tools, leveraging feedback and data to enhance efficiency and effectiveness.

  • Mentor and train junior support engineers, sharing knowledge and best practices to build a strong, capable support team.

  • Responsible for configuring and maintaining cloud environments for application runtimes utilizing automated scripts to streamline deployment, scaling, and management processes, ensuring optimal performance and reliability.

  • Participate in on-call support rotations to provide 24/7 assistance as needed.

Essential knowledge, skills & experience:

  • 5+ years of professional experience as an Application Support Specialist, Software Engineer, DevOps Engineer, or equivalent.

  • Basic understanding of reading metrics and HTTP status codes.

  • Skill with system scripting languages such as Python, Ruby, or Bash/Zsh.

  • Strong understanding of API testing/scripting, with knowledge of how to test using tools such as Postman or equivalent.

  • Proficient in basic SQL, with the ability to execute standard SQL commands from the command line to satisfy ad-hoc support requests.

  • Kicking off tests using a testing tool (Gatling or similar such as BlazeMeter, Jmeter, etc), interpreting metrics, anomalies, results, etc.

  • Strong understanding of AWS services and security best practices.

  • Experienced with centralized logging systems such as Graylog or Loki, with strong skills in writing sophisticated queries to extract, analyze, and monitor log data for operational insights and troubleshooting.

  • Solid understanding of security and encryption practices.

  • Knowledge and understanding of Kubernetes concepts.

  • Experience with proxy, debugging, and profiling tools.

  • Proficiency in reading and writing XML/XSD.

  • Comfortable with version control such as GIT.

  • Excellent problem-solving and analytical skills, with the ability to diagnose and resolve complex technical issues.

  • Strong communication and interpersonal skills, with the ability to work effectively with technical and non-technical stakeholders.

  • Proactive, detail-oriented, and able to work independently and as part of a team.

Desired

  • Experience with system profiling and performance tuning.

  • Ability to use Athena to perform ad-hoc support queries against data in S3 buckets.

  • Proficient in monitoring and diagnostic tools such as Grafana and Prometheus, with strong capabilities in setting up dashboards, interpreting real-time data, and implementing alerting systems for proactive incident management.

  • Understanding of JVM memory management and garbage collection tuning.

  • Experience with NoSQL databases.

  • Familiarity with Elasticache, DynamoDB, APIGateway, Lambda, Cloudwatch, Cognito.

  • Experience with load testing high-performance distributed systems.

Interview Steps

  1. GT interview with Recruiter

  2. Introductory interview with the TMS team

  3. Technical interview

  4. Final interview

  5. Reference Check

We go beyond usual perks… By working with us, you will get:

  • Health insurance

  • Psychotherapy assistance allowance

  • Gym allowance

  • Individual educational budget

  • Paid vacations.

  • Paid sick leaves.

  • All public holidays are paid days off.

GT working model:

You will work directly with a client through our Extended Team model. We try to do things differently and try to integrate you as deeply as possible into the client’s team. You work with the same tools and technologies as they do and are managed directly by the client without any intermediary. We help you build relationships and create an environment where you genuinely feel like a member of the client’s team. We also encourage trips to a client and join teambuilding and after-work activities. Our Extended Team model is focused on long-term projects that last over several years.

View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

View All Jobs

Get notified when new jobs are added by GT

Similar Jobs

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Get notifed when new similar jobs are uploaded

DevOps Jobs

GoTo Group - Site Reliability Engineer - EP (SE4)

GoTo Group, India (On-Site)

Hitachi - Cloud Solutions Architect

Hitachi, Costa Rica (Remote)

version 1 - Senior Azure Platform Consultant

version 1, United Kingdom (On-Site)

ZeniMax Media - Sr. Systems Engineer

ZeniMax Media, United States (On-Site)

Limit Break - Senior Site Reliability Engineer

Limit Break, Japan (On-Site)

Luxoft - Senior Software Support Engineer

Luxoft, Czechia (Remote)

Guerrilla - SENIOR PLATFORM ENGINEER [DECIMA]

Guerrilla, Netherlands (On-Site)

Nielsen Holdings - Principal Data Engineer - AWS

Nielsen Holdings, India (Hybrid)

Get notifed when new similar jobs are uploaded