Site Reliability Engineer (SRE)

4 Months ago • 5 Years + • Devops

Job Summary

Job Description

As a Site Reliability Engineer (SRE) at Sword Health, you will be responsible for maintaining the health and uptime of our services. You will work with development teams to build and operate scalable and resilient systems, troubleshoot issues, and automate tasks. You will be involved in monitoring and incident management, automating tasks, optimizing performance, ensuring security and compliance, documenting systems, and managing databases. This role requires a strong understanding of various technologies and a proactive approach to ensure our systems run smoothly and efficiently. You will also collaborate with a team of talented colleagues to help build a pain-free world.
Must have:
  • Proficiency in programming languages like Python, Go, and Javascript
  • 5+ years of experience with cloud platforms like AWS, Google Cloud, or Azure
  • Strong understanding of Linux/Unix systems and networking
  • Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes)
  • Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack)
  • Knowledge of CI/CD pipelines and tools (e.g., Jenkins, GitLab CI)
  • Database Experience: Proficiency with relational and NoSQL databases (e.g., MySQL, PostgreSQL, Redis, Elasticsearch)
  • Team player willing to share knowledge for collective success
  • Taking responsibility for work and demonstrating accountability for outcomes
Good to have:
  • A passion for exploring new technologies and methodologies to improve reliability and performance.
  • Ability to anticipate potential issues and implement preventive measures.
  • Dedication to learning and growing in your role, staying updated with industry trends and best practices.
Perks:
  • Health, dental and vision insurance
  • Meal allowance
  • Equity shares
  • Remote work allowance
  • Flexible working hours
  • Work from home
  • Discretionary vacation
  • Snacks and beverages
  • English class

Job Details

Sword Health is on a mission to free two billion people from pain. 


With 67% of members achieving a pain-free life and a 70% reduction in surgery intent, at Sword, we are using AI Care to change lives, and save millions for our 25,000+ enterprise clients across three continents. Today, we hold the majority of industry patents, win 70% of competitive evaluations, and have raised more than $300 million from top venture firms like Founders Fund, Sapphire Ventures, General Catalyst, and Khosla Ventures.


Recognized as a Forbes Best Startup Employer in 2025, this award highlights our focus on being a destination for the best and brightest  talent. Not only have we experienced unprecedented growth since our market debut in 2020,  but we’ve also created a remarkable mission and value-driven environment that is loved by our growing team. With a recent valuation of $3 billion, we are in a phase of hyper growth and expansion, and we’re looking for individuals with passion, commitment, and energy to help us scale our global impact. 


Joining Sword means committing to a set of core values, chief amongst them to “do it for the patients” every day, and to always “deliver more than expected” on behalf of our members and clients.


This is an opportunity for you to make a significant difference on a massive scale as you work alongside 900+ (and growing!) talented colleagues, spanning three continents. Your charge? To help us build a pain-free world, powered by AI, enhanced by people — accessible to all.



As a Site Reliability Engineer (SRE) at Sword Health, you will play a critical role in maintaining the health and uptime of our services. You will collaborate with development teams to build and operate scalable and resilient systems, troubleshoot issues across the stack, and implement automation to reduce manual work.


What you'll be doing:
  • Monitoring and Incident Management: Develop and maintain monitoring and alerting solutions. Respond to incidents, troubleshoot issues, and perform root cause analysis.
  • Automation and Tooling: Automate repetitive tasks and improve deployment processes. Develop and maintain tools to support infrastructure and applications.
  • Performance Optimization: Analyze system performance and implement optimizations to improve efficiency and reduce latency.
  • Security and Compliance: Ensure systems are secure and compliant with relevant standards and regulations.
  • Documentation and Knowledge Sharing: Maintain comprehensive documentation of systems and processes. Share knowledge and best practices with team members.
  • Database Management: Ensure the reliability, performance, and scalability of databases. Perform database optimization, maintenance, and troubleshooting.


What you need to have :
  • Proficiency in programming languages such as Python, Go, Javascript.
  • 5+ years of experience with cloud platforms such as AWS, Google Cloud, or Azure.
  • Strong understanding of Linux/Unix systems and networking.
  • Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes).
  • Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
  • Knowledge of CI/CD pipelines and tools (e.g., Jenkins, GitLab CI).
  • Database Experience: Proficiency with relational and NoSQL databases (e.g., MySQL, PostgreSQL, Redis, Elasticsearch).
  • Team Player: Willingness to collaborate and share knowledge with colleagues to drive collective success.
  • Ownership: Taking responsibility for your work and demonstrating accountability for outcomes.


What we would love to see:
  • Innovative Mindset: A passion for exploring new technologies and methodologies to improve reliability and performance.
  • Proactive Approach: Ability to anticipate potential issues and implement preventive measures.
  • Continuous Improvement: A dedication to learning and growing in your role, staying updated with industry trends and best practices.


Portugal - Sword Benefits & Perks:


• Health, dental and vision insurance

• Meal allowance

• Equity shares

• Remote work allowance

• Flexible working hours

• Work from home

• Discretionary vacation

• Snacks and beverages

• English class



Note: Please note that this position does not offer relocation assistance. Candidates must possess a valid EU visa and be based in Portugal.



Sword Health, which includes SWORD Health, Inc. and Sword Health Professionals (consisting of Sword Health Care Providers, P.A., SWORD Health Care Providers of NJ, P.C., SWORD Health Care Physical Therapy Providers of CA, P.C.*) complies with applicable Federal and State civil rights laws and does not discriminate on the basis of Age, Ancestry, Color, Citizenship, Gender, Gender expression, Gender identity, Gender information, Marital status, Medical condition, National origin, Physical or mental disability, Pregnancy, Race, Religion, Caste, Sexual orientation, and Veteran status.

Similar Jobs

Samsung Semiconductor - Principal Engineer, Firmware

Samsung Semiconductor

San Jose, California, United States (On-Site)
1 Month ago
Qualcomm - CPU Power Management FW Developer

Qualcomm

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Qualcomm - Senior ASIC Synthesis/DFT Engineer

Qualcomm

Colombes, Île-de-France, France (On-Site)
2 Months ago
Saviynt - Technical Lead, Expert Services- IAM/IGA

Saviynt

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Barracuda - Vulnerability Assessment Manager

Barracuda

Chelmsford, Massachusetts, United States (On-Site)
3 Months ago
Google - Software Engineer III, Performance, Platforms Infrastructure Engineering

Google

Sunnyvale, California, United States (On-Site)
3 Months ago
CyberArk - Senior DevOps Engineer

CyberArk

United States (On-Site)
2 Months ago
Scopely - Senior Client Engineer (Core Tech/Platform Engineering)

Scopely

Barcelona, Catalonia, Spain (Hybrid)
1 Month ago
Penrose studios - Lead Platform Engineer

Penrose studios

San Francisco, California, United States (On-Site)
4 Years ago
DevRev - Partner Solutions Engineer

DevRev

Bengaluru, Karnataka, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Mashgin - Deployment Engineer - Georgia

Mashgin

Atlanta, Georgia, United States (Remote)
9 Months ago
NVIDIA - Production System Engineer

NVIDIA

Pune, Maharashtra, India (On-Site)
4 Months ago
Apple - AIML Triage and Diagnostic Tooling Engineer, AIML Integration and Delivery

Apple

Santa Clara, California, United States (On-Site)
3 Months ago
Salesforce - Staff Software Engineer, Android

Salesforce

Atlanta, Georgia, United States (On-Site)
2 Months ago
Scanline VFX - Modeler

Scanline VFX

Seoul, South Korea (Hybrid)
9 Months ago
Marvell - Principal Engineer - Silicon Validation Engineer

Marvell

Santa Clara, California, United States (On-Site)
2 Months ago
Motorola solutions - Sr. Systems Engineer

Motorola solutions

Plantation, Florida, United States (Remote)
2 Months ago
Yggdrasil Sandbox - Integration Support Specialist

Yggdrasil Sandbox

St. Julian's, Malta (On-Site)
4 Weeks ago
Octopus - Technical Account Manager

Octopus

Netherlands (Remote)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Portugal

vizrt - Revenue Operations Analyst

vizrt

Lisbon, Lisbon, Portugal (On-Site)
3 Months ago
miniclip - SuccessFactors System Specialist

miniclip

Lisbon, Lisbon, Portugal (On-Site)
2 Months ago
Hawkeye Innovations - Match Operations Assistant - Lisbon

Hawkeye Innovations

Lisbon, Lisbon, Portugal (On-Site)
3 Months ago
Tesla - Senior Service Technician

Tesla

Porto, Porto District, Portugal (On-Site)
5 Months ago
Any Desk - Channel Business Development Associate

Any Desk

Lisbon, Lisbon, Portugal (On-Site)
1 Month ago
Sword Health - ML Engineer

Sword Health

Porto, Porto District, Portugal (Hybrid)
2 Years ago
Sword Health - Senior Data Analyst

Sword Health

Porto, Porto District, Portugal (Hybrid)
2 Months ago
Springer Group - Engineering Product Manager

Springer Group

Lisbon, Lisbon, Portugal (On-Site)
1 Year ago
miniclip - Senior Infrastructure Cloud Engineer

miniclip

Lisbon, Lisbon, Portugal (On-Site)
2 Months ago
Aptive - Process Engineering Technician

Aptive

Lisbon, Lisbon, Portugal (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Devops Jobs

welevel  - Fullstack AI Platform Engineer

welevel

Munich, Bavaria, Germany (On-Site)
2 Weeks ago
Gearbox - Senior Site Reliability Engineer

Gearbox

Frisco, Texas, United States (On-Site)
2 Months ago
Moloco - Senior Software Engineer, Ads Infrastructure - Supply

Moloco

Seattle, Washington, United States (On-Site)
5 Months ago
Next Level Business Services - CCI News Gate Solution Architect

Next Level Business Services

Jersey City, New Jersey, United States (On-Site)
9 Months ago
Unity - DevOps Tech Lead

Unity

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Lorikeet - Solutions Engineer

Lorikeet

Sydney, New South Wales, Australia (On-Site)
1 Month ago
Egnyte - Junior Site Reliability Engineer

Egnyte

Mumbai, Maharashtra, India (On-Site)
1 Month ago
Scopely - Principal Server Engineer, Infrastructure

Scopely

Barcelona, Catalonia, Spain (On-Site)
6 Months ago
CME Group - Site Reliability Engineer I

CME Group

Belfast, Northern Ireland, United Kingdom (Hybrid)
2 Years ago
Nagarro - Principal Engineer, Cloud

Nagarro

India (On-Site)
9 Months ago

Get notifed when new similar jobs are uploaded