SWORD Health Logo

SWORD Health

Senior Site Reliability Engineer (SRE)

Reposted 14 Days Ago
Be an Early Applicant
Hybrid
Porto
Senior level
Hybrid
Porto
Senior level
The Senior Site Reliability Engineer will ensure the uptime of services, automate tasks, optimize performance, and manage databases while collaborating with teams.
The summary above was generated by AI
Sword Health is on a mission to free two billion people from pain. 

With 67% of members achieving a pain-free life and a 70% reduction in surgery intent, at Sword, we are using AI Care to change lives, and save millions for our 25,000+ enterprise clients across three continents. Today, we hold the majority of industry patents, win 70% of competitive evaluations, and have raised more than $400 million from top venture firms like Founders Fund, Sapphire Ventures, General Catalyst, and Khosla Ventures.

Recognized as a Forbes Best Startup Employer in 2025, this award highlights our focus on being a destination for the best and brightest  talent. Not only have we experienced unprecedented growth since our market debut in 2020,  but we’ve also created a remarkable mission and value-driven environment that is loved by our growing team. With a recent valuation of $4 billion, we are in a phase of hyper growth and expansion, and we’re looking for individuals with passion, commitment, and energy to help us scale our global impact. 

Joining Sword means committing to a set of core values, chief amongst them to “do it for the patients” every day, and to always “deliver more than expected” on behalf of our members and clients.

This is an opportunity for you to make a significant difference on a massive scale as you work alongside 1000+ (and growing!) talented colleagues, spanning three continents. Your charge? To help us build a pain-free world, powered by AI, enhanced by people — accessible to all.


As a Site Reliability Engineer (SRE) at Sword Health, you will play a critical role in maintaining the health and uptime of our services. You will collaborate with development teams to build and operate scalable and resilient systems, troubleshoot issues across the stack, and implement automation to reduce manual work.

What you'll be doing:

  • Monitoring and Incident Management: Develop and maintain monitoring and alerting solutions. Respond to incidents, troubleshoot issues, and perform root cause analysis.
  • Automation and Tooling: Automate repetitive tasks and improve deployment processes. Develop and maintain tools to support infrastructure and applications.
  • Performance Optimization: Analyze system performance and implement optimizations to improve efficiency and reduce latency.
  • Security and Compliance: Ensure systems are secure and compliant with relevant standards and regulations.
  • Documentation and Knowledge Sharing: Maintain comprehensive documentation of systems and processes. Share knowledge and best practices with team members.
  • Database Management: Ensure the reliability, performance, and scalability of databases. Perform database optimization, maintenance, and troubleshooting.

What you need to have:

  • Proficiency in programming languages such as Python, Go, Javascript.
  • 5+ years of experience with cloud platforms such as AWS, Google Cloud, or Azure.
  • Strong understanding of Linux/Unix systems and networking.
  • Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes).
  • Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
  • Knowledge of CI/CD pipelines and tools (e.g., Jenkins, GitLab CI).
  • Database Experience: Proficiency with relational and NoSQL databases (e.g., MySQL, PostgreSQL, Redis, Elasticsearch).
  • Team Player: Willingness to collaborate and share knowledge with colleagues to drive collective success.
  • Ownership: Taking responsibility for your work and demonstrating accountability for outcomes.

What we would love to see:

  • Innovative Mindset: A passion for exploring new technologies and methodologies to improve reliability and performance.
  • Proactive Approach: Ability to anticipate potential issues and implement preventive measures.
  • Continuous Improvement: A dedication to learning and growing in your role, staying updated with industry trends and best practices.

To ensure you feel good solving a big Human problem, we offer:

  • A stimulating, fast-paced environment with lots of room for creativity;
  • A bright future at a promising high-tech startup company;
  • Career development and growth, with a competitive salary;
  • The opportunity to work with a talented team and to add real value to an innovative solution with the potential to change the future of healthcare;
  • A flexible environment where you can control your hours (remotely) with unlimited vacation; 
  • Access to our health and well-being program (digital therapist sessions);
  • Remote or Hybrid work policy (Portugal only);
  • To get to know more about our Tech Stack, check here.

Portugal - Sword Benefits & Perks:

• Health, dental and vision insurance
• Meal allowance
• Equity shares
• Remote work allowance
• Flexible working hours
• Work from home
• Discretionary vacation
• Snacks and beverages
• English class


Note: Please note that this position does not offer relocation assistance. Candidates must possess a valid EU visa and be based in Portugal.


Sword Health complies with applicable Federal and State civil rights laws and does not discriminate on the basis of Age, Ancestry, Color, Citizenship, Gender, Gender expression, Gender identity, Gender information, Marital status, Medical condition, National origin, Physical or mental disability, Pregnancy, Race, Religion, Caste, Sexual orientation, and Veteran status.

Top Skills

AWS
Azure
Docker
Elasticsearch
Elk Stack
Gitlab Ci
Go
GCP
Grafana
JavaScript
Jenkins
Kubernetes
Linux
MySQL
Postgres
Prometheus
Python
Redis
Unix

Similar Jobs

22 Days Ago
In-Office
Porto, PRT
Senior level
Senior level
Other
The role involves designing and maintaining key infrastructure for a cloud platform, automating deployment and infrastructure provisioning, and improving system reliability and processes.
Top Skills: AnsibleArgocdAtlantisBashElasticsearchGithub ActionsGitopsHelm ChartsJavaKotlinKubernetesMongoDBMySQLPostgresPrometheusPythonRedisRubyScalaSpaceliftTerraform
10 Days Ago
Easy Apply
In-Office
Porto, PRT
Easy Apply
Mid level
Mid level
Software
The Senior Site Reliability Engineer will design and maintain cloud systems, monitor performance, and improve service scalability, requiring experience with cloud providers, programming, and infrastructure tools.
Top Skills: Ci/CdDockerGitopsGoGCPKubernetesNode.jsPrometheusPythonRubyTerraform
22 Days Ago
In-Office or Remote
35 Locations
Entry level
Entry level
Machine Learning • Natural Language Processing
Welo Data seeks contributors fluent in Portuguese for various AI tasks including annotation, evaluation, and prompt creation. Remote work is available.
Top Skills: AIDigital Tools

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account