As a Senior Software Engineer in Observability, you'll develop and operate Atlassian's observability platforms, driving projects, managing systems, and mentoring junior engineers.
About the Observability Patronus Team
The Observability Patronus team is responsible for building and operating Atlassian's core observability platforms, including Grafana Mimir and self-hosted Sentry (FedRAMP). Our mission is to empower engineering teams with reliable, scalable, and compliant monitoring solutions that ensure the health and performance of Atlassian's products.
The Role
As a Senior Software Engineer in Observability, you'll be building the platforms and defining practices that keep Atlassian's products rock-solid and lightning-fast for millions of users.
We're not talking about your average telemetry here-we're collecting, moving, and ingesting multiple petabytes of data every single day, and making it all available within seconds. It's a massive scale, and it's critical to everything we do. You'll empower hundreds of Atlassian engineering teams with world-class observability solutions, helping them ship reliable, high-quality products our customers love.
This is a platform team that sits at the intersection of infrastructure and software engineering. Some candidates expect all software engineering, others expect all infrastructure-here, you'll need to be passionate and skilled in both. You'll be building and operating ultra-high-scale systems for metrics, including one of the world's largest Mimir clusters (at 10x the certified load!). You'll face architectural and scaling challenges that will stretch your skills and imagination.
If you're excited by big challenges, love working with smart people, and want to make a real impact, you'll fit right in.
What you'll do
Requirements
Sound like an exciting opportunity? We think so too... To set you up for impact on day one, we'll expect you to have this on your first day:
We'd also see it as a big bonus if you have any of the following skills:
The Observability Patronus team is responsible for building and operating Atlassian's core observability platforms, including Grafana Mimir and self-hosted Sentry (FedRAMP). Our mission is to empower engineering teams with reliable, scalable, and compliant monitoring solutions that ensure the health and performance of Atlassian's products.
The Role
As a Senior Software Engineer in Observability, you'll be building the platforms and defining practices that keep Atlassian's products rock-solid and lightning-fast for millions of users.
We're not talking about your average telemetry here-we're collecting, moving, and ingesting multiple petabytes of data every single day, and making it all available within seconds. It's a massive scale, and it's critical to everything we do. You'll empower hundreds of Atlassian engineering teams with world-class observability solutions, helping them ship reliable, high-quality products our customers love.
This is a platform team that sits at the intersection of infrastructure and software engineering. Some candidates expect all software engineering, others expect all infrastructure-here, you'll need to be passionate and skilled in both. You'll be building and operating ultra-high-scale systems for metrics, including one of the world's largest Mimir clusters (at 10x the certified load!). You'll face architectural and scaling challenges that will stretch your skills and imagination.
If you're excited by big challenges, love working with smart people, and want to make a real impact, you'll fit right in.
What you'll do
- Drive complex projects autonomously, from technical design to launch.
- Manage and improve mission-critical observability platforms, including Grafana Mimir and self-hosted Sentry (FedRAMP)
- Lead initiatives to automate processes, enhance operational excellence, and reduce costs.
- Be a role model for delivering clean, high-quality code, mentoring more junior members in their growth
- Be a promoter of the best Observability standards and practices to hundreds of Atlassian teams
- Deliver work that impacts thousands of engineers and services improving the reliability of our products
Requirements
Sound like an exciting opportunity? We think so too... To set you up for impact on day one, we'll expect you to have this on your first day:
- Have 6+ years experience in building and operating highly-available observability(metrics) platforms at scale.
- Experience with Prometheus, OpenTelemetry, or similar observability tools.
- Proficiency with at least one programming language (e.g., Golang, Python)
- Expert in handling and managing infrastructure with AWS, GCP or other cloud providers.
- Knowledge of high-volume data pipelines (e.g., Kinesis, Kafka).
- Comfortable working in with ambiguous requirements, being able to identify the most valuable problems to solve and pave a path forward
- Demonstrated skills in the Observability domain and working knowledge in multiple tech domains
- Great soft skills, with the ability to collaboratively deliver with other world-class engineers
We'd also see it as a big bonus if you have any of the following skills:
- Expertise in operating Grafana Mimir at scale.
- Hands-on experience with SignalFX, Splunk, and Sentry.
Top Skills
AWS
GCP
Go
Grafana Mimir
Kafka
Kinesis
Opentelemetry
Prometheus
Python
Sentry
Signalfx
Splunk
Similar Jobs at Atlassian
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
The Senior Product Manager will define strategies for Jira's Work Item experience, collaborate cross-functionally, and deliver measurable outcomes through customer-focused solutions.
Top Skills:
AIProduct Management
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
The Principal Software Engineer at Atlassian will design and optimize high-scale storage systems on AWS, mentor engineers, and improve storage solutions' performance and reliability.
Top Skills:
AWSAws CloudformationCi/CdDynamoDBEbsEfsFsxGlacierGoJavaKotlinKubernetesPythonRustS3Terraform
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
The Principal Software Engineer at Atlassian will lead PostgreSQL initiatives, mentor engineers, and drive contributions to the open-source community while improving scalability and performance.
Top Skills:
AuroraAWSCi/CdGCPGrafanaKubernetesPostgresPrometheusRdsTerraform
What you need to know about the Chennai Tech Scene
To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

