Akamai Technologies Logo

Akamai Technologies

Site Reliability Engineer II

Posted Yesterday
Be an Early Applicant
In-Office or Remote
2 Locations
Junior
In-Office or Remote
2 Locations
Junior
As a Site Reliability Engineer II, you will ensure system reliability, oversee deployment and maintenance of platforms, improve performance, and troubleshoot issues while collaborating cross-functionally.
The summary above was generated by AI

Do you like collaborating across teams to solve complex problems?

Do you enjoy solving large scale systems problems?

Join our Site Reliability Engineering team

The Edge Experience SRE Team designs interfaces, applications, APIs, and services to enhance Akamai's user experience. This team enables customers and stakeholders to develop software using leading internal and third-party technologies.

Be part of enhancing our organization

As a Site Reliability Engineer II, ensure optimal performance and uptime of Akamai's microservices-based Portal platform. Maintain critical infrastructure while collaborating with operations and development teams. Develop tools and software to monitor and enhance system reliability. Work across technologies, releasing new applications and modernising existing tools.

As a Site Reliability Engineer II, you will be responsible for:

  • Deploying and maintaining internal platforms and tools to support daily operations while reducing manual effort.
  • Ensuring product reliability, scalability, usability, and overall system availability by partnering with teams.
  • Troubleshooting complex issues using automation, scripting, and systems programming techniques.
  • Collaborating across engineering, operations, and support to resolve and investigate technical problems.
  • Improving the Portal platform for faster error detection, better performance, and enhanced system reliability.
  • Participating in on-call rotations while contributing to service restoration, stability improvements, and code enhancements.

Do what you love

To be successful in this role you will:

  • Have 2+ years' experience as Systems Performance/SRE and a Bachelor's degree in Computer Science or its equivalent
  • Have good analytical and troubleshooting skills while managing tasks with incomplete information.
  • Demonstrate observability and monitoring expertise, including adherence to SLOs.
  • Excel in minimising toil through automation utilising Python, Golang, and scripting languages.
  • Utilise Terraform, Ansible, Jenkins, and Linux; familiarity with Docker and Kubernetes considered valuable.
  • Utilise monitoring tools such as Prometheus, Grafana, and observability practices, including adherence to SLOs.

Work in a way that works for you

FlexBase, Akamai's Global Flexible Working Program, is based on the principles that are helping us create the best workplace in the world. When our colleagues said that flexible working was important to them, we listened. We also know flexible working is important to many of the incredible people considering joining Akamai. FlexBase, gives 95% of employees the choice to work from their home, their office, or both (in the country advertised). This permanent workplace flexibility program is consistent and fair globally, to help us find incredible talent, virtually anywhere. We are happy to discuss working options for this role and encourage you to speak with your recruiter in more detail when you apply.
Learn what makes Akamai a great place to work

Connect with us on social and see what life at Akamai is like!

We power and protect life online, by solving the toughest challenges, together.

At Akamai, we're curious, innovative, collaborative and tenacious. We celebrate diversity of thought and we hold an unwavering belief that we can make a meaningful difference. Our teams use their global perspectives to put customers at the forefront of everything they do, so if you are people-centric, you'll thrive here.

Working for you

At Akamai, we will provide you with opportunities to grow, flourish, and achieve great things. Our benefit options are designed to meet your individual needs for today and in the future. We provide benefits surrounding all aspects of your life:

  • Your health
  • Your finances
  • Your family
  • Your time at work
  • Your time pursuing other endeavors

Our benefit plan options are designed to meet your individual needs and budget, both today and in the future.

About us

Akamai powers and protects life online. Leading companies worldwide choose Akamai to build, deliver, and secure their digital experiences helping billions of people live, work, and play every day. With the world's most distributed compute platform from cloud to edge we make it easy for customers to develop and run applications, while we keep experiences closer to users and threats farther away.

Join us

Are you seeking an opportunity to make a real difference in a company with a global reach and exciting services and clients? Come join us and grow with a team of people who will energize and inspire you!

Top Skills

Ansible
Docker
Go
Grafana
Jenkins
Kubernetes
Linux
Prometheus
Python
Terraform

Similar Jobs

Yesterday
In-Office or Remote
2 Locations
Mid level
Mid level
Cloud • Security • Software • Cybersecurity
As a Site Reliability Engineer II, you'll manage cloud platforms and ensure reliability through automation, configuration management, and monitoring solutions.
Top Skills: AnsibleChefGrafanaJenkinsPrometheusPythonSalt StackShellTerraformYaml
8 Days Ago
Easy Apply
Remote
India
Easy Apply
Mid level
Mid level
Food • Mobile
Provide 24/7 support for OpenTable's global data platforms, focusing on database operations, maintaining high availability, backups, and performance optimization.
Top Skills: CloudwatchDockerGitGitGoGrafanaKubernetesMongoDBPostgresPrometheusPuppetPythonRedisShell BashSQL Server
21 Days Ago
Easy Apply
Remote
India
Easy Apply
Senior level
Senior level
Natural Language Processing • Software • Conversational AI
Maintain and scale cloud-native platform infrastructure: manage Kubernetes (GKE/EKS), build Terraform modules, standardize deployments with Helm, implement GitLab CI/CD, enhance observability (Prometheus/Grafana/Datadog), automate tooling in Python/Go/Shell, participate in on-call rotation, perform RCAs, and collaborate with development teams to improve reliability.
Top Skills: Api GatewayArgocdAWSDatadogEksFluxGCPGitlab Ci/CdGkeGoGrafanaHelmIstioKubernetesLinkerdLinuxPagerdutyPrometheusPythonServicenowShellTerraform

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account