Ford Motor Company Logo

Ford Motor Company

Site Reliability Engineer

Posted 3 Days Ago
Be an Early Applicant
In-Office
Chennai, Tamil Nadu, IND
Senior level
In-Office
Chennai, Tamil Nadu, IND
Senior level
The SRE will automate cloud environments, manage scalability, optimize cloud solutions, monitor applications, and lead incident response for an eCommerce application.
The summary above was generated by AI

The specific responsibilities of an SRE managing a large, distributed eCommerce application involving Adobe Experience manager as Content Management System and Service layer built on microservices, spring boot, and Google Cloud.
 

Responsibilities
  • Automate and manage a highly available and scalable cloud environment that allows development teams to deploy and run their services.

  • Having depth knowledge in Terraform (Infrastructure as Cloud) and able to create new terraform or modify the existing file according to Ford formats to create new Monitoring Dashboards / Alert policies and SLA.

  • Collaborating with engineering and Architects teams to evaluate and identify optimal cloud solutions, also leveraging scalability, high-performance and security.

  • Extensive Log monitoring and analysis for both application and deployment pipeline to keep the Cloud Run services up and running without any issues.

  • Creating SLO / SLA / SLI with GCP / Grafana / Dynatrace dashboards.

  • Ability to support incident escalation and troubleshooting and conducting blameless postmortem on the incident resolution.

  • Ensuring efficient functioning of data storage and processing functions in accordance with company security policies and best practices in cloud security.

  • Collaborate with Engineering teams to identify optimization strategies, help develop self-healing capabilities.

  • Experience in developing a strong observability capability.

  • Regularly reviewing performance analysis of existing systems and making recommendations for improvements.

  • Participating in 24x7 on-call production support rotations and handling incident response to minimize disruptions. 

Qualifications
  • 4 Year College Degree in Computer Science or Equivalent

  • 5 - 6 years’ experience with JAVA, J2EE, NoSQL/SQL Datastore, Spring Boot, GCP/AWS/Azure & Docker/K8 in Maintenance and Development of multi-tier applications.

  • Proven work experience in designing, deploying, and operating mid to large scale public cloud environments.

    • Professional Certification 

    • Public Cloud >> GCP is a Must have.

  • Proven work experience in provisioning Infrastructure as Code (IaC) using Terraform Enterprise or community edition.

  • Experience in package, config, and deployment management.

  • Strong knowledge in GitHub, DevOps (Tekton is an advantage)

  • Should be proficient in scripting and coding, that include traditional languages like Python, Node.js and React.

  • Extensive knowledge and hands-on experience in Dynatrace, Grafana and Prometheus micro libraries.

  • Exposure to Cloud Monitoring and logging.

  • Experience with automation tools should be a priority. 

Top Skills

Adobe Experience Manager
DevOps
Docker
Dynatrace
Git
GCP
Grafana
Kubernetes
Microservices
Node.js
NoSQL
Python
React
Spring Boot
SQL
Terraform

Similar Jobs

9 Days Ago
In-Office
Chennai, Tamil Nadu, IND
Mid level
Mid level
Artificial Intelligence • Machine Learning
Seeking a Site Reliability Engineer with Go proficiency to enhance tooling automation, incident response, and standards in a multi-cloud environment. Responsible for self-service enablement and operational support.
Top Skills: ArgocdAWSDatadogFluxGoGrafanaKubernetesOpaPrometheusTerraform
10 Days Ago
In-Office
Chennai, Tamil Nadu, IND
Senior level
Senior level
Artificial Intelligence • Machine Learning
The Staff Software Engineer - SRE will design and build automation frameworks, enhance incident response processes, and enforce operational standards while collaborating with feature teams to scale engineering excellence.
Top Skills: AWSGoKubernetesTerraform
21 Days Ago
In-Office
Chennai, Tamil Nadu, IND
Senior level
Senior level
Automotive
The SRE Manager will define SRE strategies, lead teams for reliability, monitor systems, oversee incident management, and collaborate cross-functionally.
Top Skills: AiopsAWSAzureCi/CdCloud-NativeDynatraceGCPGrafanaHybrid CloudIacKubernetesMicroservicesPrometheusTerraform

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account