Site Reliability Engineering Lead

Posted 12 Days Ago
Be an Early Applicant
Chennai, Tamil Nadu
5-7 Years Experience
Hardware • Information Technology • Other • Software • Analytics
The Role
Lead and manage the Site Reliability Engineering team responsible for designing, implementing, and maintaining high-availability and scalable systems in the public cloud. Drive automation, monitoring, and incident response to ensure system performance and stability. Collaborate with software development teams for reliable releases and mentor junior team members.
Summary Generated by Built In

Title: Site Reliability Engineering Lead

Location: Chennai , India 

Department: Trimble Cloud Core Platform

Are you interested in cutting edge cloud technologies, ready to dirt your hands in the cloud world? Do you like to be part of a core team with industry leading site reliability engineering standards?

What You Will Do

Are you a self-motivated and enthusiastic Site Reliability Engineer with hands-on experience in cloud computing? If so, our Trimble cloud core platform division is looking for people like you to join our dynamic SRE team. You will join Trimble cloud core platform team to work on provisioning and operating our core engineering services in the public cloud.

  • Design, implement, and maintain high-availability and scalable systems, ensuring our platforms run smoothly 24/7 with minimal downtime

  • Emphasize SRE as an engineering discipline, driven by automation. Create and improve IaC, automation tools for continuous integration, deployment, and incident response, reducing manual work and improving response times.

  • Develop and maintain comprehensive monitoring, alerting, and logging systems to provide deep insights into system performance, identifying potential issues before they impact users.

  • Monitor system performance and usage, conducting capacity planning and scaling efforts to meet growing demands.Design cost controls and rollout the cost optimization strategy.

  • Own KPIs for site stability, performance, and root cause analysis (RCA) for production issues.Develop services for automatic incident and disaster recovery.

  • Participate in troubleshooting, capacity analysis, planning, and performance analysis.

  • Lead incident response efforts, perform root cause analyses, and implement post-mortem processes to prevent future issues and improve system resilience

  • Handle escalations from internal stakeholders and manage critical issues to resolution.

  • Identify problems and opportunities for improvements that are common across many teams and services.

  • Responsible for fixing compliance issues and requirements raised by CyberSecurity tools

  • Adopt reliability engineering practices such as error budgets, blameless retrospectives, chaos engineering, etc.

  • Production operational support of our global service catalog

  • Foster collaboration with software product development, architecture, and engineering team to ensure releases are delivered with repeatable and auditable processes

  • Ensure 24x7 coverage with business continuity principles.

  • Learn and be passionate about cloud computing

  • Evaluate and utilize the newer technologies coming in the industry to keep the solution on the cutting edge

  • Mentor junior SREs and other engineering team members, sharing knowledge and promoting a culture of reliability, efficiency, and continuous learning.

What Skills & Experience You Should Bring

  • Bachelor's/Master’s degree in Computer Engineering, or related field 

  • Minimum 6+ years experience in technical.

  • History of supporting applications and infrastructure in Production

  • Experience in Capacity planning and Cost optimization

  • Experience with Amazon Web Services (Azure or GCP acceptable)

  • Deep understanding of Linux/Unix operating systems

  • Experience building and deploying containers and serverless architecture.

  • Familiarity with modern web application development and architecture

  • Experience using a high-level scripting language (Python preferred) and IaC tools(Terraform , CloudFormation) and containerization

Desired Skills

  • AWS Certification (or equivalent in another public cloud)

  • Experience with microservice architecture

  • Expertise in Python or another high-level programming language

  • Experience with SaaS monitoring tool sets (Datadog, SumoLogic, PagerDuty, InfluxDB , Grafana)

  • Experience in CloudFormation, SAM Template and Terraform 

  • Experience in Github, Atlassian tools , Bitbucket , Jira and Confluence

  • Experience in Ansible and Packer

  • Experience using SQL and NoSQL databases

  • Experience with Github actions, Jenkins, Azure DevOps and Gradle for CI/CD

About Our Location

The global pandemic fundamentally changed the way we think about work and the workplace.
 

We created a Flexible Work Arrangement (FWA) Program to provide a framework for flexibility in where, when, and how we work.

Trimble’s new office in Chennai features state-of-the-art infrastructure and facilities and will enable Trimble to better serve its customers and partners from around the world. The 300,000 square feet Class A office space, with 50 meeting rooms and a seating capacity of nearly 2,000 staff simultaneously, allows for effective social distancing and compliance to local Covid guidelines.

Offering employees greater flexibility, the Chennai office will provide a hybrid working model.

About Our Trimble Cloud Core Division

Trimble Cloud Core Platform is leading Connect & Scale. As part of Trimble's Office of Digital Transformation, we create cloud-first workflows that enable Trimble's customer-centric approach in digital transformation.

Our products and common core services connect data, users, and applications across the enterprise. Our central approach enables Trimble scale, collaboration, enterprise security, and cost-efficiency

Trimble’s Inclusiveness Commitment

We believe in celebrating our differences. That is why our diversity is our strength. To us, that means actively participating in opportunities to be inclusive. Diversity, Equity, and Inclusion have guided our current success while also moving our desire to improve. We actively seek to add members to our community who represent our customers and the places we live and work.

We have programs in place to make sure our people are seen, heard, and welcomed and most importantly that they know they belong, no matter who they are or where they are coming from.

Trimble’s Privacy Policy

Our Company 

Trimble is transforming the way the world works by delivering products and services that connect the physical and digital worlds. Core technologies in positioning, modeling, connectivity and data analytics enable customers to improve productivity, quality, safety, and sustainability. From purpose-built products to enterprise lifecycle solutions, Trimble software, hardware, and services are transforming a broad range of industries such as agriculture, construction, geospatial and transportation, and logistics. For more information about Trimble (NASDAQ: TRMB), visit www.trimble.com 


 

Top Skills

Amazon Web Services
Linux
The Company
Chennai, Tamil Nadu
10,001 Employees
On-site Workplace

What We Do

Trimble is transforming the way the world works by delivering products and services that connect the physical and digital worlds. Core technologies in positioning, modeling, connectivity and data analytics enable customers to improve productivity, quality, safety and sustainability. From purpose built products to enterprise lifecycle solutions, Trimble software, hardware and services are transforming industries such as agriculture, construction, geospatial and transportation. For more information about Trimble (NASDAQ:TRMB), visit: www.trimble.com.

Trimble products are used in over 141 countries around the world. Employees in more than 30 countries, coupled with a highly capable network of dealers and distribution partners serve and support customers worldwide. As the market leader in most of our businesses, we offer a compelling value proposition to our customers based on productivity, return on investment and environmental stewardship. Come position yourself with an innovative industry leader and position yourself for success.

Jobs at Similar Companies

MediaNews Group Logo MediaNews Group

Publisher

Consumer Web • Digital Media • News + Entertainment
Hybrid
Estes Park, CO, USA
4000 Employees

MediaNews Group Logo MediaNews Group

Digital Account Executive

Consumer Web • Digital Media • News + Entertainment
Hybrid
Scranton, PA, USA
4000 Employees

ServiceNow Logo ServiceNow

Vice President of Sales, Federal Defense & National Security

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote
Hybrid
Washington, DC, USA
23000 Employees

ServiceNow Logo ServiceNow

Technical Support Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote
Hybrid
Tokyo, JPN
23000 Employees

Similar Companies Hiring

CSC Thumbnail
Software • Legal Tech • Fintech • Financial Services • Data Privacy • Cybersecurity
Wilmington, DE
8000 Employees
Toast Thumbnail
Software • Information Technology • Hospitality • Food • Fintech • Cloud
Boston, MA
4500 Employees
TransUnion Thumbnail
Information Technology • Fintech • Financial Services • Cybersecurity • Business Intelligence • Big Data Analytics • Big Data
Chicago, IL
15000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account