Observability Dev Ops Lead Engineer - AVP C12 - Chennai

Posted 5 Days Ago
Be an Early Applicant
India
5-7 Years Experience
Fintech
The Role
The Observability Dev Ops Lead Engineer at Citi will architect and develop Observability solutions, drive engineering and certification of event management and monitoring platform products, collaborate with cross-functional teams, and ensure system stability and functionality.
Summary Generated by Built In

An Dev Ops position in Inventory Data & Enterprise Application Services (IDEAS) Observability group that will play a key role in Architecting / Planning and developing Observability solutions. Team Focus is the Event Management and Notification space processing events from upstream Observability tooling and processing them through AI/ML functions, Ticketing, Notifications, and other automations.

Strong focus on communication and technical skills, system stability, quality and functionality against user expectations, problem management and resolution, including issue documentation, root cause analysis and trend analysis. Ensure all processes and procedures are being always followed to comply with audit and regulatory requirements.

Responsibilities:

  • Drive engineering and certification of event management and infrastructure monitoring platform products.
  • Development of custom tooling extension and integrations of monitoring services with external systems, such as CMDB’ s, ticketing and notification systems and larger data lake technologies and deliver with automated SRE tooling functions.
  • Work closely with engineering, and operations, and applications teams across Citi to understand and collect monitoring and data analytics requirements.
  • Provide operational support to existing Event and notification systems and build SRE automations to manage production support functions.
  • Utilizes good understanding of apps support procedures and concepts and basic knowledge of other technical areas to field issues and queries from stakeholders, provide short-term resolutions and work with relevant technology partners for long term remediation. 
  • Develop a comprehensive understanding of how areas of apps support collectively integrate to contribute to achieving business goals.
  • Participates in disaster recovery testing. 
  • Participate in application releases, from development, testing and deployment into production. 
  • Perform post release checkouts after application releases and infrastructure updates.
  • Develop and maintain technical support documentation. 
  • Analyses applications to identify risks, vulnerabilities, and security issues. 
  • Makes evaluative judgments based on analysis of information, resolves problems by identifying and selecting solutions.
  • Cooperation with Development colleagues to prioritize bug fixes and support tooling requirements. 
  • Active involvement in and ownership of Support Project items, covering Stability, Efficiency, and Effectiveness initiatives. 
  • Co-ordinate with vendor management for any issues / new developments and have frequent meetings to address all the gaps.
  • Proactively check and remediate all CAMP / FEMA / CISAR / Black duck /VTM alerts to be complaint.
  • Willing to get cross trained with other applications within event management like SMRP / NOI and provide end to end support as and when required.
  • Understanding of Ansible playbook and Starfleet functionality.
  • Ensure true end-to-end ownership of production environment, exceeding stability targets through collective ownership of initiatives across all plans, build, and operate functions.
  • Collaborate with engineering teams in the improvement of CI/CD practices.
  • Support and maintenance of infrastructure to include cloud deployments.
  • Collaborate with development, QA, and engineering teams globally on various business projects.
  • Provide timely and regular communication, and overall project reporting within team, business partners, business leadership and engineering leadership.
  • Ability to communicate effectively across various levels of management, technology, architecture, and compliance.
  • Proficiency to study historical performance trends by using dashboards, data, charts, etc.
  • Handle all JIRAs assigned and make sure it is driven and completed on time.
  • Perform Incident, Change and Problem management including prioritization, root cause analysis and escalation/coordinate to appropriate groups.
  • Provide on call support during weekend or when required for the applications on a rotational basis.

Qualifications:

  • 9+ years of experience into IT infrasture domain with minimum 5 years’ Engineering of 3rd party software and services in the event management notification tooling space
  • Proficient in Agile work methods and JIRA based workflow management.
  • Knowledge on Ansible playbook automation.
  • Knowledge on yaml will be an added advantage
  • Proficient in shell scripting, Perl or python
  • Strong skillset required in Linux (RHEL) and Windows; OS concepts and services; regular expressions; end-to-end testing; performance/scalability testing.
  • Must have exposure to Incident Management tools like ServiceNow or any similar market application
  • Experience with software delivery and documentation tools like Bit bucket, Artifactory, Jenkins and Confluence.
  • Knowledge on databases like MSSQL, Oracle, MongoDB will be plus.
  • Knowledge of Event Analytics, Data Science, Machine learning and Artificial intelligence will also be a plus. 
  • Open minded and willing to learn new tools/methodologies/concepts.
  • Experience with cloud platforms like AWS, Azure or Google Cloud is a plus

Education:

  • Bachelor’s degree/University degree or equivalent experience

------------------------------------------------------

Job Family Group:

Technology

------------------------------------------------------

Job Family:

Infrastructure

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Citi is an equal opportunity and affirmative action employer.

Qualified applicants will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

Citigroup Inc. and its subsidiaries ("Citi”) invite all qualified interested applicants to apply for career opportunities. If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.

View the "EEO is the Law" poster. View the EEO is the Law Supplement.

View the EEO Policy Statement.

View the Pay Transparency Posting

Top Skills

Ansible
The Company
Chennai, Tamil Nadu
223,850 Employees
Hybrid Workplace

What We Do

Citi's mission is to serve as a trusted partner to our clients by responsibly providing financial services that enable growth and economic progress. Our core activities are safeguarding assets, lending money, making payments and accessing the capital markets on behalf of our clients. We have 200 years of experience helping our clients meet the world's toughest challenges and embrace its greatest opportunities. We are Citi, the global bank – an institution connecting millions of people across hundreds of countries and cities.

Jobs at Similar Companies

Fusion92 Logo Fusion92

Account Executive

AdTech • Agency • Digital Media • Enterprise Web • Marketing Tech • Analytics • Web3
IL, USA
263 Employees

ForeFlight Logo ForeFlight

Product Designer II

Aerospace • Software • App development
Remote
Austin, TX, USA
466 Employees

IonQ Logo IonQ

Lead Ion Trap Design Engineer

Artificial Intelligence • Hardware • Information Technology • Internet of Things • Software
Easy Apply
Seattle, WA, USA
305 Employees

Snap Inc. Logo Snap Inc.

Application Engineer, Salesforce UI

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Hybrid
New York, NY, USA
5000 Employees

Similar Companies Hiring

CSC Thumbnail
Software • Legal Tech • Fintech • Financial Services • Data Privacy • Cybersecurity
Wilmington, DE
8000 Employees
Toast Thumbnail
Software • Information Technology • Hospitality • Food • Fintech • Cloud
Boston, MA
4500 Employees
TransUnion Thumbnail
Information Technology • Fintech • Financial Services • Cybersecurity • Business Intelligence • Big Data Analytics • Big Data
Chicago, IL
15000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account