Astreya Logo

Astreya

IT Infrastructure Operations Engineer II

Posted 20 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Senior level
Remote
Hiring Remotely in India
Senior level
Provide technical support for enterprise server and network operations, mentor L1 engineers, and ensure high availability in a 24x7 environment.
The summary above was generated by AI

About the Job

We are looking for an experienced L2 IT Infrastructure Operations Engineer to provide advanced technical support for our enterprise server and network infrastructure. This mid-level position bridges the gap between frontline support and expert-level engineering, handling escalated incidents, performing complex
troubleshooting, and contributing to operational excellence. The ideal candidate will possess hands-on experience with Dell PowerEdge servers, Cisco networking equipment, and enterprise monitoring solutions. You will mentor L1 engineers, participate in change management activities, and collaborate with
cross-functional teams to ensure high availability and performance of critical infrastructure in a 24x7 global environment.

Key Responsibilities

 Provide advanced troubleshooting and fault isolation for escalated server and network incidents, utilizing iDRAC, Redfish, and Cisco CLI tools to diagnose and resolve complex issues.
 Execute firmware, BIOS, and driver updates on Dell PowerEdge servers following standardized procedures, ensuring minimal service disruption and maintaining system stability.
 Perform IOS/NX-OS firmware and software updates on Cisco routers and switches, adhering to change management protocols and conducting post-update validation.
 Manage hardware break/fix procedures for server infrastructure, coordinating with Dell support for warranty claims, parts ordering, and scheduling on-site technician dispatch.
 Conduct regular network health audits and performance analysis, identifying potential bottlenecks and recommending optimization measures to prevent service degradation.
 Collaborate with the SRE team to enhance monitoring dashboards and refine alerting thresholds, ensuring proactive detection of infrastructure instability or security events.
 Mentor and provide technical guidance to L1 engineers, conducting knowledge transfer sessions and assisting with complex ticket resolution to build team capability.
 Participate in blameless post-mortems following major incidents, contributing to root cause analysis and implementing preventative actions to improve system reliability.
 Maintain and update operational runbooks, network diagrams, and technical documentation to reflect current configurations and best practices.
 Support hardware lifecycle management activities including equipment provisioning, asset
tracking, and coordination with vendors for hardware returns and repairs.
 Provide 24x7 on-call support for critical escalations, ensuring rapid response to high-priority incidents affecting production systems.
 Collaborate with the FTE IT Team Lead on capacity planning activities, providing data-driven insights on infrastructure utilization trends and growth projections.

Required Skills

 Related field Experience with 5+ years of hands-on experience in enterprise IT infrastructure operations.
 Strong proficiency with Dell PowerEdge server administration, including hardware troubleshooting, iDRAC/Redfish management, and firmware lifecycle management.
 Solid experience with Cisco networking equipment (routers, switches), including IOS/NX-OS configuration, troubleshooting, and upgrade procedures.
 Working knowledge of monitoring and logging tools, with ability to create dashboards, configure alerts, and analyze performance metrics for proactive issue detection.
 Excellent problem-solving abilities with demonstrated experience in incident management, root cause analysis, and implementing corrective actions in production environments.
 Industry certifications such as, Dell Server certifications, or ITIL Foundation; ability to work rotating shifts in a 24x7 global support model.

Tools Required
 Server Hardware Tools: Dell iDRAC, Lifecycle Controller, OpenManage, RAID/PERC utilities for server provisioning, firmware baselining, and remote management.
 OS Deployment Tools: PXE boot infrastructure, iDRAC Virtual Media, Windows Server & Linux ISOs with hardening and automation scripts.
 Network Tools: Cisco IOS CLI, PoE management, VLAN/QoS configuration tools, network monitoring, and bandwidth/latency testing utilities.
 Automation & Operations Tools: Ansible, Python, CMDB systems, configuration backup tools, and documentation/diagramming platforms for global 24x7 operations.

Top Skills

Ansible
Cisco Cli
Cisco Networking Equipment
Dell Poweredge Servers
Idrac
Linux
Python
Redfish
Windows Server

Similar Jobs

23 Minutes Ago
Remote or Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
Expert/Leader
Expert/Leader
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Recruit, develop and manage partner relationships to drive enterprise software and services revenue for the Dynatrace platform. Build joint go-to-market strategies, generate partner-sourced pipeline, run quarterly business reviews, coordinate with regional sales teams, and travel to enable partner initiatives and ensure successful execution.
Top Skills: AWSCloud ServicesDavis AiDevOpsDynatraceGCPMicrosoftSaaS
2 Hours Ago
In-Office or Remote
18 Locations
Mid level
Mid level
Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
Lead 24x7 incident management to minimize service disruption, coordinate cross-team response, ensure restoration within SLAs, maintain documentation/repositories, produce post-incident reports, support problem management, and follow business continuity procedures.
Top Skills: Itil,Etom,Wla,Ericsson Bss,Ericsson Service Layer,Ericsson Core,Ericsson Access,Eridoc,Ericoll,Gsm,Wcdma,Lte,Network Topology,Business Continuity
2 Hours Ago
Remote or Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
Senior level
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
The Sr. Information Security Engineer leads security strategy execution, manages security alerts, develops security assessments, and mentors junior engineers, ensuring robust information security practices at BlackLine.
Top Skills: Aws Security HubAzure Security CenterBashDlpEdrPowershellPythonScceTerraformWaf

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account