Citi Logo

Citi

Senior Infrastructure Engineer - GenAI

Reposted 7 Days Ago
Be an Early Applicant
In-Office
Chennai, Tamil Nadu, IND
Mid level
In-Office
Chennai, Tamil Nadu, IND
Mid level
Design and maintain backend infrastructure for generative AI applications, ensuring scalability and reliability, overseeing deployment and monitoring of AI models, and collaborating with stakeholders on production-ready services.
The summary above was generated by AI

We are seeking an experienced Senior Backend Engineer to design, develop, and maintain the infrastructure powering our generative AI applications. You will work closely with AI engineers, platform teams, and product stakeholders to build scalable, reliable backend systems that support AI model deployment, inference, and integration. This role combines traditional backend engineering expertise with cutting-edge AI infrastructure challenges to deliver robust solutions at enterprise scale.

Key Responsibilities:

  • Design and implement scalable backend services and APIs for generative AI applications using microservices architecture and cloud-native patterns.

  • Build and maintain model serving infrastructure with load balancing, auto-scaling, caching, and failover capabilities for high-availability AI services.

  • Deploy and orchestrate containerized AI workloads using Docker, Kubernetes, ECS, and OpenShift across development, staging, and production environments.

  • Develop serverless AI functions using AWS Lambda, ECS Fargate, and other cloud services for scalable, cost-effective inference.

  • Implement robust CI/CD pipelines for automated deployment of AI services, including model versioning and gradual rollout strategies.

  • Create comprehensive monitoring, logging, and alerting systems for AI service performance, reliability, and cost optimization.

  • Integrate with various LLM APIs (OpenAI, Anthropic, Google) and open-source models, implementing efficient batching and optimization techniques.

  • Build data pipelines for training data preparation, model fine-tuning workflows, and real-time streaming capabilities.

  • Ensure adherence to security best practices, including authentication, authorization, API rate limiting, and data encryption.

  • Collaborate with AI researchers and product teams to translate AI capabilities into production-ready backend services.

Required Technical Skills:

  • Strong experience with backend development using Python, with familiarity in Go, Node.js, or Java for building scalable web services and APIs.

  • Hands-on experience with containerization using Docker and orchestration platforms including Kubernetes, OpenShift, and AWS ECS in production environments.

  • Proficient with cloud infrastructure, particularly AWS services (Lambda, ECS, EKS, S3, RDS, ElastiCache) and serverless architectures.

  • Experience with CI/CD pipelines using Jenkins, GitLab CI, GitHub Actions, or similar tools, including Infrastructure as Code with Terraform or CloudFormation.

  • Strong knowledge of databases including PostgreSQL, MongoDB, Redis, and experience with vector databases for AI applications.

  • Familiarity with message queues (RabbitMQ, Apache Kafka, AWS SQS/SNS) and event-driven architectures.

  • Experience with monitoring and observability tools such as Prometheus, Grafana, DataDog, or equivalent platforms.

  • Knowledge of AI/ML model serving frameworks like MLflow, Kubeflow, TensorFlow Serving, or Triton Inference Server.

  • Understanding of API design principles, load balancing, caching strategies, and performance optimization techniques.

  • Experience with microservices architecture, distributed systems, and handling high-traffic, low-latency applications.

Qualifications:

  • Bachelor’s degree in computer science, Engineering, or related technical field, or equivalent practical experience.

  • 4–6 years of experience in backend engineering with focus on scalable, production systems.

  • 2+ years of hands-on experience with containerization, Kubernetes, and cloud infrastructure in production environments.

  • Demonstrated experience with AI/ML model deployment and serving in production systems.

------------------------------------------------------

Job Family Group:

Technology

------------------------------------------------------

Job Family:

Infrastructure

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Most Relevant Skills

Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills

For complementary skills, please see above and/or contact the recruiter.

------------------------------------------------------

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

 

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.

Citi Chennai, Tamil Nadu, IND Office

C P Ramaswamy Road, Chennai, Tamil Nadu, India, 600018

Similar Jobs

An Hour Ago
Remote or Hybrid
India
Internship
Internship
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
As a TTF India Graduate Intern at Mondelēz, you will experience a supportive environment to grow, take on new challenges, and contribute to various areas in snack production and development.
An Hour Ago
In-Office
Chennai, Tamil Nadu, IND
Mid level
Mid level
Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
The Technology Manager will develop and implement technology strategies, lead customer solution development, manage PoCs, and participate in technology strategy forums.
Top Skills: Artificial IntelligenceCloud NativeGenerative AiSecuritySoftware Technology
An Hour Ago
In-Office
Chennai, Tamil Nadu, IND
Senior level
Senior level
Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
The Senior Test Engineer is responsible for testing, administering, and troubleshooting customer application systems, primarily within IT and telecommunications, utilizing manual and automated test case design.
Top Skills: BillingCatalog ManagerCisDmpEricsson ChargingJIRAOrder Care

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account