Citi Logo

Citi

Python Developer (Data Engineering/AI) – Assistant Vice President

Posted 7 Days Ago
Be an Early Applicant
In-Office
Pune, Mahārāshtra
Senior level
In-Office
Pune, Mahārāshtra
Senior level
The role involves developing NLP pipelines, data processing jobs, APIs for model inference, and supporting CI/CD deployments using various data tools.
The summary above was generated by AI

Role Summary
We are looking for a mid-level Python Developer with combined experience in Data Engineering and AI/NLP engineering. The candidate will build NLP pipelines using libraries such as Flair, BERT, and LLM frameworks, and will also work on large-scale data processing using PySpark, Pandas, and related data tools. The role includes developing APIs, integrating with platform services, and supporting CI/CD deployments using GitHub and LightSpeed Enterprise.

Key Responsibilities
  • Develop and optimize ETL/data processing jobs using PySpark, Pandas, PyArrow, and related libraries.
  • Work with Parquet files using FastParquet or pyarrow.parquet for efficient data processing.
  • Implement data parsing and serialization using json, ujson, or orjson for high-performance JSON handling.
  • Build and maintain NLP pipelines using Flair, BERT, and LLM-based models.
  • Develop scalable ingestion and data transformation pipelines for AI and analytics use cases.
  • Build and maintain Flask-based APIs for model inference and service integrations.
  • Use regular expressions for text cleaning, parsing, and NLP preprocessing.
  • Integrate caching and fast lookups using Redis.
  • Manage and deploy ML models using MLflow for tracking and versioning.
  • Support CI/CD workflows using GitHub, LightSpeed Enterprise, and deployment pipelines.
  • Create and maintain Autosys JILs for job scheduling and automation.
  • Use basic Linux commands for troubleshooting, operations, and deployment tasks.
  • Monitor application and system health using ITRS Geneos.
  • Write unit tests and improve automation test coverage (PyTest/unittest).
  • Work with REST APIs, microservices, and basic shell scripting.
  • Work with cloud services (ECS), including boto3.
Required Skills
  • 8+ years of hands-on Python programming experience.
  • Strong fundamentals in Python, OOP, and design patterns.
  • Experience with NLP libraries such as Flair, BERT, HuggingFace Transformers, or similar.
  • Solid experience with PySpark, Pandas, PyArrow, and distributed data pipelines.
  • Proficient in working with Parquet using FastParquet or pyarrow.parquet.
  • Familiarity with fast JSON parsing libraries (json, ujson, orjson).
  • Experience building APIs using Flask (FastAPI is a plus).
  • Experience with MLflow for model tracking and deployment.
  • Good understanding of CI/CD practices and Git workflows.
  • Experience working with Redis or similar in-memory stores.
  • Experience with Autosys JILs for job scheduling.
  • Comfortable with Linux command line and shell scripting.
  • Strong debugging, problem-solving, and teamwork skills.
  • Exposure to cloud services; AWS boto3 experience is an asset.
Nice-to-Have
  • Experience with Polars or Dask for high-performance data processing.
  • Experience with PyTorch or TensorFlow for model training.
  • Experience with Docker, Kubernetes, or containerized deployments.
  • Experience with monitoring tools such as ITRS Geneos.
  • Experience with FastAPI, Airflow, or Prefect.

------------------------------------------------------

Job Family Group: Technology

------------------------------------------------------

Job Family:Applications Development

------------------------------------------------------

Time Type:Full time

------------------------------------------------------

Most Relevant Skills Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills For complementary skills, please see above and/or contact the recruiter.

------------------------------------------------------

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

 

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.

Citi Chennai, Tamil Nadu, IND Office

C P Ramaswamy Road, Chennai, Tamil Nadu, India, 600018

Similar Jobs

An Hour Ago
Remote or Hybrid
India
Internship
Internship
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
As a TTF India Graduate Intern at Mondelēz, you will experience a supportive environment to grow, take on new challenges, and contribute to various areas in snack production and development.
An Hour Ago
In-Office
Senior level
Senior level
Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
Design, deploy, and manage third-party infrastructure solutions, focusing on HPE and EMC server and storage infrastructure within SI delivery engagements, while administering RHEL systems and developing automation scripts.
Top Skills: AnsibleEmcEsxiHpeKvmRhelVcenterVMwareVsphere
An Hour Ago
Remote or Hybrid
India
Mid level
Mid level
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
As an AVP - Finance Data Quality, you will support data services for Finance regarding compliance and risk management, collaborate with IT and business stakeholders, and document data processes.
Top Skills: AlteryxConfluenceExcelMicrosoft PowerpointMicrosoft VisioPythonQlik SenseRational Team ConcertSASSQLTableauVBA

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account