Citi Logo

Citi

Data Engineer

Posted Yesterday
Be an Early Applicant
In-Office
Pune, Mahārāshtra
Senior level
In-Office
Pune, Mahārāshtra
Senior level
The Data Engineer develops scalable data solutions, supports regulatory needs, and designs analytical models. They automate data pipelines and ensure alignment with architectural standards while mentoring team members.
The summary above was generated by AI

The Role

The Data Engineer is accountable for developing high quality data products to support the Bank’s regulatory requirements and data driven decision making. A Data Engineer will serve as an example to other team members, work closely with customers, and remove or escalate roadblocks. By applying their knowledge of data architecture standards, data warehousing, data structures, and business intelligence they will contribute to business outcomes on an agile team.

Responsibilities

  • Developing and supporting scalable, extensible, and highly available data solutions
  • Deliver on critical business priorities while ensuring alignment with the wider architectural vision
  • Identify and help address potential risks in the data supply chain
  • Follow and contribute to technical standards
  • Design and develop analytical data models

Required Qualifications & Work Experience

  • First Class Degree in Engineering/Technology (4-year graduate course)
  • 5 to 8 years’ experience implementing data-intensive solutions using agile methodologies
  • Experience of relational databases and using SQL for data querying, transformation and manipulation
  • Experience of modelling data for analytical consumers
  • Ability to automate and streamline the build, test and deployment of data pipelines
  • Experience in cloud native technologies and patterns
  • A passion for learning new technologies, and a desire for personal growth, through self-study, formal classes, or on-the-job training
  • Excellent communication and problem-solving skills

Technical Skills (Must Have)

  • ETL: Hands on experience of building data pipelines. Proficiency in two or more data integration platforms such as Ab Initio, Apache Spark, Talend and Informatica
  • Big Data: Experience of ‘big data’ platforms such as Hadoop, Hive or Snowflake for data storage and processing
  • Data Warehousing & Database Management: Understanding of Data Warehousing concepts, Relational (Oracle, MSSQL, MySQL) and NoSQL (MongoDB, DynamoDB) database design
  • Data Modeling & Design: Good exposure to data modeling techniques; design, optimization and maintenance of data models and data structures
  • Languages: Proficient in one or more programming languages commonly used in data engineering such as Python, Java or Scala
  • DevOps: Exposure to concepts and enablers - CI/CD platforms, version control, automated quality control management

Technical Skills (Valuable)

  • Ab Initio: Experience developing Co>Op graphs; ability to tune for performance. Demonstrable knowledge across full suite of Ab Initio toolsets e.g., GDE, Express>IT, Data Profiler and Conduct>IT, Control>Center, Continuous>Flows
  • Cloud: Good exposure to public cloud data platforms such as S3, Snowflake, Redshift, Databricks, BigQuery, etc. Demonstratable understanding of underlying architectures and trade-offs
  • Data Quality & Controls: Exposure to data validation, cleansing, enrichment and data controls
  • Containerization: Fair understanding of containerization platforms like Docker, Kubernetes
  • File Formats: Exposure in working on Event/File/Table Formats such as Avro, Parquet, Protobuf, Iceberg, Delta
  • Others: Basics of Job scheduler like Autosys. Basics of Entitlement management
  • Certification on any of the above topics would be an advantage.

------------------------------------------------------

Job Family Group:

Technology

------------------------------------------------------

Job Family:

Digital Software Engineering

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Most Relevant Skills

Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills

Structured Query Language (SQL).

------------------------------------------------------

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

 

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.

Top Skills

Spark
Ci/Cd
Docker
DynamoDB
ETL
Hadoop
Hive
Informatica
Java
Kubernetes
MongoDB
Mssql
MySQL
Oracle
Python
Scala
Snowflake
Talend

Citi Chennai, Tamil Nadu, IND Office

C P Ramaswamy Road, Chennai, Tamil Nadu, India, 600018

Similar Jobs

5 Days Ago
Hybrid
Pune, Mahārāshtra, IND
Senior level
Senior level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The Senior Data Engineer will develop data governance strategies, manage data tools, and ensure effective utilization of data for analytics services, while leading and mentoring cross-functional teams.
Top Skills: AlteryxHadoopNifiPythonSparkSQLSsis
Yesterday
In-Office
Pune, Mahārāshtra, IND
Senior level
Senior level
Fintech • Financial Services
The Data Engineer will design and optimize ETL pipelines using PySpark, manage AWS services, and ensure data security and compliance in cloud environments.
Top Skills: Apache AirflowAWSCloudFormationEmrGitGlueLambdaPysparkPythonRedshiftS3SnowflakeSQLTerraform
Yesterday
In-Office
Pune, Mahārāshtra, IND
Senior level
Senior level
Fintech • Financial Services
The Data Engineer develops high-quality data solutions for regulatory needs and decision-making, working on data pipelines and models, and guiding team members.
Top Skills: Ab InitioSparkCi/CdDockerDynamoDBHadoopHiveInformaticaJavaKubernetesMongoDBMssqlMySQLOraclePythonScalaSnowflakeSQLTalend

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account