Citi Logo

Citi

Data engineer - Chennai

Posted 4 Days Ago
Be an Early Applicant
Tharamani, Chennai, Tamil Nadu
Senior level
Tharamani, Chennai, Tamil Nadu
Senior level
We are seeking a Senior Data Engineer to build data pipelines and leverage data architecture standards. The position involves working with Agile teams, writing code in Python and Pyspark, managing data, and ensuring high-quality data products for decision-making processes.
The summary above was generated by AI

Job Title: Data Engineer

We are looking for a hands-on Data Engineer who is passionate about solving business problems through innovation and engineering practices. As a Data Engineer, the candidate will leverage deep technical knowledge and will apply knowledge of data architecture standards, data warehousing, data structures, and business intelligence to drive the creation of high-quality data products for data driven decision making.

Required Qualifications

6+ Years of relevant experience of implementing data-intensive solutions using agile methodologies.

Code contributing member of Agile teams, working to deliver sprint goals.

Write clean, efficient, and maintainable code that meets the highest standards of quality.

Very strong in coding Python/Pyspark, UNIX shell scripting

Experience in cloud native technologies and patterns

Ability to automate and streamline the build, test and deployment of data pipelines

Technical Skills (Must Have)

  • ETL: Hands on experience of building data pipelines. Proficiency in data integration platforms such as Apache Spark

Experienced in writing Pyspark code to handle large data set ,perform data transformation , familiarity with Pyspark integration with other Apache Spark component ,such as Spark SQL , Understanding of Pyspark optimization techniques

Strong proficiency in working with relational databases and using SQL for data querying, transformation, and manipulation.

  • Big Data: Exposure to ‘big data’ platforms such as Hadoop, Hive or Iceberg for data storage and processing
  • Data Warehousing & Database Management: Understanding of Data Warehousing concepts, Relational (Oracle, MSSQL, MySQL) and NoSQL (MongoDB, DynamoDB) database design
  • Data Modeling & Design: Good exposure to data modeling techniques; design, optimization and maintenance of data models and data structures
  • Languages: Proficient in one or more programming languages commonly used in data engineering such as Python, PySpark, UNIX Shell scripting
  • DevOps: Exposure to concepts and enablers - CI/CD platforms, bitbucket/Github, JIRA, Jenkins, Tekton, Harness

Technical Skills (Valuable)

  • Data Quality & Controls: Exposure to data validation, cleansing, enrichment and data controls, framework libraries like Deequ
  • Federated Query: Starburst, Trino
  • Containerization: Fair understanding of containerization platforms like Docker, Kubernetes, Openshift
  • File Formats: Exposure in working on File/Table Formats such as Avro, Parquet, Iceberg, Delta
  • Schedulers: Basics of Job scheduler like Autosys, Airflow
  • Cloud: Experience in cloud native technologies and patterns (AWS, Google Cloud)
  • Nice to have: Java, for REST API development

Other skills :

  • Strong project management and organizational skills. 
  • Excellent problem-solving, communication, and organizational skills. 
  • Proven ability to work independently and with a team.
  • Experience in managing and implementing successful projects
  • Ability to adjust priorities quickly as circumstances dictate
  • Consistently demonstrates clear and concise written and verbal communication

Education:

  • Bachelor’s degree/University degree or equivalent experience

------------------------------------------------------

Job Family Group:

Technology

------------------------------------------------------

Job Family:

Applications Development

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Citi is an equal opportunity and affirmative action employer.

Qualified applicants will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

Citigroup Inc. and its subsidiaries ("Citi”) invite all qualified interested applicants to apply for career opportunities. If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.

View the "EEO is the Law" poster. View the EEO is the Law Supplement.

View the EEO Policy Statement.

View the Pay Transparency Posting

Top Skills

Pyspark
Python
Unix

Citi Chennai, Tamil Nadu, IND Office

C P Ramaswamy Road, Chennai, Tamil Nadu, India, 600018

Similar Jobs

2 Days Ago
Easy Apply
Chennai, Tamil Nadu, IND
Easy Apply
Mid level
Mid level
Artificial Intelligence • Consumer Web • Edtech • Enterprise Web • HR Tech • Social Impact • Generative AI
As a Data Platform Engineer, you will design and build a self-service data infrastructure for Udemy's data mesh. You'll create scalable data pipelines using AWS services and tools like Airflow and Kafka, and contribute to data quality and privacy initiatives. You will work collaboratively in a dynamic environment focused on innovation.
Top Skills: JavaPythonScala
15 Hours Ago
Tharamani, Chennai, Tamil Nadu, IND
Senior level
Senior level
Fintech • Financial Services
The Data Engineer will focus on developing data-intensive solutions by leveraging technical expertise in data architecture, data warehousing, and business intelligence. Responsibilities include building data pipelines, optimizing data processing, and ensuring high-quality data product delivery for informed decision-making, all while working in Agile teams.
Top Skills: PysparkPythonUnix
2 Days Ago
Industrial Estate, Mambalam Guindy, Chennai, Tamil Nadu, IND
Expert/Leader
Expert/Leader
Security • Cybersecurity
As a Principal Software Engineer at Gen, you'll design, develop, and maintain software applications using Java and Python. Collaborate with teams to deliver new features, ensure application quality, mentor junior engineers, and stay updated with industry trends and best practices.
Top Skills: JavaPython

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account