Citi Logo

Citi

Python Data Engineer

Posted 2 Days Ago
Be an Early Applicant
In-Office
Chennai, Tamil Nadu
Mid level
In-Office
Chennai, Tamil Nadu
Mid level
Design, build, test, and deploy Python-based data migration, profiling, and processing solutions. Implement scalable PySpark pipelines, optimize SQL (Oracle) queries, integrate via JDBC, and handle streaming (Kafka/Spark Streaming). Collaborate with architects and stakeholders, perform data analysis with Pandas, ensure data integrity, and troubleshoot pipeline performance and quality issues.
The summary above was generated by AI
We are seeking a highly motivated and intuitive Python Developer to join our dynamic team, focusing on critical data migration and profiling initiatives. The ideal candidate will be a self-starter with strong engineering principles, capable of designing and implementing robust solutions for handling large datasets and complex data flows. This role offers an exciting opportunity to work on challenging projects that drive significant impact within our data ecosystem.

Responsibilities:

  • Develop, test, and deploy high-quality Python code for data migration, data profiling, and data processing.
  • Design and implement scalable solutions for working with large and complex datasets, ensuring data integrity and performance.
  • Utilize PySpark for distributed data processing and analytics on large-scale data platforms.
  • Develop and optimize SQL queries for various database systems, including Oracle, to extract, transform, and load data efficiently.
  • Integrate Python applications with JDBC-compliant databases (e.g., Oracle) for seamless data interaction.
  • Implement data streaming solutions to process real-time or near real-time data efficiently.
  • Perform in-depth data analysis using Python libraries, especially Pandas, to understand data characteristics, identify anomalies, and support profiling efforts.
  • Collaborate with data architects, data engineers, and business stakeholders to understand requirements and translate them into technical specifications.
  • Contribute to the design and architecture of data solutions, ensuring best practices in data management and engineering.
  • Troubleshoot and resolve technical issues related to data pipelines, performance, and data quality.

Qualifications:

  • 4-7 years of relevant experience in the Financial Service industry
  • Strong Proficiency in Python:
  • Excellent command of Python programming, including object-oriented principles, data structures, and algorithms.
  • PySpark Experience:
  • Demonstrated experience with PySpark for big data processing and analysis.
  • Database Expertise:
  • Proven experience working with relational databases, specifically Oracle, andconnecting applications using JDBC.
  • SQL Mastery:
  • Advanced SQL querying skills for complex data extraction, manipulation, andoptimization.
  • Big Data Handling:
  • Experience in working with and processing large datasets efficiently.
  • Data Streaming:
  • Familiarity with data streaming concepts and technologies (e.g., Kafka, SparkStreaming) for processing continuous data flows.
  • Data Analysis Libraries:
  • Proficient in using data analysis libraries such as Pandas for data manipulationand exploration.
  • Software Engineering Principles:
  • Solid understanding of software engineering best practices,including version control (Git), testing, and code review.
  • Problem-Solving:
  • Intuitive problem-solver with a self-starter mindset and the ability to work independently and as part of a team.

Education:

  • Bachelor’s degree/University degree or equivalent experience

This job description provides a high-level review of the types of work performed. Other job-related duties may be assigned as required.

  • Preferred Skills & Qualifications (Good to Have):

    • Experience in developing and maintaining reusable Python packages or libraries for data engineering tasks.

    • Familiarity with cloud platforms (e.g., AWS, Azure, GCP) and their data services.

    • Knowledge of data warehousing concepts and ETL/ELT processes.

    • Experience with CI/CD pipelines for automated deployment.

------------------------------------------------------

Job Family Group:

Technology

------------------------------------------------------

Job Family:

Applications Development

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Most Relevant Skills

Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills

For complementary skills, please see above and/or contact the recruiter.

------------------------------------------------------

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

 

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.

Top Skills

Python,Pyspark,Sql,Oracle,Jdbc,Kafka,Spark Streaming,Pandas,Git,Aws,Azure,Gcp,Ci/Cd

Citi Chennai, Tamil Nadu, IND Office

C P Ramaswamy Road, Chennai, Tamil Nadu, India, 600018

Similar Jobs

An Hour Ago
Hybrid
Chennai, Tamil Nadu, IND
Junior
Junior
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
The Data Governance Reporting Analyst will support data governance initiatives, develop data policies, ensure data accuracy, and maintain reporting tools and dashboards to aid strategic projects.
Top Skills: AlationCollibraExcelInformaticaPower BITableau
3 Hours Ago
Hybrid
2 Locations
Senior level
Senior level
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Design and maintain backend services using Java and/or Python, develop APIs and microservices, write SQL queries, and collaborate across teams.
Top Skills: AWSFlaskGCPGitJavaJenkinsPostgresPythonSpring BootSQL
3 Hours Ago
Hybrid
2 Locations
Senior level
Senior level
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Design and build scalable backend services primarily with Java, while utilizing Python in a polyglot environment. Optimize microservices and develop data processing components, ensuring high-quality code and cloud-native applications.
Top Skills: BigQueryGCPGerritGitHibernateJavaJenkinsMapreducePythonSparkSpring BootSQL

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account