Citi Logo

Citi

Principal Data Engineer (Big Data - Scala/Pyspark) – C13/VP - Chennai

Posted 2 Days Ago
Be an Early Applicant
Chennai, Tamil Nadu
Senior level
Chennai, Tamil Nadu
Senior level
The Principal Data Engineer will develop high-quality software products and lead engineering practices. Responsibilities include actively contributing to Agile teams, mentoring junior engineers, and creating scalable solutions while influencing technical architecture and systems.
The summary above was generated by AI

The Role

We are looking for a hands-on Principal Data Engineer who is passionate about solving business problems through innovation and engineering practices. As a Principal Data Engineer, you will leverage your deep technical knowledge to drive the creation of high-quality software products. You will also be expected to mentor other engineers, share your technical expertise, and promote a culture of technical excellence within the team. The Principal Data Engineer will report to an Engineering Manager and will be a floating member of multiple engineering teams. There is an expectation to contribute to the codebase and deliver solutions against the sprint-level commitments.

Responsibilities

·         Code contributing member of multiple Agile teams, working to deliver sprint goals.

·         Demonstrating deep technical knowledge and expertise in software development, including programming languages, frameworks, and best practices. Providing guidance and mentorship to junior team members

·         Actively contributes to the implementation of critical features and complex technical solutions. Write clean, efficient, and maintainable code that meets the highest standards of quality.

·         Collaborate with other Principal Engineers to define and evolve the overall system architecture and design.

·         Provide guidance on scalable, robust, and efficient solutions that align with business requirements and industry best practices.

·         Offer expert engineering guidance and support to multiple teams, helping them overcome technical challenges, make informed decisions, and deliver high-quality software solutions. Foster a culture of technical excellence and continuous improvement.

·         Stay up to date with emerging technologies, tools, and industry trends. Evaluate their potential impact on the organization and provide recommendations for technology adoption and innovation.

Required Qualifications

·         10+ years’ experience of implementing data-intensive solutions using agile methodologies.

·         Proficient in one or more programming languages commonly used in data engineering such as Scala or Pyspark

·         Experience with Hadoop for data storage and processing is valuable, as is exposure to modern data platforms such as Snowflake and Databricks.

·         Proven experience of providing technical vision and guidance to a data team

·         Experience of modelling data for analytical consumers

·         Strong proficiency in working with relational databases and using SQL for data querying, transformation, and manipulation.

·         Clear understanding of Data Structures and Object-Oriented Principles.

·         Multiple years of experience with software engineering best practices (unit testing, automation, design patterns, peer review, etc.)

·         Experience in cloud native technologies and patterns (AWS, Google Cloud)

·         Multiple years of experience architecting and building horizontally scalable, highly available, highly resilient, and low latency applications

·         Multiple years of experience with Cloud-native development and Container Orchestration tools (Serverless, Docker, Kubernetes, OpenShift, etc.)

·         Ability to automate and streamline the build, test and deployment of data pipelines.

·         Thrives in a dynamic environment, capable of managing multiple tasks simultaneously while maintaining a high standard of work.

·         BA/BS degree or equivalent work experience.

 

 

Preferred Qualifications

·         Familiarity with open-source data engineering tools and frameworks (e.g. Spark, Kafka, Beam, Flink, Trino, Airflow, DBT) is a valuable asset

·         Exposure to a range of table and file formats including Iceberg, Hive, Avro, Parquet and JSON

·         Exposure to Infrastructure as Code tools (i.e., Terraform, Cloudformation, etc.)

·         Experience of driving and/or influencing the data strategy of your team or organization

------------------------------------------------------

Job Family Group:

Technology

------------------------------------------------------

Job Family:

Applications Development

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Citi is an equal opportunity and affirmative action employer.

Qualified applicants will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

Citigroup Inc. and its subsidiaries ("Citi”) invite all qualified interested applicants to apply for career opportunities. If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.

View the "EEO is the Law" poster. View the EEO is the Law Supplement.

View the EEO Policy Statement.

View the Pay Transparency Posting

Top Skills

Pyspark
Scala

Citi Chennai, Tamil Nadu, IND Office

C P Ramaswamy Road, Chennai, Tamil Nadu, India, 600018

Similar Jobs

8 Hours Ago
Hybrid
Chennai, Tamil Nadu, IND
Senior level
Senior level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
As a Solution Engineer, you will design and maintain curated data sources in Tableau, enabling self-service reporting and analytics for business users while troubleshooting and optimizing data pipelines. You will also act as the digital SME for the Specialty business unit, collaborating with teams to enhance data sources and ensure data integrity and security.
Top Skills: PythonRSQLTableau
8 Hours Ago
Hybrid
Chennai, Tamil Nadu, IND
Senior level
Senior level
Artificial Intelligence • Cloud • Internet of Things • Software • Cybersecurity • Industrial
The role involves implementing, maintaining, and supporting agile software development, including designing and modifying applications, automating workflows, and writing source codes. Responsibilities also include using automated testing tools, managing client requirements, and ensuring the quality of software through effective testing strategies.
Top Skills: Java
8 Hours Ago
Chennai, Tamil Nadu, IND
Senior level
Senior level
Agency • Digital Media • eCommerce • Professional Services • Software • Analytics • Consulting
The Senior ITIL Engineer will lead the lifecycle of major incidents, ensuring quick restoration of services while minimizing impact on business operations. Responsibilities include managing communications during incidents, overseeing escalations, providing root cause analyses, and aligning support efforts with SLAs. The role requires collaboration across teams and identification of opportunities for continuous improvement in incident management processes.
Top Skills: Itil

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account