Citi Logo

Citi

Senior Data Engineer (ETL/Big Data/Python) - Assistant Vice President

Posted 4 Days Ago
Be an Early Applicant
In-Office
Chennai, Tamil Nadu, IND
Senior level
In-Office
Chennai, Tamil Nadu, IND
Senior level
Lead design, development, and maintenance of an ETL-based reporting framework to collect utilization metrics, automate billing/chargebacks, and build dashboards. Implement scalable PySpark/Databricks pipelines on Hadoop ecosystem, manage storage across Oracle, MongoDB, and Snowflake, schedule jobs with Autosys/Airflow, optimize performance, mentor team members, lead stakeholder requirements, coordinate QA and resolve production incidents.
The summary above was generated by AI

CITI Bank Enterprise Analytical Services organization is seeking a highly skilled and experienced Lead Developer for the Reporting Framework. This critical framework is essential for collecting and analyzing utilization metrics from our Analytical Services Platforms, automating billing and chargeback processes, and developing insightful dashboards. The ideal candidate will be a technical leader with a strong background in data analytics & visualization, ETL design, and performance optimization.

As the Lead Developer, you will be a key driver of the reporting framework's success, responsible for its design, development, and maintenance. You will play a pivotal role in shaping our data reporting capabilities and providing crucial business intelligence to stakeholders.

Key Responsibilities:

  • Full-Lifecycle Development:
    • Design, develop, and implement ETL based reporting framework, focusing on data collection, transformation, and presentation.
    • Utilize a robust technology stack including Databricks, Spark, Hive, Ozone, and Hadoop to process large volumes of data efficiently.
    • Write and optimize complex ETL jobs using PySpark and advanced Python scripting.
    • Design and manage data storage in various databases, including Oracle, MongoDB and Snowflake for flexible data models.
    • Develop and maintain automated job schedules using Autosys, Apache Airflow for seamless data pipeline execution.
    • Ability to develop visualization reporting dashboards.
    • Able to leverage enterprise approved productivity tools like Copilot, etc., in daily analysis & development tasks
  • Technical Leadership & Collaboration:
    • Act as the technical subject matter expert (SME) for the reporting framework, providing guidance and mentorship to the development team.
    • Lead requirements gathering discussions with business stakeholders and product owners to understand reporting needs and translate them into technical solutions.
    • Coordinate closely with the QA team to ensure thorough testing and data validation, maintaining high standards of data accuracy and integrity.
    • Collaborate with other development teams and data engineers to ensure data sources are integrated correctly and efficiently.
  • Performance & Optimization:
    • Design the framework with an "ETL design mindset," focusing on modularity, scalability, and maintainability.
    • Proactively identify and resolve performance bottlenecks in data pipelines and queries.
    • Ensure the framework's code is optimized for high performance and low latency.
    • Apply advanced scripting for automation, system administration, and data management tasks.
  • Issue Resolution & Support:
    • Timely analyze and resolve user issues and incidents related to the reporting framework as a development SME.
    • Conduct root cause analysis for production issues and implement strategic resolution.
    • Participate in code reviews to ensure code quality, best practices, and security standards are met.

Qualifications:

  • Education: Bachelor's degree in Computer Science, Information Technology, or a related field.
  • Experience:
    • 8+ years of experience in data engineering, software development, or a similar role, with at least 2 years in a lead capacity.
    • Proven experience with ETL pipeline design and development.
  • Technical Skills:
    • Expert proficiency in Python and Spark.
    • Strong experience with advanced scripting.
    • Deep knowledge of relational databases (Oracle) and NoSQL databases (MongoDB), Snowflake.
    • Solid understanding of data modeling, data warehousing, and performance tuning.
    • Experience with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI, GitHub Actions).
    • Experience with monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK Stack, Splunk, Datadog).
    • Familiarity with Databricks, data visualization tools and dashboard development is a plus.
    • Experience with IaC tools (e.g., Terraform, Ansible, CloudFormation) is a plus.
  • Soft Skills:
    • Excellent problem-solving and analytical skills, with a keen eye for detail.
    • Strong communication and leadership skills, with the ability to manage and mentor a team.
    • Proactive and self-driven, with a strong commitment to delivering high-quality, reliable solutions.
    • Ability to thrive in a fast-paced, collaborative, and results-oriented environment.

------------------------------------------------------

Job Family Group: Technology

------------------------------------------------------

Job Family:Systems & Engineering

------------------------------------------------------

Time Type:Full time

------------------------------------------------------

Most Relevant Skills Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills For complementary skills, please see above and/or contact the recruiter.

------------------------------------------------------

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

 

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.

Citi Chennai, Tamil Nadu, IND Office

C P Ramaswamy Road, Chennai, Tamil Nadu, India, 600018

Similar Jobs

14 Minutes Ago
In-Office
Chennai, Tamil Nadu, IND
Senior level
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Lead offshore engineering teams delivering full stack and Generative AI solutions. Provide technical leadership, mentor engineers, and ensure delivery quality.
Top Skills: Github CopilotLangchainLanggraphMongoDBNode.jsPythonReactRestful Apis
21 Minutes Ago
In-Office
Chennai, Tamil Nadu, IND
Senior level
Senior level
Artificial Intelligence • Machine Learning
The role involves managing complex programs across engineering, AI/ML, and business teams, ensuring delivery quality while adapting to rapidly changing priorities within an AI-native environment.
Top Skills: AgileConfluenceEmotion AiGenerative AiGitJIRAKnowledge Ai
23 Minutes Ago
Remote or Hybrid
India
Senior level
Senior level
AdTech • Big Data • Digital Media • Software
Lead technical operations for Magnite in India, focusing on integrations, optimisation, and strategy while guiding clients and stakeholders on ad tech solutions.
Top Skills: APIsGamJavaScriptOpenrtbPrebidPythonSpringserveSQLVast

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account