Barclays Logo

Barclays

Data Engineer - Pyspark,SQL

Job Posted 16 Days Ago Reposted 16 Days Ago
Be an Early Applicant
In-Office
Pune, Mahārāshtra
Mid level
In-Office
Pune, Mahārāshtra
Mid level
The Data Engineer role involves building and maintaining data architectures, optimizing Pyspark performance, and collaborating on machine learning models. Key responsibilities include developing data solutions on AWS, implementing ETL pipelines, and engaging with stakeholders.
The summary above was generated by AI
Job Description

Purpose of the role

To build and maintain the systems that collect, store, process, and analyse data, such as data pipelines, data warehouses and data lakes to ensure that all data is accurate, accessible, and secure. 

Accountabilities

  • Build and maintenance of data architectures pipelines that enable the transfer and processing of durable, complete and consistent data.
  • Design and implementation of data warehoused and data lakes that manage the appropriate data volumes and velocity and adhere to the required security measures.
  • Development of processing and analysis algorithms fit for the intended data complexity and volumes.
  • Collaboration with data scientist to build and deploy machine learning models.

Assistant Vice President Expectations

  • To advise and influence decision making, contribute to policy development and take responsibility for operational effectiveness. Collaborate closely with other functions/ business divisions.
  • Lead a team performing complex tasks, using well developed professional knowledge and skills to deliver on work that impacts the whole business function. Set objectives and coach employees in pursuit of those objectives, appraisal of performance relative to objectives and determination of reward outcomes
  • If the position has leadership responsibilities, People Leaders are expected to demonstrate a clear set of leadership behaviours to create an environment for colleagues to thrive and deliver to a consistently excellent standard. The four LEAD behaviours are: L – Listen and be authentic, E – Energise and inspire, A – Align across the enterprise, D – Develop others.
  • OR for an individual contributor, they will lead collaborative assignments and guide team members through structured assignments, identify the need for the inclusion of other areas of specialisation to complete assignments. They will identify new directions for assignments and/ or projects, identifying a combination of cross functional methodologies or practices to meet required outcomes.
  • Consult on complex issues; providing advice to People Leaders to support the resolution of escalated issues.
  • Identify ways to mitigate risk and developing new policies/procedures in support of the control and governance agenda.
  • Take ownership for managing risk and strengthening controls in relation to the work done.
  • Perform work that is closely related to that of other areas, which requires understanding of how areas coordinate and contribute to the achievement of the objectives of the organisation sub-function.
  • Collaborate with other areas of work, for business aligned support areas to keep up to speed with business activity and the business strategy.
  • Engage in complex analysis of data from multiple sources of information, internal and external sources such as procedures and practises (in other areas, teams, companies, etc).to solve problems creatively and effectively.
  • Communicate complex information. 'Complex' information could include sensitive information or information that is difficult to communicate because of its content or its audience.
  • Influence or convince stakeholders to achieve outcomes.

All colleagues will be expected to demonstrate the Barclays Values of Respect, Integrity, Service, Excellence and Stewardship – our moral compass, helping us do what we believe is right. They will also be expected to demonstrate the Barclays Mindset – to Empower, Challenge and Drive – the operating manual for how we behave.

Join us as a Data Engineer - Pyspark,SQL at Barclays, where you'll spearhead the evolution of our digital landscape, driving innovation and excellence. You'll harness cutting-edge technology to revolutionise our digital offerings, ensuring unparalleled customer experiences. As a part of team of developers, you will deliver technology stack, using strong analytical and problem solving skills to understand the business requirements and deliver quality solutions.
To be successful as a Data Engineer - Pyspark,SQL you should have experience with:

  • Hands on experience in Pyspark and strong knowledge on Dataframes, RDD and SparkSQL

  • Hands on experience in Pyspark performance optimization techniques .

  • Hands on Experience in developing, testing and maintaining applications on AWS Cloud.

  • Strong hold on AWS Data Analytics Technology Stack (Glue, S3, Lambda, Lake formation, Athena)

  • Design and implement scalable and efficient data transformation/storage solutions with open table formats such as DELTA, Iceberg, Hudi.

  • Experience in using DBT (Data Build Tool) with snowflake/Athena/Glue for ELT pipeline development.

  • Experience in Writing advanced SQL and PL SQL programs.

  • Hands On Experience for building reusable components using Snowflake and AWS Tools/Technology

  • Should have worked at least on two major project implementations.

  • Exposure to data governance or lineage tools such as Immuta and Alation is added advantage.

  • Experience in using Orchestration tools such as Apache Airflow or Snowflake Tasks is added advantage.

  • Knowledge on Ab-initio ETL tool is a plus

Some other highly valued skills includes:

  • Ability to engage with Stakeholders, elicit requirements/ user stories and translate requirements into ETL components

  • Ability to understand the infrastructure setup and be able to provide solutions either individually or working with teams.

  • Good knowledge of Data Marts and Data Warehousing concepts.

  • Resource should possess good analytical and Interpersonal skills.

  • Implement Cloud based Enterprise data warehouse with multiple data platform along with Snowflake and NoSQL environment to build data movement strategy.

You may be assessed on key critical skills relevant for success in role, such as risk and controls, change and transformation, business acumen, strategic thinking and digital and technology, as well as job-specific technical skills.

This role is based out of Pune.

Top Skills

Apache Airflow
AWS
Dbt
Delta
Glue
Hudi
Iceberg
Lambda
Pl Sql
Pyspark
S3
Snowflake
SQL

Barclays Chennai, Tamil Nadu, IND Office

Chennai, India, 600004

Similar Jobs

Yesterday
Hybrid
3 Locations
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The Escalation Engineer resolves technical issues for enterprise customers, leads support teams, mentors others, and collaborates with engineering on solutions.
Top Skills: BashCassandraElastic StackGoGraphQLKafkaNgsiemOpensearchPowershellPythonRedisRest
Mid level
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Responsible for leading data governance initiatives for food safety data intelligence, ensuring project completion on time, managing risks, and supporting implementation across regions.
Top Skills: Data AnalyticsData Governance
Yesterday
Hybrid
Pune, Mahārāshtra, IND
Mid level
Mid level
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
The role involves preparing tax provisions, compliance disclosures, and supporting international tax initiatives, requiring strong skills in tax research and accounting principles.
Top Skills: BnaCchExcelOnesource Income Tax Return SoftwareOnesource Tax Provision SoftwarePeoplesoft

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account