This role involves developing scalable data integration pipelines, collaborating with teams on solutions, and ensuring data quality using various modern technologies.
This is a remote position.
About our client:
Our client develops and supports software and data solutions across a variety of industries. They want you to get ahead of the market and stay there. They offer a combination of plug and play products that can be integrated with existing systems and processes and can also be customised to client needs. Their capabilities extend to big data engineering and bespoke software development, solutions are available as both cloud-based and hosted.
What you will be doing:
- Analyzes complex customer data to determine integration needs.
- Develops and tests scalable data integration/transformation pipelines using PySpark, SparkSQL, and Python.
- Contributes to the codebase through coding, reviews, validation, and complex transformation logic.
- Automates and maintains data validation and quality checks.
- Collaborates with FPA, data engineers, and developers to align solutions with financial reporting and business objectives.
- Participates in solution architecture and technical discussions, refining user stories and acceptance criteria.
- Utilizes modern data formats/platforms (Parquet, Delta Lake, S3/Blob Storage, Databricks).
- Partners with the product team to ensure accurate customer data reflection and provide feedback based on data insights.
What our client is looking for:
- A Data Analytics Engineer with 5+ years of experience.
- Must have strong Python, PySpark, Notebook, and SQL coding skills, especially with Databricks and Delta Lake.
- Proven ability to build and deploy scalable ETL pipelines to cloud production environments using CI/CD.
- Experience with Agile/Scrum, data quality concepts, and excellent communication is essential.
- Cloud environment (Azure, AWS) and Infrastructure as Code (Terraform, Pulumi) experience beneficial.
- Telecoms industry or consulting experience, plus accounting knowledge, is a plus.
Job ID:
- J106998
For a more comprehensive list of opportunities that we have on offer, do visit our website - https://www.parvana.co.uk/careers
Requirements
Data Engineer, PySpark, Python, SQL, Databricks, ETL, CI/CD, Cloud, Azure, AWS
Similar Jobs
Fintech • Payments • Software
The Data Engineer will design, develop, and maintain data pipelines, implementing ETL processes and managing data infrastructure, ensuring high-quality data delivery and system reliability.
Top Skills:
Apache AirflowBigQueryDebeziumHelmKafkaKubernetesMongoDBMySQLPostgresPythonRedshiftSinglestoreTerraform
Automotive • eCommerce • Fintech • Transportation
The Data Engineer will develop and maintain data pipelines, perform data extraction and transformation, and support data analytics across the enterprise.
Top Skills:
AirbyteApache AirflowGoogle BigqueryKubernetesPandasPysparkPythonSQL
Artificial Intelligence • HR Tech • Information Technology • Social Impact
The Data Engineer will analyze and interpret data, design scalable databases, optimize data operations, and develop LookML models while collaborating with various teams.
Top Skills:
BigQueryDbtFivetranGitLookerLookmlPythonSQL
What you need to know about the Chennai Tech Scene
To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.



