As a Data Engineer focusing on web scraping, you will manage scraping configurations, monitor errors, oversee data retrieval, and build data pipelines.
We are growing! We are currently looking to hire a Data Engineer - Web Scraping to work with us remotely on a 4 months contract.
Who we are:
Founded in 2006, today, we’re proud to be a global business. From Shanghai to Paris, we have 12 offices and operate across four continents in 70 countries. We are home to over 160 professionals from around the world, working together to serve more than 200 luxury clients.
At CXG, we love to evolve, elevate, and transform experiences while bringing brand promises to life. We offer strategic solutions that impact performance and elevate the customer experience of some of the world’s most iconic premium and luxury brands.
What you will be doing:
- Maintain and manage website scraping configurations using Python.
- Monitor scraping configurations for errors and potential crashes.
- Oversee retrieved data to detect potential issues and blockages.
- Coordinate with stakeholders to understand scraping task requirements and report issues.
- Prepare and share periodic reports on scraping activities with stakeholders.
- Develop necessary pipelines to ingest data into the Datalake and perform required transformations.
Requirements
What you will bring along:
- Proven experience in data engineering with expertise in designing and implementing scalable data architectures.
- Strong experience with ETL processes, data modeling, and data warehousing (Airflow & DBT preferred).
- Expertise in database technologies, both relational (SQL) and NoSQL.
- Knowledge of cloud platforms, particularly Azure.
- Solid understanding of data security measures and compliance standards.
- Excellent Python experience for data engineering and automation.
- Strong collaboration skills to work closely with data scientists and analysts.
- Ability to optimize data pipelines for performance and efficiency.
- Ability to build, test, and maintain tasks and projects.
- Experience with version control systems, such as Git.
- Hands-on experience with Airflow and/or DBT.
- Experience with Terraform for infrastructure management.
- Minimum 2 years of experience in a similar role.
- Strong academic background in a relevant field.
- Fluent in English (French is a plus).
Similar Jobs
Big Data • Cloud • Information Technology
The Senior Product Manager will define product strategy, manage SaaS product enhancements, drive innovation in data governance, and collaborate across teams to deliver customer-centric solutions.
Top Skills:
Agile SdlcAIAnalyticsBusiness IntelligenceData Loss PreventionData PrivacyData VisualizationJIRAMachine LearningNatural Language ProcessingSaaS
Appliances • Manufacturing
The Communications Lead will develop and execute strategies to enhance Dyson's reputation in SEA, manage external agency partnerships, and lead a team to deliver impactful communications.
Top Skills:
Communications StrategyMedia RelationsSocial Media
Appliances • Manufacturing
The Finance Systems Analyst will manage finance systems, ensure optimal performance, support implementation, and train users while collaborating with various teams.
Top Skills:
Bi&AC#Data WarehousesErpFinancial SystemsSQLVb.Net
What you need to know about the Chennai Tech Scene
To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

