Proximity Works Logo

Proximity Works

Data Engineer

Reposted 5 Days Ago
Remote
Hiring Remotely in India
Mid level
Remote
Hiring Remotely in India
Mid level
The Software Engineer will design and maintain large-scale data pipelines, optimize workflows, and ensure data quality for Ads reporting and analytics.
The summary above was generated by AI

We are looking for a highly skilled Data Engineer. The ideal candidate will have hands-on expertise in Big Data technologies, with a strong foundation in distributed data processing, real-time pipelines, and large-scale data systems. You will design, build, and optimize data solutions that power insights, reporting, and decision-making for our advertising ecosystem.

Key Responsibilities
  • Design, develop, and maintain large-scale data pipelines to support Ads reporting, attribution, and analytics use cases.
  • Work extensively with Hive, Spark, SQL, Scala, and Kafka to process and manage petabyte-scale datasets.
  • Optimize data workflows for performance, scalability, and cost efficiency.
  • Partner with data scientists, product managers, and platform engineers to deliver high-quality, reliable datasets and APIs.
  • Ensure data quality, integrity, and consistency across multiple data sources.
    Troubleshoot and resolve issues in real-time streaming pipelines and batch data jobs.
  • Continuously evaluate new technologies to enhance the Ads Data platform.

Requirements
  • Strong programming experience in Scala (preferred), Java, or Python.
  • Hands-on experience with Apache Spark (batch & streaming) for large-scale data processing.
  • Proficiency in Hive, SQL, and data modeling for analytical workloads.
  • Experience working with Kafka for real-time event streaming.
  • Solid understanding of Big Data ecosystems (S4, Hive, Presto, Delta etc.).
  • Strong debugging, performance tuning, and problem-solving skills.
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field.
Nice to Have
  • Experience in AdTech, Attribution, or Campaign Analytics.
  • Familiarity with cloud-based big data solutions (AWS EMR, GCP BigQuery, Databricks, etc.).
  • Familiarity with scheduling services like AirFlow
  • Knowledge of data governance, security, and compliance best practices.

Benefits
  • Best in class salary: We hire only the best, and we pay accordingly.
  • Proximity Talks: Meet other designers, engineers, and product geeks — and learn from experts in the field.
  • Keep on learning with a world-class team: Work with the best in the field, challenge yourself constantly, and learn something new every day.

Top Skills

Airflow
Spark
Aws Emr
Databricks
Gcp Bigquery
Hive
Java
Kafka
Python
Scala
SQL

Similar Jobs

14 Days Ago
Easy Apply
Remote
India
Easy Apply
Senior level
Senior level
Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
As a Business Intelligence Data Engineer, you'll develop scalable data architectures and models, manage data pipelines, and enhance analytics using AI tools.
Top Skills: AirbyteAirflowAWSBigQueryDatabricksDbtFivetranPythonRedshiftRetoolSnowflakeSQLTableau
20 Days Ago
Remote or Hybrid
India
Senior level
Senior level
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
As a Lead Data Engineer, you'll design and implement solutions for Disability & Absence products, improve existing systems, and collaborate with teams to enhance customer experience.
Top Skills: Big DataCi/CdHbaseHiveKafkaNoSQLPigPythonScalaShell ScriptingSolrSpark
2 Days Ago
Remote or Hybrid
16 Locations
Mid level
Mid level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
The Platform Engineer manages cloud infrastructure, automates tasks, and improves system reliability while collaborating with cross-functional teams to meet platform needs.
Top Skills: AnsibleAWSAzureBashDockerGitKubernetesPythonTerraformTypescript

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account