GroundTruth Logo

GroundTruth

Engineering Manager- Data Engineering

Posted 2 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Senior level
Remote
Hiring Remotely in India
Senior level
The Engineering Manager leads the Data Engineering team, overseeing the design of scalable data pipelines using AWS technologies, mentoring engineers, and collaborating with stakeholders on data-first initiatives.
The summary above was generated by AI

GroundTruth is an advertising platform that turns real-world behavior into marketing that drives in-store visits and other real business results. We use observed real-world consumer behavior, including location and purchase data, to create targeted advertising campaigns across all screens, measure how consumers respond, and uncover unique insights to help optimize ongoing and future marketing efforts.

With this focus on media, measurement, and insights, we provide marketers with tools to deliver media campaigns that drive measurable impact, such as in-store visits, sales, and more.

Learn more at groundtruth.com.

We believe that innovative technology starts with the best talent and have been ranked one of Ad Age’s Best Places to Work in 2021, 2022, 2023 & 2025! Learn more about the perks of joining our team here.

About Us

GroundTruth is looking for a Data Engineering Manager with strong expertise in designing and building scalable data platforms and pipelines to join our team. The Data Engineering Team is responsible for the core data infrastructure that powers our audience platform.
As an Engineering Manager on our Audience Engineering team, you will build solutions that add new data capabilities and analytical depth to our platform while managing sophisticated AWS-native data services.

You will:

  • Architect Scalable Pipelines: Oversee the design and deployment of large-scale distributed data processing jobs using PySpark on Amazon EMR clusters and serverless AWS Glue ETL jobs.
  • Coach and mentor engineers—supporting growth in technical skills (particularly Python and Spark optimization), data modeling best practices, and career progression.
  • Partner with stakeholders and engineering leadership to evaluate, plan, and deliver data-first projects across advertising systems, analytics services, and reporting features.
  • Lead by example: Write production-ready Python and PySpark code, perform code reviews, and optimize Spark configurations to improve performance and reduce costs. Apply Agile methodologies such as Scrum to drive iterative development, foster team collaboration, and ensure continuous delivery of high-quality data solutions.
  • Support engineers through regular 1:1s, feedback, quarterly reviews, recognition, and performance management.

You have:

  • Bachelor’s degree in Computer Engineering, Data Science, or equivalent practical experience.
  • 8+ years of experience in technology, specifically focused on data engineering, data warehousing, or big data architecture.
  •  2+ years of experience of leading a data engineering team.
  • Expertise in Python & PySpark: Deep experience writing and tuning distributed processing applications, handling data skew, and optimizing Spark memory management.
  • Advanced AWS Expertise: Proven track record of managing Amazon EMR for heavy-duty processing.
  • Experience with Big Data Infrastructure: Build the systems required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS ‘big data’ technologies (S3, EMR, Glue, Athena and Lambda).
  • Expert SQL skills for complex transformations, performance tuning, and deep-dive analytics.
  • Experience with Orchestration: Advanced proficiency with Airflow and Git.
  • AI-Driven Engineering: Proven track record of leveraging AI across the data engineering process to drive modernization, automate data quality checks, and enhance delivery outcomes.
  • Hands-on familiarity with AI-native tools such as Cursor, Claude, or GitHub Copilot to scale data development.

How you can impress us:

  • Performance Tuning Specialist: Ability to debug complex PySpark  and/or Scala jobs and optimize EMR Instance Fleets/Spot Instances to balance performance with infrastructure costs.
  • Good to have experience with event-driven architecture and hands-on experience using AWS SQS for scalable, reliable event processing.
  • AWS certification is preferred, demonstrating expertise in designing and building scalable cloud-based data solutions.
  • Organized and collaborative—comfortable in a fast-moving, data-intensive environment.
  • Detail-oriented: Catches data quality issues early and implements automated course-corrections.
  • Strong communicator who aligns business needs with technical data constraints through clear trade-offs.
  • Deep problem solver who diagnoses pipeline bottlenecks and partners across teams to drive durable data solutions

Benefits

At GroundTruth, we want our employees to be comfortable with their benefits so they can focus on doing the work they love.

  • Parental leave- Maternity and Paternity
  • Flexible Time Offs (Earned Leaves, Sick Leaves, Birthday leave, Bereavement leave & Company Holidays) 
  • In Office Daily Catered Breakfast, Lunch, Snacks and Beverages
  • Health cover for any hospitalization. Covers both nuclear family and parents
  • Tele-med for free doctor consultation, discounts on health checkups and medicines
  • Wellness/Gym Reimbursement
  • Pet Expense Reimbursement
  • Childcare Expenses and reimbursements
  • Employee referral program
  • Education reimbursement program
  • Skill development program
  • Cell phone reimbursement (Mobile Subsidy program).
  • Internet reimbursement/Postpaid cell phone bill/or both.
  • Birthday treat reimbursement
  • Employee Provident Fund Scheme offering different tax saving options such as Voluntary Provident Fund and employee and employer contribution up to 12% Basic
  • Creche reimbursement
  • Co-working space reimbursement
  • National Pension System employer match
  • Meal card for tax benefit
  • Special benefits on salary account

Top Skills

Airflow
Athena
AWS
Aws Glue
Emr
Git
Lambda
Pyspark
Python
S3
SQL

Similar Jobs

8 Days Ago
Remote
Karnataka, IND
Senior level
Senior level
Other • Retail
The Senior Manager will lead a team of data engineers, designing scalable data platforms, executing strategies for analytics and machine learning, and ensuring data quality and governance.
Top Skills: Data ArchitectureData PipelinesDatabricksMachine LearningSnowflake
18 Days Ago
Remote
Karnataka, IND
Senior level
Senior level
Other • Retail
Lead a high-performing software engineering team focused on developing scalable data platforms and services, drive SDK and API development, and ensure adherence to best practices.
Top Skills: Apache AirflowApache KafkaSparkAWSAws RdsAzureAzure SqlC++DatabricksGCPGcp Cloud SqlJavaOciPythonSnowflakeSQL
5 Days Ago
Easy Apply
Remote
India
Easy Apply
Senior level
Senior level
Big Data • Logistics • Analytics
Manage a team of Data Engineers, overseeing the execution of data engineering initiatives for scalable data systems and governance.
Top Skills: AdlsAzureCloud-Native Data ServicesDatabricksSQL

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account