Design, build, optimize, and maintain high-performance Spark-based data pipelines using Scala/Java and Hive on Hadoop/CDP. Own full project lifecycle, enforce coding best practices, troubleshoot Spark/Hive/YARN performance, and collaborate with stakeholders to deliver scalable data solutions.
We need a Senior Data Engineer with 10+ years exp proficient in Spark, Scala/Java, and Hive, with extensive hands-on development experience in the Big Data Ecosystem.
Key Responsibilities:
- Design, implement, and optimize highly performant data pipelines using Spark, Scala/Java, and Hive on platforms like Cloudera Data Platform (CDP) or other Hadoop echo systems.
- Take complete ownership of complex data engineering projects within the big data ecosystem, covering the entire lifecycle from initial design and development to deployment and ongoing maintenance.
- Develop robust and efficient Hive queries for extensive data analysis and reporting.
- Champion and enforce best practices and coding standards for new and existing data flows to ensure they are robust, scalable, secure, and maintainable using Spark, Scala/Java, and Hive within the big data ecosystem.
- Diagnose, troubleshoot, and resolve complex issues related to Spark, Scala/Java, and Hive applications and YARN resource management, implementing performance optimization solutions.
- Proactively collaborate with stakeholders, working closely to develop solutions with full commitment and accountability.
Technical Skills & Experience:
- Proven hands-on development expertise with Apache Spark
- Strong programming proficiency in Scala and/or Java
- In-depth knowledge and practical experience with Hive, including query optimization and data analysis.
- Experience with data platforms such as Cloudera Data Platform (CDP) is highly desirable.
Education:
- Bachelor’s / Master's degree/University degree or equivalent experience
Photon Chennai, Tamil Nadu, IND Office
DLF IT Park 1/124 Mount Poonamallee Road Sivaji Gardens Manapakkam , Chennai, India, 600089
Similar Jobs
Agency • Information Technology
Design, build, and optimize end-to-end data pipelines for large structured and unstructured data. Implement near-real-time ETL, data validation, monitoring, and performance optimization. Collaborate with stakeholders, document designs and workflows, and provide technical guidance to the team.
Top Skills:
Amazon RedshiftSparkAzure Data FactoryAzure DatabricksAzure DevopsAzure Storage AccountsCi/CdDatabricksDelta LakeAzurePythonScalaSnowflakeSQL
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
The role involves developing executive-level relationships, managing end-to-end customer engagement, and demonstrating effective solution-based sales processes in complex sales campaigns with enterprise customers.
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
The consultant manages endpoint services, handles application lifecycle, patch compliance, and employs automation and AI for operational improvements.
Top Skills:
Microsoft Endpoint Configuration Manager (Mecm/Sccm)Microsoft IntuneOmnissa Workspace One UemPowershell
What you need to know about the Chennai Tech Scene
To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.


