Nacre Capital Logo

Nacre Capital

Data Engineer

Posted 18 Days Ago
5 Locations
Entry level
5 Locations
Entry level
The Data Engineer will build and maintain data pipelines and workflows for machine learning research, ensure data quality and availability, utilize databases, and collaborate with ML researchers. Responsibilities include implementing data architectures and supporting the deployment of ML models.
The summary above was generated by AI

Description

Data Engineer

About Us

Aquaticode builds artificial intelligence solutions for aquaculture. Our core competency lies at the intersection of biology and artificial intelligence, utilizing specialized imaging technology to detect, identify, and predict traits of aquatic species. We value commitment and creativity in building real-world solutions that benefit humanity.

Position Overview

We are seeking a talented Data Engineer with experience in supporting Machine Learning (ML) research to join our team. The ideal candidate will have a strong background in building robust data pipelines and workflows that facilitate ML projects and eagerness to learn new technologies. This role requires proficiency in data processing technologies and an understanding of the data needs specific to ML research.

Key Responsibilities

· Develop, maintain, and optimize data pipelines and workflows to support ML research and model development.

· Design and implement scalable data architectures for handling large datasets used in ML models.

· Collaborate closely with ML researchers and data scientists to understand data requirements and ensure data availability and quality.

· Work with databases and data integration processes to prepare and transform data for ML experiments.

· Utilize MongoDB and other NoSQL databases to manage unstructured and semi- structured data.

· Write efficient, reliable, and maintainable code in Python and SQL for data processing tasks.

· Implement data validation and monitoring systems to ensure data integrity and performance.

· Support the deployment of ML models by integrating data solutions into production environments.

· Ensure the scalability and performance of data systems through rigorous testing and optimization.

Required Skills & Qualifications

· Proficiency in English (spoken and written).

· Strong experience in Python and SQL.

· Hands-on experience with data processing in Apache Airflow.

· Experience working with databases, including MongoDB (NoSQL) and relational databases.

· Understanding of data modeling, ETL processes, and data warehousing concepts.

· Experience with cloud platforms like AWS, GCP, or Azure.

Good to Have

· Experience with other NoSQL databases like InfluxDB, Elasticsearch, or similar technologies.

· Experience with backend frameworks like FastAPI, Flask, or Django.

· Knowledge of containerization tools like Docker.

· Familiarity with messaging queues like RabbitMQ.

· Understanding of DevOps practices and experience with CI/CD pipelines.

· Experience with front-end development (e.g., React, NextJs).

About Nacre Capital

We were founded by Nacre Capital, a venture builder focused on AI within the life

sciences. Nacre has an impressive track record in creating, building, and growing deep

tech startups, including Face.com (acquired by Facebook), Fairtility, FDNA, and Seed-X.

Top Skills

Python
SQL

Similar Jobs

Be an Early Applicant
3 Days Ago
Hyderabad, Telangana, IND
Senior level
Senior level
Big Data • Fintech • Information Technology • Insurance • Financial Services
As a Lead Data Engineer at MassMutual, responsibilities include data analytics, modeling, and database design, along with coding and scripting in Python, Java, and Scala. The role involves mentoring junior teammates, participating in architecture discussions, and leading technical efforts across teams, while improving development processes and ensuring data integrity.
3 Days Ago
Bengaluru, Karnataka, IND
Remote
11,000 Employees
Junior
11,000 Employees
Junior
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
As a Data Engineer at Atlassian, you will build and maintain data lakes, improve data pipelines, and develop micro-services to support business growth. You will collaborate with stakeholders to enhance data ingest processes and ensure effective data management, working with technologies like AWS, Spark, and Airflow.
3 Days Ago
Gurugram, Haryana, IND
Remote
2,500 Employees
Mid level
2,500 Employees
Mid level
Computer Vision • Gaming • Software • Virtual Reality • Web3 • Metaverse
The Finance Data Engineer will design, develop, and maintain data pipelines for finance use cases, ensuring data accuracy and timeliness. The role requires collaborating with stakeholders to enhance data delivery, manage data integration from various sources, and utilize skills in ETL, SQL, and data visualization.

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account