Saaf Finance Logo

Saaf Finance

Data Engineer

Reposted 14 Days Ago
Remote
Hiring Remotely in India
Mid level
Remote
Hiring Remotely in India
Mid level
The Data Engineer will design and maintain data pipelines, optimize data warehousing, and ensure data quality and governance while collaborating with cross-functional teams.
The summary above was generated by AI

Saaf Finance is building the AI workforce for the mortgage industry, reimagining how loans are underwritten and processed. Backed by leading financial institutions, we’re scaling fast and looking for a Data Engineer who thrives on solving complex data challenges and wants to shape the infrastructure behind one of the most data-intensive industries in the world.

As a Data Engineer at Saaf, you’ll own the backbone of our AI-driven platform: the data. From borrower interactions to loan performance metrics, every workflow we build depends on data being accurate, reliable, and available in real-time. This is a hands-on role where you’ll work closely with engineering, product, and data teams to design and operate production-grade infrastructure that powers analytics, product features, and next-generation agentic automation in mortgage origination.

This role is perfect for someone who enjoys building from scratch, wants to push the boundaries of how data pipelines can support AI systems, and is excited to reimagine financial workflows at scale.

Key Responsibilities
  • Data Pipeline Development – Design, implement, and maintain ETL/ELT pipelines for structured and unstructured datasets from internal and external sources.
  • Data Warehousing – Build and optimize warehouses and marts (Snowflake, BigQuery, or similar) for analytics, reporting, and product use cases.
  • Integration – Ingest data from APIs, SaaS platforms such as CRM and financial data APIs, and internal systems into the core data platform.
  • Data Modeling: Design, implement, and maintain conceptual, logical, and physical data models to ensure scalable, consistent, and high-quality datasets for downstream analytics and applications
  • Data Quality and Governance – Implement validation, schema management, and robust documentation to ensure data accuracy and compliance.
  • Performance Optimization – Monitor and fine-tune pipeline and warehouse performance for scalability and cost efficiency.
  • Security and Compliance – Apply data security and privacy controls aligned with financial regulatory requirements, ensuring full traceability of every transformation.
  • Analytics Enablement – Provide clean, consistent datasets for analysts, product managers, and operational teams to support fast, data-driven decisions.

Requirements

Technical Expertise

  • Strong SQL and Python development skills for data transformation and automation.
  • Experience with modern ETL/ELT frameworks such as dbt.
  • Proficiency with cloud platforms (AWS preferred) and serverless data services.
  • Strong experience with data warehouse technologies (Snowflake preferred).
  • Skilled in API integrations and ingestion from third-party systems.

Data Operations

  • Proficient in data modeling (Kimball/Star schema, Data Vault).
  • Experience implementing CI/CD practices for data workflows.
  • Ability to set up logging, monitoring, and alerting for data jobs.
Bonus Skills
  • Experience building agentic workflows and orchestrating multi-step automated processes that act on data in real time.
  • Familiarity with data engineering patterns and infrastructure required for the recent wave of AI-powered tools and automation platforms.
  • Experience working with financial datasets and APIs in a high-compliance environment.
  • Understanding of data privacy regulations such as GDPR and CCPA.
Qualifications
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field.
  • 3+ years in a data engineering or similar backend data-focused role.
  • Proven track record of delivering production-grade data pipelines at scale.
  • Experience collaborating closely with product managers, data scientists, and full stack engineers.
  • Startup mindset: hands-on, resourceful, and comfortable operating in a fast-paced environment.

Benefits
  • Competitive salary
  • High ownership from day one — your work will directly shape core systems and products
  • Fast-paced environment with quick decision cycles and minimal bureaucracy
  • Remote-first team with flexibility on work hours and location
  • Direct access to founders and cross-functional teams — no layers, no silos
  • Clear expectations, regular feedback, and support for professional growth
  • Work on real problems in a complex, high-impact industry

Top Skills

AWS
Dbt
Python
Snowflake
SQL

Similar Jobs

14 Days Ago
Easy Apply
Remote
India
Easy Apply
Senior level
Senior level
Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
As a Business Intelligence Data Engineer, you'll develop scalable data architectures and models, manage data pipelines, and enhance analytics using AI tools.
Top Skills: AirbyteAirflowAWSBigQueryDatabricksDbtFivetranPythonRedshiftRetoolSnowflakeSQLTableau
20 Days Ago
Remote or Hybrid
India
Senior level
Senior level
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
As a Lead Data Engineer, you'll design and implement solutions for Disability & Absence products, improve existing systems, and collaborate with teams to enhance customer experience.
Top Skills: Big DataCi/CdHbaseHiveKafkaNoSQLPigPythonScalaShell ScriptingSolrSpark
2 Days Ago
Remote or Hybrid
16 Locations
Mid level
Mid level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
The Platform Engineer manages cloud infrastructure, automates tasks, and improves system reliability while collaborating with cross-functional teams to meet platform needs.
Top Skills: AnsibleAWSAzureBashDockerGitKubernetesPythonTerraformTypescript

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account