Proximity Works Logo

Proximity Works

Senior Data Scientist - LLMs, RAG & Multimodal AI (Remote | Immediate joiner)

Posted 2 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Senior level
Remote
Hiring Remotely in India
Senior level
The Senior Data Scientist will design and optimize large language models and multimodal systems, focusing on retrieval-augmented generation and high-quality search solutions.
The summary above was generated by AI

Join Proximity Works, one of the world’s most ambitious AI technology companies, shaping the future of Sports, Media, and Entertainment. Since 2019, Proximity Works has created and scaled AI-driven products used by 697 million daily users, generating $73.5 billion in enterprise value for our partners. With headquarters in San Francisco and offices in Los Angeles, Dubai, Mumbai, and Bangalore, we partner with some of the biggest global brands to solve complex problems with cutting-edge AI.

We are looking for a Senior Data Scientist with deep expertise in large language models (LLMs), retrieval-augmented generation (RAG), and multimodal learning to shape the next generation of intelligent, scalable, and reliable search systems.
Role Summary

This is a hands-on applied science role at the frontier of AI. You will design, fine-tune, and optimize large-scale language and multimodal models, with a strong focus on retrieval and search. You will productionize retrieval-augmented pipelines, develop ranking and relevance techniques, and define robust evaluation frameworks. You will work closely with engineering and product teams to build systems that combine language, vision, and retrieval modalities — powering high-quality, real-world search and discovery experiences at scale.

What You’ll Do
  • Design, fine-tune, and optimize LLMs for applied multimodal generation use cases.
  • Build and productionize RAG pipelines that combine embedding-based search, metadata filtering, and LLM-driven re-ranking/summarization.
  • Apply prompt engineering, RAG techniques, and model distillation to improve grounding, reduce hallucinations, and ensure output reliability.
  • Define and implement evaluation metrics across semantic search (nDCG, Recall@K, MRR) and generation quality (grounding accuracy, hallucination rate).
  • Optimize inference pipelines for latency-sensitive use cases with strategies like token budgeting, prompt compression, and sub-100ms response targets.
  • Train and adapt models via transfer learning, LoRA/QLoRA, and checkpoint reloading, ensuring robust deployment in production environments.
  • Collaborate with product and research teams to explore innovative multimodal integrations for user-facing applications.
What Success Looks Like
  • Deployment of production-ready LLM + RAG pipelines powering global-scale search and discovery applications.
  • Demonstrable improvements in grounding accuracy and hallucination reduction across deployed systems.
  • Consistent delivery of sub-100ms inference latency for generation workloads.
  • Adoption of rigorous evaluation metrics that drive continuous model improvement.
  • Effective cross-functional collaboration with engineering, product, and research teams.

RequirementsWhat You’ll Need
  • Strong background in NLP, machine learning, and multimodal AI.
  • Proven hands-on experience in LLM fine-tuning, RAG, distillation, and evaluation of foundation models.
  • Expertise in semantic search and retrieval pipelines (e.g., FAISS, Weaviate, Vespa, Pinecone).
  • Demonstrated ability to deploy models at scale, including distributed inference setups.
  • Solid understanding of evaluation frameworks for ranking, retrieval, and generation.
  • Proficiency in Python, PyTorch/TensorFlow, and modern ML toolkits.
  • Experience in multimodal AI (bridging text, vision, or speech with LLMs).
  • Track record of shipping latency-sensitive AI products.
  • Strong communication skills and the ability to collaborate with cross-functional global teams.
Success Traits

Builder’s mindset · High ownership · Analytical clarity · Collaborative spirit · Global mindset · Growth orientation


BenefitsWhy Join Proximity Works
  • Work directly on frontier AI problems with some of the world’s largest sports, media, and entertainment brands.
  • Be part of a global-first, high-performance engineering culture.
  • Competitive compensation aligned with global markets, with remote-first flexibility.
  • Annual global off-sites with Proxonauts from San Francisco, Dubai, India, and beyond.
  • High autonomy, direct accountability, and the opportunity to ship AI systems at scale.

Top Skills

Faiss
Pinecone
Python
PyTorch
TensorFlow
Vespa
Weaviate

Similar Jobs

3 Hours Ago
Remote
India
Senior level
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
As a Senior Staff Software Engineer at Coinbase, you will lead AI infrastructure projects, write high-quality code in Python and Golang, mentor team members, and enhance system reliability and scalability.
Top Skills: DockerGoMongoDBPostgresPython
3 Hours Ago
Remote
India
Senior level
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Staff Software Engineer will architect and develop the identity platform, mentor junior engineers, and collaborate across teams to define technical roadmaps.
Top Skills: SparkDockerGoGrpcHiveSQL
3 Hours Ago
Remote
India
Expert/Leader
Expert/Leader
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Seeking a Principal Engineer to influence engineering efforts, drive product architecture, mentor team members, and oversee large-scale systems in fintech and blockchain.
Top Skills: BlockchainCryptoEngineering Best PracticesFintechLarge-Scale Systems

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account