Conduct research and development on LLM technologies (RAG, prompt engineering, knowledge-grounded dialogue). Improve foundation model performance via data acquisition, evaluation, SFT, reward modeling, and RL. Prototype downstream AI/agent products for Binance and track academic and industry advances.
Binance is the leading global blockchain ecosystem and cryptocurrency infrastructure provider whose suite of financial products includes the world’s largest digital-asset exchange.
Our mission is to accelerate cryptocurrency adoption and increase the freedom of money.
If you’re looking for a fast-paced, mission-driven organization where opportunities to learn and excel are endless, then Binance is the place for you.
Responsibilities:
- Conduct research on cutting-edge LLM technologies, including but not limited to: large language models and fine-tuning techniques, retrieval-augmented generation (RAG), prompt engineering, and knowledge-grounded dialogue systems.
- Enhance overall performance for foundation models, encompassing data acquisition, model evaluation, SFT, reward modeling, and reinforcement learning.
- Explore new downstream products with AI and AI agent technologies at their core for Binance products.
- Stay up-to-date with the latest academic and industrial advancements in AI, LLMs, etc.
Qualifications:
- Master’s or PhD in Software Development, Computer Science, Computer Engineering, or a related technical discipline.
- At least 3-5 years of relevant industry experience in developing Machine Learning models at scale from inception to business impact
- Research experience in LLMs, multi-modal understanding, vision and language, and other related topics.
- Highly competent in algorithms and programming; strong coding skills in Python, C/C++, etc.
- Experience in NLP and LLM technologies.
- Publications in top-tier venues, such as KDD, CVPR, NeurIPS, ICLR, ICML, EMNLP, ACL, ECCV, ICCV, etc.
Preferred Qualifications
Similar Jobs
Blockchain • Fintech • Software • Cryptocurrency • Metaverse
Develop, deploy, and maintain LLM pipelines and RAG QA/search systems; design and optimize prompts and multi-agent LLM architectures; operate multi‑GPU/cluster inference; build evaluation pipelines for model quality, bias, and hallucination; collaborate with product and CS teams to integrate conversational AI.
Top Skills:
AutogenChain-Of-ThoughtCluster DeploymentCrewaiDpoFew-ShotInference OptimizationLanggraphLlmMulti-Agent Llm ArchitecturesMulti-GpuMulti-Source Knowledge BasesPrompt EngineeringQa/Search SystemsQuantizationRetrieval-Augmented Generation (Rag)Reward Re-RankingSafety FilteringSftSglangTool-CallingVllmZero-Shot
Blockchain • Fintech • Software • Cryptocurrency • Metaverse
Lead end-to-end LLM pipeline for customer service scheduling: data prep, prompt design, RAG systems, multi-agent architectures, multi-GPU deployment, evaluation pipelines, and chatbot integration to improve model quality and decision-making.
Top Skills:
AutogenChain-Of-ThoughtChatbot IntegrationCluster DeploymentCrewaiDpoEvaluation PipelinesHuman FeedbackInference OptimizationLanggraphLlmsMulti-Agent ArchitecturesMulti-GpuPrompt EngineeringQuantizationRetrieval-Augmented Generation (Rag)SftSglangTool-CallingVllm
Blockchain • Fintech • Software • Cryptocurrency • Metaverse
Early-career role contributing to design and development of AI agents, building benchmark datasets, prototyping agent workflows (e.g., LangChain/LangGraph), and collaborating with senior engineers on system integration and deployment.
Top Skills:
LangchainLanggraphLlmLlm Fine-TuningPrompt EngineeringPythonRag
What you need to know about the Chennai Tech Scene
To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.
