CoinMarketCap Logo

CoinMarketCap

LLM Algorithm Engineer

Reposted 20 Days Ago
Be an Early Applicant
In-Office or Remote
13 Locations
Mid level
In-Office or Remote
13 Locations
Mid level
The role involves post-training of LLMs, model alignment, server operation for checkpoint routing, and building evaluation pipelines.
The summary above was generated by AI
Job Responsibilities:
1. Advanced post-training of large language models (e.g. SFT, RLHF/RLAIF, continual pretraining).
2. Aligning models for reliable JSON-schema function calls and external tool usage.
3. Design, deploy, and operate Model Context Protocol (MCP) servers that handle checkpoint routing, manage context windows, and enforce safety gates.
4. Experience in distributed training and inference with DeepSpeed/FSDP, LoRA/QLoRA, mixed precision, and performance tuning on vLLM or Triton clusters.
5. Build offline and live eval pipelines for alignment, factuality, grounding, and hallucinations.

Qualifications
1. Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field.
2. 3+ years of experience in developing and optimizing large language models.
3. Proven track record in implementing advanced post-training techniques (SFT, RLHF, RLAIF, continual pretraining).
4. Hands-on experience with distributed training frameworks (DeepSpeed, FSDP) and optimization techniques (LoRA, QLoRA, mixed precision).
5. Familiarity with model alignment, JSON-schema function calls, and external tool integration.
6. Experience in building and maintaining evaluation pipelines for model performance assessment.
7. Proficiency in Python and relevant machine learning frameworks (e.g., PyTorch, TensorFlow).
8. Strong understanding of distributed systems and high-performance computing.
9. Experience with model deployment and inference optimization on vLLM or Triton clusters.
10. Knowledge of JSON-schema and API development.

Top Skills

Deepspeed
Fsdp
Lora
Python
PyTorch
Qlora
TensorFlow
Triton
Vllm

Similar Jobs

2 Hours Ago
Remote or Hybrid
Calgary, AB, CAN
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Drive innovative solutions and consult with customers to architect applications on the Now Platform, facilitating workshops and mentoring peers.
Top Skills: AIAngularjsBootstrapHTMLJavaScriptNow PlatformSQL
2 Hours Ago
Remote
Canada
Senior level
Senior level
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
The Solutions Sales Executive leads sales strategy in ITSM, collaborates with internal teams, and drives customer success across territory accounts.
Top Skills: Google SuiteSalesforceSlackZoom
2 Hours Ago
In-Office or Remote
Toronto, ON, CAN
Mid level
Mid level
Healthtech • Other • Productivity • Software • Automation
As a Full Stack Engineer, you will build scalable features, develop services, and ensure high-quality code for business-critical functionalities.
Top Skills: DockerJava Spring BootKafkaNext.JsPostgresReactRestful Apis

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account