Dentsu Creative Logo

Dentsu Creative

Lead AI Developer and DevOps

Posted 2 Days Ago
Be an Early Applicant
In-Office
8 Locations
Expert/Leader
In-Office
8 Locations
Expert/Leader
Lead AI Developer and DevOps position focusing on AI model development, deployment, and infrastructure management using various ML and cloud technologies.
The summary above was generated by AI
The purpose of this role is to lead the collaboration with ML Engineers and DevOps Engineers to formulate AI designs that can be built, tested and deployed through the Route to Live and into Production using continuous integration / deployment.

Job Description:

Model Development & Deployment

Model fine-tuning: Use open-source libraries like DeepSpeed, Hugging Face Transformers, JAX, PyTorch, and TensorFlow to improve model performance Large Language Model Operations (LLMOps)

Model deployment and maintenance: deploying and managing LLMs on cloud platforms

Model training and fine-tuning: training and refining LLMs to improve their performance on specific tasks

work out how to scale LLMs up and down, do blue/green deployments and roll back bad releases

Data Management & Pipeline Operations

Curating and preparing training data, as well as monitoring and maintaining data quality

Data prep and prompt engineering: Iteratively transform, aggregate, and de-duplicate data, and make the data visible and shareable across data teams

Building vector databases to retrieve contextually relevant information

Monitoring & Evaluation

Monitoring and evaluation: tracking LLM performance, identifying errors, and optimizing models

Model monitoring with human feedback: Create model and data monitoring pipelines with alerts both for model drift and for malicious user behavior

Establish monitoring metrics

Infrastructure & DevOps

Continuous integration and delivery (CI/CD), where CI/CD pipelines automate the model development process and streamline testing and deployment

Develop and manage infrastructure for distributed model training (e.g., SageMaker, Ray, Kubernetes). Deploy ML models using containerization (Docker)

Required Technical Skills

Programming & Frameworks

Use open-source libraries like DeepSpeed, Hugging Face Transformers, JAX, PyTorch, and TensorFlow

LLM pipelines, built using tools like LangChain or LlamaIndex

Python programming expertise for ML model development

Experience with containerization technologies (Docker, Kubernetes)

Cloud Platforms & Infrastructure

Familiarity with cloud platforms like AWS, Azure, or GCP, including knowledge of services like EC2, S3, SageMaker, or Google Cloud ML Engine for scalable and efficient model deployment

Deploying large language models on Azure and AWS clouds or services such as Databricks

Experience with distributed training infrastructure

LLM-Specific Technologies

Vector databases for RAG implementations

Prompt engineering and template management

Techniques such as few-shot and chain-of-thought (CoT) prompting enhance the model's accuracy and response quality

Fine-tuning and model customization techniques

Knowlege Graphs

Relevance Engineering

Location:

DGS India - Pune - Baner M- Agile

Brand:

Merkle

Time Type:

Full time

Contract Type:

Permanent

Top Skills

AWS
Azure
Deepspeed
Docker
GCP
Hugging Face Transformers
Jax
Kubernetes
Python
PyTorch
TensorFlow

Similar Jobs

2 Days Ago
In-Office
8 Locations
Senior level
Senior level
AdTech • Marketing Tech
Lead collaboration with ML and DevOps Engineers to develop and deploy AI models, optimize model performance, manage data quality, and automate CI/CD processes for AI solutions.
Top Skills: AWSAzureDatabricksDeepspeedDockerEc2GCPHugging Face TransformersJaxKubernetesLangchainLlamaindexPythonPyTorchS3SagemakerTensorFlowVector Databases
Yesterday
Easy Apply
Remote or Hybrid
3 Locations
Easy Apply
Junior
Junior
AdTech • Cloud • Marketing Tech • Productivity • Software • Analytics • Automation
As an Associate SRE, you'll ensure the reliability of data infrastructure, develop monitoring solutions, build automation tools using Python, and collaborate with Data Engineering teams to implement best practices.
Top Skills: AirflowAnsibleAWSAzureDagsterDatadogDockerGCPKubernetesPrefectPythonSnowflakeTerraform
Yesterday
Hybrid
Delhi, Connaught Place, New Delhi, Delhi, IND
Mid level
Mid level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The Regional Sales Manager will drive new business opportunities and grow existing relationships within enterprise clients, focusing on cybersecurity solutions.
Top Skills: Salesforce

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account