Develop and deploy NLP and GenAI applications/products, conduct ML experiments, and create APIs and UI for user interaction.
You'll be working alongside leading technical experts from all around the world, on a variety of products involving Sequence/token classification, QA/chatbots, translation, semantic/search and summarization, among others.
Responsibilities- Design NLP/LLM/GenAI applications/products by following robust coding practices,
- Explore SoTA models/techniques so that they can be applied for automotive industry usecases
- Conduct ML experiments to train/infer models; if need be, build models that abide by memory & latency restrictions,
- Deploy REST APIs or a minimalistic UI for NLP applications using Docker and Kubernetes tools.
- Showcase NLP/LLM/GenAI applications in the best way possible to users through web frameworks (Dash, Plotly, Streamlit, etc.,)
- Converge multibots into super apps using LLMs with multimodalities.
- Develop agentic workflow using Autogen, Agentbuilder, langgraph
- Build modular AI/ML products that could be consumed at scale.
Education: Bachelor’s or master’s degree in computer science, Engineering, Maths or Science
Performed any modern NLP/LLM courses/open competitions is also welcomed.
Technical Requirements:
Soft Skills:
- Strong communication skills and do excellent teamwork through Git/slack/email/call with multiple team members across geographies.
GenAI Skills:
- Experience in LLM models like PaLM, GPT4, Mistral (open-source models),
- Work through the complete lifecycle of Gen AI model development, from training and testing to deployment and performance monitoring.
- Developing and maintaining AI pipelines with multimodalities like text, image, audio etc.
- Have implemented in real-world Chat bots or conversational agents at scale handling different data sources.
- Experience in developing Image generation/translation tools using any of the latent diffusion models like stable diffusion, Instruct pix2pix.
- Expertise in handling large scale structured and unstructured data.
- Efficiently handled large-scale generative AI datasets and outputs.
ML/DL Skills:
- High familiarity in the use of DL theory/practices in NLP applications
- Comfort level to code in Huggingface, LangChain, Chainlit, Tensorflow and/or Pytorch, Scikit-learn, Numpy and Pandas
- Comfort level to use two/more of open source NLP modules like SpaCy, TorchText, fastai.text, farm-haystack, and others
NLP Skills:
- Knowledge in fundamental text data processing (like use of regex, token/word analysis, spelling correction/noise reduction in text, segmenting noisy unfamiliar sentences/phrases at right places, deriving insights from clustering, etc.,)
- Have implemented in real-world BERT/or other transformer fine-tuned models (Seq classification, NER or QA) from data preparation, model creation and inference till deployment.
Python Project Management Skills
- Familiarity in the use of Docker tools, pipenv/conda/poetry env
- Comfort level in following Python project management best practices (use of setup.py, logging, pytests, relative module imports,sphinx docs,etc.,)
- Familiarity in use of Github (clone, fetch, pull/push,raising issues and PR, etc.,)
Cloud Skills and Computing:
- Use of GCP services like BigQuery, Cloud function, Cloud run, Cloud Build, VertexAI,
- Good working knowledge on other open-source packages to benchmark and derive summary.
- Experience in using GPU/CPU of cloud and on-prem infrastructures.
- Skillset to leverage cloud platform for Data Engineering, Big Data and ML needs.
Deployment Skills:
- Use of Dockers (experience in experimental docker features, docker-compose, etc.,)
- Familiarity with orchestration tools such as airflow, Kubeflow
- Experience in CI/CD, infrastructure as code tools like terraform etc.
- Kubernetes or any other containerization tool with experience in Helm, Argoworkflow, etc.,
- Ability to develop APIs with compliance, ethical, secure and safe AI tools.
UI:
- Good UI skills to visualize and build better applications using Gradio, Dash, Streamlit, React, Django, etc.,
- Deeper understanding of javascript, css, angular, html, etc., is a plus.
Miscellaneous Skills:
Data Engineering:
- Skillsets to perform distributed computing (specifically parallelism and scalability in Data Processing, Modeling and Inferencing through Spark, Dask, RapidsAI or RapidscuDF)
- Ability to build python-based APIs (e.g.: use of FastAPIs/ Flask/ Django for APIs)
- Experience in Elastic Search and Apache Solr is a plus, vector databases.
Top Skills
Apache Solr
BigQuery
Chainlit
Ci/Cd
Cloud Build
Cloud Function
Cloud Run
Django
Docker
Elasticsearch
Fastapi
Flask
GCP
Genai
Huggingface
Kubernetes
Langchain
Llm
Nlp
Numpy
Pandas
PyTorch
Rest Apis
Scikit-Learn
TensorFlow
Vertexai
Similar Jobs
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
The HR Ops Intern will manage onboarding processes, maintain documentation, support induction sessions, and address employee queries, ensuring a seamless experience for new hires.
Top Skills:
ExcelMicrosoft OutlookMicrosoft Word
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
As a Senior Software Engineer, you'll design and maintain AI-enabled applications, focusing on cloud-native solutions, integration, and team collaboration.
Top Skills:
AzureDockerFastapiFlaskGithub ActionsJavaJavaScriptKubernetesNext.JsNode.jsPostgresPythonPyTorchReactTensorFlowTerraformTypescript
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
The Principal Data Engineer supports and operates Oracle and MongoDB Atlas platforms, ensuring operational stability, troubleshooting, and performance optimization in cloud-based environments.
Top Skills:
AWSAzureGCPGitKubernetesMongodb AtlasOraclePowershellPythonShellTerraform
What you need to know about the Chennai Tech Scene
To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.


