iGenius Logo

iGenius

AI Engineer

Job Posted 17 Days Ago Reposted 17 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Mid level
Remote
Hiring Remotely in India
Mid level
The AI Engineer will implement and scale large language models, optimize AI systems for production, collaborate on AI infrastructure, and ensure robust performance and safety measures.
The summary above was generated by AI
Description

We’re looking for a talented AI Engineer to join our team focused on implementing and scaling large language models (LLMs) and generative AI systems. In this role, you will bridge the gap between cutting-edge research and practical applications, turning innovative AI concepts into robust, efficient, and production-ready systems. You will work closely with our research team and data engineers to build and optimize AI solutions that drive our company's products and services.

Key Responsibilities

  • Implement and optimize large language models and generative AI systems for production environments
  • Collaborate with researchers to translate research prototypes into scalable, efficient implementations
  • Design and develop AI infrastructure components for model training, fine-tuning, and inference
  • Optimize AI models for performance, latency, and resource utilization
  • Implement systems for model evaluation, monitoring, and continuous improvement
  • Develop APIs and integration points for AI services within our product ecosystem
  • Troubleshoot complex issues in AI systems and implement solutions
  • Contribute to the development of internal tools and frameworks for AI development
  • Stay current with emerging techniques in AI engineering and LLM deployment
  • Collaborate with data engineers to ensure proper data flow for AI systems
  • Implement safety measures, content filtering, and responsible AI practices
Requirements

Required Skills & Qualifications

  • Bachelor's or Master's degree in Computer Science, Engineering, or related technical field
  • 3+ years of hands-on experience implementing and optimizing machine learning models
  • Strong programming skills in Python and related ML frameworks (PyTorch, TensorFlow)
  • Experience with deploying and scaling AI models in production environments
  • Familiarity with large language models, transformer architectures, and generative AI
  • Knowledge of cloud platforms (AWS, GCP, Azure) and containerization technologies
  • Understanding of software engineering best practices (version control, CI/CD, testing)
  • Experience with ML engineering tools and platforms (MLflow, Kubeflow, etc.)
  • Strong problem-solving skills and attention to detail
  • Ability to collaborate effectively in cross-functional teams

Preferred Qualifications

  • Experience with fine-tuning and prompt engineering for large language models
  • Knowledge of distributed computing and large-scale model training
  • Familiarity with model optimization techniques (quantization, pruning, distillation)
  • Experience with real-time inference systems and low-latency AI services
  • Understanding of AI ethics, bias mitigation, and responsible AI development
  • Experience with model serving platforms (TorchServe, TensorFlow Serving, Triton)
  • Knowledge of vector databases and similarity search for LLM applications
  • Experience with reinforcement learning and RLHF techniques
  • Familiarity with front-end technologies for AI application interfaces
Benefits

Compensation

iGenius offers a competitive compensation structure, including salary, performance-based bonuses, and additional components based on experience. All roles include comprehensive benefits as part of the total compensation package.

About iGenius

iGenius is a deep-tech company specialized in the development of Artificial Intelligence solutions for companies operating in highly regulated industries, including financial services, government, or heavy industry. iGenius’ main product, Unicorn, offers tailored solutions for companies looking to integrate AI safely and effectively, mainly through two proprietary Large Language Models (LLMs). Italia 10B, is a multi-language model optimized for regulated sectors and elevated computational efficiency, while Colosseum 355B, built with latest-generation NVIDIA technology, is fit for mission-critical use cases. In addition to Unicorn, iGenius’ product offer includes Crystal, an AI agent for Decision Intelligence that analyzes business data in natural language and accurately supports strategic, insight-driven decision-making. In December 2024, iGenius joined forces with NVIDIA to build Colosseum – one of the largest AI supercomputers in the world – to support the deployment of its models with unrivaled speed, performance, and efficiency. 

Active in both Europe and the United States, iGenius is one of the leading AI unicorns in the European landscape, and  has attracted Fortune 500 companies, including Allianz and Intesa Sanpaolo. This led Gartner to recognize iGenius as a “Cool Vendor” in the  AI Core Technologies category, as well as mention the company in its Market Guide for Conversational AI. 

Please review our Privacy Policy here . 


Top Skills

AWS
Azure
GCP
Kubeflow
Mlflow
Python
PyTorch
TensorFlow
Tensorflow Serving
Torchserve
Triton

Similar Jobs

9 Hours Ago
Remote or Hybrid
Hyderabad, Telangana, IND
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Develop high-quality, scalable software, mentor colleagues, and enhance products using AI technologies while maintaining best practices.
Top Skills: AWSGen AiGoogleJavaScriptOpen AiXML
4 Days Ago
Remote or Hybrid
Bengaluru, Karnataka, IND
Junior
Junior
Software
As an AI Engineer at Clari, you will design and ship AI-driven micro-services, working collaboratively to build scalable applications for revenue intelligence.
Top Skills: AIElasticsearchFastapiHuggingfaceKafkaLlmOpenaiPythonPyTorchRaySqs
An Hour Ago
In-Office or Remote
Pune, Mahārāshtra, IND
Senior level
Senior level
Digital Media • Gaming • Software
The Senior AI Engineer will develop AI systems for customer support, integrating NLP, image, and audio processing models, ensuring scalability and robustness.
Top Skills: AWSAzureDockerGCPKafkaKubernetesNlpNluPostgresPythonRest ApisSnowflakeYugabytedb

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account