Design and deploy machine learning models, manage lifecycle on GCP, optimize performance, and ensure reliability for AI applications in production.
We are looking for a Machine Leaning Engineer (MLE) to design, build, and optimize our machine learning operation. You will play a crucial role in scaling AI models from research to production, ensuring smooth model deployment, monitoring, and lifecycle management across our Google Cloud Platform (GCP) infrastructure. You'll work closely with data scientists, ML Ops, and data engineers to automate workflows, improve model performance, and ensure reliability for our AI that serves millions of players worldwide.
What You'll Do
- Design, develop, and deploy machine learning models and solutions, leveraging tools such as LangGraph and MLflow for orchestration and lifecycle management.
- Collaborate on building and maintaining scalable data and feature pipeline infrastructure for real time and batch processing using tools like BigQuery, BigTable, Dataflow, Composer(Airflow), PubSub, and Cloud Run to support ML model training and inference.
- Develop and implement robust strategies for model monitoring and observability to detect model drift, bias, and performance degradation, leveraging tools like Vertex AI Model Monitoring and custom dashboards.
- Optimize ML model inference performance to improve latency and cost-efficiency of AI applications.
- Ensure the overall reliability, performance, and scalability of the ML models and data infrastructure platform, including proactive identification and resolution of issues related to model performance and data quality.
- Troubleshoot and resolve complex issues impacting ML models, data pipelines, and production AI systems.
- Ensure AI/ML models and workflows meet data governance, security, and compliance requirements, specifically for real-money gaming.
What We're Looking For
- 1+ years of experience as an ML Engineer, with a focus on developing and deploying machine learning models in production environments.
- Strong experience in Google Cloud Platform (GCP), including services relevant to ML and data infrastructure such as BigQuery, Dataflow, Vertex AI, Cloud Run, and Pub/Sub and Composer (Airflow).
- Solid grasp of containerization (Docker, Kubernetes) and experience with Kubernetes orchestration platforms like GKE for deploying ML services.
- Experience building and deploying scalable data pipelines and machine learning models in production environments.
- Understanding of model monitoring, logging, and observability best practices for ML models and applications.
- Experience in Python and ML frameworks (e.g., TensorFlow, PyTorch, scikit-learn).
- Familiarity with AI orchestration concepts using tools like LangGraph or LangChain is a bonus.
- Bonus experience includes working in gaming, real-time fraud detection, or AI personalization systems and Agentic workflows.
Top Skills
BigQuery
Cloud Run
Composer
Dataflow
Docker
Google Cloud Platform
Kubernetes
Langgraph
Pub/Sub
PyTorch
Scikit-Learn
TensorFlow
Vertex Ai
Similar Jobs
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Lead the design and execution of knowledge management solutions within enterprise transformation programs, ensuring knowledge assets are captured and reused effectively.
Top Skills:
AIAutomationBloomfireBusiness Process ManagementInformation ScienceKnowledge Management
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Compliance QA Analyst II will conduct QA reviews of EDD and screening cases, provide feedback to agents, conduct root cause analysis, collaborate on complaints handling, and ensure compliance with global regulations.
Top Skills:
ExcelGoogle SheetsGoogle SlidesJIRAPowerPointSalesforce Service Cloud
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
The role involves gathering business requirements, leading data migration, integration, supporting system decommissioning, and facilitating Agile processes within risk and compliance workstreams.
Top Skills:
AWSConfluenceJIRAMS Office
What you need to know about the Chennai Tech Scene
To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.


.png)
