The AI Engineer will maintain and enhance AI services on the Sootra platform, optimizing pipelines, managing microservices, and developing feedback systems to improve AI performance.
Role Overview
We are looking for an AI Engineer to maintain and enhance the AI-driven backbone of the Sootra platform. This role involves ensuring production stability of LLM/VLM pipelines, optimizing model interactions, maintaining APIs and queues, and building feedback loops that continuously improve AI outputs.
Responsibilities
- Maintain and optimize LLM- and VLM-powered services for content generation, compliance scoring, and campaign testing.
- Manage and scale Flask/FastAPI microservices, ensuring high uptime and low latency.
- Maintain Dramatiq queues for async AI workflows, campaign generation, and pipeline orchestration.
- Deploy, monitor, and debug Uvicorn/Gunicorn-based hosting in production environments.
- Integrate with OpenRouter and equivalent LLM routing tools to balance cost, latency, and quality.
- Design and refine prompt engineering strategies for reliability, context-awareness, and compliance.
- Build and maintain feedback pipelines for AI model evaluation (human-in-the-loop scoring, automated quality checks, reinforcement).
- Expose and maintain REST APIs for AI services, ensuring secure, versioned endpoints.
- Collaborate with backend/frontend teams to keep microservice architecture aligned and maintainable.
- Track token consumption, latency, and error rates to ensure production-grade performance.
Required Skills
- Programming: Strong in Python, with experience in production-grade codebases.
- Frameworks: Flask (for APIs), FastAPI (optional), Uvicorn/Gunicorn for async hosting.
- Queues/Workers: Dramatiq (or Celery/RQ equivalent) for background jobs.
- AI/ML: Hands-on with LLMs and VLMs, including prompt engineering, fine-tuning, and evaluation.
- AI Infrastructure: Familiar with OpenRouter or equivalent LLM/VLM routing & fallback tools.
- Architecture: Experience designing and maintaining microservice architectures.
- APIs: Strong experience with REST API design (auth, rate limiting, documentation).
- Production: Dockerized deployments, CI/CD pipelines, logging/monitoring, error handling.
- Feedback Loops: Building structured evaluation/feedback systems for AI model performance.
- Cloud: AWS/GCP experience preferred (deployment, monitoring, scaling).
Experience
- 3–5 years as an AI Engineer or Python Backend Engineer working with production systems.
- Prior work with SaaS platforms, LLM/VLM integrations, or AI-first products is highly valued.
Demonstrated ability to maintain AI pipelines in production, not just prototypes.
Similar Jobs
Information Technology • Software • Financial Services • Quantitative Trading
Software Engineers at Citadel develop, maintain, and support high-performance trading platforms, focusing on custom software solutions and system stability.
Top Skills:
C++
Cloud • Information Technology • Productivity • Software • Automation
Technical leader designing and implementing scalable, fault-tolerant backend microservices and Agentic AI systems. Leads architecture, cloud infrastructure, data strategies, incident response, testing, and mentorship to deliver production-grade high-throughput solutions.
Top Skills:
Agentic AiAnsibleAWSAzureChaos EngineeringCi/CdCloudFormationContainersDistributed CachingDjangoEksEvent StreamingFastapiFlaskGCPJavaKubernetesLlmsMessage QueuesNoSQLOpensearchPrompt EngineeringPythonRagSlisSlosSpring BootSQLTerraformVector Databases
AdTech • Big Data • Digital Media • Software
Lead technical operations for Magnite in India, focusing on integrations, optimisation, and strategy while guiding clients and stakeholders on ad tech solutions.
Top Skills:
APIsGamJavaScriptOpenrtbPrebidPythonSpringserveSQLVast
What you need to know about the Chennai Tech Scene
To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.



