Orion Innovation Logo

Orion Innovation

Performance Tester – GenAI

Posted 7 Days Ago
Be an Early Applicant
In-Office
Chennai, Tamil Nadu
Senior level
In-Office
Chennai, Tamil Nadu
Senior level
Seeking skilled Performance Tester with strong Generative AI experience to ensure performance of AI applications, including testing and optimizing GenAI projects.
The summary above was generated by AI

Orion Innovation is a premier, award-winning, global business and technology services firm.  Orion delivers game-changing business transformation and product development rooted in digital strategy, experience design, and engineering, with a unique combination of agility, scale, and maturity.  We work with a wide range of clients across many industries including financial services, professional services, telecommunications and media, consumer products, automotive, industrial automation, professional sports and entertainment, life sciences, ecommerce, and education.


Role: Performance Test Engineer – Generative AI

Experience: 5+ years (with hands-on performance testing in GenAI / LLM-based applications)

Role Overview:

We are seeking a skilled and detail-oriented Performance Tester with strong experience in Generative AI (GenAI) projects. The ideal candidate will be responsible for ensuring scalability, reliability, and optimal performance of AI-powered applications, including Large Language Model (LLM) integrations, conversational AI systems, and Retrieval-Augmented Generation (RAG) pipelines. This role requires expertise in performance engineering, cloud platforms, and testing of AI/ML workloads in production environments.

Key Responsibilities

• Performance Strategy & Planning:

  • Define and implement performance testing strategies for GenAI and LLM-based applications.
  • Identify performance bottlenecks across APIs, model inference layers, vector databases, and cloud infrastructure.
  • Establish performance benchmarks, SLAs, and scalability targets for AI-driven systems.

• Performance Testing & Engineering:

  • Design, develop, and execute load, stress, spike, endurance, and scalability tests for GenAI applications.
  • Perform performance testing of LLM-powered APIs (e.g., ChatGPT-like applications) hosted on cloud platforms.
  • Validate latency, throughput, token usage, concurrency handling, and cost-performance trade-offs.
  • Conduct performance validation for RAG pipelines including embedding generation and vector search.
  • Analyze model inference time, GPU/CPU utilization, memory usage, and autoscaling behavior.

• Tools & Automation:

  • Develop automated performance test scripts using tools such as JMeter, LoadRunner, k6, or Gatling.
  • Monitor system performance using APM tools like Dynatrace, AppDynamics, Azure Monitor, or AWS CloudWatch.
  • Integrate performance testing into CI/CD pipelines using Azure DevOps or similar platforms.
  • Create dashboards and reports for performance metrics and trend analysis.

• Cloud & Infrastructure Testing:

  • Conduct performance testing on AI solutions deployed on Azure, AWS, or GCP.
  • Validate autoscaling configurations, containerized deployments (Docker, Kubernetes), and serverless architectures.
  • Assess performance of vector databases such as Chroma, Pinecone, Weaviate, or FAISS under load.

• Collaboration & Optimization:

  • Collaborate with AI engineers, data scientists, DevOps, and architects to optimize model serving and API performance.
  • Recommend improvements in prompt engineering, caching strategies, batching, and parallelization.
  • Support capacity planning and cost optimization for LLM-based applications.

• Governance & Reporting:

  • Document performance test results, bottlenecks, and optimization recommendations.
  • Ensure compliance with security and data privacy standards in performance environments.
  • Present findings to stakeholders and provide actionable insights.

Key Requirements

• Technical Skills:

  • 5+ years of experience in Performance Testing and Engineering.
  • Hands-on experience in performance testing GenAI / LLM-based applications.
  • Experience working with LLM platforms such as OpenAI GPT models, Gemini, Llama 2, Claude, or Grok.
  • Understanding of concepts like tokenization, embeddings, vector search, and RAG architecture.
  • Experience testing AI services hosted on Azure AI Services, Azure ML, AWS Bedrock, or Google Vertex AI.
  • Proficiency in performance testing tools such as JMeter, LoadRunner, k6, or Gatling.
  • Knowledge of API testing tools like Postman or Rest Assured.
  • Familiarity with monitoring tools such as Azure Monitor, AWS CloudWatch, Grafana, or Prometheus.
  • Experience with containerization (Docker) and orchestration (Kubernetes).
  • Basic scripting knowledge in Python or Java for test automation.
  • Understanding of CI/CD pipelines and DevOps practices.

• GenAI-Specific Knowledge:

  • Experience testing conversational AI applications and chatbot performance.
  • Knowledge of inference latency optimization techniques for LLMs.
  • Understanding of GPU-based workloads and performance considerations.
  • Exposure to agentic frameworks like LangChain, Semantic Kernel, AutoGen, or CrewAI (preferred).
  • Experience validating performance of vector databases (Chroma, Pinecone, Weaviate, FAISS).

Qualifications

  • Bachelor’s degree in Computer Science, Information Technology, or related field.
  • 5+ years of experience in performance testing, with at least 2 years in AI/ML or GenAI projects.
  • Experience in testing cloud-native, microservices-based applications.
  • Strong analytical and troubleshooting skills.
  • Excellent communication and stakeholder management skills.

Orion is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, citizenship status, disability status, genetic information, protected veteran status, or any other characteristic protected by law.

Candidate Privacy Policy

Orion Systems Integrators, LLC and its subsidiaries and its affiliates (collectively, “Orion,” “we” or “us”) are committed to protecting your privacy. This Candidate Privacy Policy (orioninc.com) (“Notice”) explains:

  • What information we collect during our application and recruitment process and why we collect it;
  • How we handle that information; and
  • How to access and update that information.

Your use of Orion services is governed by any applicable terms in this notice and our general Privacy Policy.


Top Skills

AWS
Aws Cloudwatch
Azure
Azure Devops
Azure Monitor
Chroma
Claude
Docker
Faiss
Gatling
GCP
Gemini
Grafana
Grok
Java
Jmeter
K6
Kubernetes
Llama 2
Loadrunner
Openai Gpt
Pinecone
Prometheus
Python
Weaviate

Orion Innovation Chennai, Tamil Nadu, IND Office

Ambit IT Park, Ambit Park Road Ambattur Industrial Estate, Chennai, India, 600 058

Similar Jobs

57 Minutes Ago
Remote or Hybrid
4 Locations
Senior level
Senior level
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Lead the design and execution of knowledge management solutions within enterprise transformation programs, ensuring knowledge assets are captured and reused effectively.
Top Skills: AIAutomationBloomfireBusiness Process ManagementInformation ScienceKnowledge Management
57 Minutes Ago
Hybrid
3 Locations
Senior level
Senior level
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Lead the design and development of Adobe Experience Manager solutions, focusing on website customization, API integration, and performance tuning.
Top Skills: Adobe AnalyticsAdobe Experience ManagerAdobe TargetApi IntegrationDispatcherHtlJavaJcrOsgiSling
58 Minutes Ago
Hybrid
Chennai, Tamil Nadu, IND
Junior
Junior
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
The Rep will address consumer queries related to credit reports, ensure SLA compliance, escalate issues, and maintain quality standards in operations.
Top Skills: MS Office

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account