Careerflow.ai Logo

Careerflow.ai

Data Annotation Specialist - Computer Use Agents (CUA) Trajectory Evaluator

Posted 7 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in IN
Mid level
Remote
Hiring Remotely in IN
Mid level
Create, validate, and document step-by-step Computer-Use Agent (CUA) trajectories for technical developer workflows. Break down natural language instructions into reproducible actions, execute and test workflows in Linux using Python/Bash, interact with APIs and browser automation, and collaborate to improve annotation quality and guidelines.
The summary above was generated by AI

Role Overview:

We are looking for skilled professionals to contribute as S2 Annotators, responsible for producing and validating high-quality Computer-Use Agent (CUA) trajectories for developer-adjacent workflows. This includes tasks such as file operations, light scripting, API interactions, and browser automation. This role requires a strong understanding of technical workflows, attention to detail, and the ability to translate natural language instructions into precise, step-by-step executable actions that can be used to train advanced AI systems.

What does day-to-day look like

  • Create detailed, step-by-step positive CUA trajectories for technical tasks (e.g., file manipulation, scripting, API calls, browser-based workflows)

  • Break down natural language instructions into clear, verifiable actions

  • Validate and review trajectories for correctness, completeness, and reproducibility

  • Work within Linux desktop environments to execute and document workflows

  • Use scripting (Python/Bash) to simulate or validate task execution where required

  • Interact with tools and environments involving APIs, terminals, and browser automation

  • Collaborate with internal teams to refine task quality and annotation guidelines

  • Ensure consistency, accuracy, and high-quality standards across all annotations

Requirements

  • 2–5 years of experience in software development, technical support, or similar technical roles

  • Strong familiarity with Linux environments and command-line operations

  • Proficiency in at least one scripting language: Python or Bash

  • Ability to decompose complex instructions into structured, step-by-step workflows

  • Strong attention to detail in documenting technical processes

  • Exposure to LLM-based tools, AI systems, or agentic workflows

  • Basic understanding of APIs, file systems, and developer tooling

  • Familiarity with OpenClaw or similar environments/tools

Nice to have

  • Prior experience in data annotation, RLHF, or SFT labeling workflows

  • Exposure to CI/CD pipelines, REST APIs, or terminal-based automation

  • Experience working with browser automation tools or developer productivity tools

  • Background in evaluating or improving AI-generated outputs

Offer Details:

  • Engagement type: Contractor assignment/freelancer (no medical/paid leave)

  • Duration: 5 weeks

Evaluation Process:

  • Resume screening

  • Take home assessment (60 mins)

Similar Jobs

5 Days Ago
Remote or Hybrid
Senior level
Senior level
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Lead deployment of Industry 4.0 and automation across manufacturing sites: develop digital roadmaps, implement automation/vision/process control solutions, ensure cybersecurity/compliance, drive capability building, and support cross-regional digital transformation and value realization.
Top Skills: Artificial IntelligenceBeckhoffCloud ComputingEdge ComputingIdcIndustrial NetworkingIot PlatformsMachine LearningMachine VisionOpc UaPlcPower AppsPower AutomatePower BIPythonRoboticsRockwellSiemens
20 Days Ago
Easy Apply
Remote
Easy Apply
Mid level
Mid level
Big Data • Fintech • Mobile • Payments • Financial Services
As the CRA Compliance Lead, you will manage compliance strategies, enhance community engagement, analyze consumer complaints, and ensure alignment with regulatory expectations for Affirm Bank.
Yesterday
Remote
Entry level
Entry level
Fintech • Payments • Financial Services
The Business Developer will identify prospects, manage client relationships, assist in achieving sales goals, and support outreach efforts in ATM card services.
Top Skills: MS Office

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account