Careerflow.ai Logo

Careerflow.ai

Data Operations Analyst (AI-Assisted Data Transformation & Structuring)

Posted 7 Days Ago
Remote
Hiring Remotely in IN
Mid level
Remote
Hiring Remotely in IN
Mid level
Independently extract data from CSV and PDF sources, transform and map into strict JSON schemas, build material- and supplier-level records, and validate every record with zero data loss. Use AI tools to assist but manually verify all outputs and follow SOPs and QA checklists.
The summary above was generated by AI

Summary

Looking for a detail-oriented Data Operations Analyst to independently handle data extraction, transformation, and validation across structured (CSV) and unstructured (PDF) datasets. The role involves converting complex raw datasets into highly structured JSON formats using defined schemas, with zero data loss and strict quality standards. AI tools can be used to improve speed and efficiency, but 100% accuracy and manual validation is mandatory.

Key Responsibilities

1. Data Extraction

Extract and organize data from:

  • CSV datasets (NASA materials data)

  • PDF documents (BGS mining directory)

Use tools where needed, while ensuring data accuracy

2. Data Transformation

  • Convert raw data into structured JSON formats

  • Map all fields according to strict schema rules

  • Handle:

    • Complex property grouping (NASA dataset)

    • Supplier + site grouping (BGS dataset)

3. Data Structuring

Build:

  • Material-level JSON records

  • Supplier-level JSON objects

Ensure:

  • Correct categorization

  • Proper grouping of related data

  • Consistent naming conventions

4. Quality Assurance (Critical)

Validate every record before submission

Ensure:

  • No missing data

  • No incorrect mappings

  • No duplicates

Follow strict SOP and validation checklist

5. AI-Assisted Workflow

Use AI tools to:

  • Speed up the transformation

  • Assist in structuring data

    Independently verify all AI-generated outputs

Final accountability for accuracy remains with the analyst

Candidate must be comfortable using AI tools responsibly for productivity, not blindly:

  • ChatGPT – for structuring, reasoning, and transformation assistance

  • Claude – for long document parsing and summarization

  • Microsoft Excel (with AI features like Copilot) – for data handling

  • Adobe Acrobat or similar – for PDF extraction

    Optional:

  • Python (basic scripting)

  • Google Sheets

Key expectation: Ability to use AI as an assistant, not a replacement for thinking

Qualifications

Bachelor’s degree in:

  • Computer Science / Engineering

  • Mathematics / Statistics

  • Or any relevant field with strong data handling experience

Operations / Material Science background is a plus.

Similar Jobs

An Hour Ago
In-Office or Remote
Mid level
Mid level
Information Technology • Software • Financial Services • Quantitative Trading
Software Engineers at Citadel develop, maintain, and support high-performance trading platforms, focusing on custom software solutions and system stability.
Top Skills: C++
An Hour Ago
Remote
India
Senior level
Senior level
Cloud • Information Technology • Productivity • Software • Automation
Technical leader designing and implementing scalable, fault-tolerant backend microservices and Agentic AI systems. Leads architecture, cloud infrastructure, data strategies, incident response, testing, and mentorship to deliver production-grade high-throughput solutions.
Top Skills: Agentic AiAnsibleAWSAzureChaos EngineeringCi/CdCloudFormationContainersDistributed CachingDjangoEksEvent StreamingFastapiFlaskGCPJavaKubernetesLlmsMessage QueuesNoSQLOpensearchPrompt EngineeringPythonRagSlisSlosSpring BootSQLTerraformVector Databases
An Hour Ago
Remote or Hybrid
India
Senior level
Senior level
AdTech • Big Data • Digital Media • Software
Lead technical operations for Magnite in India, focusing on integrations, optimisation, and strategy while guiding clients and stakeholders on ad tech solutions.
Top Skills: APIsGamJavaScriptOpenrtbPrebidPythonSpringserveSQLVast

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account