STERRY Logo

STERRY

Web Scraper / Data Engineer​​​​​​​

Reposted 14 Days Ago
Be an Early Applicant
India
Junior
India
Junior
The role involves building and maintaining automated pipelines for data collection and cleaning from public sources, integrating scrapers with backend systems, and ensuring data accuracy and compliance with platform rules.
The summary above was generated by AI
Web Scraper / Data Engineer
Location: Remote
Job Type: Full-time
About STERRY

At STERRY, we’re not your average Growth Marketing Agency—we’re the rocket fuel behind crowdfunding and e-commerce success. Since day one, we’ve helped clients pull in over $100 million in trackable online revenue. We build strategies that go beyond brand and marketing—we deliver measurable results rooted in online performance
Role Overview
We’re looking for an experienced Web Scraper / Data Engineer to help us build and maintain automated pipelines that collect, clean, and enrich creator and campaign data from public sources. You’ll be responsible for designing reliable, scalable scrapers and integrating them with our backend.
Responsibilities
  • Build scrapers and crawlers to collect creator profile data (followers, engagement, category, contact info, etc.) from social platforms (TikTok, Instagram, YouTube, etc.) and directories
  •  Parse and clean unstructured data into structured datasets (JSON, CSV, or direct to database)
  •  Integrate with APIs (YouTube, TikTok, Instagram, etc.) where possible
  •  Detect and handle rate limits, CAPTCHA, and anti-bot mechanisms
  • Implement and monitor scraping tasks using proxy rotation and headless browsers (Puppeteer, Playwright, Selenium, etc.)
  •  Collaborate with the backend team to feed data into AI recommendation engine
  •  Maintain high data accuracy, freshness, and compliance with platform TOS and privacy rules

Requirements
  •  2+ years experience building web scrapers, crawlers, or data extraction pipelines
  •  Strong Python or Node.js skills (BeautifulSoup, Playwright, Puppeteer, Scrapy, or similar)
  •  Experience with APIs, JSON, REST, and rate-limiting management
  •  Familiarity with databases (MongoDB, PostgreSQL, Firebase, etc.)
  •  Knowledge of proxies, headless browsers, and data scaling infrastructure
  •  Attention to detail and ability to deliver clean, well-documented code
  •  (Bonus) Experience with influencer data, social analytics, or SaaS platforms

What We Offer
  • Flexible working hours (remote-first)
  • Competitive pay (hourly or project-based)
  • Long-term potential to transition into a data engineering role
  • Opportunity to shape the foundation of a fast-growing AI SaaS startup

Top Skills

Beautifulsoup
Firebase
JSON
MongoDB
Node.js
Playwright
Postgres
Puppeteer
Python
Rest
Scrapy

Similar Jobs

An Hour Ago
Remote or Hybrid
India
Senior level
Senior level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
SailPoint is seeking a Senior Software Engineer to build Python SDKs and frameworks for their big data platform. Responsibilities include designing, delivering, and testing backend services while collaborating with teammates and engaging in product demos and customer support.
Top Skills: AWSDockerEksGrafanaJIRAKafkaKibanaNoSQLPrometheusPythonRedisSQL
3 Hours Ago
Remote or Hybrid
India
Senior level
Senior level
Artificial Intelligence • Hardware • Information Technology • Security • Software • Cybersecurity • Big Data Analytics
The Strategic Territory Director will develop sales strategies in India, oversee the sales lifecycle, manage customer relationships, and conduct market analysis for growth.
Top Skills: CRMManetRf NetworkingTactical Radio TechnologiesWireless Communications
9 Hours Ago
Remote or Hybrid
India
Senior level
Senior level
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The Software Engineer I role involves implementing AI solutions, utilizing multiple tech stacks, AI frameworks, and developing applications on cloud platforms.
Top Skills: Azure AiCi/CdPythonPyTorchTensorFlow

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account