STERRY Logo

STERRY

Web Scraper / Data Engineer​​​​​​​

Posted 20 Days Ago
Be an Early Applicant
India
Junior
India
Junior
The role involves building and maintaining automated pipelines for data collection and cleaning from public sources, integrating scrapers with backend systems, and ensuring data accuracy and compliance with platform rules.
The summary above was generated by AI
Web Scraper / Data Engineer
Location: Remote
Job Type: Full-time
About STERRY

At STERRY, we’re not your average Growth Marketing Agency—we’re the rocket fuel behind crowdfunding and e-commerce success. Since day one, we’ve helped clients pull in over $100 million in trackable online revenue. We build strategies that go beyond brand and marketing—we deliver measurable results rooted in online performance
Role Overview
We’re looking for an experienced Web Scraper / Data Engineer to help us build and maintain automated pipelines that collect, clean, and enrich creator and campaign data from public sources. You’ll be responsible for designing reliable, scalable scrapers and integrating them with our backend.
Responsibilities
  • Build scrapers and crawlers to collect creator profile data (followers, engagement, category, contact info, etc.) from social platforms (TikTok, Instagram, YouTube, etc.) and directories
  •  Parse and clean unstructured data into structured datasets (JSON, CSV, or direct to database)
  •  Integrate with APIs (YouTube, TikTok, Instagram, etc.) where possible
  •  Detect and handle rate limits, CAPTCHA, and anti-bot mechanisms
  • Implement and monitor scraping tasks using proxy rotation and headless browsers (Puppeteer, Playwright, Selenium, etc.)
  •  Collaborate with the backend team to feed data into AI recommendation engine
  •  Maintain high data accuracy, freshness, and compliance with platform TOS and privacy rules

Requirements
  •  2+ years experience building web scrapers, crawlers, or data extraction pipelines
  •  Strong Python or Node.js skills (BeautifulSoup, Playwright, Puppeteer, Scrapy, or similar)
  •  Experience with APIs, JSON, REST, and rate-limiting management
  •  Familiarity with databases (MongoDB, PostgreSQL, Firebase, etc.)
  •  Knowledge of proxies, headless browsers, and data scaling infrastructure
  •  Attention to detail and ability to deliver clean, well-documented code
  •  (Bonus) Experience with influencer data, social analytics, or SaaS platforms

What We Offer
  • Flexible working hours (remote-first)
  • Competitive pay (hourly or project-based)
  • Long-term potential to transition into a data engineering role
  • Opportunity to shape the foundation of a fast-growing AI SaaS startup

Top Skills

Beautifulsoup
Firebase
JSON
MongoDB
Node.js
Playwright
Postgres
Puppeteer
Python
Rest
Scrapy

Similar Jobs

28 Minutes Ago
Easy Apply
In-Office
Bangalore, Bengaluru, Karnataka, IND
Easy Apply
Senior level
Senior level
Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Design and implement scalable backend solutions as part of a team. Guide architecture decisions and ensure code quality for fleet safety features.
Top Skills: AWSCi/CdGitGoJavaMySQLNode.jsPostgresPythonRuby (Rails)
38 Minutes Ago
Hybrid
Chennai, Tamil Nadu, IND
Senior level
Senior level
Agency • Digital Media • eCommerce • Professional Services • Software • Analytics • Consulting
Lead the design and development of mobile applications, ensuring high-quality solutions while collaborating with cross-functional teams and implementing security best practices.
Top Skills: AndroidAWSAzureGCPiOSJavaKotlinMicroservices ArchitectureReact NativeRestful ApisSwift
40 Minutes Ago
Hybrid
Hyderabad, Telangana, IND
Junior
Junior
Artificial Intelligence • Productivity • Software
The Software Engineer, Infrastructure will contribute to the async task runner and configuration management platform, ensuring reliability and scalability to support Notion's growth.

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account