Web Scraping Engineer (Data Extraction) in Treehouse Holdings

FULL_TIME

  Remote | Semi Senior | Full time | Programming

Gross salary $2400 - 4000 USD/month

3 applications
Last checked today
Apply now
Requires applying in English

We are Treehouse Holdings, building a large-scale, high-impact data extraction platform using Apify Actors technology. The mission is to provide reliable, one-click access to data from every useful website on the planet. By 2027, we aim to launch 1,000 high-quality Apify Actors that power data products influencing billions of users worldwide. This role is part of a passionate team focused on creating scalable, efficient web scraping solutions that enable new data-driven insights for research, AI training, and market intelligence.

Applications are only received at getonbrd.com.

What You Will Do

  • Design, build, and release 10+ production-ready Apify Actors each month using Node.js/TypeScript or Python.
  • Select the optimal crawling technology (Cheerio vs. Playwright/Puppeteer) and fine-tune concurrency for maximum throughput and efficiency.
  • Implement advanced proxy usage, session management, and fingerprint rotation to avoid blocks and CAPTCHAs.
  • Write streamlined Dockerfiles, automate the build process, and deploy images to Apify Cloud.
  • Create clear input/output schemas, automated tests, and comprehensive README documentation to ensure user success on first run.
  • Monitor system logs, troubleshoot and resolve customer issues promptly while continuously improving performance and reliability.
  • Utilize AI coding assistants like Cursor and Claude Code daily to assist with scaffolding, refactoring, and documenting code faster.

Must-Have Skills

  • Minimum 2+ years experience building production-grade web scrapers.
  • Strong expertise in Node.js with TypeScript or Python development.
  • Deep knowledge of Crawlee plus Playwright or Puppeteer for building robust scraping actors.
  • Proven skills in designing proxy pools, managing sessions, and fingerprint rotation techniques to bypass anti-scraping measures.
  • Comfortable working with Docker containers, Git version control, and asynchronous/concurrent programming paradigms.
  • Experience as a daily user of AI coding tools such as Cursor and Claude Code to enhance coding productivity.
  • Good command of English (B2+), able to write clear documentation and communicate asynchronously with a remote team.

We seek a thoughtful engineer who thrives building scalable systems, quickly solving complex problems, and delivering reliable data extraction products that support multiple downstream applications.

Nice-to-Have Skills

  • Published or commercially deployed Apify Actors demonstrating real-world scraping impact.
  • Experience working with Continuous Integration/Continuous Deployment (CI/CD) pipelines and automated testing suites.
  • Familiarity with REST or GraphQL API design and development.

Compensation & Benefits

  • Competitive salary paid monthly in USD.
  • Learning budget available for courses and conference attendance to support professional growth.
  • Paid time off in addition to public holidays based on your local region.
  • Clear career progression opportunities: Senior Engineer → Staff Engineer → Tech Lead.
  • Fully remote-work environment with core hours between 10 a.m. – 4 p.m. EST (14:00–20:00 UTC).
  • Async-first culture with minimal meetings and strong ownership of your roadmap supported by the team.

GETONBRD Job ID: 55293

Fully remote You can work from anywhere in the world.
Pet-friendly Pets are welcome at the premises.
Flexible hours Flexible schedule and freedom for attending family needs or personal errands.
Informal dress code No dress code is enforced.
Vacation over legal Treehouse Holdings gives you paid vacations over the legal minimum.

Remote work policy

Fully remote

Candidates can reside anywhere in the world.

  1. Jobs ›
  2. Programming ›
  3. Treehouse Holdings ›
  4. Web Scraping Engineer (Data Extraction)

About Treehouse Holdings

Treehouse Holdings starts, buys, and accelerates tech-enabled service brands. We recruit self-directed engineers and operators worldwide, hand them real ownership over revenue-critical projects, and arm them with the resources of a U.S. company. — Treehouse Holdings's full profile

Web Scraping Engineer (Data Extraction)
Treehouse Holdings •   Remote
Apply
Requires applying in English
Share this job Share