Product Engineer - Scrape (Remote)
ARCHIVED
We can't find an active application page for this role right now. It may reopen or be listed elsewhere. Use Next Steps to search for an active apply link and similar live jobs.
Location: Remote, Americas time zones preferred Compensation: $180K-$290K base + meaningful early-stage equity Company stage: Series A, ~40-person team Employment type: Full-time About the Company We are a fast-growing AI infrastructure and developer tools startup building APIs that help developers extract clean, structured, LLM-ready data from the web. Our product is already used by a large developer community and has reached significant revenue traction quickly. We are backed by top-tier investors and are growing a small, high-ownership engineering team. This is a rare opportunity to join early, own a critical product surface, and directly shape infrastructure used by developers building AI applications. About the Role We are looking for a Product Engineer to own our core web extraction endpoint end-to-end. You will be responsible for making it easier for developers to turn any URL into clean, reliable, structured data. This includes improving scraping reliability, handling JavaScript-heavy sites, reducing noisy output, improving markdown/JSON quality, and making the API feel simple, fast, and dependable. This is not a narrow backend role. You should be comfortable making product decisions, designing API behavior, implementing the work yourself, measuring impact, and iterating quickly without needing a PM to define the roadmap. What You'll DoOwn a flagship developer-facing API endpoint end-to-end across reliability, latency, response quality, and edge cases.Improve scraping and extraction performance across JavaScript-heavy SPAs, dynamic content, anti-bot walls, and other messy web environments.Build and refine structured extraction workflows, including schema-based extraction, JSON output, and prompt-based extraction.Make thoughtful API and developer experience decisions around response formats, errors, documentation, latency, and cost tradeoffs.Dogfood the product, analyze support tickets, GitHub issues, and developer feedback, then turn that signal into shipped improvements.Run fast product experiments: form a hypothesis, instrument it, ship it, measure results, and decide quickly. What We're Looking For3+ years of experience shipping production software, ideally at an early-stage startup or developer tools company.Strong backend engineering ability with full-stack/product instincts.Hands-on experience with web scraping, crawling, browser automation, data extraction, or web infrastructure.Familiarity with tools or concepts such as Playwright, Puppeteer, Chromium, Crawlee, Scrapy, Apify SDK, headless browsers, anti-bot systems, JS rendering, HTML parsing, or stealth techniques.Strong TypeScript and/or Node.js experience.Experience building or operating production APIs under real usage.Good instincts for what makes scraped or extracted data clean, useful, and LLM-ready.Comfort making tradeoffs across quality, latency, reliability, cost, and developer experience.High ownership, fast execution, and the ability to define what "good" looks like without a PM or designer. Nice to HaveExperience building developer-facing APIs, SDKs, or infrastructure products.Open-source contributions to scraping, crawling, browser automation, or data extraction projects.Experience with observability tools such as Datadog, Sentry, OpenTelemetry, Elastic, or similar.Experience building on top of LLMs or designing data pipelines for AI applications.Prior founding engineer, early engineer, or startup experience. Why JoinOwn a core product surface used by a large developer community.Join a small, high-velocity team with direct founder access.Work on hard, real-world infrastructure problems at the intersection of web data, APIs, and AI.Competitive compensation, meaningful equity, and remote flexibility across Americas time zones.Help define the best-in-class developer experience for turning the web into clean, usable AI-ready data.