{"schemaVersion":"jobsearcher.job.v1","id":"bed8c3fe402578c209fcf408","url":"https://jobsearcher.com/jobs/bed8c3fe402578c209fcf408","canonicalUrl":"https://jobsearcher.com/jobs/bed8c3fe402578c209fcf408","title":"AI Engineer","description":"About Sixtyfour\nSixtyfour is a data orchestration engine for company and people data. Our AI agents specialize in enriching detailed datapoints about individuals and organizations, gathering information from both public and proprietary sources. We enable enterprise customers to build custom data pipelines that help them find, enrich, qualify, and act on company‑specific requirements for people and business data.\n\nOur Mission\nTo power how enterprises understand and act on people and companies with world‑class intelligence.\n\nWhat You’ll Do\n\nDesign and ship agentic systems (tool calling, multi‑agent workflows, structured outputs) that reliably fetch, extract, and normalize data across the web and APIs.\n\nOwn robust web scraping: directory crawling, CAPTCHA handling, headless browsers, rotating proxies, anti‑bot evasion, and backoff/retry policies.\n\nDevelop backend services in Python + FastAPI with clean contracts and strong observability.\n\nScale workloads on AWS + Docker (batch/queue workers, autoscaling, fault tolerance, cost control).\n\nParallelize external API requests safely (rate limits, idempotency, circuit breakers, retries, dedupe).\n\nIntegrate third‑party APIs for enrichment and search; model and cache responses; manage schema evolution.\n\nTransform and analyze data using Pandas (or similar) for normalization, QA, and reporting.\n\nPitch in across the stack: billing (Stripe), and occasional front‑end changes to ship end‑to‑end features.\n\nMinimum Requirements\n\nHands‑on experience with agentic architectures (tool calling, structured outputs/JSON, planning/execution loops) and prompt engineering.\n\nProven web scraping expertise: solving CAPTCHAs, session/auth flows, proxy rotation, stealth techniques, and legal/ethical constraints.\n\nAWS + Docker in production (at least two of: ECS/EKS, Lambda, SQS/SNS, Batch, Step Functions, CloudWatch).\n\nBuilding high‑throughput data/IO pipelines with concurrency (asyncio/multiprocessing), resilient retries, and rate‑limit aware scheduling.\n\nIntegrating diverse external APIs (auth patterns, pagination, webhooks); designing stable interfaces and backfills.\n\nStrong data wrangling with Pandas or equivalent; comfort with large CSV/Parquet workflows and memory/perf tuning.\n\nExcellent ownership, product sense, and pragmatic debugging.\n\nNice to have\n\nEntity resolution/record linkage at scale (probabilistic matching, blocking, deduping).\n\nExperience with Langfuse, OpenTelemetry, or similar for tracing/evals; task queues (Celery/RQ), Redis, Postgres.\n\nSearch relevance (BM25/vector/hybrid), embeddings, and retrieval pipelines.\n\nPlaywright/Selenium, stealth browsers, anti‑bot frameworks, CAPTCHA providers.\n\nCI/CD, infrastructure as code (Terraform), and cost/perf observability.\n\nSecurity & compliance basics for data handling and PII.\n\n#J-18808-Ljbffr","company":"SupportFinity","rawCompany":"supportfinity","city":"Millbrae","state":"CA","isRemote":false,"isActive":false,"createdAt":"2026-06-18T03:15:10.954Z","occupations":[{"code":"15-1252.00","title":"Software Developers","slug":"software-developers"},{"code":"15-1299.08","title":"Computer Systems Engineers/Architects","slug":"computer-systems-engineers-architects"},{"code":"15-1243.01","title":"Data Warehousing Specialists","slug":"data-warehousing-specialists"}],"industries":[{"code":"541511","title":"Custom Computer Programming Services","slug":"custom-computer-programming-services"},{"code":"513210","title":"Software Publishers","slug":"software-publishers"},{"code":"541512","title":"Computer Systems Design Services","slug":"computer-systems-design-services"}],"jobPosting":{"@context":"https://schema.org","@type":"JobPosting","title":"AI Engineer","description":"About Sixtyfour\nSixtyfour is a data orchestration engine for company and people data. Our AI agents specialize in enriching detailed datapoints about individuals and organizations, gathering information from both public and proprietary sources. We enable enterprise customers to build custom data pipelines that help them find, enrich, qualify, and act on company‑specific requirements for people and business data.\n\nOur Mission\nTo power how enterprises understand and act on people and companies with world‑class intelligence.\n\nWhat You’ll Do\n\nDesign and ship agentic systems (tool calling, multi‑agent workflows, structured outputs) that reliably fetch, extract, and normalize data across the web and APIs.\n\nOwn robust web scraping: directory crawling, CAPTCHA handling, headless browsers, rotating proxies, anti‑bot evasion, and backoff/retry policies.\n\nDevelop backend services in Python + FastAPI with clean contracts and strong observability.\n\nScale workloads on AWS + Docker (batch/queue workers, autoscaling, fault tolerance, cost control).\n\nParallelize external API requests safely (rate limits, idempotency, circuit breakers, retries, dedupe).\n\nIntegrate third‑party APIs for enrichment and search; model and cache responses; manage schema evolution.\n\nTransform and analyze data using Pandas (or similar) for normalization, QA, and reporting.\n\nPitch in across the stack: billing (Stripe), and occasional front‑end changes to ship end‑to‑end features.\n\nMinimum Requirements\n\nHands‑on experience with agentic architectures (tool calling, structured outputs/JSON, planning/execution loops) and prompt engineering.\n\nProven web scraping expertise: solving CAPTCHAs, session/auth flows, proxy rotation, stealth techniques, and legal/ethical constraints.\n\nAWS + Docker in production (at least two of: ECS/EKS, Lambda, SQS/SNS, Batch, Step Functions, CloudWatch).\n\nBuilding high‑throughput data/IO pipelines with concurrency (asyncio/multiprocessing), resilient retries, and rate‑limit aware scheduling.\n\nIntegrating diverse external APIs (auth patterns, pagination, webhooks); designing stable interfaces and backfills.\n\nStrong data wrangling with Pandas or equivalent; comfort with large CSV/Parquet workflows and memory/perf tuning.\n\nExcellent ownership, product sense, and pragmatic debugging.\n\nNice to have\n\nEntity resolution/record linkage at scale (probabilistic matching, blocking, deduping).\n\nExperience with Langfuse, OpenTelemetry, or similar for tracing/evals; task queues (Celery/RQ), Redis, Postgres.\n\nSearch relevance (BM25/vector/hybrid), embeddings, and retrieval pipelines.\n\nPlaywright/Selenium, stealth browsers, anti‑bot frameworks, CAPTCHA providers.\n\nCI/CD, infrastructure as code (Terraform), and cost/perf observability.\n\nSecurity & compliance basics for data handling and PII.\n\n#J-18808-Ljbffr","datePosted":"2026-06-18T03:15:10.954Z","dateModified":"2026-06-18T03:15:10.954Z","hiringOrganization":{"@type":"Organization","name":"SupportFinity","sameAs":"https://jobsearcher.com"},"jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Millbrae","addressRegion":"CA","addressCountry":"US"}},"identifier":{"@type":"PropertyValue","name":"JobSearcher","value":"bed8c3fe402578c209fcf408"},"url":"https://jobsearcher.com/jobs/bed8c3fe402578c209fcf408"}}