• Built a crowdsourcing platform for collecting and validating long-tail job posting leads across regions and languages.
• Implemented task subscriptions, ranked next-task delivery, dynamic forms, assignment and task routing, worker identity, performance tracking, and downstream ingestion triggers.
• Added quality-control mechanisms including majority voting, cross-validation, and consistency checks before data entered crawling pipelines.
• Enabled hundreds of global workers to complete hundreds of thousands of tasks, contributing to >$10M in customer LTV.
• Built a websites classifier on the publicwww.com dataset — ~750k positively identified sites fed into crawler discovery.