Home / How we collect / Hybrid
Dedicated core for the spine of your dataset. Crowdsourced edge for diversity and long-tail. One unified pipeline.
Hybrid combines a dedicated core team with crowdsource edge collection under a single SOW. Your core handles high-complexity, IP-sensitive tasks on controlled rigs. The crowd fills in scene diversity, geographic spread, and long-tail variants from real homes and workspaces across 12 countries.
Best for
Teams that need lab-quality core data and real-world diversity in the same training set — without managing two vendors.
80/20 typical core/edge split.
14 days SOW to first batch.
1 pipeline unified delivery.
Where production teams land after trying pure dedicated or pure crowdsource and finding both lacking.
A high-quality spine for the bulk of training, plus diversity capture for robustness and long-tail generalization.
Where pure crowdsource is too noisy for the spine and pure dedicated is too slow to scale the edge.
Dedicated for held-out benchmarks. Crowdsourced for fine-tune sprints between releases.
Parallel ramp on both sides. Same SOW, same SLAs, same accountability.
Define what’s spine (high-quality, controlled) and what’s edge (diversity, scale). One SOW covers both.
Dedicated pod begins recruitment, training, and pilot. Default 80% of unit volume.
Crowdsource network onboards for diversity, geographic spread, and long-tail capture.
Both layers operating. Weekly merge into your dataset. Unified QA and dashboard across both.
A dedicated spine and a crowdsourced edge, both reporting into one dataset and one dashboard.
Default ratio. Tunable per program.
A hybrid dataset is only as good as the layer joins. These four pillars hold the seam tight.
Both layers tested against the same acceptance criteria. The spine sets the bar. The edge is measured against it.
Dedicated operators occasionally take crowd tasks. Crowd seniors take spine tasks. Drift detection runs both directions.
Every unit tagged with layer, operator tier, geography, and hardware. Filter your dataset by source at training time.
A single view across both layers. Same accuracy metrics, same throughput counts, same SLAs.
Pick hybrid when you need a dedicated spine for quality and a crowdsourced edge for diversity.
| Criteria | Dedicated | Crowdsource | HybridRECOMMENDED FOR PRODUCTION |
|---|---|---|---|
| IP control | Full exclusivity. Operators NDA’d. | Per-batch NDA. | Mixed — dedicated core, crowdsourced edge. |
| Operator quality | Trained on your spec to a bar you set. | Tier-qualified. Variable. Aggregated. | Best for spine + scale for diversity. |
| Ramp time | 4 weeks SOW to production. | 3 days spec to first units. | 2–3 weeks. Dedicated first, crowd added. |
| Hardware | Your rig in our studio, or our matching rig. | Operator-provided or our standard rigs. | Mix of both per task type. |
| Best for | Surgical, AV, defense, industrial. | Long-tail, scene diversity, geo spread. | Most production foundation models. |
| Pricing | Per FTE-month. | Per task or per hour. | Custom SOW. |
Considering the other two? Dedicated • Crowdsource
FAQ
From the blog
How to Scale Teleop Data Collection Without Losing QualityThe hybrid model for burst scale without sacrificing core quality.
Tell us your spine-vs-edge ratio. We’ll scope both layers in a single SOW. Fourteen days to first batch.