Egocentric
Head, chest, and smart-glass views for human manipulation workflows.
Physical AI DataOps
We collect, enrich, and deliver real-world manipulation data for robotics labs, VLA models, and world-model teams.
200+
in-house operators
180s
review-ready clip units
Ego + Exo
synchronized capture
QA
human verified delivery
Anonymized proof
Capture system
World Archive builds task-specific capture programs across factories, workshops, textile floors, retail aisles, packing stations, and tool-use environments. The output is not a video reel; it is a traceable dataset package.
Head, chest, and smart-glass views for human manipulation workflows.
Fixed side, overhead, and wide scene cameras for world context.
Close-range object contact, tool handling, and fine-grain assembly.
Sensor depth where available, with clearly marked monocular estimates.
Motion traces from hands, wrists, tools, or capture rigs.
Task intent, spoken notes, sub-step captions, and failure annotations.
Enrichment
Automated labeling gets the first pass. Human QA closes the loop. The deliverable is built for model teams that need synchronized media, annotations, provenance, and clip quality in one place.
Boxes and persistent IDs for manipulated objects, tools, and fixtures.
Masks for hands, target objects, tools, and work surfaces.
Frame-level 2D hand landmarks, plus estimated 3D when capture permits.
Reach, grasp, insert, align, press, release, inspect, retry.
Camera pose, scene anchors, depth maps, and calibration metadata.
Blur, occlusion, PII, manipulation density, success, and rejection notes.
Sample dataset pack
Clients can review a compact clip, inspect overlays, and ingest structured labels. Full parent sessions stay available for licensing, but 180 seconds is the clean unit for QA, buyer review, and model-data sampling.
videos/shuttle_001_raw.mp4
180-second egocentric source clip
videos/shuttle_001_overlay.mp4
tracks, masks, depth, and action boundaries
annotations/episodes.jsonl
task, sub-step, success, failure, and timestamps
annotations/hand_keypoints.jsonl
21-point hand landmarks with confidence scores
annotations/objects_coco.json
boxes, masks, object IDs, and frame spans
Task domains
We prioritize workflows with repeated contact, tool use, alignment, grasp correction, inspection, and failure recovery. These are the behaviors physical-AI systems need to observe before deployment.
Pipeline
Our operating loop is intentionally narrow: collect the real task, package it cleanly, annotate the signals that matter, verify quality, and deliver in formats model teams can actually use.
01
Operators record real manipulation tasks with ego, exo, wrist, depth, or IMU rigs.
02
Long sessions are segmented into buyer-reviewable 180-second units with parent-session links.
03
Models draft hand tracks, masks, object IDs, depth estimates, and temporal labels.
04
Human QA checks visibility, privacy, contact states, and task success before delivery.
05
Teams receive MP4, overlay previews, JSONL, Parquet, masks, and QA scorecards.
QA scorecard
Every dataset package includes visibility, privacy, manipulation density, task success, and annotation QA so buyers can decide quickly.
Hand visibility
94%
Manipulation density
87%
PII cleared
Yes
Task success
Pass
Depth source
Sensor / estimated
Annotation QA
Human verified
Delivery formats
| Media | MP4 raw clips, synchronized overlay previews, parent-session references |
|---|---|
| Annotations | JSONL episodes, COCO-style objects, masks, hand landmarks |
| Metadata | Device, location class, operator, task, timestamps, consent, QA |
| Reports | Privacy review, rejection notes, manipulation density, checksums |
Founder-led scoping
Share the target behavior, environment, modality, annotation need, and delivery timeline. We will map the capture protocol and sample pack.