Auto-detect format, list episodes, cameras, action/state dimensions, FPS, and schema. Works with local paths and HuggingFace URIs.
$ forge inspecthf://lerobot/aloha_sim_cube
Quality Scoring
Score every episode 0-10 with 8 research-backed metrics. Detect jerky demos, dead actions, gripper chatter, and idle periods from proprioception alone.
$ forge quality./my_dataset--export report.json
Video Quality
Opt-in camera-stream scoring: blur, exposure, frozen frames, and colorfulness (Tier 0), plus optical-flow motion, smoothness, and shot-cut detection (Tier 1). Composes with filtering.
Find and remove near-duplicate episodes — exact copies, re-encodes, near-identical takes — by perceptual hashing of camera keyframes. Numpy only, no model.
$ forge dedup./dataset./deduped
Episode Segmentation
PELT changepoint detection on proprioception signals. Automatically split episodes into sub-skills, regime changes, and idle periods.
Browser-based viewer with multi-camera support, action/state charts, timeline scrubber, and segment overlay. Zero extra dependencies.
$ forge visualizepusht--segment
Action Tokenization
Turn continuous actions into the discrete tokens VLA models train on. Four built-in strategies (RT-1, OpenVLA, quantile, mu-law) and a comparator that benchmarks them on your data — measure, don't guess.
$ forge lint hf://lerobot/pushtWARNcamera.ambiguous_name
Camera 'image' doesn't say where it is (third-person vs wrist?).
→ Use <modality>.<location>, e.g. 'observation.images.wrist'.WARNtask.missing
Dataset has no language / task instructions.
→ VLA training needs per-episode instructions; add them.INFOcamera.low_resolution
Camera 'image' is 96x96; below 640x480.
INFOcamera.too_few_views
Only 1 camera view(s); HF recommends >= 2.
PASS 206 episodes 0 errors,2 warnings,2 info