Open
Description
π― Task
Lower a cube off a Post-it and then place another cube onto it.
π€ Robot & Data
- Robot: OpenManipulator Sara (6-DOF)
- Camera/Robot Pose: Not fixed β I modified the diffusion code so data transforms run continuously.
π§ What I Tried
- ALoHa with default params
β Ran training script β agent froze in one posture. - VQ Encoder
β Difficult to integrate, only 1 fps output β impractical.
β (ALoHa team: clearer docs would help.) - Temporal Embedding at Inference
β Without it β jerky, inconsistent motions (needs formal testing). - Hyperparameter Sweeps
β LR, scheduler, batch size, chunk size, KL weight, data volume, training steps, encoder on/off.
β All variants eventually got βstuckβ in a single pose. - Simple Task Baseline
β Cubeβbin pick-and-place learned perfectly under constrained environment.
β Used that as a starting point for harder Post-it task.
π Breakthrough
- Hidden Dim increase: 512 β 1024
- Result: significant improvement
- More pick/place behaviors, proper sequencing (lower first cube before touching the next)
- Lesson: Larger hidden dimensions help handle complex tasks.
π Next Steps
- Co-train with varied datasets
- Revisit the native ALoHa platform for better data-edit workflow
βοΈ Contact: [email protected]
Feel free to reach out for questions or tips!
Metadata
Metadata
Assignees
Labels
No labels