Skip to content

ALoHa Training Tips for Engineers (Based on My Experience)Β #1169

Open
@aoml3245

Description

@aoml3245

🎯 Task

Lower a cube off a Post-it and then place another cube onto it.


πŸ€– Robot & Data

  • Robot: OpenManipulator Sara (6-DOF)
  • Camera/Robot Pose: Not fixed β†’ I modified the diffusion code so data transforms run continuously.

πŸ”§ What I Tried

  1. ALoHa with default params
    – Ran training script β†’ agent froze in one posture.
  2. VQ Encoder
    – Difficult to integrate, only 1 fps output β†’ impractical.
    – (ALoHa team: clearer docs would help.)
  3. Temporal Embedding at Inference
    – Without it β†’ jerky, inconsistent motions (needs formal testing).
  4. Hyperparameter Sweeps
    – LR, scheduler, batch size, chunk size, KL weight, data volume, training steps, encoder on/off.
    – All variants eventually got β€œstuck” in a single pose.
  5. Simple Task Baseline
    – Cubeβ†’bin pick-and-place learned perfectly under constrained environment.
    – Used that as a starting point for harder Post-it task.

πŸš€ Breakthrough

  • Hidden Dim increase: 512 β†’ 1024
    • Result: significant improvement
    • More pick/place behaviors, proper sequencing (lower first cube before touching the next)
  • Lesson: Larger hidden dimensions help handle complex tasks.

πŸ”œ Next Steps

  • Co-train with varied datasets
  • Revisit the native ALoHa platform for better data-edit workflow

βœ‰οΈ Contact: [email protected]
Feel free to reach out for questions or tips!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions