HOW TO RUN

This repository is currently maintained by Tianzong Cheng.

I. Environment

i. ZED SDK

Install from StereoLabs. The AI module is not used in this project, so you can choose not to install it to save time.

ii. ultralytics

Simply pip install ultralytics.

iii. FoundationPose

Docker

If you are running this project on a 40-series GPU, use this docker image (refer to this). Note that this is the default.

If you are running the project for the first time, you need to manually run FoundationPose/build_all.sh in the docker environment. This step isn't required later.

Data Prepare

Follow FoundationPose README and download weight files (demo data are not needed).

iv. Grounded SAM 2

You need to create a Python venv for Grounded SAM 2.

cd Grounded-SAM-2
python3 -m venv gdino-env
source gdino_env/bin/activate
pip3 install torch torchvision torchaudio
# Check CUDA path:
# which nvcc
export CUDA_HOME=/usr/local/cuda-12.4
pip install -e .
pip install --no-build-isolation -e grounding_dino
pip install opencv-python supervision pycocotools transformers addict yapf timm

Download SAM 2 and Grounding DINO checkpoints.

cd Grounded-SAM-2
cd checkpoints
bash download_ckpts.sh
cd ..
cd gdino_checkpoints
bash download_ckpts.sh

v. Misc

Download from JBox and extract it to the root folder: assets/mesh and assets/cam_K.txt.

II. Run

Run run.sh under the root directory.

First, 30 frames will be captured to video/ folder at 15 FPS and exported to .png format. Then, the seg/draw_mask_cup.py script will draw the mask of the mug. Finally, run FoundationPose.

You can change how many frames is recorded in ZED/record.py.

The output is under FoundationPose/debug/ob_in_cam, in the format of a homogeneous transformation matrix.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
FoundationPose		FoundationPose
Grounded-SAM-2		Grounded-SAM-2
ZED		ZED
eye-in-hand-calibration		eye-in-hand-calibration
seg		seg
.gitignore		.gitignore
README.md		README.md
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HOW TO RUN

I. Environment

i. ZED SDK

ii. ultralytics

iii. FoundationPose

Docker

Data Prepare

iv. Grounded SAM 2

v. Misc

II. Run

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

tianzong-cheng/llm_robot_perception

Folders and files

Latest commit

History

Repository files navigation

HOW TO RUN

I. Environment

i. ZED SDK

ii. ultralytics

iii. FoundationPose

Docker

Data Prepare

iv. Grounded SAM 2

v. Misc

II. Run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages