Skip to content

[CVPR‘ 2025 ] JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration

License

Notifications You must be signed in to change notification settings

LYL1015/JarvisIR

Repository files navigation

[CVPR' 2025] JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration

JarvisIR Logo

1Xiamen University, 2The Hong Kong University of Science and Technology (Guangzhou), 3Bytedance's Pico, 4Tencent, 5Huawei Noah's Ark Lab, 6The Chinese University of Hong Kong

Accepted by CVPR 2025

JarvisIR Demo

JarvisIR Demo

JarvisIR Gradio Demo: Showcasing image restoration capabilities under various adverse weather conditions

📮 Updates

🧭 Navigation


♦️ Overview

JarvisIR (CVPR 2025) is a VLM-powered agent designed to tackle the challenges of vision-centric perception systems under unpredictable and coupled weather degradations. It leverages the VLM as a controller to manage multiple expert restoration models, enabling robust and autonomous operation in real-world conditions. JarvisIR employs a novel two-stage framework consisting of supervised fine-tuning and human feedback alignment, allowing it to effectively fine-tune on large-scale real-world data in an unsupervised manner. Supported by CleanBench, a comprehensive dataset with 150K synthetic and 80K real instruction-response pairs, JarvisIR demonstrates superior decision-making and restoration capabilities, achieving a 50% improvement in the average of all perception metrics on CleanBench-Real.

JarvisIR Teaser

💻 Getting Started

For gradio demo runing, please follow:

For inference and model usage, please follow:

For image degradation data synthesis, please refer to:

For sft training and environment setup preparation, please follow:

🧰 Expert Models

JarvisIR integrates multiple expert restoration models to handle various types of image degradation. To test the performance of individual expert models, please refer to the instructions and scripts provided in ./package/agent_tools/.

Task Model Description
Super-resolution Real-ESRGAN Fast GAN-based model for super-resolution, deblurring, and artifact removal
Denoising SCUNet Hybrid UNet-based model combining convolution and transformer blocks for robust denoising
Deraining UDR-S2Former Uncertainty-aware transformer model for rain streak removal
Img2img-turbo-rain Efficient SD-turbo based model for fast and effective rain removal
Raindrop removal IDT Transformer-based model for de-raining and raindrop removal
Dehazing RIDCP Efficient dehazing model utilizing high-quality codebook priors
KANet Efficient dehazing network using a localization-and-removal pipeline
Desnowing Img2img-turbo-snow Efficient model for removing snow artifacts while preserving natural scene details
Snowmaster Real-world image desnowing via MLLM with multi-model feedback optimization
Low-light enhancement Retinexformer One-stage Retinex-based Transformer for low-light image enhancement
HVICIDNet Lightweight transformer for low-light and exposure correction
LightenDiff Diffusion-based framework for low-light enhancement

🎪 Checklist

  • Release preview inference code and gradio demo
  • Release sft training
  • Release Inference code
  • Release huggingFace online demo
  • Release degradation synthesis code
  • Release mrrhf training code
  • Release CleanBench dataset

🙏 Acknowledgements

We would like to express our gratitude to HuggingGPT, XTuner, and RRHF for their valuable open-source contributions which have provided important technical references for our work.

🤟 Citation

@inproceedings{jarvisir2025,
  title={JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration},
  author={Lin, Yunlong and Lin, Zixu and Chen, Haoyu and Pan, Panwang and Li, Chenxin and Chen, Sixiang and Kairun, Wen and Jin, Yeying and Li, Wenbo and Ding, Xinghao},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2025}
}

About

[CVPR‘ 2025 ] JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •