Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation

Code for Img2Loc paper presented on SIGIR 2024.

Installation

Instructions on how to install and set up the project. If you needs help to access the generated embeddings, please contact us directly.

# Clone the repository
git clone [email protected]:Douglas2Code/Img2Loc.git

# Change to the project directory
cd Img2Loc

# Create a conda environment
conda create -n img2loc python=3.10 -y

# Activate the conda environment
conda activate img2loc

# Install faiss databse following this guide
https://github.com/facebookresearch/faiss/blob/main/INSTALL.md

# Install the project dependencies
pip install -r requirements.txt

# Download MP16 dataset
http://www.multimediaeval.org/mediaeval2016/placing/

# Generate embeddings using CLIP model
https://github.com/openai/CLIP

# Generate a vector database using FAISS
https://github.com/facebookresearch/faiss/wiki/Getting-started#in-python-1

Usage

Run the streamlip application

streamlit run app.py --browser.gatherUsageStats false

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contact

Zhongliang Zhou: [email protected] Jielu Zhang: [email protected]

Citation

If you find this project helpful, please consider cite our work.

@inproceedings{zhou2024img2loc,
  title={Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation},
  author={Zhou, Zhongliang and Zhang, Jielu and Guan, Zihan and Hu, Mengxuan and Lao, Ni and Mu, Lan and Li, Sheng and Mai, Gengchen},
  booktitle={Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval},
  pages={2749--2754},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
static		static
LICENSE		LICENSE
README.md		README.md
app.py		app.py
img2loc_GPT4V.py		img2loc_GPT4V.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation

Table of Contents

Installation

Usage

License

Contact

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Douglas2Code/Img2Loc

Folders and files

Latest commit

History

Repository files navigation

Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation

Table of Contents

Installation

Usage

License

Contact

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages