Skip to content

s9roll7/speech_bubble_remove_and_copy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Speech Bubble Remove and Copy Tool

This is a tool to remove or copy the speech bubbles.

  • Remove all speech bubbles from the image. (source image / cleaned image)
  • (As you can see from this example, not all speech bubbles can be captured completely. Or maybe the sample images aren't suitable...)

  • Copy all speech bubbles in image A to image B. (source image(A) / base image(B) / base image with speech bubble)

Installation(for windows)

Python 3.10 and git client must be installed

git clone https://github.com/s9roll7/speech_bubble_remove_and_copy.git
cd speech_bubble_remove_and_copy
py -3.10 -m venv venv
venv\Scripts\activate.bat
# Please install torch according to your environment.(https://pytorch.org/get-started/locally/)
python -m pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
pip install -r requirements.txt

How To Use

run launch_cmd.bat

# remove
python main.py remove SRC_IMAGE_DIR_PATH

#!! Create base images based on the clean images.
#!! and put it in BASE_IMAGE_DIR with the "same file name".

# copy
python main.py copy SRC_IMAGE_DIR_PATH BASE_IMAGE_DIR_PATH

or

# remove
#  prepare your project directory
#  my_proj_1/
#        src/ <--- put source images here


#  with comic panel extraction
python main.py remove_proj PROJ_DIR_PATH

or

#  without comic panel extraction
python main.py remove_proj PROJ_DIR_PATH --split=False

# copy
#  prepare your project directory
#  my_proj_1/
#        src/ <--- put source images here
#        base/ <--- put base images here

#!! Create base images based on the clean images.
#!! and put it in "my_proj_1/base" with the "same file name".

python main.py copy_proj PROJ_DIR_PATH

Advanced Settings

Switching models

If you want to include text as well as speech bubbles in the process, you need to use a different model.
Download the "adetailerForTextSpeech" model from civitai, Place it in the following location.
data/models/adetailerForTextSpeech_v20/unwantedV10x.pt
Use the following command.

# remove
python main.py remove_proj PROJ_DIR_PATH --model_type=1

# copy
python main.py copy_proj PROJ_DIR_PATH --model_type=1

If you want to use a different model, edit YOLO_SEG_MODEL_LOCATION(in bubble_tool.py)

Changelog

2024-12-22

Added comic panel extraction function
Changed lama model

Related resources

About

Tool to remove or copy the speech bubbles

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published