home models images videos posts articles bounties challenges events updates shop

Comic Speech Bubble Detection - ADetailer - (comic_speechbubble_m_yolov8)

Name: Comic Speech Bubble Detection - ADetailer - (comic_speechbubble_m_yolov8)
Rating: 5 (124 reviews)
Author: mnemic

124

1.7k

138

Updated: Jan 31, 2025

tool

adetailer

Download (19.78 MB)

Verified: 2 years ago

Other

Details

Type	Detection
Stats	199 0
Reviews	Positive (48)
Published	Feb 18, 2024
Base Model	SD 1.5
Hash	AutoV2 997D183A99

1 File

mnemic

A Yolov8 detection model that detects comic book speech bubbles and sound effects in images.

The model can be used as an ADetailer model (for Automatic1111 / Stable Diffusion use), or using other inference scripts to return detection bounding boxes of watermarks.

A small tutorial on how to use the model can be found on this Github: https://github.com/MNeMoNiCuZ/yolov8-scripts or this CivitAI article.

This model is only meant for research purposes. The model is entirely trained on the following dataset: yolomanga/speechballoon_comic However, since the dataset is created entirely out of Marvel comic book panels, I think the original author cannot licence the images as CC4. I do not think this model can ba used commercially either.

comic_speechbubble_m_yolov8_v1:

comic_speechbubble_s_yolov8_v1

Note:

The large preview images may not be from the correct model.

The A1111 screenshots are from the correct version however.

The medium model performs slightly better in general.