Sign In

Comic Speech Bubble Detection - ADetailer - (comic_speechbubble_m_yolov8)

110

1.4k

61

Updated: Jan 31, 2025

tooladetailer

Type

Detection

Stats

168

0

Reviews

Published

Feb 18, 2024

Base Model

SD 1.5

Hash

AutoV2
997D183A99
Civitai Festive 2024 Contest Winner
MN

mnemic

A Yolov8 detection model that detects comic book speech bubbles and sound effects in images.

The model can be used as an ADetailer model (for Automatic1111 / Stable Diffusion use), or using other inference scripts to return detection bounding boxes of watermarks.

A small tutorial on how to use the model can be found on this Github: https://github.com/MNeMoNiCuZ/yolov8-scripts or this CivitAI article.


This model is only meant for research purposes. The model is entirely trained on the following dataset: yolomanga/speechballoon_comic However, since the dataset is created entirely out of Marvel comic book panels, I think the original author cannot licence the images as CC4. I do not think this model can ba used commercially either.


comic_speechbubble_m_yolov8_v1: image/jpeg

comic_speechbubble_s_yolov8_v1

Note:

The large preview images may not be from the correct model.

The A1111 screenshots are from the correct version however.

The medium model performs slightly better in general.