A YOLOv8-based detector for manga speech bubbles and text boxes. This project uses computer vision and deep learning to automatically detect and classify different types of text elements in manga ...
DEIMv2 is an evolution of the DEIM framework while leveraging the rich features from DINOv3. Our method is designed with various model sizes, from an ultra-light version up to S, M, L, and X, to be ...