imgutils.detect.head

Overview:

Detect human heads (including the entire head) in anime images.

Trained on dataset ani_face_detection with YOLOv8.

../../_images/head_detect_demo.plot.py.svg

This is an overall benchmark of all the head detect models:

../../_images/head_detect_benchmark.plot.py.svg

detect_heads

imgutils.detect.head.detect_heads(image: str | PathLike | bytes | bytearray | BinaryIO | Image, level: str = 's', max_infer_size=640, conf_threshold: float = 0.3, iou_threshold: float = 0.7) List[Tuple[Tuple[int, int, int, int], str, float]][source]
Overview:

Detect human heads in anime images.

Parameters:
  • image – Image to detect.

  • level – The model level being used can be either s or n. The n model runs faster with smaller system overhead, while the s model achieves higher accuracy. The default value is s.

  • max_infer_size – The maximum image size used for model inference, if the image size exceeds this limit, the image will be resized and used for inference. The default value is 640 pixels.

  • conf_threshold – The confidence threshold, only detection results with confidence scores above this threshold will be returned. The default value is 0.3.

  • iou_threshold – The detection area coverage overlap threshold, areas with overlaps above this threshold will be discarded. The default value is 0.7.

Returns:

The detection results list, each item includes the detected area (x0, y0, x1, y1), the target type (always head) and the target confidence score.

Examples::
>>> from imgutils.detect import detect_heads, detection_visualize
>>>
>>> image = 'mostima_post.jpg'
>>> result = detect_heads(image)  # detect it
>>> result
[
    ((29, 441, 204, 584), 'head', 0.7874319553375244),
    ((346, 59, 529, 275), 'head', 0.7510495185852051),
    ((606, 51, 895, 336), 'head', 0.6986488103866577)
]
>>>
>>> # visualize it
>>> from matplotlib import pyplot as plt
>>> plt.imshow(detection_visualize(image, result))
>>> plt.show()