imgutils.detect.text
- Overview:
Detect text in images.
Models are hosted on deepghs/text_detection.
This is an overall benchmark of all the text detect models:
Warning
This module has been deprecated and will be removed in the future.
It is recommended to migrate to the
imgutils.ocr.detect_text_with_ocr()
function as soon as possible. This function uses a higher-quality text detection model provided by PaddleOCR, resulting in improved performance and higher efficiency.
detect_text
- imgutils.detect.text.detect_text(image: str | PathLike | bytes | bytearray | BinaryIO | Image, model: str = 'dbnetpp_resnet50_fpnc_1200e_icdar2015', threshold: float = 0.05, max_area_size: int | None = 640)[source]
Detect text regions in the given image using the specified model and threshold.
- Parameters:
image (ImageTyping) – Input image.
model (str) – Model name for DBNET or DBNET++.
threshold (float) – Confidence threshold for text detection.
max_area_size (Optional[int]) – Max area size when doing inference. Default is
640
, which means if the image’s area is over 640x640, it will be resized. When assigned toNone
, it means do not resize in any case.
- Returns:
List of detected text bounding boxes, labels, and scores.
- Return type:
List[Tuple[Tuple[int, int, int, int], str, float]]
Warning
This function is deprecated, and it will be removed from imgutils in the future. Please migrate to
imgutils.ocr.detect_text_with_ocr()
as soon as possible.Deprecated since version 0.2.10: Use the new function
imgutils.ocr.detect_text_with_ocr()
instead