dots.ocr Designed for universal accessibility, it possesses the capability to recognize virtually any human script. Beyond achieving state-of-the-art (SOTA) performance in standard multilingual ...
Abstract: We present GLEE in this work, an object-level foundation model for locating and identifying objects in images and videos. Through a unified framework, GLEE accomplishes detection, ...
Abstract: Object detection plays a pivotal role in intelligent transportation systems. In recent years, the rapid advancements in deep learning have propelled object detection to a new level. However, ...