矩形
计算机科学
对角线的
方向(向量空间)
代表(政治)
独特性
间断(语言学)
顶点(图论)
四边形的
对象(语法)
水平和垂直
算法
计算机视觉
数学
几何学
人工智能
理论计算机科学
图形
物理
热力学
数学分析
有限元法
政治
法学
政治学
作者
Guangtao Nie,Hua Huang
标识
DOI:10.1109/tpami.2022.3191753
摘要
Most existing methods adopt the quadrilateral or rotated rectangle representation to detect multi-oriented objects. Yet, the same oriented object may correspond to several different representations, due to different vertex ordering, or angular periodicity and edge exchangeability. To ensure the uniqueness of the representation, some engineered rules are usually added. This makes these methods suffer from discontinuity problem, resulting in degraded performance for objects around some orientation. In this article, we propose to encode the multi-oriented object with double horizontal rectangles (DHRec) to solve the discontinuity problem. Specifically, for an oriented object, we arrange the horizontal and vertical coordinates of its four vertices in left-right and top-down order, respectively. The first (resp. second) horizontal box is given by two diagonal points with smallest (resp. second) and third (resp. largest) coordinates in both horizontal and vertical dimensions. We then regress three factors given by area ratios between different regions, helping to guide the oriented object decoding from the predicted DHRec. Inherited from the uniqueness of horizontal rectangle representation, the proposed method is free of discontinuity issue, and can accurately detect objects of arbitrary orientation. Extensive experimental results show that the proposed method significantly improves the existing baseline representation, and outperforms state-of-the-art methods. The code is available at: https://github.com/lightbillow/DHRec.
科研通智能强力驱动
Strongly Powered by AbleSci AI