CN114445745A - Method, system, apparatus and storage medium for identifying stacked articles - Google Patents
Method, system, apparatus and storage medium for identifying stacked articles Download PDFInfo
- Publication number
- CN114445745A CN114445745A CN202210090912.7A CN202210090912A CN114445745A CN 114445745 A CN114445745 A CN 114445745A CN 202210090912 A CN202210090912 A CN 202210090912A CN 114445745 A CN114445745 A CN 114445745A
- Authority
- CN
- China
- Prior art keywords
- marked
- items
- area
- stacked
- regions
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
Landscapes
- Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Image Analysis (AREA)
Abstract
本公开提供了一种叠放物品的识别方法。该方法通过标记带有叠放物品的图像中的物品,以得到分别对应图像中物品的标记区域;轮换标记区域,以使每个标记区域至少一次被作为参考区域;筛选处理标记区域,以得到对应每个参考区域且检测置信度最高的中间区域;将该参考区域与对应的中间区域所标记的物品识别为叠放物品。通过该方法可有效提高由于行检测失败而漏检的叠放物品的检出率。
The present disclosure provides a method for identifying stacked items. In the method, the items in the image with the stacked items are marked to obtain marked areas corresponding to the items in the image respectively; the marked areas are rotated so that each marked area is used as a reference area at least once; the marked areas are screened and processed to obtain Corresponding to each reference area and detecting the middle area with the highest confidence; the items marked by the reference area and the corresponding middle area are identified as stacked items. This method can effectively improve the detection rate of stacked items missed due to line detection failure.
Description
技术领域technical field
本公开属于计算机视觉中的图像处理的技术领域,尤其涉及叠放物品的识别方法、系统、设备及存储介质。The present disclosure belongs to the technical field of image processing in computer vision, and in particular, relates to a method, system, device and storage medium for recognizing stacked objects.
背景技术Background technique
货架中不乏将物品层叠放置的现象。由于叠放的物品通常与被放置在底部的产品相同,在客观上造成了无效的牌面占用,因此相关厂商在计算商品的品牌牌面占比时,会主动忽略叠放的物品。There is no shortage of stacking items on the shelves. Since the stacked items are usually the same as the products placed at the bottom, which objectively causes invalid card surface occupation, the relevant manufacturers will actively ignore the stacked items when calculating the brand card surface ratio of the product.
现有技术中,叠放识别通常是基于行检测来实现的。现有技术中的行指货架每一层。In the prior art, overlapping identification is usually implemented based on line detection. A row in the prior art refers to each level of a shelf.
发明内容SUMMARY OF THE INVENTION
本公开的一方面提供了一种叠放物品的识别方法。叠放物品的识别方法包括如下步骤:An aspect of the present disclosure provides a method for identifying stacked items. The method for identifying stacked items includes the following steps:
获取带有叠放物品的图像,识别并标记所述图像中的物品,以得到分别对应所述图像中物品的标记区域;acquiring an image with stacked items, identifying and marking the items in the image, to obtain marked areas corresponding to the items in the image respectively;
轮换所述标记区域,以使每个所述标记区域至少一次被作为参考区域;Rotating the marked regions such that each of the marked regions is used as a reference region at least once;
筛选处理所述标记区域,以得到对应每个所述参考区域且检测置信度最高的中间区域;将所述参考区域与对应的所述中间区域所标记的物品识别为叠放物品。Screening and processing the marked areas to obtain an intermediate area corresponding to each of the reference areas and having the highest detection confidence; identifying the items marked by the reference area and the corresponding intermediate areas as stacked items.
根据本公开一实施例,将所述参考区域与对应的所述中间区域所标记的物品识别为叠放物品的步骤进一步包括:According to an embodiment of the present disclosure, the step of identifying the items marked in the reference area and the corresponding intermediate area as stacked items further includes:
复筛处理所述标记区域,以得到对应每个所述中间区域且检测置信度最高的目标区域;Re-screening the marked area to obtain a target area corresponding to each of the intermediate areas and with the highest detection confidence;
比较检测置信度高低,以得到所述中间区域分别对应的参考区域和所述目标区域的检测置信度高低的比较结果;Comparing the detection confidence level to obtain a comparison result of the detection confidence level of the reference area corresponding to the intermediate area and the target area respectively;
识别所述比较结果,以将所述参考区域与对应的所述目标区域所标记的物品识别为叠放物品;其中,所述参考区域的检测置信度不低于所述目标区域的检测置信度。Identifying the comparison result to identify the items marked by the reference area and the corresponding target area as stacked items; wherein the detection confidence of the reference area is not lower than the detection confidence of the target area .
根据本公开一实施例,轮换所述标记区域的步骤之前还包括:According to an embodiment of the present disclosure, before the step of rotating the marked area, the step further includes:
对所述标记区域进行排序,以便按序轮换所述标记区域。The marked regions are sorted so that the marked regions are rotated sequentially.
根据本公开一实施例,所述筛选处理中的第二阈值的取值范围为0.6~1。According to an embodiment of the present disclosure, the value range of the second threshold in the screening process is 0.6˜1.
根据本公开一实施例,将被识别为叠放物品的两个或两个以上的标记区域合并为一个标记区域。According to an embodiment of the present disclosure, two or more marked areas identified as stacked items are combined into one marked area.
本公开的另一方面提供了一种叠放物品的识别系统。叠放物品的识别系统用于实现如前所述的叠放物品的识别方法。该识别系统包括:Another aspect of the present disclosure provides an identification system for stacked items. The identification system for stacked items is used to implement the aforementioned method for identifying stacked items. The identification system includes:
物品标记模块,用于获取带有叠放物品的图像,识别并标记所述图像中的物品,以得到分别对应所述图像中物品的标记区域;an item marking module, used for acquiring an image with stacked items, identifying and marking the items in the image, so as to obtain marked areas corresponding to the items in the image respectively;
区域轮换模块,用于轮换所述标记区域,以使每个所述标记区域至少一次被作为参考区域;an area rotation module, configured to rotate the marked areas so that each of the marked areas is used as a reference area at least once;
筛选处理模块,用于筛选处理所述标记区域,以得到对应每个所述参考区域且检测置信度最高的中间区域;a screening and processing module for screening and processing the marked regions to obtain an intermediate region corresponding to each of the reference regions and having the highest detection confidence;
叠放识别模块,用于将所述参考区域与对应的所述中间区域所标记的物品识别为叠放物品。The stacking identification module is configured to identify the items marked by the reference area and the corresponding intermediate area as stacked items.
本公开的再一方面还提供了一种叠放物品的识别设备。该识别设备包括:Yet another aspect of the present disclosure also provides an identification device for stacked items. The identification device includes:
存储器和处理器,所述存储器中存储有指令,所述存储器和所述处理器通过线路互连;a memory and a processor, the memory having instructions stored therein, the memory and the processor being interconnected by wires;
所述处理器调用所述存储器中的所述指令,实现本公开一实施例中的叠放物品的识别方法。The processor invokes the instructions in the memory to implement the method for recognizing stacked objects in an embodiment of the present disclosure.
本公开的最后一方面则是提供了一种计算机可读存储介质。所述计算机可读存储介质上存储有计算机程序,所述计算机程序被处理器执行时实现本公开一实施例中的叠放物品的识别方法。A final aspect of the present disclosure provides a computer-readable storage medium. A computer program is stored on the computer-readable storage medium, and when the computer program is executed by the processor, implements the method for recognizing stacked objects in an embodiment of the present disclosure.
本公开由于采用以上技术方案,使其与现有技术相比具有以下的优点和积极效果:Compared with the prior art, the present disclosure has the following advantages and positive effects due to the adoption of the above technical solutions:
1)本公开一实施例中的叠放物品的识别方法,通过标记带有叠放物品的图像中的物品,以得到分别对应图像中物品的标记区域;轮换标记区域,以使每个标记区域至少一次被作为参考区域;筛选处理标记区域,以得到对应每个参考区域且检测置信度最高的中间区域;将该参考区域与对应的中间区域所标记的物品识别为叠放物品。相对于现有技术而言,进一步提高了叠放物品的检测成功率。1) The method for recognizing stacked items in an embodiment of the present disclosure is to mark items in an image with stacked items to obtain marked areas corresponding to the items in the image respectively; and rotate the marked areas so that each marked area It is used as a reference area at least once; the marked area is screened and processed to obtain an intermediate area corresponding to each reference area and the detection confidence is the highest; the items marked by the reference area and the corresponding intermediate area are identified as stacked items. Compared with the prior art, the detection success rate of stacked objects is further improved.
2)本公开一实施例中的叠放物品的识别方法,将被识别为叠放物品的两个或两个以上的标记区域合并为一个标记区域,可在计算品牌的牌面占比时,不会因为物品叠放而被重复计算。2) In the method for identifying stacked items in an embodiment of the present disclosure, two or more marked areas identified as stacked items are combined into one marked area. Will not be double-counted for stacking items.
3)本公开一实施例中的叠放物品的识别方法,为了方便轮换标记区域,对标记区域进行了排序,以便按序轮换标记区域,避免重复或遗漏,从而提高物品叠放的识别率。3) In the method for recognizing stacked objects in an embodiment of the present disclosure, in order to facilitate the rotation of the marked regions, the marked regions are sorted, so as to rotate the marked regions in sequence to avoid repetition or omission, thereby improving the recognition rate of object stacking.
附图说明Description of drawings
图1为本公开一实施例中的叠放物品的识别方法流程图;FIG. 1 is a flowchart of a method for identifying stacked objects in an embodiment of the disclosure;
图2为本公开一实施例中的叠放物品的识别系统框图;FIG. 2 is a block diagram of an identification system for stacked objects in an embodiment of the disclosure;
图3为本公开一实施例中的叠放物品的识别设备示意图;FIG. 3 is a schematic diagram of a device for identifying stacked objects in an embodiment of the disclosure;
图4为本公开一实施例中的计算机可读存储介质的示意图。FIG. 4 is a schematic diagram of a computer-readable storage medium in an embodiment of the disclosure.
具体实施方式Detailed ways
以下结合附图和具体实施例对本公开提出的叠放物品的识别方法、系统、设备及存储介质作进一步详细说明。根据下面说明和权利要求书,本公开的优点和特征将更清楚。The method, system, device and storage medium for recognizing stacked articles proposed in the present disclosure will be further described in detail below with reference to the accompanying drawings and specific embodiments. The advantages and features of the present disclosure will become apparent from the following description and claims.
实施例一Example 1
目前的叠放识别是基于行检测的,其中“行”在现有技术中是指货架上的每一层区域。当多个产品处于同一行中且在货架层的垂直方向有一定程度的重叠时,即可视为叠放。Current stack recognition is based on row detection, where "row" in the prior art refers to each level area on the shelf. Stacking occurs when multiple products are in the same row with a certain degree of overlap in the vertical direction of the shelf level.
在具体实施过程中,现有的检测方案存在行检测失败的情况,一般为了拍摄清晰仓储式超市中的大型包装产品或者需要对卷纸等行间距过大的物体时,相机画幅可能会丢失货架的一些特征,例如层板、立柱等,致使现有检测方案中的检测模型无法有效检测到行的特征,最终导致行检测频繁失效。In the specific implementation process, the existing detection scheme has failures in line detection. Generally, in order to clearly photograph large-scale packaged products in warehouse supermarkets or objects with excessive line spacing such as roll paper, the camera frame may be lost on the shelf. Some features, such as laminates, columns, etc., make the detection model in the existing detection scheme unable to effectively detect the features of the row, and eventually lead to frequent failure of row detection.
针对上述问题,本实施例提供了一种叠放物品的识别方法,请参看图1,该叠放物品的识别方法包括以下步骤:In view of the above problems, the present embodiment provides a method for identifying stacked items, please refer to FIG. 1 , and the method for identifying stacked items includes the following steps:
步骤S100:获取带有叠放物品的图像;Step S100: acquiring an image with stacked items;
步骤S200:标记图像中的物品,以得到分别对应图像中物品的标记区域;Step S200: marking the items in the image to obtain marked areas corresponding to the items in the image respectively;
步骤S300:轮换标记区域,以使每个标记区域至少一次被作为参考区域;Step S300: Rotate the marked areas so that each marked area is used as a reference area at least once;
步骤S400:筛选处理标记区域,以得到对应每个参考区域且检测置信度最高的中间区域;Step S400: Screening and processing the marked area to obtain an intermediate area corresponding to each reference area and having the highest detection confidence;
步骤S500:将参考区域与对应的中间区域所标记的物品识别为叠放物品。Step S500: Identify the items marked in the reference area and the corresponding intermediate area as stacked items.
具体的,在步骤S100中,可通过获取实时抓拍的物品叠放场景的现场图像,该现场图像为货架的物品存放图像,包括货架及货架上的物品。Specifically, in step S100, a live image of a scene of stacking items captured in real time can be obtained, where the on-site image is an image of the storage of items on the shelf, including the shelf and the items on the shelf.
在步骤S200中,标记图像中的物品,以得到分别对应图像中物品的标记区域。实际应用中,可将在现场获取的带有叠放物品的图像输入预置的物品识别模型进行物品识别,生成包围单个物品的边界框(即标记区域)。In step S200, the items in the image are marked to obtain marked regions corresponding to the items in the image respectively. In practical applications, images with stacked items obtained on site can be input into a preset item recognition model for item recognition, and a bounding box (ie, a marked area) surrounding a single item can be generated.
其中,物品识别模型可以是MASK R-CNN模型,该MASK R-CNN模型包括ResNet-101网络、RPN网络、ROI Align层和分类网络。该MASK R-CNN模型识别原理如下:The item recognition model may be a MASK R-CNN model, and the MASK R-CNN model includes a ResNet-101 network, an RPN network, an ROI Align layer, and a classification network. The recognition principle of the MASK R-CNN model is as follows:
将图像输入预置的MASK R-CNN模型,通过ResNet-101网络提取图像特征,得到第一特征图;将第一特征图输入RPN网络,并通过预置选择性搜索算法对第一特征图进行预测框提取,得到第一特征图对应的预测框;将第一特征图和所述预测框输入ROI Align层进行预测,得到包含预测框的第二特征图;将第二特征图输入所述全连接层进行分类处理,得到预测框包含物品的预测概率;基于预测框包含物品的预测概率,以得到物品被标记区域标记的图像。Input the image into the preset MASK R-CNN model, extract the image features through the ResNet-101 network, and obtain the first feature map; input the first feature map into the RPN network, and perform the first feature map through the preset selective search algorithm. The prediction frame is extracted to obtain the prediction frame corresponding to the first feature map; the first feature map and the prediction frame are input into the ROI Align layer for prediction, and the second feature map including the prediction frame is obtained; The connection layer performs classification processing to obtain the predicted probability that the predicted frame contains the item; based on the predicted probability that the predicted frame contains the item, the image of the item marked by the marked area is obtained.
在实际应用中,可根据应用场景,选择不同的物品识别模型,并不局限本实施例中的MASK R-CNN模型,也可以是基于YOLO系列的物品识别模型。In practical applications, different item recognition models can be selected according to application scenarios, which is not limited to the MASK R-CNN model in this embodiment, and can also be an item recognition model based on the YOLO series.
此外,还可以采用静态图像检测器对图片中的物品进行标记,生成每个物品的边界框。该静态图像检测器为DeepID-Net或CRAFT静态图像检测器。In addition, a static image detector can be used to label the items in the image and generate bounding boxes for each item. The still image detector is DeepID-Net or CRAFT still image detector.
在步骤S300中,轮换物品的标记区域,以使每个标记区域至少一次被作为参考区域。这里的轮换是指将每个标记区域轮流作为参考区域去进行筛选处理。In step S300, the marked areas of the article are rotated so that each marked area is used as a reference area at least once. The rotation here refers to taking each marked area as a reference area in turn for screening.
为了方便轮换标记区域,可对标记区域进行排序,以便按序轮换标记区域,避免重复或遗漏,从而提高后续物品叠放的识别率。In order to facilitate the rotation of the marked areas, the marked areas can be sorted, so as to rotate the marked areas in sequence, to avoid repetition or omission, thereby improving the recognition rate of subsequent article stacking.
在步骤S400中,筛选处理物品的标记区域,以得到对应每个参考区域且检测置信度最高的中间区域。In step S400, the marked area of the processed item is screened to obtain an intermediate area corresponding to each reference area and having the highest detection confidence.
为了能够更清楚地描述本公开提供的方案内容,本公开在各实施例中统一给出定义,将货架上物品叠放的延伸方向定义为第一方向,而第二方向定义为与第一方向相反的方向。In order to more clearly describe the content of the solution provided by the present disclosure, the present disclosure provides a unified definition in each embodiment, and defines the extending direction of the stacking of items on the shelf as the first direction, and the second direction is defined as the same as the first direction Opposite Direction.
对于本公开各实施例中所涉及的“方向”概念,在实际应用中通常会使用更加具体的方位名词。例如在本公开中,所指出的货架通常是放置在超市、便利店等地面上的常规货架。由于货架的每一层互相平行,并且为了使摆放的物品更加稳定,货架的每一层还会与地面平行。容易理解,超市、便利店等投入商业使用的地面往往可认为是水平状态的,所以在本公开中本领域技术人员可以将货架的每一层也视为水平状态。而相对于货架所在的地面而言,竖直向上的方向,即竖直正方向与物品叠放的延伸方向一致,因此在本公开的各实施例中,上述定义的第一方向可以具体表述为竖直正方向,而第二方向可以具体表述为竖直负方向。For the concept of "direction" involved in various embodiments of the present disclosure, more specific orientation nouns are usually used in practical applications. For example, in the present disclosure, the indicated shelves are usually conventional shelves placed on the ground of supermarkets, convenience stores, and the like. Since each layer of the shelf is parallel to each other, and in order to make the items placed more stable, each layer of the shelf is also parallel to the ground. It is easy to understand that the floors of supermarkets, convenience stores, etc. that are put into commercial use can often be considered to be in a horizontal state, so those skilled in the art can regard each layer of a shelf as a horizontal state in the present disclosure. Relative to the ground where the shelves are located, the vertical upward direction, that is, the vertical positive direction, is consistent with the extending direction of the stacking of items. Therefore, in each embodiment of the present disclosure, the first direction defined above can be specifically expressed as The vertical positive direction, while the second direction can be specifically expressed as the vertical negative direction.
本公开中的筛选处理是指选取与参考区域在第一方向上间距不大于第一阈值的其他标记区域,作为筛选候选区域;继续选取与参考区域垂直于第一方向的平面上的投影重叠度不小于第二阈值,且形状相似的筛选候选区域,作为筛选待定区域;在筛选待定区域中选出检测置信度最高的标记区域,以得到中间区域。形状相似是指标记区域之间的长宽比差距、区域面积大小差距分别不大于第三阈值和第四阈值的区域。The screening process in the present disclosure refers to selecting other marked regions whose distance from the reference region in the first direction is not greater than the first threshold value, as screening candidate regions; and continuing to select the projection overlap degree on the plane perpendicular to the first direction with the reference region. The screening candidate regions that are not less than the second threshold and have similar shapes are used as the screening pending regions; the marked regions with the highest detection confidence are selected from the screening pending regions to obtain the intermediate regions. Similar shape refers to an area in which the difference in aspect ratio and the difference in area size between the marked areas are not greater than the third threshold and the fourth threshold, respectively.
上述第一阈值、第二阈值、第三阈值及第四阈值,可根据实际需要进行设置。结合实际使用效果,在面对常规的超市货架时,本公开中第二阈值的取值范围设定为0.6~1时,既能够带来较精准的检测结果,适用于针对市面多数超市货架所拍摄得到的图像,而且还能降低计算机的处理强度,提高识别效率。第三阈值和/或第四阈值的值设定的越大时,可以认为选择出的标记区域之间的形状越趋于一致。The above-mentioned first threshold, second threshold, third threshold and fourth threshold may be set according to actual needs. Combined with the actual use effect, when faced with conventional supermarket shelves, when the value range of the second threshold in the present disclosure is set to 0.6 to 1, it can bring more accurate detection results, and is suitable for most supermarket shelves in the market. The image obtained by shooting can also reduce the processing intensity of the computer and improve the recognition efficiency. When the value of the third threshold and/or the fourth threshold is set larger, it can be considered that the shapes of the selected marked regions tend to be more consistent.
基于对“方向”的定义,本实施例中,可先将包含物品的标记框按竖直方向进行排序,以构建标记区域的连通图;在连通图中轮换标记区域,以使至少一个标记区域被作为参考区域。在连通图中轮换标记区域时,也可以使每个标记区域至少一次被作为参考区域。Based on the definition of "direction", in this embodiment, the marked boxes containing items may be sorted in the vertical direction to construct a connected graph of marked areas; the marked areas are rotated in the connected graph so that at least one marked area is used as a reference area. When the labeled regions are rotated in the connectivity graph, each labeled region can also be used as a reference region at least once.
下面以物品边界框a为例说明步骤S400:Step S400 is described below by taking the item bounding box a as an example:
对于每一个框a(参考区域),沿竖直正方向,寻找距离框a小于第一阈值的物品边界框,作为筛选候选区域。当边界框为矩形时,该距离为下方矩形框的上顶边与上方矩形框的下底边之间的间距。而第一阈值可根据不同物品高度自行调整,如根据当前边界框的大小(只用框的高度来表征框的大小)乘以一个根据实际数据而合理设定的系数进行调整。例如,摆放牙膏的行距会比摆放卷纸的行距小。For each box a (reference area), along the vertical positive direction, find the bounding box of the item whose distance from the box a is less than the first threshold, as a screening candidate area. When the bounding box is a rectangle, the distance is the distance between the upper top edge of the lower rectangle and the lower bottom edge of the upper rectangle. The first threshold can be adjusted according to the height of different items, for example, according to the size of the current bounding box (only the height of the box is used to represent the size of the box) multiplied by a coefficient reasonably set according to the actual data. For example, the line spacing of toothpaste will be smaller than the line spacing of roll paper.
从筛选候选区域中,筛选与标记物品在水平方向上的重叠度大于第二阈值,且形状相似的标记物品,作为筛选待定区域。在筛选待定区域中选出检测置信度最高的标记区域,以得到中间区域。From the screening candidate area, the marked items whose overlapping degree in the horizontal direction and the marked item is greater than the second threshold and are similar in shape are selected as the screening pending area. In the screening undetermined area, the marked area with the highest detection confidence is selected to obtain the intermediate area.
即,从上述筛选候选区域中,挑选出与框a在水平方向重叠度大于第二阈值,且形状相似的物品边界框,作为筛选待定区域。在筛选待定区域中选出检测置信度最高的标记区域,以得到中间区域a_i。That is, from the above-mentioned screening candidate regions, an item bounding box with a horizontal overlap with the frame a greater than the second threshold and a similar shape is selected as the screening pending region. Select the marked area with the highest detection confidence in the undetermined area to be screened to obtain the intermediate area a_i.
需要指出的是,这里的水平方向的重叠可以理解为在水平面上投影的重叠。所有物品都向同一个水平面上投影(即沿竖直方向投影)的话,一般情况下只有与目标物品也就是a处于上下罗列的状态时才会出现较大的水平方向重叠。因此设置第二阈值,可以筛选出重叠度较高的物品。经过分析,在本公开的实施例中将第二阈值设置为0.7能够获得较理想的筛选结果。It should be pointed out that the overlap in the horizontal direction here can be understood as the overlap of projections on the horizontal plane. If all items are projected on the same horizontal plane (that is, projected in the vertical direction), in general, there will be a large horizontal overlap only when the target item, that is, a, is in a state of being listed up and down. Therefore, by setting a second threshold, items with a high degree of overlap can be filtered out. After analysis, in the embodiment of the present disclosure, setting the second threshold to 0.7 can obtain an ideal screening result.
本公开中的重叠度用来衡量两个或多个物体之间的重叠程度。一般可以认为重叠度的值处于0~1之间,0说明完全不重叠,1表示完全重叠。The degree of overlap in this disclosure is used to measure the degree of overlap between two or more objects. Generally, it can be considered that the value of the overlap degree is between 0 and 1, where 0 means no overlap at all, and 1 means complete overlap.
在步骤S500中,将参考区域与对应的中间区域所标记的物品识别为叠放物品。In step S500, the items marked by the reference area and the corresponding intermediate area are identified as stacked items.
具体的,复筛处理标记区域,以得到对应每个中间区域且检测置信度最高的目标区域;比较检测置信度高低,以得到中间区域分别对应的参考区域和目标区域的检测置信度高低的比较结果;识别该比较结果,以将参考区域与对应的目标区域所标记的物品识别为叠放物品;其中,该参考区域的检测置信度不低于目标区域的检测置信度。Specifically, the marked area is re-screened to obtain the target area corresponding to each intermediate area and the detection confidence is the highest; the detection confidence is compared to obtain the comparison of the detection confidence of the reference area and the target area corresponding to the intermediate area respectively. Result: Identify the comparison result to identify the items marked by the reference area and the corresponding target area as stacked items; wherein, the detection confidence of the reference area is not lower than the detection confidence of the target area.
这里的复筛处理是指选取与中间区域在第二方向上间距不大于第一阈值的其他标记区域,作为复筛候选区域;继续选取与中间区域垂直于第二方向的平面上的投影重叠度不小于第二阈值,且形状相似的复筛候选区域,以作为复筛待定区域;在复筛待定区域中选出检测置信度最高的标记区域,以得到目标区域。The re-screening process here refers to selecting other marked areas whose distance from the middle area in the second direction is not greater than the first threshold value, as the re-screening candidate area; continue to select the projection overlap degree on the plane perpendicular to the second direction with the middle area The re-screening candidate regions that are not less than the second threshold and have similar shapes are used as the re-screening pending regions; the marked regions with the highest detection confidence are selected from the re-screening pending regions to obtain the target regions.
其中,形状相似是与上述步骤S400中筛选处理中的形状相似要求一致。The shape similarity is consistent with the shape similarity requirement in the screening process in the above step S400.
在实际应用中,从中间区域a_i出发,沿竖直负方向,选取与其间距不大于第一阈值的其他标记区域,作为复筛候选区域;继续选取与中间区域竖直负方向的平面上的投影重叠度不小于第二阈值,且形状相似的复筛候选区域,作为复筛待定区域;在复筛待定区域中选出检测置信度最高的标记区域,得到目标区域a_j。In practical applications, starting from the middle area a_i, along the vertical negative direction, select other marked areas with a distance not greater than the first threshold as the candidate area for re-screening; continue to select the projection on the plane in the vertical negative direction with the middle area The re-screening candidate regions with a degree of overlap not less than the second threshold and similar shapes are used as re-screening pending regions; the marked regions with the highest detection confidence are selected from the re-screening pending regions to obtain target regions a_j.
本公开中的竖直负方向与竖直正方向是一个相对的概念,即步骤S400中是竖直正方向,那么此步骤中必然与步骤S400中的方向相反,即竖直负方向。The vertical negative direction and the vertical positive direction in the present disclosure are relative concepts, that is, the vertical positive direction is in step S400, then this step must be opposite to the vertical direction in step S400, that is, the vertical negative direction.
经过上述多个步骤的筛选和排除,再继续验证检测置信度,此时若检测置信度不低于a_j,则可以认为a~a_i之间属于同一组叠放物品。After the above-mentioned multiple steps of screening and exclusion, the detection confidence is continued to be verified. At this time, if the detection confidence is not lower than a_j, it can be considered that a to a_i belong to the same group of stacked items.
识别出叠放物品之后,将被识别为叠放物品的两个或两个以上的标记区域合并为一个标记区域,可在计算品牌的牌面占比时,不会因为物品叠放而被重复计算,可以自动忽略掉被叠放的物品,从而在提高计算速度的同时,提高牌面占比准确率。After the stacked items are identified, the two or more marked areas identified as stacked items are combined into one marked area, so that when calculating the proportion of the brand's card surface, it will not be repeated due to the stacking of items Calculation, the stacked items can be automatically ignored, so as to improve the calculation speed and improve the accuracy of the card face ratio.
基于上述说明可知,通过本公开提供的叠放物品的识别方法能够有效减少由于行检测失败而出现的漏检问题。Based on the above description, it can be seen that the method for identifying stacked objects provided by the present disclosure can effectively reduce the problem of missed detection due to failure of line detection.
实施例二Embodiment 2
本实施例提供了一种叠放物品的识别系统,请参看图2,该叠放物品的识别系统包括:This embodiment provides an identification system for stacked items, please refer to FIG. 2 , the identification system for stacked items includes:
物品标记模块110,用于获取带有叠放物品的图像,识别并标记图像中的物品,以得到分别对应图像中物品的标记区域;The
区域轮换模块120,用于轮换标记区域,以使每个标记区域至少一次被作为参考区域;an
筛选处理模块130,用于筛选处理标记区域,以得到对应每个参考区域且检测置信度最高的中间区域;The
叠放识别模块140,用于将参考区域与对应的中间区域所标记的物品识别为叠放物品。The stacking
该叠放物品的识别系统还包括复筛处理单元和比较单元。其中,复筛处理单元用于复筛处理标记区域,以得到对应每个中间区域且检测置信度最高的目标区域;比较检测置信度高低,以得到中间区域分别对应的参考区域和目标区域的检测置信度高低的比较结果;识别该比较结果,以将参考区域与对应的目标区域之间所标记的物品识别为叠放物品;其中,该参考区域的检测置信度不低于目标区域的检测置信度。The identification system for stacked items further includes a re-screening processing unit and a comparing unit. Among them, the re-screening processing unit is used to re-screen and process the marked area to obtain the target area corresponding to each intermediate area and the highest detection confidence; compare the detection confidence level to obtain the detection of the reference area and the target area corresponding to the intermediate area respectively The comparison result of the confidence level; identify the comparison result to identify the item marked between the reference area and the corresponding target area as a stacked item; wherein, the detection confidence level of the reference area is not lower than the detection confidence level of the target area Spend.
比较单元用于比较检测置信度高低,以得到中间区域分别对应的参考区域和目标区域的检测置信度高低的比较结果。The comparison unit is used for comparing the detection confidence level, so as to obtain a comparison result of the detection confidence level of the reference area and the target area corresponding to the intermediate area respectively.
另外,叠放识别模块140还能够用于识别由比较单元所得出的比较结果,以将参考区域与对应的目标区域所标记的物品识别为叠放物品。需要注意的是,叠放识别模块140在比较结果为参考区域的检测置信度不低于目标区域的检测置信度的条件下,识别出本段所指的叠放物品。In addition, the
本实施例所提供的叠放物品的识别系统,相对现有技术而言通过利用对物品间距和检测置信度的筛选,能够更快速地获取叠放物品的有效特征,从而提高了系统在识别叠放物品时的执行效率;另外,还通过对标记区域轮换、排序等步骤,有序进行相应的识别操作,避免了重复和遗漏识别的问题,进而降低了对计算资源的占用比例。Compared with the prior art, the identification system for stacked items provided in this embodiment can obtain the effective features of stacked items more quickly by screening the item spacing and detection confidence, thereby improving the system's ability to identify stacking items. In addition, through the steps of rotating and sorting the marked area, the corresponding identification operations are carried out in an orderly manner, which avoids the problems of duplication and omission of identification, thereby reducing the occupation ratio of computing resources.
实施例三Embodiment 3
本实施例提供了一种叠放物品的识别设备。请参看图3,该叠放物品的识别设备500可因配置或性能不同而产生比较大的差异,可以包括一个或一个以上处理器(centralprocessing units,CPU)510(例如,一个或一个以上处理器)和存储器520,一个或一个以上存储应用程序533或数据532的存储介质530(例如一个或一个以上海量存储设备)。其中,存储器520和存储介质530可以是短暂存储或持久存储。存储在存储介质530的程序可以包括一个或一个以上模块(图示没标出),每个模块可以包括对叠放物品的识别设备500中的一系列指令操作。This embodiment provides an identification device for stacked articles. Referring to FIG. 3 , the
进一步,处理器510可以设置为与存储介质530通信,在叠放物品的识别设备500上执行存储介质530中的一系列指令操作。Further, the
叠放物品的识别设备500还可以包括一个或一个以上电源540,一个或一个以上有线或无线的网络接口550,一个或一个以上输入输出接口560,和/或,一个或一个以上操作系统531,例如Windows Serve、Vista等等。The
本领域技术人员可以理解,图3示出的叠放物品的识别设备结构并不构成对叠放物品的识别设备的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。Those skilled in the art can understand that the structure of the identification device for stacked objects shown in FIG. 3 does not constitute a limitation on the identification device for stacked objects, and may include more or less components than those shown in the figure, or combine some components , or a different component arrangement.
本公开的另一实施例还提供了一种计算机可读存储介质。Another embodiment of the present disclosure also provides a computer-readable storage medium.
图4描述了根据实施例中用于实现前述实施例所公开的识别方法的程序存储产品600。该计算机可读存储介质可以为非易失性计算机可读存储介质,该计算机可读存储介质也可以为易失性计算机可读存储介质。该计算机可读存储介质中存储有指令,当该指令在计算机上运行时,使得计算机执行实施例一中的叠放物品的识别方法的步骤。FIG. 4 depicts a
叠放物品的识别方法如果以程序指令的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本公开的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件的形式体现出来,该计算机软件存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本公开各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(Read-only memory,ROM)、随机存取存储器(Random access memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。The method for identifying the stacked items, if implemented in the form of program instructions and sold or used as a stand-alone product, may be stored in a computer-readable storage medium. Based on this understanding, the technical solutions of the present disclosure can be embodied in the form of software, in essence or in part that contributes to the prior art, or all or part of the technical solutions. The computer software is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in various embodiments of the present disclosure. The aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (ROM), random access memory (RAM), magnetic disk or optical disk and other media that can store program codes .
所属领域的技术人员可以清楚地了解到,为描述得方便和简洁,上述描述的系统及设备的具体执行的识别内容,可以参考前述方法实施例中的对应过程。Those skilled in the art can clearly understand that, for the convenience and brevity of the description, for the identification content of the specific execution of the system and device described above, reference may be made to the corresponding process in the foregoing method embodiments.
上面结合附图对本公开的实施方式作了详细说明,但是本公开并不限于上述实施方式。即使对本公开作出各种变化,倘若这些变化属于本公开权利要求及其等同技术的范围之内,则仍落入在本公开的保护范围之中。The embodiments of the present disclosure are described above in detail with reference to the accompanying drawings, but the present disclosure is not limited to the above-mentioned embodiments. Even if various changes are made to the present disclosure, if such changes fall within the scope of the claims of the present disclosure and their equivalents, they still fall within the protection scope of the present disclosure.
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202210090912.7A CN114445745B (en) | 2022-01-26 | 2022-01-26 | Method, system, device and storage medium for identifying stacked objects |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202210090912.7A CN114445745B (en) | 2022-01-26 | 2022-01-26 | Method, system, device and storage medium for identifying stacked objects |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN114445745A true CN114445745A (en) | 2022-05-06 |
| CN114445745B CN114445745B (en) | 2025-08-05 |
Family
ID=81370288
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202210090912.7A Active CN114445745B (en) | 2022-01-26 | 2022-01-26 | Method, system, device and storage medium for identifying stacked objects |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN114445745B (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN115861777A (en) * | 2022-11-01 | 2023-03-28 | 青岛海尔电冰箱有限公司 | Object identification method, storage medium and refrigeration equipment |
Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9996890B1 (en) * | 2017-07-14 | 2018-06-12 | Synapse Technology Corporation | Detection of items |
| WO2019000653A1 (en) * | 2017-06-30 | 2019-01-03 | 清华大学深圳研究生院 | Image target identification method and apparatus |
| US20190279017A1 (en) * | 2018-03-09 | 2019-09-12 | Ricoh Co., Ltd. | User Interface for Object Detection and Labeling |
| CN111145430A (en) * | 2019-12-27 | 2020-05-12 | 北京每日优鲜电子商务有限公司 | Method and device for detecting commodity placing state and computer storage medium |
| US20210042592A1 (en) * | 2019-08-08 | 2021-02-11 | Toyota Jidosha Kabushiki Kaisha | Object detection device, object detection method, and computer program for object detection |
| US20210174299A1 (en) * | 2019-12-09 | 2021-06-10 | Shanghai Clobotics Technology Co., Ltd. | Method, system and device for collecting sales information of commodities in cabinet, and storage medium |
| JP2021096635A (en) * | 2019-12-17 | 2021-06-24 | 日本電気株式会社 | Image processing system, image processing method, and program |
-
2022
- 2022-01-26 CN CN202210090912.7A patent/CN114445745B/en active Active
Patent Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2019000653A1 (en) * | 2017-06-30 | 2019-01-03 | 清华大学深圳研究生院 | Image target identification method and apparatus |
| US9996890B1 (en) * | 2017-07-14 | 2018-06-12 | Synapse Technology Corporation | Detection of items |
| US20190279017A1 (en) * | 2018-03-09 | 2019-09-12 | Ricoh Co., Ltd. | User Interface for Object Detection and Labeling |
| US20210042592A1 (en) * | 2019-08-08 | 2021-02-11 | Toyota Jidosha Kabushiki Kaisha | Object detection device, object detection method, and computer program for object detection |
| US20210174299A1 (en) * | 2019-12-09 | 2021-06-10 | Shanghai Clobotics Technology Co., Ltd. | Method, system and device for collecting sales information of commodities in cabinet, and storage medium |
| JP2021096635A (en) * | 2019-12-17 | 2021-06-24 | 日本電気株式会社 | Image processing system, image processing method, and program |
| CN111145430A (en) * | 2019-12-27 | 2020-05-12 | 北京每日优鲜电子商务有限公司 | Method and device for detecting commodity placing state and computer storage medium |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN115861777A (en) * | 2022-11-01 | 2023-03-28 | 青岛海尔电冰箱有限公司 | Object identification method, storage medium and refrigeration equipment |
Also Published As
| Publication number | Publication date |
|---|---|
| CN114445745B (en) | 2025-08-05 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN107833213B (en) | A Weakly Supervised Object Detection Method Based on False-True Value Adaptive Method | |
| CN109726658B (en) | Crowd counting and positioning method and system, electronic terminal and storage medium | |
| Lim et al. | Real-time image-based 6-dof localization in large-scale environments | |
| CN106716450B (en) | Image-Based Feature Detection Using Edge Vectors | |
| US9767604B2 (en) | Image analysis method by analyzing point cloud using hierarchical search tree | |
| CN103207898B (en) | A kind of similar face method for quickly retrieving based on local sensitivity Hash | |
| US8548256B2 (en) | Method for fast scene matching | |
| CN111199562B (en) | System and method for rapid object detection in robotic picking | |
| CN111310662B (en) | A method and system for flame detection and recognition based on integrated deep network | |
| US8200010B1 (en) | Image segmentation by clustering web images | |
| US8649612B1 (en) | Parallelizing cascaded face detection | |
| CN110598715B (en) | Image recognition method, device, computer equipment and readable storage medium | |
| CN111428743B (en) | Commodity identification method, commodity processing device and electronic equipment | |
| CN114821102A (en) | Intensive citrus quantity detection method, equipment, storage medium and device | |
| CN113033706B (en) | Multi-source two-stage dish identification method based on visual detection and re-identification | |
| US20250139576A1 (en) | Determining dimensions of an item using point cloud information | |
| CN107730553B (en) | A Weakly Supervised Object Detection Method Based on Pseudo-truth Search | |
| US12200391B2 (en) | Corner detection method and corner detection device | |
| Sharma | Object detection and recognition using Amazon Rekognition with Boto3 | |
| CN114255377A (en) | Differential commodity detection and classification method for intelligent container | |
| CN110069980A (en) | Product polymerization and device, electronic equipment based on image | |
| CN111932582A (en) | Target tracking method and device in video image | |
| CN116342525A (en) | SOP chip pin defect detection method and system based on Lenet-5 model | |
| CN111373393B (en) | Image retrieval method and device, and image library generation method and device | |
| CN114445745A (en) | Method, system, apparatus and storage medium for identifying stacked articles |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |