CN111539390A

CN111539390A - Small target image identification method, equipment and system based on Yolov3

Info

Publication number: CN111539390A
Application number: CN202010573564.XA
Authority: CN
Inventors: 宋梦; 刘奔; 汪明润; 王华文
Original assignee: Shenzhen Teamway Electric Co ltd
Current assignee: Shenzhen Teamway Electric Co ltd
Priority date: 2020-06-22
Filing date: 2020-06-22
Publication date: 2020-08-14

Abstract

The invention discloses a small target image recognition method, equipment and system based on Yolov3, and belongs to the technical field of image recognition. According to the method, the area loss is added, so that the defect of the traditional coordinate independent loss summation is reduced, and the prediction precision of the loss function is improved, so that the small target identification precision is further improved; by weakening the network prediction layer with large characteristic size, enhancing the network prediction layer with small characteristic size, highlighting the identification depth of the small target layer and improving the identification precision of the small target; and after the preset time period, data backflow iteration is carried out on the recognition model, so that the small target recognition precision is further improved.

Description

Small target image identification method, equipment and system based on Yolov3

Technical Field

The invention relates to the technical field of image recognition, in particular to a small target image recognition method, equipment and system based on Yolov 3.

Background

In recent years, with the development of deep learning technology, the image features and classification are gradually extracted and classified by using a novel convolutional neural network instead of the traditional manual method; under the scene that the novel convolutional neural network technology is continuously popularized, in order to meet the monitoring scene and the identification requirement, the accuracy of small target identification in the image is required.

The prior art provides a small target image identification method, which comprises the steps of firstly confirming the type of a target to be detected, obtaining the ratio of the width and height of all target labeled frames to the width and height of an original image, and clustering 9 anchor frames with different sizes by using K-means to realize the identification of the small target image.

However, when the method provided by the prior art is used, the small target layer identification depth is not highlighted, so that the accuracy and precision of small target image identification are low.

Disclosure of Invention

In order to solve the problems in the prior art, the invention provides a method, a device and a system for identifying a small target image based on Yolov3, wherein the method comprises the following steps:

in one aspect, a small target image recognition method based on Yolov3 is provided, and includes:

setting the labeling category of the small target image to be recognized in the recognition model;

setting an area loss coefficient, a confidence coefficient loss coefficient and a category loss coefficient;

setting a loss function as a weighting function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss coefficient and the category loss coefficient;

training the recognition model according to the labeling type and the loss function until the training loss stops decreasing;

and identifying the small target image to be identified according to the identification model.

Optionally, the training of the recognition model according to the labeling category and the loss function is performed until training loss stops decreasing, and the method further includes:

weakening the network prediction layer with large characteristic size and enhancing the network prediction layer with small characteristic size.

Optionally, after the small target image to be recognized is recognized according to the recognition model, the method further includes:

and after a preset time period, performing data reflow iteration on the recognition model.

Optionally, the weakening of the network prediction layer with a large feature size and the enhancing of the network prediction layer with a small feature size include:

setting a second block x 2 of the network layer prediction layer to block x 6;

setting a third block 8 of the network layer prediction layer as block 10;

setting a fourth block 8 of the network layer prediction layer as block 4;

the fifth block 4 of the network layer prediction layer is set to block 2.

Optionally, the setting of the annotation category of the small target image to be identified in the identification model further includes:

and acquiring the labeling category of the small target image to be identified, which is customized by the user through the picture/video.

In another aspect, a Yolov 3-based small target image recognition device is provided, including:

the setting module is used for setting the labeling category of the small target image to be recognized in the recognition model;

the optimization module is used for setting an area loss coefficient, a confidence coefficient loss coefficient and a category loss coefficient;

the optimization module is further used for setting a loss function as a weighted function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss coefficient and the category loss coefficient;

the training module is used for training the recognition model according to the labeling type and the loss function until the training loss stops decreasing;

and the identification module is used for identifying the small target image to be identified according to the identification model.

Optionally, the optimization module is further configured to:

Optionally, the optimization module is further specifically configured to:

setting a second block x 2 of the network layer prediction layer to block x 6;

setting a third block 8 of the network layer prediction layer as block 10;

setting a fourth block 8 of the network layer prediction layer as block 4;

the fifth block 4 of the network layer prediction layer is set to block 2.

Optionally, the setting module is specifically configured to:

In another aspect, a Yolov 3-based small object image recognition device is provided, the device comprising a processor and a memory connected to the processor, the memory storing a set of program codes, the processor executing the program codes stored in the memory to:

Optionally, the processor executes the program code stored in the memory to further implement the following operations:

setting a second block x 2 of the network layer prediction layer to block x 6;

setting a third block 8 of the network layer prediction layer as block 10;

setting a fourth block 8 of the network layer prediction layer as block 4;

the fifth block 4 of the network layer prediction layer is set to block 2.

In another aspect, a Yolov 3-based small target image recognition system is provided, including:

the setting equipment is used for setting the labeling category of the small target image to be recognized in the recognition model;

the optimization equipment is used for setting an area loss coefficient, a confidence coefficient loss coefficient and a category loss coefficient;

the optimization equipment is further used for setting a loss function as a weighted function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss coefficient and the category loss coefficient;

the training equipment is used for training the recognition model according to the labeling type and the loss function until the training loss stops decreasing;

and the identification equipment is used for identifying the small target image to be identified according to the identification model.

Optionally, the optimization device is further configured to:

Optionally, the optimization device is further specifically configured to:

setting a second block x 2 of the network layer prediction layer to block x 6;

setting a third block 8 of the network layer prediction layer as block 10;

setting a fourth block 8 of the network layer prediction layer as block 4;

the fifth block 4 of the network layer prediction layer is set to block 2.

Optionally, the setting device is specifically configured to:

The invention provides a method, a device and a system for identifying a small target image based on Yolov3, wherein the method comprises the following steps: setting the labeling category of the small target image to be recognized in the recognition model; setting an area loss coefficient, a confidence coefficient loss coefficient and a category loss coefficient; setting a loss function as a weighting function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss coefficient and the category loss coefficient; training the recognition model according to the labeling type and the loss function until the training loss stops decreasing; and identifying the small target image to be identified according to the identification model.

The technical scheme provided by the invention has the following beneficial effects:

1. by adding area loss, the defect of the traditional coordinate independent loss summation is reduced, and the prediction precision of the loss function is improved, so that the small target identification precision is further improved;

2. by weakening the network prediction layer with large characteristic size, enhancing the network prediction layer with small characteristic size, highlighting the identification depth of the small target layer and improving the identification precision of the small target;

3. and after the preset time period, data backflow iteration is carried out on the recognition model, so that the small target recognition precision is further improved.

Drawings

In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.

FIG. 1 is a flowchart of a method for identifying a small target image based on Yolov3 according to an embodiment of the present invention;

FIG. 2 is a flowchart of a method for identifying a small target image based on Yolov3 according to an embodiment of the present invention;

FIG. 3 is a schematic structural diagram of a small target image recognition device based on Yolov3 according to an embodiment of the present invention;

FIG. 4 is a schematic structural diagram of a small target image recognition device based on Yolov3 according to an embodiment of the present invention;

fig. 5 is a schematic diagram of a Yolov 3-based small target image recognition system according to an embodiment of the present invention.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings.

In order to make the objects, technical solutions and advantages of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

In addition, the term "and/or" herein is only one kind of association relationship describing an associated object, and means that there may be three kinds of relationships, for example, a and/or B, which may mean: a exists alone, A and B exist simultaneously, and B exists alone. In addition, the character "/" herein generally indicates that the former and latter related objects are in an "or" relationship, unless otherwise specified.

The Yolov 3-based small target image identification method provided by the embodiment of the invention is mainly applied to monitoring scenes, social scenes based on videos/pictures, beautification scenes or processing scenes of the videos/pictures, and in addition, the method provided by the embodiment of the invention can also be applied to other scenes, and the embodiment of the invention does not limit specific scenes.

To facilitate the understanding of the method described in the embodiment of the present invention by those skilled in the art, the existing Yolov3 described in the embodiment of the present invention may be:

the loss function includes at least: coordinate loss, width and height loss, confidence loss and category loss;

the network prediction layer is as follows:

network layer predicting layer second block x 2;

the third block of the network layer prediction layer 8;

network layer predicting layer fourth block 8;

the network layer predicts the fifth block 4 of the layer.

Example one

The embodiment of the invention provides a small target image identification method based on Yolov3, and as shown in fig. 1, the method comprises the following steps:

101. and setting the labeling type of the small target image to be recognized in the recognition model.

Specifically, the labeling category of the small target image to be identified, which is customized by the user through the picture/video, is obtained.

102. And setting a loss function in the recognition model according to the area loss.

Specifically, an area loss coefficient, a confidence coefficient and a category loss coefficient are set;

and setting the loss function as a weighted function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss and the category loss coefficient.

103. Weakening the network prediction layer with large characteristic size and enhancing the network prediction layer with small characteristic size.

Specifically, a second block x 2 of the network layer prediction layer is set as block x 6;

setting a third block 8 of the network layer prediction layer as block 10;

setting a fourth block 8 of the network layer prediction layer as block 4;

the fifth block 4 of the network layer prediction layer is set to block 2.

104. And training the recognition model according to the labeling type and the loss function until the training loss stops decreasing.

105. And identifying the small target image to be identified according to the identification model.

106. And after a preset time period, performing data reflow iteration on the recognition model.

Example two

An embodiment of the present invention provides a small target image identification method based on Yolov3, and as shown in fig. 2, the method includes:

201. and setting the labeling type of the small target image to be recognized in the recognition model.

Specifically, selecting the labeling category of the small target image from the small target data set;

and setting the annotation category as the annotation category of the small target image to be identified in the identification model.

In practical applications, in order to improve the user experience, the process described in step 201 may further be:

in a social scene, acquiring a labeling category of a small target image to be identified, which is customized by a user through a picture/video, wherein the process specifically comprises the following steps:

identifying a small target image selected by a user through a gesture in a picture/video;

identifying the annotation category of the small target image, wherein the gesture of the user at least comprises inputting a track at least comprising the small target image on an interface displaying pictures/videos; the identification process can be realized through a pre-configured identification model, the identification model can be pre-trained by the method of the embodiment of the invention, or can be realized through the existing identification algorithm or the identification model;

displaying the small target image and the recognition result to a user;

and after the user inputs the confirmation information, setting the annotation category of the small target image as the annotation category of the small target image to be identified.

Optionally, in a social scene or a beautification scene or a processing scene of a video/picture, the annotation category of the small target image to be identified, which is customized by the user through the picture/video, is obtained, and the process specifically includes:

acquiring a picture/video input by a user, wherein the picture/video at least comprises a small target image required by the user;

and identifying the small target image and the labeling type of the small target image in the same way as the steps, and the detailed description is omitted here.

And displaying the small target image and the recognition result to the user, and setting the annotation category of the small target image as the annotation category of the small target image to be recognized after the user inputs the confirmation information.

In addition, in the monitoring scenario, step 201 may also be actively triggered after the monitored area is changed after the installation of the monitoring device is completed, so that when the target in the monitoring scenario changes, the changed small target identification may be implemented, thereby improving the reliability in the monitoring scenario.

After the installation of the monitoring device is completed, the labeling category of the small target image to be recognized in the recognition model is periodically set, and the process may be:

after a preset time period, all small target images in the monitored area and the labeling types of all the small target images are acquired, and if the labeling types are different from the previous labeling types, step 201 and the subsequent steps are triggered.

202. An area loss coefficient, a confidence loss coefficient, and a category loss coefficient are set.

Specifically, the setting process may be set through an empirical value, or may be set through a training result or a recognition result, and the specific setting manner is not limited in the embodiment of the present invention.

The process of setting the area loss coefficient, the confidence coefficient and the category loss coefficient according to the training result or the recognition result can also be setting the area loss coefficient, the confidence coefficient and the category loss coefficient according to the training result or the recognition result; the process of setting the area loss coefficient, the confidence coefficient and the category loss coefficient according to the training result or the recognition result may be:

respectively setting an area loss coefficient interval, a confidence coefficient interval and a category loss coefficient interval, respectively randomly selecting values in the intervals to form a plurality of area loss coefficient, confidence coefficient and category loss coefficient value groups, and executing the following steps for any group of values:

setting the value group as an area loss coefficient, a confidence coefficient loss coefficient and a category loss coefficient, and executing the processes from the step 201 to the step 206 to obtain a trained recognition model;

inputting test data into the identification model to obtain a test result;

continuing to execute the steps on the next group of numerical values until all the numerical values in the interval complete the steps to obtain a plurality of test results;

and evaluating the test result, and setting the numerical value group with the test result which is most consistent with the expectation as an area loss coefficient, a confidence coefficient and a category loss coefficient.

203. And setting the loss function as a weighted function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss and the category loss coefficient.

Specifically, the loss function may be specifically:

wherein, therein

In the interest of a loss of area of the frame,

in order to be a loss of confidence,

is a category loss;

the area loss coefficient, the confidence coefficient loss coefficient and the category loss coefficient are respectively;

in addition, in practical application, the area loss in the embodiment of the present invention may be obtained by coordinate loss and width-height loss, and the embodiment of the present invention does not limit a specific obtaining manner;

it should be noted that steps 202 to 203 are processes for implementing the setting of the loss function in the recognition model according to the area loss, and the processes may be implemented in other ways besides the ways described in the above steps, and the embodiment of the present invention is not limited to the specific ways.

The loss function in the existing Yolov3 algorithm is the summation of coordinate loss and width and height loss, and the width and height loss is calculated independently, so that the recognition precision is low, the reliability and the accuracy of small target recognition are affected, and through area loss, the problems are avoided, end-to-end loss evaluation is realized, and the reliability and the accuracy of small target recognition are further improved.

204. And enhancing the network prediction layer with small feature size.

the third block 8 of the network layer prediction layer is set to block 10.

205. And weakening the network prediction layer with large feature size.

Specifically, a fourth block × 8 of the network layer prediction layer is set as a block × 4;

the fifth block 4 of the network layer prediction layer is set to block 2.

It should be noted that the manner of enhancing the network prediction layer with small feature size and weakening the network prediction layer with large feature size described in step 204 to step 205 is merely exemplary, and in practical applications, the process is implemented by the training result or the recognition result.

By weakening the network prediction layer with large characteristic size, enhancing the network prediction layer with small characteristic size, and utilizing the advantages of the network prediction layer with small characteristic size in small target image recognition, the reliability of small target image recognition is further strengthened.

206. And training the recognition model according to the labeling type and the loss function until the training loss stops decreasing.

Specifically, a training parameter is set, and the recognition model is trained according to the training parameter, the label category and the loss function until the training loss stops decreasing.

207. And identifying the small target image to be identified according to the identification model.

Specifically, a picture containing a small target image to be recognized is input to the recognition model, and a recognition result is output.

208. And after a preset time period, performing data reflow iteration on the recognition model.

Specifically, the data reflow iteration mode includes training the recognition model again.

Besides performing periodic iteration according to a preset time period, the iterative processing method may further include the following steps:

the manner of actively triggering after the monitored area changes and detecting whether the monitored area changes is the same as the manner described in step 201, and is not described herein again.

The method can also be triggered actively by the user, and the active triggering mode of the user comprises clicking a virtual key or periodically carrying out testing through a testing image and then actively triggering.

EXAMPLE III

An embodiment of the present invention provides a Yolov 3-based small target image recognition device 3, and as shown in fig. 3, the device includes:

the setting module 31 is used for setting the labeling category of the small target image to be recognized in the recognition model;

an optimization module 32, configured to set an area loss coefficient, a confidence loss coefficient, and a category loss coefficient;

the optimization module 32 is further configured to set the loss function as a weighted function of the area loss, the confidence coefficient loss, and the category loss according to the area loss coefficient, the confidence coefficient loss, and the category loss coefficient;

the training module 33 is configured to train the recognition model according to the labeled category and the loss function until the training loss stops decreasing;

and the identification module 34 is used for identifying the small target image to be identified according to the identification model.

Optionally, the optimization module 32 is further configured to:

Optionally, the optimization module 32 is further specifically configured to:

setting a second block x 2 of the network layer prediction layer to block x 6;

setting a third block 8 of the network layer prediction layer as block 10;

setting a fourth block 8 of the network layer prediction layer as block 4;

the fifth block 4 of the network layer prediction layer is set to block 2.

Optionally, the setting module 31 is specifically configured to:

and acquiring the annotation category of the small target image to be identified, which is customized by the user through the picture/video.

Example four

An embodiment of the present invention provides a Yolov 3-based small target image recognition device 4, which is shown in fig. 4, and includes a processor 41 and a memory 42 connected to the processor, where the memory 42 is used to store a set of program codes, and the processor 41 executes the program codes stored in the memory 42 to implement the following operations:

setting a loss function in the recognition model according to the area loss;

Optionally, the processor 41 executes the program code stored in the memory 42 for implementing the following operations:

setting a second block x 2 of the network layer prediction layer to block x 6;

setting a third block 8 of the network layer prediction layer as block 10;

setting a fourth block 8 of the network layer prediction layer as block 4;

the fifth block 4 of the network layer prediction layer is set to block 2.

EXAMPLE five

An embodiment of the present invention provides a Yolov 3-based small target image recognition system, and as shown in fig. 5, the system includes:

the setting device 51 is used for setting the labeling category of the small target image to be recognized in the recognition model;

an optimization device 52 for setting an area loss coefficient, a confidence loss coefficient, and a category loss coefficient;

the optimization device 52 is further configured to set the loss function as a weighted function of the area loss, the confidence loss, and the category loss according to the area loss coefficient, the confidence loss coefficient, and the category loss coefficient;

the training device 53 is used for training the recognition model according to the labeled category and the loss function until the training loss stops decreasing;

and the recognition device 54 is used for recognizing the small target image to be recognized according to the recognition model.

Optionally, the optimizing device 52 is further configured to:

Optionally, the optimization device 52 is specifically configured to:

Optionally, the optimization device 52 is further specifically configured to:

setting a second block x 2 of the network layer prediction layer to block x 6;

setting a third block 8 of the network layer prediction layer as block 10;

setting a fourth block 8 of the network layer prediction layer as block 4;

the fifth block 4 of the network layer prediction layer is set to block 2.

Optionally, the setting device 51 is specifically configured to:

The embodiment of the invention provides a small target image identification method, equipment and system based on Yolov3, which reduce the defect of the traditional independent loss summation of coordinates by adding area loss, improve the prediction precision of a loss function and further improve the identification precision of a small target; by weakening the network prediction layer with large characteristic size, enhancing the network prediction layer with small characteristic size, highlighting the identification depth of the small target layer and improving the identification precision of the small target; and after the preset time period, data backflow iteration is carried out on the recognition model, so that the small target recognition precision is further improved.

All the above-mentioned optional technical solutions can be combined arbitrarily to form the optional embodiments of the present invention, and are not described herein again.

It should be noted that: when the small target image recognition device and the small target image recognition system provided by the above embodiments execute the small target image recognition method, only the division of the above functional modules is taken as an example, and in practical applications, the above function distribution may be completed by different functional modules according to needs, that is, the internal structure of the device is divided into different functional modules, so as to complete all or part of the above described functions. In addition, the small target image recognition method, device and system embodiments provided by the above embodiments belong to the same concept, and specific implementation processes thereof are detailed in the method embodiments and are not described herein again.

It will be understood by those skilled in the art that all or part of the steps for implementing the above embodiments may be implemented by hardware, or may be implemented by a program instructing relevant hardware, where the program may be stored in a computer-readable storage medium, and the above-mentioned storage medium may be a read-only memory, a magnetic disk or an optical disk, etc.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims

1. A small target image identification method based on Yolov3 is characterized by comprising the following steps:

2. The method of claim 1, wherein the training of the recognition model according to the label class and the loss function is performed until training loss stops decreasing, the method further comprising:

3. The method according to claim 1 or 2, wherein after the small target image to be recognized is recognized according to the recognition model, the method further comprises:

4. The method of claim 3, wherein weakening the network prediction layer with large feature size and enhancing the network prediction layer with small feature size comprises:

setting a second block x 2 of the network layer prediction layer to block x 6;

setting a third block 8 of the network layer prediction layer as block 10;

setting a fourth block 8 of the network layer prediction layer as block 4;

the fifth block 4 of the network layer prediction layer is set to block 2.

5. The method according to claim 4, wherein the setting of the label category of the small target image to be identified in the identification model further comprises:

6. A Yolov 3-based small object image recognition device, the device comprising:

7. The apparatus of claim 6, wherein the optimization module is further configured to:

8. The apparatus of claim 6 or 7, wherein the optimization module is further configured to:

9. The device of claim 8, wherein the setup module is specifically configured to:

10. A Yolov 3-based small object image recognition system, the system comprising: