[go: up one dir, main page]

CN111539390A - Small target image identification method, equipment and system based on Yolov3 - Google Patents

Small target image identification method, equipment and system based on Yolov3 Download PDF

Info

Publication number
CN111539390A
CN111539390A CN202010573564.XA CN202010573564A CN111539390A CN 111539390 A CN111539390 A CN 111539390A CN 202010573564 A CN202010573564 A CN 202010573564A CN 111539390 A CN111539390 A CN 111539390A
Authority
CN
China
Prior art keywords
loss
coefficient
setting
category
small target
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010573564.XA
Other languages
Chinese (zh)
Inventor
宋梦
刘奔
汪明润
王华文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Teamway Electric Co ltd
Original Assignee
Shenzhen Teamway Electric Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Teamway Electric Co ltd filed Critical Shenzhen Teamway Electric Co ltd
Priority to CN202010573564.XA priority Critical patent/CN111539390A/en
Publication of CN111539390A publication Critical patent/CN111539390A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Software Systems (AREA)
  • Molecular Biology (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a small target image recognition method, equipment and system based on Yolov3, and belongs to the technical field of image recognition. According to the method, the area loss is added, so that the defect of the traditional coordinate independent loss summation is reduced, and the prediction precision of the loss function is improved, so that the small target identification precision is further improved; by weakening the network prediction layer with large characteristic size, enhancing the network prediction layer with small characteristic size, highlighting the identification depth of the small target layer and improving the identification precision of the small target; and after the preset time period, data backflow iteration is carried out on the recognition model, so that the small target recognition precision is further improved.

Description

Small target image identification method, equipment and system based on Yolov3
Technical Field
The invention relates to the technical field of image recognition, in particular to a small target image recognition method, equipment and system based on Yolov 3.
Background
In recent years, with the development of deep learning technology, the image features and classification are gradually extracted and classified by using a novel convolutional neural network instead of the traditional manual method; under the scene that the novel convolutional neural network technology is continuously popularized, in order to meet the monitoring scene and the identification requirement, the accuracy of small target identification in the image is required.
The prior art provides a small target image identification method, which comprises the steps of firstly confirming the type of a target to be detected, obtaining the ratio of the width and height of all target labeled frames to the width and height of an original image, and clustering 9 anchor frames with different sizes by using K-means to realize the identification of the small target image.
However, when the method provided by the prior art is used, the small target layer identification depth is not highlighted, so that the accuracy and precision of small target image identification are low.
Disclosure of Invention
In order to solve the problems in the prior art, the invention provides a method, a device and a system for identifying a small target image based on Yolov3, wherein the method comprises the following steps:
in one aspect, a small target image recognition method based on Yolov3 is provided, and includes:
setting the labeling category of the small target image to be recognized in the recognition model;
setting an area loss coefficient, a confidence coefficient loss coefficient and a category loss coefficient;
setting a loss function as a weighting function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss coefficient and the category loss coefficient;
training the recognition model according to the labeling type and the loss function until the training loss stops decreasing;
and identifying the small target image to be identified according to the identification model.
Optionally, the training of the recognition model according to the labeling category and the loss function is performed until training loss stops decreasing, and the method further includes:
weakening the network prediction layer with large characteristic size and enhancing the network prediction layer with small characteristic size.
Optionally, after the small target image to be recognized is recognized according to the recognition model, the method further includes:
and after a preset time period, performing data reflow iteration on the recognition model.
Optionally, the weakening of the network prediction layer with a large feature size and the enhancing of the network prediction layer with a small feature size include:
setting a second block x 2 of the network layer prediction layer to block x 6;
setting a third block 8 of the network layer prediction layer as block 10;
setting a fourth block 8 of the network layer prediction layer as block 4;
the fifth block 4 of the network layer prediction layer is set to block 2.
Optionally, the setting of the annotation category of the small target image to be identified in the identification model further includes:
and acquiring the labeling category of the small target image to be identified, which is customized by the user through the picture/video.
In another aspect, a Yolov 3-based small target image recognition device is provided, including:
the setting module is used for setting the labeling category of the small target image to be recognized in the recognition model;
the optimization module is used for setting an area loss coefficient, a confidence coefficient loss coefficient and a category loss coefficient;
the optimization module is further used for setting a loss function as a weighted function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss coefficient and the category loss coefficient;
the training module is used for training the recognition model according to the labeling type and the loss function until the training loss stops decreasing;
and the identification module is used for identifying the small target image to be identified according to the identification model.
Optionally, the optimization module is further configured to:
weakening the network prediction layer with large characteristic size and enhancing the network prediction layer with small characteristic size.
Optionally, the optimization module is further configured to:
and after a preset time period, performing data reflow iteration on the recognition model.
Optionally, the optimization module is further specifically configured to:
setting a second block x 2 of the network layer prediction layer to block x 6;
setting a third block 8 of the network layer prediction layer as block 10;
setting a fourth block 8 of the network layer prediction layer as block 4;
the fifth block 4 of the network layer prediction layer is set to block 2.
Optionally, the setting module is specifically configured to:
and acquiring the labeling category of the small target image to be identified, which is customized by the user through the picture/video.
In another aspect, a Yolov 3-based small object image recognition device is provided, the device comprising a processor and a memory connected to the processor, the memory storing a set of program codes, the processor executing the program codes stored in the memory to:
setting the labeling category of the small target image to be recognized in the recognition model;
setting an area loss coefficient, a confidence coefficient loss coefficient and a category loss coefficient;
setting a loss function as a weighting function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss coefficient and the category loss coefficient;
training the recognition model according to the labeling type and the loss function until the training loss stops decreasing;
and identifying the small target image to be identified according to the identification model.
Optionally, the processor executes the program code stored in the memory to further implement the following operations:
weakening the network prediction layer with large characteristic size and enhancing the network prediction layer with small characteristic size.
Optionally, the processor executes the program code stored in the memory to further implement the following operations:
and after a preset time period, performing data reflow iteration on the recognition model.
Optionally, the processor executes the program code stored in the memory to further implement the following operations:
setting a second block x 2 of the network layer prediction layer to block x 6;
setting a third block 8 of the network layer prediction layer as block 10;
setting a fourth block 8 of the network layer prediction layer as block 4;
the fifth block 4 of the network layer prediction layer is set to block 2.
Optionally, the processor executes the program code stored in the memory to further implement the following operations:
and acquiring the labeling category of the small target image to be identified, which is customized by the user through the picture/video.
In another aspect, a Yolov 3-based small target image recognition system is provided, including:
the setting equipment is used for setting the labeling category of the small target image to be recognized in the recognition model;
the optimization equipment is used for setting an area loss coefficient, a confidence coefficient loss coefficient and a category loss coefficient;
the optimization equipment is further used for setting a loss function as a weighted function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss coefficient and the category loss coefficient;
the training equipment is used for training the recognition model according to the labeling type and the loss function until the training loss stops decreasing;
and the identification equipment is used for identifying the small target image to be identified according to the identification model.
Optionally, the optimization device is further configured to:
weakening the network prediction layer with large characteristic size and enhancing the network prediction layer with small characteristic size.
Optionally, the optimization device is further configured to:
and after a preset time period, performing data reflow iteration on the recognition model.
Optionally, the optimization device is further specifically configured to:
setting a second block x 2 of the network layer prediction layer to block x 6;
setting a third block 8 of the network layer prediction layer as block 10;
setting a fourth block 8 of the network layer prediction layer as block 4;
the fifth block 4 of the network layer prediction layer is set to block 2.
Optionally, the setting device is specifically configured to:
and acquiring the labeling category of the small target image to be identified, which is customized by the user through the picture/video.
The invention provides a method, a device and a system for identifying a small target image based on Yolov3, wherein the method comprises the following steps: setting the labeling category of the small target image to be recognized in the recognition model; setting an area loss coefficient, a confidence coefficient loss coefficient and a category loss coefficient; setting a loss function as a weighting function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss coefficient and the category loss coefficient; training the recognition model according to the labeling type and the loss function until the training loss stops decreasing; and identifying the small target image to be identified according to the identification model.
The technical scheme provided by the invention has the following beneficial effects:
1. by adding area loss, the defect of the traditional coordinate independent loss summation is reduced, and the prediction precision of the loss function is improved, so that the small target identification precision is further improved;
2. by weakening the network prediction layer with large characteristic size, enhancing the network prediction layer with small characteristic size, highlighting the identification depth of the small target layer and improving the identification precision of the small target;
3. and after the preset time period, data backflow iteration is carried out on the recognition model, so that the small target recognition precision is further improved.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.
FIG. 1 is a flowchart of a method for identifying a small target image based on Yolov3 according to an embodiment of the present invention;
FIG. 2 is a flowchart of a method for identifying a small target image based on Yolov3 according to an embodiment of the present invention;
FIG. 3 is a schematic structural diagram of a small target image recognition device based on Yolov3 according to an embodiment of the present invention;
FIG. 4 is a schematic structural diagram of a small target image recognition device based on Yolov3 according to an embodiment of the present invention;
fig. 5 is a schematic diagram of a Yolov 3-based small target image recognition system according to an embodiment of the present invention.
Detailed Description
The present invention will be described in further detail with reference to the accompanying drawings.
In order to make the objects, technical solutions and advantages of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In addition, the term "and/or" herein is only one kind of association relationship describing an associated object, and means that there may be three kinds of relationships, for example, a and/or B, which may mean: a exists alone, A and B exist simultaneously, and B exists alone. In addition, the character "/" herein generally indicates that the former and latter related objects are in an "or" relationship, unless otherwise specified.
The Yolov 3-based small target image identification method provided by the embodiment of the invention is mainly applied to monitoring scenes, social scenes based on videos/pictures, beautification scenes or processing scenes of the videos/pictures, and in addition, the method provided by the embodiment of the invention can also be applied to other scenes, and the embodiment of the invention does not limit specific scenes.
To facilitate the understanding of the method described in the embodiment of the present invention by those skilled in the art, the existing Yolov3 described in the embodiment of the present invention may be:
the loss function includes at least: coordinate loss, width and height loss, confidence loss and category loss;
the network prediction layer is as follows:
network layer predicting layer second block x 2;
the third block of the network layer prediction layer 8;
network layer predicting layer fourth block 8;
the network layer predicts the fifth block 4 of the layer.
Example one
The embodiment of the invention provides a small target image identification method based on Yolov3, and as shown in fig. 1, the method comprises the following steps:
101. and setting the labeling type of the small target image to be recognized in the recognition model.
Specifically, the labeling category of the small target image to be identified, which is customized by the user through the picture/video, is obtained.
102. And setting a loss function in the recognition model according to the area loss.
Specifically, an area loss coefficient, a confidence coefficient and a category loss coefficient are set;
and setting the loss function as a weighted function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss and the category loss coefficient.
103. Weakening the network prediction layer with large characteristic size and enhancing the network prediction layer with small characteristic size.
Specifically, a second block x 2 of the network layer prediction layer is set as block x 6;
setting a third block 8 of the network layer prediction layer as block 10;
setting a fourth block 8 of the network layer prediction layer as block 4;
the fifth block 4 of the network layer prediction layer is set to block 2.
104. And training the recognition model according to the labeling type and the loss function until the training loss stops decreasing.
105. And identifying the small target image to be identified according to the identification model.
106. And after a preset time period, performing data reflow iteration on the recognition model.
Example two
An embodiment of the present invention provides a small target image identification method based on Yolov3, and as shown in fig. 2, the method includes:
201. and setting the labeling type of the small target image to be recognized in the recognition model.
Specifically, selecting the labeling category of the small target image from the small target data set;
and setting the annotation category as the annotation category of the small target image to be identified in the identification model.
In practical applications, in order to improve the user experience, the process described in step 201 may further be:
in a social scene, acquiring a labeling category of a small target image to be identified, which is customized by a user through a picture/video, wherein the process specifically comprises the following steps:
identifying a small target image selected by a user through a gesture in a picture/video;
identifying the annotation category of the small target image, wherein the gesture of the user at least comprises inputting a track at least comprising the small target image on an interface displaying pictures/videos; the identification process can be realized through a pre-configured identification model, the identification model can be pre-trained by the method of the embodiment of the invention, or can be realized through the existing identification algorithm or the identification model;
displaying the small target image and the recognition result to a user;
and after the user inputs the confirmation information, setting the annotation category of the small target image as the annotation category of the small target image to be identified.
Optionally, in a social scene or a beautification scene or a processing scene of a video/picture, the annotation category of the small target image to be identified, which is customized by the user through the picture/video, is obtained, and the process specifically includes:
acquiring a picture/video input by a user, wherein the picture/video at least comprises a small target image required by the user;
and identifying the small target image and the labeling type of the small target image in the same way as the steps, and the detailed description is omitted here.
And displaying the small target image and the recognition result to the user, and setting the annotation category of the small target image as the annotation category of the small target image to be recognized after the user inputs the confirmation information.
In addition, in the monitoring scenario, step 201 may also be actively triggered after the monitored area is changed after the installation of the monitoring device is completed, so that when the target in the monitoring scenario changes, the changed small target identification may be implemented, thereby improving the reliability in the monitoring scenario.
After the installation of the monitoring device is completed, the labeling category of the small target image to be recognized in the recognition model is periodically set, and the process may be:
after a preset time period, all small target images in the monitored area and the labeling types of all the small target images are acquired, and if the labeling types are different from the previous labeling types, step 201 and the subsequent steps are triggered.
202. An area loss coefficient, a confidence loss coefficient, and a category loss coefficient are set.
Specifically, the setting process may be set through an empirical value, or may be set through a training result or a recognition result, and the specific setting manner is not limited in the embodiment of the present invention.
The process of setting the area loss coefficient, the confidence coefficient and the category loss coefficient according to the training result or the recognition result can also be setting the area loss coefficient, the confidence coefficient and the category loss coefficient according to the training result or the recognition result; the process of setting the area loss coefficient, the confidence coefficient and the category loss coefficient according to the training result or the recognition result may be:
respectively setting an area loss coefficient interval, a confidence coefficient interval and a category loss coefficient interval, respectively randomly selecting values in the intervals to form a plurality of area loss coefficient, confidence coefficient and category loss coefficient value groups, and executing the following steps for any group of values:
setting the value group as an area loss coefficient, a confidence coefficient loss coefficient and a category loss coefficient, and executing the processes from the step 201 to the step 206 to obtain a trained recognition model;
inputting test data into the identification model to obtain a test result;
continuing to execute the steps on the next group of numerical values until all the numerical values in the interval complete the steps to obtain a plurality of test results;
and evaluating the test result, and setting the numerical value group with the test result which is most consistent with the expectation as an area loss coefficient, a confidence coefficient and a category loss coefficient.
203. And setting the loss function as a weighted function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss and the category loss coefficient.
Specifically, the loss function may be specifically:
Figure 445339DEST_PATH_IMAGE001
wherein, therein
Figure 503425DEST_PATH_IMAGE002
In the interest of a loss of area of the frame,
Figure 548742DEST_PATH_IMAGE003
in order to be a loss of confidence,
Figure 209530DEST_PATH_IMAGE004
is a category loss;
Figure 23902DEST_PATH_IMAGE005
the area loss coefficient, the confidence coefficient loss coefficient and the category loss coefficient are respectively;
in addition, in practical application, the area loss in the embodiment of the present invention may be obtained by coordinate loss and width-height loss, and the embodiment of the present invention does not limit a specific obtaining manner;
it should be noted that steps 202 to 203 are processes for implementing the setting of the loss function in the recognition model according to the area loss, and the processes may be implemented in other ways besides the ways described in the above steps, and the embodiment of the present invention is not limited to the specific ways.
The loss function in the existing Yolov3 algorithm is the summation of coordinate loss and width and height loss, and the width and height loss is calculated independently, so that the recognition precision is low, the reliability and the accuracy of small target recognition are affected, and through area loss, the problems are avoided, end-to-end loss evaluation is realized, and the reliability and the accuracy of small target recognition are further improved.
204. And enhancing the network prediction layer with small feature size.
Specifically, a second block x 2 of the network layer prediction layer is set as block x 6;
the third block 8 of the network layer prediction layer is set to block 10.
205. And weakening the network prediction layer with large feature size.
Specifically, a fourth block × 8 of the network layer prediction layer is set as a block × 4;
the fifth block 4 of the network layer prediction layer is set to block 2.
It should be noted that the manner of enhancing the network prediction layer with small feature size and weakening the network prediction layer with large feature size described in step 204 to step 205 is merely exemplary, and in practical applications, the process is implemented by the training result or the recognition result.
By weakening the network prediction layer with large characteristic size, enhancing the network prediction layer with small characteristic size, and utilizing the advantages of the network prediction layer with small characteristic size in small target image recognition, the reliability of small target image recognition is further strengthened.
206. And training the recognition model according to the labeling type and the loss function until the training loss stops decreasing.
Specifically, a training parameter is set, and the recognition model is trained according to the training parameter, the label category and the loss function until the training loss stops decreasing.
207. And identifying the small target image to be identified according to the identification model.
Specifically, a picture containing a small target image to be recognized is input to the recognition model, and a recognition result is output.
208. And after a preset time period, performing data reflow iteration on the recognition model.
Specifically, the data reflow iteration mode includes training the recognition model again.
Besides performing periodic iteration according to a preset time period, the iterative processing method may further include the following steps:
the manner of actively triggering after the monitored area changes and detecting whether the monitored area changes is the same as the manner described in step 201, and is not described herein again.
The method can also be triggered actively by the user, and the active triggering mode of the user comprises clicking a virtual key or periodically carrying out testing through a testing image and then actively triggering.
EXAMPLE III
An embodiment of the present invention provides a Yolov 3-based small target image recognition device 3, and as shown in fig. 3, the device includes:
the setting module 31 is used for setting the labeling category of the small target image to be recognized in the recognition model;
an optimization module 32, configured to set an area loss coefficient, a confidence loss coefficient, and a category loss coefficient;
the optimization module 32 is further configured to set the loss function as a weighted function of the area loss, the confidence coefficient loss, and the category loss according to the area loss coefficient, the confidence coefficient loss, and the category loss coefficient;
the training module 33 is configured to train the recognition model according to the labeled category and the loss function until the training loss stops decreasing;
and the identification module 34 is used for identifying the small target image to be identified according to the identification model.
Optionally, the optimization module 32 is further configured to:
weakening the network prediction layer with large characteristic size and enhancing the network prediction layer with small characteristic size.
Optionally, the optimization module 32 is further configured to:
and after a preset time period, performing data reflow iteration on the recognition model.
Optionally, the optimization module 32 is further specifically configured to:
setting a second block x 2 of the network layer prediction layer to block x 6;
setting a third block 8 of the network layer prediction layer as block 10;
setting a fourth block 8 of the network layer prediction layer as block 4;
the fifth block 4 of the network layer prediction layer is set to block 2.
Optionally, the setting module 31 is specifically configured to:
and acquiring the annotation category of the small target image to be identified, which is customized by the user through the picture/video.
Example four
An embodiment of the present invention provides a Yolov 3-based small target image recognition device 4, which is shown in fig. 4, and includes a processor 41 and a memory 42 connected to the processor, where the memory 42 is used to store a set of program codes, and the processor 41 executes the program codes stored in the memory 42 to implement the following operations:
setting the labeling category of the small target image to be recognized in the recognition model;
setting a loss function in the recognition model according to the area loss;
training the recognition model according to the labeling type and the loss function until the training loss stops decreasing;
and identifying the small target image to be identified according to the identification model.
Optionally, the processor 41 executes the program code stored in the memory 42 for implementing the following operations:
weakening the network prediction layer with large characteristic size and enhancing the network prediction layer with small characteristic size.
Optionally, the processor 41 executes the program code stored in the memory 42 for implementing the following operations:
and after a preset time period, performing data reflow iteration on the recognition model.
Optionally, the processor 41 executes the program code stored in the memory 42 for implementing the following operations:
setting a second block x 2 of the network layer prediction layer to block x 6;
setting a third block 8 of the network layer prediction layer as block 10;
setting a fourth block 8 of the network layer prediction layer as block 4;
the fifth block 4 of the network layer prediction layer is set to block 2.
Optionally, the processor 41 executes the program code stored in the memory 42 for implementing the following operations:
and acquiring the annotation category of the small target image to be identified, which is customized by the user through the picture/video.
EXAMPLE five
An embodiment of the present invention provides a Yolov 3-based small target image recognition system, and as shown in fig. 5, the system includes:
the setting device 51 is used for setting the labeling category of the small target image to be recognized in the recognition model;
an optimization device 52 for setting an area loss coefficient, a confidence loss coefficient, and a category loss coefficient;
the optimization device 52 is further configured to set the loss function as a weighted function of the area loss, the confidence loss, and the category loss according to the area loss coefficient, the confidence loss coefficient, and the category loss coefficient;
the training device 53 is used for training the recognition model according to the labeled category and the loss function until the training loss stops decreasing;
and the recognition device 54 is used for recognizing the small target image to be recognized according to the recognition model.
Optionally, the optimizing device 52 is further configured to:
weakening the network prediction layer with large characteristic size and enhancing the network prediction layer with small characteristic size.
Optionally, the optimizing device 52 is further configured to:
and after a preset time period, performing data reflow iteration on the recognition model.
Optionally, the optimization device 52 is specifically configured to:
setting an area loss coefficient, a confidence coefficient loss coefficient and a category loss coefficient;
and setting the loss function as a weighted function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss and the category loss coefficient.
Optionally, the optimization device 52 is further specifically configured to:
setting a second block x 2 of the network layer prediction layer to block x 6;
setting a third block 8 of the network layer prediction layer as block 10;
setting a fourth block 8 of the network layer prediction layer as block 4;
the fifth block 4 of the network layer prediction layer is set to block 2.
Optionally, the setting device 51 is specifically configured to:
and acquiring the annotation category of the small target image to be identified, which is customized by the user through the picture/video.
The embodiment of the invention provides a small target image identification method, equipment and system based on Yolov3, which reduce the defect of the traditional independent loss summation of coordinates by adding area loss, improve the prediction precision of a loss function and further improve the identification precision of a small target; by weakening the network prediction layer with large characteristic size, enhancing the network prediction layer with small characteristic size, highlighting the identification depth of the small target layer and improving the identification precision of the small target; and after the preset time period, data backflow iteration is carried out on the recognition model, so that the small target recognition precision is further improved.
All the above-mentioned optional technical solutions can be combined arbitrarily to form the optional embodiments of the present invention, and are not described herein again.
It should be noted that: when the small target image recognition device and the small target image recognition system provided by the above embodiments execute the small target image recognition method, only the division of the above functional modules is taken as an example, and in practical applications, the above function distribution may be completed by different functional modules according to needs, that is, the internal structure of the device is divided into different functional modules, so as to complete all or part of the above described functions. In addition, the small target image recognition method, device and system embodiments provided by the above embodiments belong to the same concept, and specific implementation processes thereof are detailed in the method embodiments and are not described herein again.
It will be understood by those skilled in the art that all or part of the steps for implementing the above embodiments may be implemented by hardware, or may be implemented by a program instructing relevant hardware, where the program may be stored in a computer-readable storage medium, and the above-mentioned storage medium may be a read-only memory, a magnetic disk or an optical disk, etc.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims (10)

1. A small target image identification method based on Yolov3 is characterized by comprising the following steps:
setting the labeling category of the small target image to be recognized in the recognition model;
setting an area loss coefficient, a confidence coefficient loss coefficient and a category loss coefficient;
setting a loss function as a weighting function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss coefficient and the category loss coefficient;
training the recognition model according to the labeling type and the loss function until the training loss stops decreasing;
and identifying the small target image to be identified according to the identification model.
2. The method of claim 1, wherein the training of the recognition model according to the label class and the loss function is performed until training loss stops decreasing, the method further comprising:
weakening the network prediction layer with large characteristic size and enhancing the network prediction layer with small characteristic size.
3. The method according to claim 1 or 2, wherein after the small target image to be recognized is recognized according to the recognition model, the method further comprises:
and after a preset time period, performing data reflow iteration on the recognition model.
4. The method of claim 3, wherein weakening the network prediction layer with large feature size and enhancing the network prediction layer with small feature size comprises:
setting a second block x 2 of the network layer prediction layer to block x 6;
setting a third block 8 of the network layer prediction layer as block 10;
setting a fourth block 8 of the network layer prediction layer as block 4;
the fifth block 4 of the network layer prediction layer is set to block 2.
5. The method according to claim 4, wherein the setting of the label category of the small target image to be identified in the identification model further comprises:
and acquiring the labeling category of the small target image to be identified, which is customized by the user through the picture/video.
6. A Yolov 3-based small object image recognition device, the device comprising:
the setting module is used for setting the labeling category of the small target image to be recognized in the recognition model;
the optimization module is used for setting an area loss coefficient, a confidence coefficient loss coefficient and a category loss coefficient;
the optimization module is further used for setting a loss function as a weighted function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss coefficient and the category loss coefficient;
the training module is used for training the recognition model according to the labeling type and the loss function until the training loss stops decreasing;
and the identification module is used for identifying the small target image to be identified according to the identification model.
7. The apparatus of claim 6, wherein the optimization module is further configured to:
weakening the network prediction layer with large characteristic size and enhancing the network prediction layer with small characteristic size.
8. The apparatus of claim 6 or 7, wherein the optimization module is further configured to:
and after a preset time period, performing data reflow iteration on the recognition model.
9. The device of claim 8, wherein the setup module is specifically configured to:
and acquiring the labeling category of the small target image to be identified, which is customized by the user through the picture/video.
10. A Yolov 3-based small object image recognition system, the system comprising:
the setting equipment is used for setting the labeling category of the small target image to be recognized in the recognition model;
the optimization equipment is used for setting an area loss coefficient, a confidence coefficient loss coefficient and a category loss coefficient;
the optimization equipment is further used for setting a loss function as a weighted function of the area loss, the confidence coefficient loss and the category loss according to the area loss coefficient, the confidence coefficient loss coefficient and the category loss coefficient;
the training equipment is used for training the recognition model according to the labeling type and the loss function until the training loss stops decreasing;
and the identification equipment is used for identifying the small target image to be identified according to the identification model.
CN202010573564.XA 2020-06-22 2020-06-22 Small target image identification method, equipment and system based on Yolov3 Pending CN111539390A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010573564.XA CN111539390A (en) 2020-06-22 2020-06-22 Small target image identification method, equipment and system based on Yolov3

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010573564.XA CN111539390A (en) 2020-06-22 2020-06-22 Small target image identification method, equipment and system based on Yolov3

Publications (1)

Publication Number Publication Date
CN111539390A true CN111539390A (en) 2020-08-14

Family

ID=71980840

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010573564.XA Pending CN111539390A (en) 2020-06-22 2020-06-22 Small target image identification method, equipment and system based on Yolov3

Country Status (1)

Country Link
CN (1) CN111539390A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112733747A (en) * 2021-01-14 2021-04-30 哈尔滨市科佳通用机电股份有限公司 Identification method, system and device for relieving falling fault of valve pull rod
CN113591662A (en) * 2021-07-24 2021-11-02 深圳市铁越电气有限公司 Method, system and storage medium for recognizing smoking calling behavior
CN114092829A (en) * 2021-11-22 2022-02-25 南京华苏科技有限公司 Small target detection method based on improved YOLOV4 in high-altitude operation scene

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109711381A (en) * 2019-01-04 2019-05-03 北京环境特性研究所 Target identification method, device and the computer equipment of remote sensing images
CN109711437A (en) * 2018-12-06 2019-05-03 武汉三江中电科技有限责任公司 A kind of transformer part recognition methods based on YOLO network model
CN110766098A (en) * 2019-11-07 2020-02-07 中国石油大学(华东) Traffic scene small target detection method based on improved YOLOv3

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109711437A (en) * 2018-12-06 2019-05-03 武汉三江中电科技有限责任公司 A kind of transformer part recognition methods based on YOLO network model
CN109711381A (en) * 2019-01-04 2019-05-03 北京环境特性研究所 Target identification method, device and the computer equipment of remote sensing images
CN110766098A (en) * 2019-11-07 2020-02-07 中国石油大学(华东) Traffic scene small target detection method based on improved YOLOv3

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
徐义鎏 等: "改进损失函数的 Yolov3车型检测算法", 《信息通信》 *
袁铭阳 等: "一种基于改进 YOLOv3 的遥感影像飞机目标检测算法", 《测绘科学技术学报》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112733747A (en) * 2021-01-14 2021-04-30 哈尔滨市科佳通用机电股份有限公司 Identification method, system and device for relieving falling fault of valve pull rod
CN113591662A (en) * 2021-07-24 2021-11-02 深圳市铁越电气有限公司 Method, system and storage medium for recognizing smoking calling behavior
CN113591662B (en) * 2021-07-24 2024-07-09 深圳市铁越电气有限公司 Recognition method, system and storage medium for smoking and calling behaviors
CN114092829A (en) * 2021-11-22 2022-02-25 南京华苏科技有限公司 Small target detection method based on improved YOLOV4 in high-altitude operation scene

Similar Documents

Publication Publication Date Title
KR102635987B1 (en) Method, apparatus, device and storage medium for training an image semantic segmentation network
CN112712051B (en) Object tracking method, device, computer equipment and storage medium
CN112381104B (en) Image recognition method, device, computer equipment and storage medium
CN110781960B (en) Training method, classification method, device and equipment of video classification model
CN111985385A (en) Behavior detection method, device and equipment
CN111061898A (en) Image processing method, device, computer equipment and storage medium
CN115713715B (en) Human behavior recognition method and recognition system based on deep learning
CN113065379B (en) Image detection method and device integrating image quality and electronic equipment
CN111652181B (en) Target tracking method and device and electronic equipment
CN111539390A (en) Small target image identification method, equipment and system based on Yolov3
CN113762382B (en) Model training and scene recognition method, device, equipment and medium
CN111708913A (en) Label generation method and device and computer readable storage medium
CN115100739B (en) Man-machine behavior detection method, system, terminal device and storage medium
CN113705293A (en) Image scene recognition method, device, equipment and readable storage medium
CN110427998A (en) Model training, object detection method and device, electronic equipment, storage medium
CN111432206A (en) Artificial intelligence-based video resolution processing method, device and electronic device
CN111950507B (en) Data processing and model training method, device, equipment and medium
CN115830697A (en) Student classroom behavior identification method, device, equipment and storage medium
CN112784691A (en) Target detection model training method, target detection method and device
CN117173624A (en) Object recognition model learning method in computing device
CN109740527B (en) A kind of image processing method in video frame
CN115049675A (en) Generation area determination and light spot generation method, apparatus, medium, and program product
CN112733883B (en) Point supervision target detection method
CN112241736A (en) Text detection method and device
CN113192108A (en) Human-in-loop training method for visual tracking model and related device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200814