[go: up one dir, main page]

CN110827354B - A train positioning method based on the counting of the poles of the wayside grid - Google Patents

A train positioning method based on the counting of the poles of the wayside grid Download PDF

Info

Publication number
CN110827354B
CN110827354B CN201911057378.4A CN201911057378A CN110827354B CN 110827354 B CN110827354 B CN 110827354B CN 201911057378 A CN201911057378 A CN 201911057378A CN 110827354 B CN110827354 B CN 110827354B
Authority
CN
China
Prior art keywords
power grid
poles
train
convolution
pole
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201911057378.4A
Other languages
Chinese (zh)
Other versions
CN110827354A (en
Inventor
戴胜华
郑子缘
李洁
梁瑶
曹景铭
谢旭旭
李正交
周兴
卢建成
习家宁
王宇琦
李紫玉
郑�硕
胡泓景
时晓杰
曹梓恒
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Jiaotong University
Original Assignee
Beijing Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Jiaotong University filed Critical Beijing Jiaotong University
Priority to CN201911057378.4A priority Critical patent/CN110827354B/en
Publication of CN110827354A publication Critical patent/CN110827354A/en
Application granted granted Critical
Publication of CN110827354B publication Critical patent/CN110827354B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • G06T7/73Determining position or orientation of objects or cameras using feature-based methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30236Traffic on road, railway or crossing
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30242Counting objects in image

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Image Analysis (AREA)

Abstract

本发明提供了一种基于轨旁电网线杆计数的列车定位方法,包括:针对轨旁电网线杆,采用基于卷积神经网络的视觉几何组神经网络分类器算法训练出分类器;通过安装在列车车头中部的高速摄像机采集列车的运行视频;采用图像识别算法识别出所述运行视频的轨道线和消失点;根据所述训练好的分类器,识别出所述消失点右侧的电网线杆,并采用电网线杆计数算法记录当前识别到的电网线杆的总数量;根据当前电网线杆的总数量确定列车的当前位置。本方法可以改善现有列车定位方法成本高昂并且需要定期维护的情况,减轻铁路运输行业的经济负担。

Figure 201911057378

The present invention provides a train positioning method based on the counting of wayside power grid poles. The high-speed camera in the middle of the train head collects the running video of the train; image recognition algorithm is used to identify the track line and the vanishing point of the running video; according to the trained classifier, the power grid pole on the right side of the vanishing point is identified , and use the grid pole counting algorithm to record the total number of currently identified grid poles; determine the current position of the train according to the current total number of grid poles. The method can improve the situation that the existing train positioning methods are expensive and require regular maintenance, and reduce the economic burden of the railway transportation industry.

Figure 201911057378

Description

基于轨旁电网线杆计数的列车定位方法A train positioning method based on the counting of the poles of the wayside grid

技术领域technical field

本发明涉及轨道交通控制技术领域,尤其涉及一种基于轨旁电网线杆计数的列车定位方法。The invention relates to the technical field of rail traffic control, in particular to a train positioning method based on trackside grid line pole counting.

背景技术Background technique

随着现代铁路运输行业的发展,铁路运行压力日益增长,高速铁路飞速发展,为保证铁路行车安全,列车控制系统中的列车定位技术是必不可少的。现有列车定位方式主要是以测速定位,应答器定位为主,轨道电路定位为辅助。其中应答器定位方式定位精度最高,可以修正测速定位过程中列车的累计误差,但应答器成本高昂,且需要定期维护,耗费了大量的人力物力,给铁路运输行业带来了严重的经济负担和维护工作量。当前有学者正在研究几类新型的列车定位方式,如卫星定位,感应环线定位,无线信号强度定位,多定位方式融合定位等,但容易受到外界环境干扰,定位精度并不高。在这种情况下,探索新一代低成本,高可靠性的列车定位技术,具有很大的实际意义。With the development of the modern railway transportation industry, the pressure of railway operation is increasing day by day, and the high-speed railway is developing rapidly. In order to ensure the safety of railway traffic, the train positioning technology in the train control system is indispensable. The existing train positioning methods are mainly based on speed measurement positioning, transponder positioning, and track circuit positioning as auxiliary. Among them, the transponder positioning method has the highest positioning accuracy and can correct the cumulative error of the train during the speed measurement and positioning process. However, the transponder is expensive and requires regular maintenance, which consumes a lot of manpower and material resources, and brings a serious economic burden to the railway transportation industry. maintenance workload. At present, some scholars are studying several new types of train positioning methods, such as satellite positioning, induction loop positioning, wireless signal strength positioning, multi-positioning method fusion positioning, etc., but they are easily interfered by the external environment and the positioning accuracy is not high. In this case, it is of great practical significance to explore a new generation of low-cost, high-reliability train positioning technology.

发明内容SUMMARY OF THE INVENTION

本发明提供了一种基于轨旁电网线杆计数的列车定位方法,以改善现有列车定位方法成本高昂并且需要定期维护的情况,减轻铁路运输行业的经济负担。The present invention provides a train positioning method based on the counting of the line poles of the trackside power grid, so as to improve the situation that the existing train positioning method is expensive and requires regular maintenance, and reduces the economic burden of the railway transportation industry.

为了实现上述目的,本发明采取了如下技术方案。In order to achieve the above objects, the present invention adopts the following technical solutions.

本发明提供了一种基于轨旁电网线杆计数的列车定位方法,包括:The invention provides a method for locating a train based on the counting of the poles of a trackside power grid, comprising:

针对轨旁电网线杆,采用基于卷积神经网络的视觉几何组神经网络分类器算法训练出分类器;Aiming at the wayside power grid poles, the classifier is trained by using the visual geometry group neural network classifier algorithm based on convolutional neural network;

通过安装在列车车头的高速摄像机采集列车的运行视频;The running video of the train is collected by the high-speed camera installed on the front of the train;

采用图像识别算法识别出所述运行视频的轨道线和消失点;Using an image recognition algorithm to identify the track line and the vanishing point of the running video;

根据所述训练好的分类器,识别出所述消失点右侧的电网线杆,并采用电网线杆计数算法记录当前识别到的电网线杆的总数量;According to the trained classifier, identify the power grid pole on the right side of the vanishing point, and record the total number of the currently identified power grid poles by using the power grid pole counting algorithm;

根据当前电网线杆的总数量确定列车的当前位置。Determine the current position of the train based on the total number of current grid poles.

优选地,基于卷积神经网络的视觉几何组神经网络分类器由5层卷积层、3层全连接层和归一化指数输出层组成,其中卷积层使用3*3的卷积核,层与层之间使用2*2最大池化层分开,所有隐含层的激活单元都采用线性整流函数。Preferably, the visual geometry group neural network classifier based on convolutional neural network consists of 5 layers of convolution layers, 3 layers of fully connected layers and normalized exponential output layer, wherein the convolution layer uses 3*3 convolution kernels, The layers are separated by a 2*2 max pooling layer, and the activation units of all hidden layers use a linear rectification function.

优选地,针对轨旁电网线杆,采用基于卷积神经网络的视觉几何组神经网络分类器算法训练出分类器,包括:正样本为前景为轨旁电网线杆的图片,负样本指仅包含轨旁电网线杆局部信息或不包含轨旁电网线杆的图片,将正样本图片归入训练集和正例测试集,负样本图片归入反例测试集。Preferably, for the trackside power grid poles, a classifier is trained by using the visual geometry group neural network classifier algorithm based on convolutional neural network, including: the positive sample is a picture of the trackside power grid pole in the foreground, and the negative sample refers to only containing For the local information of the trackside power grid poles or the pictures that do not contain the trackside power grid poles, the positive sample pictures are included in the training set and the positive example test set, and the negative sample pictures are included in the negative example test set.

通过安装在列车车头的高速摄像机采集列车的运行视频,包括:所述的高速摄像机安装于列车车头中部。The running video of the train is collected by a high-speed camera installed at the head of the train, including: the high-speed camera is installed in the middle of the head of the train.

优选地,采用图像识别算法识别出所述运行视频的轨道线和消失点,包括:Preferably, an image recognition algorithm is used to identify the track line and the vanishing point of the running video, including:

将采集到的图像通过高斯滤波器滤除高频成分;The collected image is filtered by Gaussian filter to remove high frequency components;

利用Canny算法进行边缘检测,对检测结果加一个梯形窗,滤出前景;Use Canny algorithm for edge detection, add a trapezoidal window to the detection result, and filter out the foreground;

对前景进行霍夫变换,找到直线,其中越长的直线在计算斜率均值时赋予越高的权重;Perform Hough transform on the foreground to find a straight line, where the longer straight line is given a higher weight when calculating the mean slope;

对找到的直线根据斜率正负区分左右道轨;Distinguish the left and right tracks according to the positive and negative slope of the found straight line;

分别迭代计算各条所述直线的斜率与斜率均值的差,移除差值大于预定值的直线;Iteratively calculate the difference between the slope of each of the straight lines and the mean value of the slope, and remove the straight line whose difference is greater than a predetermined value;

分别对剩余左右轨道线的直线集合做线性回归得到最终左右轨道线,最终左右轨道线的交点即为消失点。Perform linear regression on the straight line sets of the remaining left and right orbital lines to obtain the final left and right orbital lines, and the intersection of the final left and right orbital lines is the vanishing point.

优选地,根据所述训练好的分类器,识别出所述消失点右侧的电网线杆,并记录当前识别到的电网线杆的总数量,包括以下步骤:Preferably, according to the trained classifier, identify the power grid poles on the right side of the vanishing point, and record the total number of the currently identified power grid poles, including the following steps:

对当前采集到的图片从右往左,从上往下,依次用112×112×3的窗取出包含完整线杆的图片,将所述包含完整线杆的图片,经64个3×3的卷积核作两次卷积加线性整流,卷积后的尺寸变为112×112×64;采用尺寸为2×2的池化单元作最大化池化,池化后的尺寸变为64×64×64;经128个3×3的卷积核作两次卷积加线性整流,尺寸变为64×64×128;作2×2的最大池化,尺寸变为32×32×128;经256个3×3的卷积核作三次卷积加线性整流,尺寸变为32×32×256;作2×2的最大池化,尺寸变为16×16×256;经512个3×3的卷积核作三次卷积加线性整流,尺寸变为16×16×512;作2×2的最大池化,尺寸变为8×8×512;经512个3×3的卷积核作三次卷积加线性整流,尺寸变为8×8×512;作2×2的最大池化,尺寸变为4×4×512;与两层1×1×4096,一层1×1×1000进行全连接线性整流;通过归一化指数函数输出1000个预测结果;所述预测结果中第一个是电网线杆且概率大于98%,则认为识别到电网线杆;将该区域在原图用经过长宽调整的矩形框标出,并记录矩形框左上角横纵坐标以及宽高;将窗的位置按顺序移动一定区域。For the currently collected pictures, from right to left, from top to bottom, use the 112×112×3 window to take out the picture containing the complete line rod in turn, and put the picture containing the complete line rod through 64 3×3 windows. The convolution kernel performs two convolutions and linear rectification, and the size after convolution becomes 112×112×64; the pooling unit with size 2×2 is used for maximum pooling, and the size after pooling becomes 64× 64×64; after 128 convolution kernels of 3×3 do two convolutions and linear rectification, the size becomes 64×64×128; for 2×2 max pooling, the size becomes 32×32×128; After 256 3 × 3 convolution kernels are used for cubic convolution and linear rectification, the size becomes 32 × 32 × 256; after 2 × 2 maximum pooling, the size becomes 16 × 16 × 256; after 512 3 × The convolution kernel of 3 is used for cubic convolution and linear rectification, and the size becomes 16×16×512; for 2×2 maximum pooling, the size becomes 8×8×512; after 512 3×3 convolution kernels For cubic convolution and linear rectification, the size becomes 8×8×512; for 2×2 maximum pooling, the size becomes 4×4×512; with two layers of 1×1×4096, one layer of 1×1× 1000 is fully connected linear rectification; 1000 prediction results are output through the normalized exponential function; the first one of the prediction results is a power grid pole and the probability is greater than 98%, it is considered that the power grid pole is identified; this area is in the original image Mark with a rectangular frame adjusted by length and width, and record the horizontal and vertical coordinates and width and height of the upper left corner of the rectangular frame; move the position of the window by a certain area in sequence.

优选地,记录当前识别到的电网线杆的总数量,包括:Preferably, the total number of currently identified grid poles is recorded, including:

对每一帧图像,分类器将以从右至左,从上至下的顺序在原始图像查找电网线杆;For each frame of image, the classifier will look for power grid poles in the original image in order from right to left and top to bottom;

每找到一个电网线杆时,分类器返回当前电网线杆左上角的横纵坐标以及宽高信息;Each time a power grid pole is found, the classifier returns the horizontal and vertical coordinates and width and height information of the upper left corner of the current power grid pole;

当找到的电网线杆个数达到三个或者已经扫描到消失点的横坐标时结束查找;The search ends when the number of grid poles found reaches three or the abscissa of the vanishing point has been scanned;

如果当前帧最右侧的电网线杆在下一帧仍然在图像采集范围内时,扫描到的最大横坐标将比前一帧大;当最右侧电网线杆已经在摄像头视野中消失时,最大的横坐标值将会发生跳变,变为原本第二大的横坐标值,如果连续5帧这个跳变过程都没有发生反复,则认为一个电网线杆已经离开了摄像头视野,此时电网线杆计数值增加1。If the power grid pole on the far right of the current frame is still within the image acquisition range in the next frame, the maximum abscissa scanned will be larger than the previous frame; The abscissa value will jump and become the second largest abscissa value. If the jumping process does not repeat for 5 consecutive frames, it is considered that a power grid pole has left the camera's field of view. At this time, the power grid line The rod count value is incremented by 1.

优选地,根据当前电网线杆的总数量确定列车的当前位置,包括:Preferably, the current position of the train is determined according to the total number of current grid poles, including:

通过数据库存储电网线杆数量-运行位置对应关系,根据数据库存储的电网线杆数量-运行位置对应关系查询当前电网线杆的总数量确定列车的当前位置。Store the corresponding relationship between the number of grid poles and the operating position in the database, and query the total number of current grid poles to determine the current position of the train according to the corresponding relationship between the number of grid poles and the operating position stored in the database.

由上述本发明的基于轨旁电网线杆计数的列车定位方法提供的技术方案可以看出,本发明方法考虑到列车轨旁电网线杆位置固定,只要能够对其进行准确计数,就能够确定列车位置,以此可以改善现有列车定位方法成本高昂并且需要定期维护的情况,减轻铁路运输行业的经济负担。It can be seen from the technical solution provided by the above-mentioned method for locating the train based on the counting of the trackside power grid poles of the present invention, the method of the present invention takes into account the fixed position of the train trackside power grid poles, so long as it can be accurately counted, the train can be determined. location, which can improve existing train positioning methods that are expensive and require regular maintenance, reducing the financial burden on the rail transport industry.

本发明附加的方面和优点将在下面的描述中部分给出,这些将从下面的描述中变得明显,或通过本发明的实践了解到。Additional aspects and advantages of the present invention will be set forth in part in the following description, which will be apparent from the following description, or may be learned by practice of the present invention.

附图说明Description of drawings

为了更清楚地说明本发明实施例的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions of the embodiments of the present invention more clearly, the following briefly introduces the accompanying drawings used in the description of the embodiments. Obviously, the drawings in the following description are only some embodiments of the present invention. For those of ordinary skill in the art, other drawings can also be obtained from these drawings without any creative effort.

图1为本实施例提供了一种基于轨旁电网线杆计数的列车定位方法流程示意图;FIG. 1 provides a schematic flowchart of a method for locating a train based on the counting of poles in a wayside power grid in the present embodiment;

图2为本实施例的图像识别算法的识别过程示意图;2 is a schematic diagram of a recognition process of the image recognition algorithm of the present embodiment;

图3为采用消失点检测算法识别电网线杆的流程示意图;Fig. 3 is the schematic flow chart of adopting the vanishing point detection algorithm to identify the power grid pole;

图4为采用电网线杆计数算法记录当前识别到的电网线杆的总数量过程示意图。FIG. 4 is a schematic diagram of a process of recording the total number of currently identified power grid poles by using a grid pole counting algorithm.

具体实施方式Detailed ways

下面详细描述本发明的实施方式,所述实施方式的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施方式是示例性的,仅用于解释本发明,而不能解释为对本发明的限制。Embodiments of the present invention are described in detail below, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals refer to the same or similar elements or elements having the same or similar functions throughout. The embodiments described below with reference to the accompanying drawings are exemplary and are only used to explain the present invention, but not to be construed as a limitation of the present invention.

本技术领域技术人员可以理解,除非特意声明,这里使用的单数形式“一”、“一个”、“所述”和“该”也可包括复数形式。应该进一步理解的是,本发明的说明书中使用的措辞“包括”是指存在所述特征、整数、步骤、操作、元件和/或组件,但是并不排除存在或添加一个或多个其他特征、整数、步骤、操作、元件、组件和/或它们的组。应该理解,这里使用的措辞“和/或”包括一个或更多个相关联的列出项的任一单元和全部组合。It will be understood by those skilled in the art that the singular forms "a", "an", "the" and "the" as used herein can include the plural forms as well, unless expressly stated otherwise. It should be further understood that the word "comprising" used in the description of the present invention refers to the presence of stated features, integers, steps, operations, elements and/or components, but does not exclude the presence or addition of one or more other features, Integers, steps, operations, elements, components and/or groups thereof. It will be understood that the term "and/or" as used herein includes any and all combinations of one or more of the associated listed items.

本技术领域技术人员可以理解,除非另外定义,这里使用的所有术语(包括技术术语和科学术语)具有与本发明所属领域中的普通技术人员的一般理解相同的意义。还应该理解的是,诸如通用字典中定义的那些术语应该被理解为具有与现有技术的上下文中的意义一致的意义,并且除非像这里一样定义,不会用理想化或过于正式的含义来解释。It will be understood by those skilled in the art that, unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. It should also be understood that terms such as those defined in general dictionaries should be understood to have meanings consistent with their meanings in the context of the prior art and, unless defined as herein, are not to be taken in an idealized or overly formal sense. explain.

为便于对本发明实施例的理解,下面将结合附图以具体实施例为例做进一步的解释说明。In order to facilitate the understanding of the embodiments of the present invention, the following will take specific embodiments as examples for further explanation and description in conjunction with the accompanying drawings.

实施例Example

图1为本实施例提供了一种基于轨旁电网线杆计数的列车定位方法流程示意图,包括:FIG. 1 provides a schematic flowchart of a method for locating a train based on the counting of the poles of a trackside power grid for this embodiment, including:

S1针对轨旁电网线杆,采用基于卷积神经网络(Convolutional NeuralNetworks,CNN)的视觉几何组(Visual Geometry Group,VGG16)神经网络分类器算法训练出分类器。S1 uses the Visual Geometry Group (VGG16) neural network classifier algorithm based on Convolutional Neural Networks (CNN) to train the classifier for the trackside power grid poles.

卷积神经网络是一类包含卷积计算和深度结构的前馈神经网络,它在图像识别领域非常高效。VGG是牛津大学计算机视觉组和DeepMind公司共同研发一种深度卷积网络,它在图像分类领域有极高的成就。VGG16是VGG中被广泛应用的一类网络结构。Convolutional Neural Networks are a class of feedforward neural networks that contain convolutional computations and deep structures, which are very efficient in the field of image recognition. VGG is a deep convolutional network jointly developed by Oxford University's Computer Vision Group and DeepMind, which has achieved extremely high achievements in the field of image classification. VGG16 is a widely used network structure in VGG.

基于卷积神经网络的视觉几何组神经网络分类器由5层卷积层、3层全连接层和归一化指数输出层组成,其中卷积层使用3*3的卷积核,层与层之间使用2*2最大池化层分开,所有隐含层的激活单元都采用线性整流函数。The visual geometry group neural network classifier based on convolutional neural network consists of 5 layers of convolution layers, 3 layers of fully connected layers and normalized index output layer, in which the convolution layer uses 3*3 convolution kernels, layers and layers A 2*2 max pooling layer is used to separate them, and the activation units of all hidden layers use a linear rectification function.

训练分类器需要事先准备好样本。本实施例中的正样本为前景为轨旁电网线杆的图片,负样本指仅包含轨旁电网线杆局部信息或不包含轨旁电网线杆的图片,将正样本图片归入训练集和正例测试集,负样本图片归入反例测试集。本实施例具体采集正样本600张,负样本1600张。将上述样本尺寸均调整为112px*112px,不做灰度化处理,在正样本图片中,500张正样本图片归入训练集,100张正样本图片归入正例测试集,负样本归入反例测试集。Training a classifier requires preparing samples in advance. In this embodiment, the positive samples are pictures with the trackside power grid poles in the foreground, and the negative samples refer to pictures that only contain the local information of the trackside power grid poles or do not include the trackside power grid lines and poles. The positive sample pictures are classified into the training set and the positive samples Example test set, negative sample pictures are included in the negative example test set. This embodiment specifically collects 600 positive samples and 1600 negative samples. Adjust the size of the above samples to 112px*112px without grayscale processing. Among the positive sample images, 500 positive sample images are included in the training set, 100 positive sample images are included in the positive sample test set, and negative samples are included in the training set. Counterexample test set.

初始化神经网络以及相关参数后,将步骤准备好的正负样本输入神经网络,将相关参数调试好后,经过15轮训练,即可得到生成好的针对电网线杆的CNN-VGG16分类器。After initializing the neural network and related parameters, input the positive and negative samples prepared in the steps into the neural network, and after debugging the related parameters, after 15 rounds of training, the generated CNN-VGG16 classifier for power grid poles can be obtained.

S2通过安装在列车车头的高速摄像机采集列车的运行视频。S2 collects the running video of the train through a high-speed camera installed at the head of the train.

本实施例中采用的列车运行图像是由安装于运行着的轨道列车头车头前方中部的高速摄像头采集而来,从图像中可以清晰的分辨列车轨道线以及左右两边的电网线杆。The train running image used in this embodiment is collected by a high-speed camera installed in the front and middle of the head of the running rail locomotive. From the image, the train track line and the power grid poles on the left and right sides can be clearly distinguished.

S3采用图像识别算法识别出所述运行视频的轨道线和消失点。S3 uses an image recognition algorithm to identify the track line and vanishing point of the running video.

图2为本实施例的图像识别算法的识别过程示意图,参照图2具体包括:FIG. 2 is a schematic diagram of the recognition process of the image recognition algorithm of the present embodiment, and with reference to FIG. 2, it specifically includes:

S31将采集到的图像通过高斯滤波器滤除高频成分。S31 filters out the high-frequency components of the collected image through a Gaussian filter.

S32利用Canny算法进行边缘检测,对检测结果加一个梯形窗,滤出前景。S32 uses the Canny algorithm for edge detection, and adds a trapezoidal window to the detection result to filter out the foreground.

S33对前景进行霍夫变换,找到直线,其中越长的直线在计算斜率均值时赋予越高的权重。S33 performs Hough transform on the foreground, and finds a straight line, wherein a longer straight line is given a higher weight when calculating the mean value of the slope.

S34对找到的直线根据斜率正负区分左右道轨。划分某条线属于左道轨或右道轨。S34 discriminates the left and right tracks according to the positive and negative slope of the straight line found. Divide a line to belong to the left track or the right track.

S35分别迭代计算各条所述直线的斜率与斜率均值的差,移除差值大于预定值的直线;S35 iteratively calculates the difference between the slope of each of the straight lines and the mean value of the slope, and removes the straight line whose difference is greater than a predetermined value;

S36分别对剩余左右轨道线的直线集合做线性回归得到最终左右轨道线,最终左右轨道线的交点即为消失点。S36 performs linear regression on the set of straight lines of the remaining left and right orbital lines to obtain the final left and right orbital lines, and the intersection of the final left and right orbital lines is the vanishing point.

S4根据所述训练好的分类器,识别出所述消失点右侧的电网线杆,并记录当前识别到的电网线杆的总数量。S4 identifies the power grid poles on the right side of the vanishing point according to the trained classifier, and records the total number of the currently identified power grid lines.

图3为采用消失点检测算法识别电网线杆的流程示意图,参照图3,具体包括:对当前采集到的图片从右往左,从上往下,依次用112x112x3的窗取出包含完整线杆的图片,将该包含完整线杆的图片经64个3x3的卷积核作两次卷积加线性整流,卷积后的尺寸变为112×112×64;采用尺寸为2×2的池化单元作最大化池化,池化后的尺寸变为64×64×64;经128个3×3的卷积核作两次卷积加线性整流,尺寸变为64×64×128;作2×2的最大池化,尺寸变为32×32×128;经256个3×3的卷积核作三次卷积加线性整流,尺寸变为32×32×256;作2×2的最大池化,尺寸变为16×16×256;经512个3×3的卷积核作三次卷积加线性整流,尺寸变为16×16×512;作2×2的最大池化,尺寸变为8×8×512;经512个3×3的卷积核作三次卷积加线性整流,尺寸变为8×8×512;作2×2的最大池化,尺寸变为4×4×512;与两层1×1×4096,一层1×1×1000进行全连接线性整流(共三层);通过归一化指数函数输出1000个预测结果;所述预测结果中第一个是电网线杆且概率大于98%,则认为识别到电网线杆;将该区域在原图用经过长宽调整的矩形框标出,并记录矩形框左上角横纵坐标以及宽高;将窗的位置按顺序移动一定区域,以避免重复采集。Figure 3 is a schematic diagram of the process of identifying power grid poles using the vanishing point detection algorithm. Referring to Figure 3, it specifically includes: from right to left, from top to bottom, sequentially using a 112x112x3 window to extract the complete line pole from the currently collected picture. Picture, the picture containing the complete line is subjected to 64 3x3 convolution kernels for twice convolution and linear rectification, and the size after convolution becomes 112 × 112 × 64; using a pooling unit with a size of 2 × 2 For maximum pooling, the size after pooling becomes 64×64×64; after 128 3×3 convolution kernels do twice convolution and linear rectification, the size becomes 64×64×128; make 2× 2 maximum pooling, the size becomes 32×32×128; after 256 3×3 convolution kernels are used for cubic convolution and linear rectification, the size becomes 32×32×256; 2×2 maximum pooling , the size becomes 16×16×256; after 512 3×3 convolution kernels are used for cubic convolution and linear rectification, the size becomes 16×16×512; for 2×2 maximum pooling, the size becomes 8 ×8×512; 512 3×3 convolution kernels are used for cubic convolution and linear rectification, and the size becomes 8×8×512; after 2×2 maximum pooling, the size becomes 4×4×512; Fully connected linear rectification with two layers of 1×1×4096 and one layer of 1×1×1000 (three layers in total); output 1000 prediction results through the normalized exponential function; the first of the prediction results is the power grid line If the pole and the probability is greater than 98%, it is considered that the power grid pole is recognized; mark the area in the original image with a rectangular frame adjusted by length and width, and record the horizontal and vertical coordinates and width and height of the upper left corner of the rectangular frame; place the positions of the windows in order Move a certain area to avoid duplicate acquisitions.

采用电网线杆计数算法记录当前识别到的电网线杆的总数量,使得每个电网线杆被识别后仅记录一次,记录当前识别到的电网线杆的总数量,图4为采用电网线杆计数算法记录当前识别到的电网线杆的总数量过程示意图,参照图4,具体过程包括:The grid pole counting algorithm is used to record the total number of currently recognized grid poles, so that each grid pole is only recorded once after being identified, and the total number of currently recognized grid poles is recorded. Figure 4 shows the use of grid poles. A schematic diagram of the process of recording the total number of power grid poles currently identified by the counting algorithm. Referring to Figure 4, the specific process includes:

对每一帧图像,分类器将以从右至左,从上至下的顺序在原始图像查找电网线杆;For each frame of image, the classifier will look for power grid poles in the original image in order from right to left and top to bottom;

每找到一个电网线杆时,分类器返回当前电网线杆左上角的横纵坐标以及宽高信息;Each time a power grid pole is found, the classifier returns the horizontal and vertical coordinates and width and height information of the upper left corner of the current power grid pole;

当找到的电网线杆个数达到三个或者已经扫描到消失点的横坐标时结束查找;The search ends when the number of grid poles found reaches three or the abscissa of the vanishing point has been scanned;

如果当前帧最右侧的电网线杆在下一帧仍然在图像采集范围内时,扫描到的最大横坐标将比前一帧大;当最右侧电网线杆已经在摄像头视野中消失时,最大的横坐标值将会发生跳变,变为原本第二大的横坐标值,如果连续5帧这个跳变过程都没有发生反复,则认为一个电网线杆已经离开了摄像头视野,此时电网线杆计数值增加1。If the power grid pole on the far right of the current frame is still within the image acquisition range in the next frame, the maximum abscissa scanned will be larger than the previous frame; The abscissa value will jump and become the second largest abscissa value. If the jumping process does not repeat for 5 consecutive frames, it is considered that a power grid pole has left the camera's field of view. At this time, the power grid line The rod count value is incremented by 1.

S5根据当前电网线杆的总数量确定列车的当前位置。S5 determines the current position of the train according to the total number of current grid poles.

通过数据库存储电网线杆数量-运行位置对应关系,根据数据库存储的电网线杆数量-运行位置对应关系查询当前电网线杆的总数量确定列车的当前位置。Store the corresponding relationship between the number of grid poles and the operating position in the database, and query the total number of current grid poles to determine the current position of the train according to the corresponding relationship between the number of grid poles and the operating position stored in the database.

本实施例以京津城际线路录制的十段包含牵引工况,制动工况,惰行工况,巡航工况,直行轨道,转弯轨道的测试视频,依次对每段视频进行处理,采用本实施例方法进行处理,列车运行图像由安置于在京津城际线路运行的轨道列车头部的高速摄像头采集而来,线路右侧电网线杆堆叠较少,容易分辨,故使用右侧电网线杆进行计数。利用主频2.0GHz的CPU i7-8550U,内存16G,显卡GTX1060的计算机,安装pycharm集成开发环境,搭建python3.7开发环境以及opencv+tensorflow框架。基于训练好的针对电网线杆的CNN-VGG16分类器,添加消失点检测程序以及电网线杆计数程序,对采集到的视频进行处理。处理完成后记录误差值,最终结果显示,其定位误差在9m内,符合实用要求。This embodiment uses ten test videos recorded on the Beijing-Tianjin intercity line, including traction conditions, braking conditions, coasting conditions, cruising conditions, straight tracks, and turning tracks. For processing by the embodiment method, the train running image is collected by a high-speed camera installed at the head of the rail train running on the Beijing-Tianjin intercity line. rod to count. Using a computer with a main frequency of 2.0GHz CPU i7-8550U, 16G memory, and a graphics card GTX1060, install the pycharm integrated development environment, build the python3.7 development environment and the opencv+tensorflow framework. Based on the trained CNN-VGG16 classifier for power grid poles, a vanishing point detection program and a power grid pole counting program are added to process the collected video. After the processing is completed, the error value is recorded, and the final result shows that the positioning error is within 9m, which meets the practical requirements.

本领域技术人员应能理解上述输入框的应用类型仅为举例,其他现有的或今后可能出现的输入框应用类型如可适用于本发明实施例,也应包含在本发明保护范围以内,并在此以引用方式包含于此。Those skilled in the art should understand that the above-mentioned application types of input boxes are only examples, and other existing or possible future application types of input boxes, if applicable to the embodiments of the present invention, should also be included within the protection scope of the present invention, and It is hereby incorporated by reference.

本领域普通技术人员可以理解:附图只是一个实施例的示意图,附图中的模块或流程并不一定是实施本发明所必须的。Those of ordinary skill in the art can understand that the accompanying drawing is only a schematic diagram of an embodiment, and the modules or processes in the accompanying drawing are not necessarily necessary to implement the present invention.

通过以上的实施方式的描述可知,本领域的技术人员可以清楚地了解到本发明可借助软件加必需的通用硬件平台的方式来实现。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品可以存储在存储介质中,如ROM/RAM、磁碟、光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例或者实施例的某些部分所述的方法。From the description of the above embodiments, those skilled in the art can clearly understand that the present invention can be implemented by means of software plus a necessary general hardware platform. Based on this understanding, the technical solutions of the present invention can be embodied in the form of software products in essence or the parts that make contributions to the prior art. The computer software products can be stored in storage media, such as ROM/RAM, magnetic disks, etc. , CD, etc., including several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in various embodiments or some parts of the embodiments of the present invention.

以上所述,仅为本发明较佳的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到的变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应该以权利要求的保护范围为准。The above description is only a preferred embodiment of the present invention, but the protection scope of the present invention is not limited to this. Substitutions should be covered within the protection scope of the present invention. Therefore, the protection scope of the present invention should be subject to the protection scope of the claims.

Claims (7)

1. A train positioning method based on trackside power grid line pole counting is characterized by comprising the following steps:
aiming at a trackside power grid line pole, training a classifier by adopting a visual geometry group neural network classifier algorithm based on a convolutional neural network;
collecting a running video of the train through a high-speed camera arranged at the head of the train;
identifying the track line and the vanishing point of the running video by adopting an image identification algorithm, wherein the method comprises the following steps of:
filtering high-frequency components of the acquired image through a Gaussian filter;
performing edge detection by using a Canny algorithm, adding a trapezoidal window to a detection result, and filtering out a foreground;
carrying out Hough transform on the foreground to find straight lines, wherein longer straight lines are endowed with higher weight when calculating the slope average value;
distinguishing left and right tracks of the found straight line according to the positive and negative slopes;
respectively and iteratively calculating the difference between the slope of each straight line and the mean value of the slopes, and removing the straight lines with the difference value larger than a preset value;
respectively performing linear regression on the straight line set of the left and right track lines to obtain a final left and right track line, wherein the intersection point of the final left and right track lines is a vanishing point;
according to the trained classifier, identifying the power grid poles on the right side of the vanishing point, and recording the total number of the currently identified power grid poles by adopting a power grid pole counting algorithm;
and determining the current position of the train according to the total number of the current power grid poles.
2. The method of claim 1, wherein the visual geometry group neural network classifier based on convolutional neural network is composed of 5 convolutional layers, 3 fully-connected layers and normalized exponential output layers, wherein the convolutional layers use 3x3 convolutional kernels, the layers are separated by 2x 2 maximal pooling layers, and the activation units of all hidden layers use linear rectification function.
3. The method according to claim 1, wherein for the trackside power grid mast, training a classifier by using a visual geometry group neural network classifier algorithm based on a convolutional neural network comprises: the positive sample is a picture with the foreground of the trackside power grid pole, the negative sample is a picture only containing trackside power grid pole local information or not containing the trackside power grid pole, the positive sample picture is classified into a training set and a positive example test set, and the negative sample picture is classified into a negative example test set.
4. The method as claimed in claim 1, wherein the step of acquiring the running video of the train by the high-speed camera installed at the head of the train comprises the following steps: the high-speed camera is arranged in the middle of the train head.
5. The method of claim 1, wherein the step of identifying the grid poles to the right of the vanishing point and recording the total number of currently identified grid poles according to the trained classifier comprises the steps of:
sequentially taking out pictures containing complete lines and poles by using 112 multiplied by 3 windows from right to left and from top to bottom for the currently collected pictures, carrying out convolution twice and linear rectification on the pictures containing the complete lines and poles by 64 3 multiplied by 3 convolution kernels, wherein the size after the convolution is changed into 112 multiplied by 64; the pooling unit with the size of 2 multiplied by 2 is adopted for maximizing pooling, and the size after pooling is changed into 64 multiplied by 64; performing two times of convolution and linear rectification by 128 convolution kernels of 3 × 3, wherein the size of the obtained product is 64 × 64 × 128; making 2X 2 maximal pooling, and changing the size to 32X 128; carrying out three times of convolution and linear rectification by 256 convolution kernels of 3 × 3, wherein the size of the convolution kernels is changed into 32 × 32 × 256; making 2 × 2 maximal pooling, and changing the size to 16 × 16 × 256; carrying out three times of convolution and linear rectification by 512 convolution kernels of 3 × 3, wherein the size of the obtained product is changed into 16 × 16 × 512; making 2 × 2 maximal pooling, and changing the size to 8 × 8 × 512; carrying out three times of convolution and linear rectification by 512 convolution kernels of 3 × 3, wherein the size of the convolution kernels is 8 × 8 × 512; making 2X 2 maximal pooling, and changing the size to 4X 512; carrying out full-connection linear rectification with two layers of 1 multiplied by 4096 and one layer of 1 multiplied by 1000; outputting 1000 prediction results through a normalized exponential function; the first one of the prediction results is a power grid pole, and the probability is greater than 98%, the power grid pole is considered to be identified; marking the identified area of the power grid pole on an original picture by using a rectangular frame with the length and width adjusted, and recording the horizontal and vertical coordinates and the width and height of the upper left corner of the rectangular frame; the position of the window is sequentially moved by a certain area.
6. The method of claim 1, wherein said recording the total number of currently identified grid poles comprises:
for each frame of image, the classifier searches for the power grid poles in the original image from right to left and from top to bottom;
when a power grid pole is found, the classifier returns the horizontal and vertical coordinates and width and height information of the upper left corner of the current power grid pole;
finishing searching when the number of the found power grid poles reaches three or the abscissa of the vanishing point is scanned;
if the power grid pole at the rightmost side of the current frame is still in the image acquisition range of the next frame, the scanned maximum abscissa is larger than that of the previous frame; when the rightmost power grid pole disappears in the camera view, the maximum abscissa value jumps to be the original second largest abscissa value, if the jumping process of 5 continuous frames does not repeat, a power grid pole is considered to leave the camera view, and the power grid pole count value is increased by 1.
7. The method of claim 1, wherein determining the current location of the train based on the total number of current grid poles comprises:
and storing the corresponding relation of the number of the electric network poles and the operation position through a database, and inquiring the total number of the current electric network poles according to the corresponding relation of the number of the electric network poles and the operation position stored in the database to determine the current position of the train.
CN201911057378.4A 2019-11-01 2019-11-01 A train positioning method based on the counting of the poles of the wayside grid Expired - Fee Related CN110827354B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911057378.4A CN110827354B (en) 2019-11-01 2019-11-01 A train positioning method based on the counting of the poles of the wayside grid

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911057378.4A CN110827354B (en) 2019-11-01 2019-11-01 A train positioning method based on the counting of the poles of the wayside grid

Publications (2)

Publication Number Publication Date
CN110827354A CN110827354A (en) 2020-02-21
CN110827354B true CN110827354B (en) 2022-07-22

Family

ID=69551858

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911057378.4A Expired - Fee Related CN110827354B (en) 2019-11-01 2019-11-01 A train positioning method based on the counting of the poles of the wayside grid

Country Status (1)

Country Link
CN (1) CN110827354B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111310700B (en) * 2020-02-27 2024-02-13 电子科技大学 Intermediate frequency sampling sequence processing method for radiation source fingerprint feature recognition
CN116161081B (en) * 2023-04-18 2023-07-07 中铁电气化局集团有限公司 Train positioning system and method based on trackside power grid pole counting

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5803411A (en) * 1996-10-21 1998-09-08 Abb Daimler-Benz Transportation (North America) Inc. Method and apparatus for initializing an automated train control system
US8970701B2 (en) * 2011-10-21 2015-03-03 Mesa Engineering, Inc. System and method for predicting vehicle location
CN103512762B (en) * 2012-06-29 2016-12-21 北京华兴致远科技发展有限公司 Image processing method, device and train failure detection system
CN106515765B (en) * 2016-09-27 2018-11-02 中车青岛四方机车车辆股份有限公司 Track train and its plugging device of vehicle end connection
CN106705907A (en) * 2016-12-12 2017-05-24 交控科技股份有限公司 Rail-side device aligning device and aligning method
CN107878511B (en) * 2016-12-27 2019-01-29 比亚迪股份有限公司 The localization method and system of train and trackside element
CN109664919A (en) * 2017-10-17 2019-04-23 株洲中车时代电气股份有限公司 A kind of train locating method and positioning system
CN109614898B (en) * 2018-11-29 2023-08-08 通号通信信息集团有限公司 Intelligent judging method for train running direction detection
CN109767427A (en) * 2018-12-25 2019-05-17 北京交通大学 Method for detecting defects of train track fasteners

Also Published As

Publication number Publication date
CN110827354A (en) 2020-02-21

Similar Documents

Publication Publication Date Title
Wei et al. Multi-target defect identification for railway track line based on image processing and improved YOLOv3 model
CN111460927B (en) Method for extracting structured information of house property evidence image
CN114612795A (en) Target recognition method in road scene based on lidar point cloud
CN107680095A (en) The electric line foreign matter detection of unmanned plane image based on template matches and optical flow method
CN113569756A (en) Abnormal behavior detection and positioning method, system, terminal equipment and readable storage medium
CN110827354B (en) A train positioning method based on the counting of the poles of the wayside grid
CN103853794B (en) Pedestrian retrieval method based on part association
Lu et al. Robust and online vehicle counting at crowded intersections
CN104978567A (en) Vehicle detection method based on scenario classification
CN111598855B (en) 2C equipment high-speed rail contact net dropper defect detection method based on deep learning and transfer learning
CN109543498B (en) Lane line detection method based on multitask network
CN116030396B (en) An Accurate Segmentation Method for Video Structured Extraction
CN113506269A (en) Turnout and non-turnout rail fastener positioning method based on deep learning
CN116363554A (en) A monitoring video key frame extraction method, system, medium, device and terminal
CN116704476B (en) Traffic sign detection method based on improved Yolov-tini algorithm
CN105184229A (en) Online learning based real-time pedestrian detection method in dynamic scene
CN107808524A (en) A kind of intersection vehicle checking method based on unmanned plane
CN118469986A (en) Transformer substation equipment defect detection method and system based on improvement YOLOv9
CN110991447A (en) Train number accurate positioning and identification method based on deep learning
CN117952946A (en) Rail surface defect detection method and device, electronic equipment and storage medium
CN118691795A (en) Intelligent inspection method of transmission lines based on image recognition and deep learning technology
Zhu et al. Corner guided instance segmentation network for power lines and transmission towers detection
CN111914712A (en) A method and system for target detection in railway ground track scene
CN113033443B (en) A UAV-based automatic inspection method for pedestrian crossing facilities in the whole road network
CN111311744B (en) Candidate frame filtering, fusion and automatic updating method, system and storage medium for identifying pulmonary nodules

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20220722