[go: up one dir, main page]

CN116540921A - Network data storage method, related system, and storage medium - Google Patents

Network data storage method, related system, and storage medium Download PDF

Info

Publication number
CN116540921A
CN116540921A CN202210093050.3A CN202210093050A CN116540921A CN 116540921 A CN116540921 A CN 116540921A CN 202210093050 A CN202210093050 A CN 202210093050A CN 116540921 A CN116540921 A CN 116540921A
Authority
CN
China
Prior art keywords
grids
grid
subspace
data
subspaces
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210093050.3A
Other languages
Chinese (zh)
Inventor
原朝
王昊
周银生
黄骞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN202210093050.3A priority Critical patent/CN116540921A/en
Priority to PCT/CN2022/114359 priority patent/WO2023142466A1/en
Publication of CN116540921A publication Critical patent/CN116540921A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/0608Saving storage space on storage systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0638Organizing or formatting or addressing of data
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/067Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W24/00Supervisory, monitoring or testing arrangements
    • H04W24/02Arrangements for optimising operational condition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W24/00Supervisory, monitoring or testing arrangements
    • H04W24/10Scheduling measurement reports ; Arrangements for measurement reports
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W8/00Network data management
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Image Analysis (AREA)

Abstract

本申请实施例提供一种网络数据存储方法及相关系统、存储介质,该方法包括:获取多个栅格对应的全量MR数据,并根据所述全量MR数据将所述多个栅格划分为P个子空间,P为不小于1的整数;根据所述P个子空间的全量MR数据,确定所述P个子空间的价值级别;根据所述P个子空间的价值级别对所述P个子空间中的栅格进行融合处理,以得到所述每个子空间的融合后的栅格;根据所述融合后的栅格存储所述全量MR数据。采用该手段,本发明采用融合后的栅格存储数据,在不丢失数据本身信息、存储全量数据的前提下,栅格总数有所减少,节省了存储空间。

An embodiment of the present application provides a network data storage method, a related system, and a storage medium. The method includes: acquiring full MR data corresponding to multiple grids, and dividing the multiple grids into P subspaces, P is an integer not less than 1; according to the full amount of MR data of the P subspaces, determine the value level of the P subspaces; The grid is fused to obtain the fused grid of each subspace; the full amount of MR data is stored according to the fused grid. By adopting this method, the present invention uses the fused grid to store data. On the premise of not losing the information of the data itself and storing the full amount of data, the total number of grids is reduced and the storage space is saved.

Description

网络数据存储方法及相关系统、存储介质Network data storage method, related system, and storage medium

技术领域technical field

本申请涉及数据处理技术领域,尤其涉及一种网络数据存储方法及相关系统、存储介质。The present application relates to the technical field of data processing, and in particular to a network data storage method, a related system, and a storage medium.

背景技术Background technique

在无线通信网络规划优化领域,用户终端设备以固定周期向基站上报网络测量数据报告。为提取、存储这些报告中的数据并表征网络在地表空间上连续变化的信息,常将地表空间切分为无缝无重叠的等大栅格,并以栅格为基本单位来存储对应空间范围的网络测量数据,进而通过数据聚合支撑面向运营商客户所需的网络规划优化分析。依据分析的结果,维优工程师对网络进行优化来提升网络用户的使用体验。In the field of wireless communication network planning and optimization, user terminal equipment reports network measurement data reports to the base station at a fixed period. In order to extract and store the data in these reports and represent the information of continuous changes in the surface space of the network, the surface space is often divided into seamless and non-overlapping equal-sized grids, and the grid is used as the basic unit to store the corresponding spatial range Network measurement data, and then through data aggregation to support network planning optimization analysis required by operator customers. According to the analysis results, Weiyou engineers optimize the network to improve the experience of network users.

随着5G通信技术及对应智能终端的普及,无线网络测量数据量正以指数级飞速增加,比如,5G通信使得单位面积的物联网设备数量可以达到4G通信的100倍以上,海量物联网的传感器将产生海量的数据。With the popularity of 5G communication technology and corresponding smart terminals, the amount of wireless network measurement data is increasing exponentially. For example, 5G communication enables the number of IoT devices per unit area to reach more than 100 times that of 4G communication. Massive IoT sensors Massive amounts of data will be generated.

针对无线通信网络规划优化领域,数据量的激增带来了一系列矛盾:海量数据为高层次的数据分析提供丰富的数据源,使分析结果更加精确,能良好的面向客户解决网络问题。另一方面也面临着数据治理、数据存储等方向的高开销。鉴于海量数据面向高质量网络的重要地位,如何降低栅格存储海量数据开销,并在其中取其精华,去其糟粕已成为新时代网络规划优化领域的重要课题,只有应对好数据治理、存储方向的挑战,才能充分利用海量数据提供的物质基础。For the field of wireless communication network planning and optimization, the surge in data volume has brought about a series of contradictions: massive data provides rich data sources for high-level data analysis, making the analysis results more accurate, and can solve network problems well for customers. On the other hand, it also faces high overhead in data governance and data storage. In view of the important position of massive data in high-quality networks, how to reduce the cost of raster storage of massive data and extract the essence from it has become an important topic in the field of network planning and optimization in the new era. In order to make full use of the material basis provided by massive data.

现有技术在存储网络数据时,使用等大栅格存储,在数据存储模块中为一个栅格对应一条数据记录,考虑到无线网络交付场景包括密集城区、一般城区、郊区、城镇乡村等,在城镇乡村或郊区等无线网络数据较为稀疏的区域,存在单个50m尺寸栅格内部并无或很少无线网络数据的情况,但该删格的经纬度仍然作为一条记录存储,这样造成存储资源的极大浪费。随着数据量飞速增加,现有技术采用数据采样的解决方式来缩减数据量,通过从全量数据中提取出少部分数据代替全量数据进行后续网络指标聚合处理,以此应对数据量飞速增加的问题,进而减少硬件开销及提升查询性能,但是采用数据采样的方式会导致网络规划优化精度下降、网络规划优化效率下降。When storing network data in the prior art, equal-sized grids are used for storage. In the data storage module, one grid corresponds to one data record. Considering that wireless network delivery scenarios include dense urban areas, general urban areas, suburban areas, urban and rural areas, etc., in the In areas where wireless network data is relatively sparse, such as towns, villages or suburbs, there may be no or very little wireless network data inside a single 50m grid, but the latitude and longitude of the deleted grid is still stored as a record, which causes a huge storage resource waste. With the rapid increase in the amount of data, the existing technology adopts the solution of data sampling to reduce the amount of data, by extracting a small part of data from the full amount of data instead of the full amount of data for subsequent network index aggregation processing, so as to deal with the problem of rapid increase in the amount of data , thereby reducing hardware overhead and improving query performance, but the use of data sampling will lead to a decrease in the accuracy of network planning and optimization, and a decrease in the efficiency of network planning and optimization.

发明内容Contents of the invention

本申请公开了一种网络数据存储方法及相关系统、存储介质,可以在存储全量数据的情况小同时减少数据存储占用空间。The present application discloses a network data storage method, a related system, and a storage medium, which can save a full amount of data while reducing the space occupied by data storage.

第一方面,本申请实施例提供一种网络数据存储方法,包括:获取多个栅格对应的全量MR数据,并根据所述全量MR数据将所述多个栅格划分为P个子空间,P为不小于1的整数;根据所述P个子空间的全量MR数据,确定所述P个子空间的价值级别;根据所述P个子空间的价值级别对所述P个子空间中的栅格进行融合处理,以得到所述每个子空间的融合后的栅格;根据所述融合后的栅格存储所述全量MR数据。In the first aspect, the embodiment of the present application provides a method for storing network data, including: acquiring the full amount of MR data corresponding to multiple grids, and dividing the multiple grids into P subspaces according to the full amount of MR data, P is an integer not less than 1; determine the value levels of the P subspaces according to the full MR data of the P subspaces; perform fusion processing on the grids in the P subspaces according to the value levels of the P subspaces , to obtain the fused grid of each subspace; store the full amount of MR data according to the fused grid.

本申请实施例,通过将多个栅格划分为P个子空间,并根据所述P个子空间的全量MR数据确定该P个子空间的价值级别,进而根据该P个子空间的价值级别对P个子空间中的栅格进行融合处理,然后根据融合后的栅格存储全量MR数据。采用该手段,基于不同价值级别,将不同子空间的多个原来的栅格分别进行了合并,使得栅格数量减少,在存储全量MR数据时占用的存储空间较少,相较于现有技术中采用数量极多的等大栅格存储MR数据,本发明采用融合后的栅格存储数据,在不丢失数据本身信息、存储全量数据的前提下,栅格总数有所减少,节省了存储空间。In this embodiment of the present application, multiple grids are divided into P subspaces, and the value levels of the P subspaces are determined according to the full MR data of the P subspaces, and then the P subspaces are evaluated according to the value levels of the P subspaces. The raster in is fused, and then the full amount of MR data is stored according to the fused raster. Using this method, multiple original grids of different subspaces are merged based on different value levels, so that the number of grids is reduced, and less storage space is occupied when storing the full amount of MR data. Compared with the existing technology A very large number of equal-sized grids are used to store MR data. The present invention uses fused grids to store data. On the premise of not losing the information of the data itself and storing the full amount of data, the total number of grids is reduced, saving storage space .

作为一种可选的实现方式,所述根据所述全量MR数据将所述多个栅格划分为P个子空间,包括:As an optional implementation manner, the dividing the plurality of grids into P subspaces according to the full amount of MR data includes:

S1、根据每个所述栅格内的MR数据的数据量,确定每个所述栅格的数据密度;S1. Determine the data density of each grid according to the amount of MR data in each grid;

S2、根据每个所述栅格的数据密度确定所述多个栅格中任意两个栅格之间的不相似度值;S2. Determine the dissimilarity value between any two grids in the plurality of grids according to the data density of each grid;

S3、将所述多个栅格中任意两个栅格之间不相似度值最小的两个栅格记为同组栅格;S3. Record the two grids with the smallest dissimilarity value between any two grids among the plurality of grids as the same group of grids;

S4、计算包含所述同组栅格的所述多个栅格中任意两个栅格之间的不相似度值;S4. Calculating a dissimilarity value between any two grids in the plurality of grids including the same group of grids;

S5、重复执行步骤S3-S4,直到所述多个栅格中的每个栅格均为同组栅格中的栅格,得到P个同组栅格,所述P个子空间与P个同组栅格对应,所述P个子空间中每个子空间包含的栅格的数量不小于第一预设值。S5. Steps S3-S4 are repeatedly executed until each of the multiple grids is a grid in the same group of grids, and P grids of the same group are obtained, and the P subspaces are the same as the P subspaces of the same group. Corresponding to a group of grids, the number of grids contained in each of the P subspaces is not less than a first preset value.

该方案基于空间邻近性,将整块电信网格区域划分多个子空间,以便为后续价值分析、多尺度栅格存储创造空间载体,并提供数据支撑、理论支撑。以子空间为单位,对其价值进行排序,为多尺度栅格存储提供理论基础。Based on spatial proximity, the scheme divides the entire telecommunications grid area into multiple subspaces to create space carriers for subsequent value analysis and multi-scale grid storage, and provide data support and theoretical support. Take the subspace as the unit, sort its value, and provide a theoretical basis for multi-scale raster storage.

作为一种可选的实现方式,所述方法还包括:若存在子空间A,其中所述子空间A包含的栅格的数量小于所述第一预设值,则从所述P个子空间中获取子空间B,所述子空间B与所述子空间A的不相似度值差值小于第二预设值;将所述子空间A合并至所述子空间B中,以更新所述子空间B。As an optional implementation, the method further includes: if there is a subspace A, wherein the number of grids contained in the subspace A is less than the first preset value, selecting from the P subspaces Obtaining subspace B, the difference of the dissimilarity value between the subspace B and the subspace A is less than a second preset value; merging the subspace A into the subspace B to update the subspace space B.

该手段可以让生成的子空间的范围满足实际业务分布范围,更贴合业务自身的需要。This method can make the range of the generated subspace meet the actual business distribution range, and better meet the needs of the business itself.

作为一种可选的实现方式,所述根据所述P个子空间的全量MR数据,确定所述P个子空间的价值级别,包括:根据所述P个子空间中每个子空间所包含的栅格的数据密度和每个子空间包含的栅格的数量确定所述P个子空间中每个子空间的数据密度平均值;根据所述P个子空间中每个子空间的数据密度平均值得到所述P个子空间的价值级别,其中,所述数据密度平均值越大,所述子空间的价值级别越高。As an optional implementation manner, the determining the value level of the P subspaces according to the full amount of MR data of the P subspaces includes: The data density and the number of grids contained in each subspace determine the average data density of each of the P subspaces; obtain the P subspaces according to the average data density of each of the P subspaces. A value level, wherein the greater the average value of the data density, the higher the value level of the subspace.

通过基于数据密度平均值确定子空间的价值级别,进而便于基于子空间的价值级别确定不同的融合策略。相较于现有技术中不对地理空间进行区分,对所有地理空间都采取同样的数据存储、处理策略,本方案引入了“价值”的概念,对地理空间进行区分,如设立“高、中、低、无价值区域”等,可以针对各种区域数据特点进行不同的处理。By determining the value level of the subspace based on the average value of the data density, it is convenient to determine different fusion strategies based on the value level of the subspace. Compared with the existing technology that does not distinguish between geographical spaces and adopts the same data storage and processing strategy for all geographical spaces, this solution introduces the concept of "value" to distinguish geographical spaces, such as setting up "high, medium, Low and valueless areas", etc., can be processed differently according to the characteristics of various regional data.

可选的,所述方法还包括:根据每个所述栅格的栅格中心点经纬度得到每个所述栅格的编码;所述根据所述P个子空间的价值级别对所述P个子空间中的栅格进行融合处理,以得到所述每个子空间的融合后的栅格,包括:根据所述P个子空间的价值级别确定所述P个子空间中每个子空间的栅格尺寸上限;根据每个所述栅格的编码,将所述每个子空间中的栅格集合进行融合,以得到所述每个子空间的融合后的栅格,其中,每个子空间中融合后的栅格尺寸不大于该子空间的栅格尺寸上限,所述每个子空间中的栅格集合中的栅格的数量为预设数值,且所述栅格集合中的栅格的编码仅最后一位不相同,所述栅格集合中的栅格为所述每个子空间中的栅格,和/或,所述栅格集合中的栅格为根据所述每个子空间中的栅格进行融合得到的;所述根据所述融合后的栅格存储所述全量MR数据,包括:根据所述融合后的栅格的编码存储所述全量MR数据,所述融合后的栅格的编码为将所述栅格集合中的栅格的编码中不相同的最后一位删除得到的。Optionally, the method further includes: obtaining the code of each grid according to the latitude and longitude of the grid center point of each grid; The grids in the grid are fused to obtain the fused grid of each subspace, including: determining the upper limit of the grid size of each subspace in the P subspaces according to the value level of the P subspaces; Encoding of each grid, fusing the set of grids in each subspace to obtain a fused grid of each subspace, wherein the size of the fused grid in each subspace is different greater than the upper limit of the grid size of the subspace, the number of grids in the grid set in each subspace is a preset value, and only the last bit of the encoding of the grids in the grid set is different, The grids in the grid set are the grids in each subspace, and/or, the grids in the grid set are fused according to the grids in each subspace; Storing the full amount of MR data according to the fused grid includes: storing the full amount of MR data according to the encoding of the fused grid, where the encoding of the fused grid is the grid The encoding of the rasters in the collection is not the same as the last bit obtained by deleting.

例如,价值级别越高,栅格尺寸上限越低(小)。For example, the higher the value level, the lower (smaller) the maximum grid size.

本申请所述的栅格集合,可以理解为,将编码仅最后一位不相同的预设数量的栅格确定为一个栅格集合。该预设数量例如为4个等。每个子空间可以包含多个栅格集合。The grid set described in this application can be understood as determining a preset number of grids whose codes differ only in the last bit as a grid set. The preset number is, for example, 4 and so on. Each subspace can contain multiple collections of rasters.

采用该手段,基于不同价值级别,将不同子空间的多个原来的栅格分别进行了合并,使得栅格数量减少,在存储全量MR数据时占用的存储空间较少,相较于现有技术中采用数量极多的等大栅格存储MR数据,本发明采用融合后的栅格存储数据,在不丢失数据本身信息、存储全量数据的前提下,栅格总数有所减少,节省了存储空间。Using this method, multiple original grids of different subspaces are merged based on different value levels, so that the number of grids is reduced, and less storage space is occupied when storing the full amount of MR data. Compared with the existing technology A very large number of equal-sized grids are used to store MR data. The present invention uses fused grids to store data. On the premise of not losing the information of the data itself and storing the full amount of data, the total number of grids is reduced, saving storage space .

进一步地,所述方法还包括:当所述P个子空间中的任一子空间C中的MR数据变化量超出预设变化量,根据所述子空间C中变化后的MR数据,更新所述子空间C的价值级别。其中,在同一电信网格下,单个子空间的数据量会随着时间变化,因此可应用例如时间序列异常检测判定是否需要更新现有价值模式。通过检测当前价值模式是否适用于新数据,避免重复计算增加资源开销,保障业务响应速度,且同时保证业务精确度。Further, the method further includes: when the variation of MR data in any subspace C of the P subspaces exceeds a preset variation, updating the The value level of subspace C. Among them, under the same telecommunications grid, the amount of data in a single subspace will change with time, so for example, time series anomaly detection can be applied to determine whether the existing value model needs to be updated. By detecting whether the current value model is applicable to new data, it avoids repeated calculations that increase resource overhead, ensures business response speed, and at the same time ensures business accuracy.

第二方面,本申请提供了一种网络数据存储装置,包括:处理模块,用于获取多个栅格对应的全量MR数据,并根据所述全量MR数据将所述多个栅格划分为P个子空间,P为不小于1的整数;确定模块,用于根据所述P个子空间的全量MR数据,确定所述P个子空间的价值级别;融合模块,用于根据所述P个子空间的价值级别对所述P个子空间中的栅格进行融合处理,以得到所述每个子空间的融合后的栅格;存储模块,用于根据所述融合后的栅格存储所述全量MR数据。In a second aspect, the present application provides a network data storage device, including: a processing module, configured to obtain full MR data corresponding to multiple grids, and divide the multiple grids into P subspaces, P is an integer not less than 1; the determination module is used to determine the value level of the P subspaces according to the full amount of MR data of the P subspaces; the fusion module is used to determine the value level of the P subspaces according to the value of the P subspaces The level performs fusion processing on the grids in the P subspaces to obtain a fused grid of each subspace; a storage module is configured to store the full amount of MR data according to the fused grids.

本申请实施例,通过将多个栅格划分为P个子空间,并根据所述P个子空间的全量MR数据确定该P个子空间的价值级别,进而根据该P个子空间的价值级别对P个子空间中的栅格进行融合处理,然后根据融合后的栅格存储全量MR数据。采用该手段,基于不同价值级别,将不同子空间的多个原来的栅格分别进行了合并,使得栅格数量减少,在存储全量MR数据时占用的存储空间较少,相较于现有技术中采用数量极多的等大栅格存储MR数据,本发明采用融合后的栅格存储数据,在不丢失数据本身信息、存储全量数据的前提下,栅格总数有所减少,节省了存储空间。In this embodiment of the present application, multiple grids are divided into P subspaces, and the value levels of the P subspaces are determined according to the full MR data of the P subspaces, and then the P subspaces are evaluated according to the value levels of the P subspaces. The raster in is fused, and then the full amount of MR data is stored according to the fused raster. Using this method, multiple original grids of different subspaces are merged based on different value levels, so that the number of grids is reduced, and less storage space is occupied when storing the full amount of MR data. Compared with the existing technology A very large number of equal-sized grids are used to store MR data. The present invention uses fused grids to store data. On the premise of not losing the information of the data itself and storing the full amount of data, the total number of grids is reduced, saving storage space .

可选的,所述处理模块,用于执行以下步骤:Optionally, the processing module is configured to perform the following steps:

S1、根据每个所述栅格内的MR数据的数据量,确定每个所述栅格的数据密度;S1. Determine the data density of each grid according to the amount of MR data in each grid;

S2、根据每个所述栅格的数据密度确定所述多个栅格中任意两个栅格之间的不相似度值;S2. Determine the dissimilarity value between any two grids in the plurality of grids according to the data density of each grid;

S3、将所述多个栅格中任意两个栅格之间不相似度值最小的两个栅格记为同组栅格;S3. Record the two grids with the smallest dissimilarity value between any two grids among the plurality of grids as the same group of grids;

S4、计算包含所述同组栅格的所述多个栅格中任意两个栅格之间的不相似度值;S4. Calculating a dissimilarity value between any two grids in the plurality of grids including the same group of grids;

S5、重复执行步骤S3-S4,直到所述多个栅格中的每个栅格均为同组栅格中的栅格,得到P个同组栅格,所述P个子空间与P个同组栅格对应,所述P个子空间中每个子空间包含的栅格的数量不小于第一预设值。S5. Steps S3-S4 are repeatedly executed until each of the multiple grids is a grid in the same group of grids, and P grids of the same group are obtained, and the P subspaces are the same as the P subspaces of the same group. Corresponding to a group of grids, the number of grids contained in each of the P subspaces is not less than a first preset value.

其中,所述处理模块,还用于:若存在子空间A,其中所述子空间A包含的栅格的数量小于所述第一预设值,则从所述P个子空间中获取子空间B,所述子空间B与所述子空间A的不相似度值差值小于第二预设值;将所述子空间A合并至所述子空间B中,以更新所述子空间B。Wherein, the processing module is further configured to: if there is a subspace A, wherein the number of grids contained in the subspace A is less than the first preset value, then obtain the subspace B from the P subspaces , the difference of the dissimilarity value between the subspace B and the subspace A is smaller than a second preset value; the subspace A is merged into the subspace B, so as to update the subspace B.

进一步对,所述确定模块,用于:根据所述P个子空间中每个子空间所包含的栅格的数据密度和每个子空间包含的栅格的数量确定所述P个子空间中每个子空间的数据密度平均值;根据所述P个子空间中每个子空间的数据密度平均值得到所述P个子空间的价值级别,其中,所述数据密度平均值越大,所述子空间的价值级别越高。Further, the determination module is configured to: determine the data density of each of the P subspaces according to the data density of the grids contained in each of the P subspaces and the number of grids contained in each subspace. The average value of data density; the value level of the P subspaces is obtained according to the average data density of each subspace in the P subspaces, wherein the greater the average value of the data density, the higher the value level of the subspace .

可选的,所述装置还包括编码模块,用于:根据每个所述栅格的栅格中心点经纬度得到每个所述栅格的编码;所述融合模块,用于:根据所述P个子空间的价值级别确定所述P个子空间中每个子空间的栅格尺寸上限;根据每个所述栅格的编码,将所述每个子空间中的栅格集合进行融合,以得到所述每个子空间的融合后的栅格,其中,每个子空间中融合后的栅格尺寸不大于该子空间的栅格尺寸上限,所述每个子空间中的栅格集合中的栅格的数量为预设数值,且所述栅格集合中的栅格的编码仅最后一位不相同,所述栅格集合中的栅格为所述每个子空间中的栅格,和/或,所述栅格集合中的栅格为根据所述每个子空间中的栅格进行融合得到的;所述存储模块,用于:根据所述融合后的栅格的编码存储所述全量MR数据,所述融合后的栅格的编码为将所述栅格集合中的栅格的编码中不相同的最后一位删除得到的。Optionally, the device further includes an encoding module, configured to: obtain the encoding of each grid according to the latitude and longitude of the grid center point of each grid; the fusion module, configured to: according to the P The value level of each subspace determines the upper limit of the grid size of each subspace in the P subspaces; according to the encoding of each grid, the grid set in each subspace is fused to obtain each The fused grids of subspaces, wherein the fused grid size in each subspace is not greater than the upper limit of the grid size of the subspace, and the number of grids in the grid set in each subspace is preset Set a value, and only the last bit of the code of the grid in the grid set is different, the grid in the grid set is the grid in each subspace, and/or, the grid The grids in the set are fused according to the grids in each subspace; the storage module is configured to: store the full amount of MR data according to the encoding of the fused grids, and the fused The code of the grid is obtained by deleting the last bit that is different from the codes of the grids in the grid set.

进一步地,所述装置还包括更新模块,用于:当所述P个子空间中的任一子空间C中的MR数据变化量超出预设变化量,根据所述子空间C中变化后的MR数据,更新所述子空间C的价值级别。Further, the device further includes an update module, configured to: when the change amount of MR data in any subspace C of the P subspaces exceeds a preset change amount, according to the changed MR data in the subspace C data, update the value level of the subspace C.

第三方面,本申请提供了一种网络数据存储装置,包括处理器和存储器;其中,所述存储器用于存储程序代码,所述处理器用于调用所述程序代码,以执行如第一方面任一实现方式提供的方法。In a third aspect, the present application provides a network data storage device, including a processor and a memory; wherein, the memory is used to store program codes, and the processor is used to call the program codes to execute any A method provided by an implementation.

第四方面,本申请提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行以实现如第一方面任一实现方式提供的方法。In a fourth aspect, the present application provides a computer-readable storage medium, where the computer-readable storage medium stores a computer program, and the computer program is executed by a processor to implement the method provided in any implementation manner of the first aspect.

第五方面,本申请提供了一种计算机程序产品,当计算机程序产品在计算机上运行时,使得所述计算机执行如第一方面任一实现方式提供的方法。In a fifth aspect, the present application provides a computer program product, which, when running on a computer, causes the computer to execute the method provided in any implementation manner of the first aspect.

可以理解地,上述提供的第二方面所述的装置、第三方面所述的装置、第四方面所述的计算机存储介质或者第五方面所述的计算机程序产品均用于执行第一方面中任一所提供的方法。因此,其所能达到的有益效果可参考对应方法中的有益效果,此处不再赘述。It can be understood that the device described in the second aspect, the device described in the third aspect, the computer storage medium described in the fourth aspect, or the computer program product described in the fifth aspect provided above are all used to execute the Either of the provided methods. Therefore, the beneficial effects that it can achieve can refer to the beneficial effects in the corresponding method, and will not be repeated here.

附图说明Description of drawings

下面对本申请实施例用到的附图进行介绍。The accompanying drawings used in the embodiments of the present application are introduced below.

图1a是本申请实施例提供的一种网络数据存储系统的框架示意图;Fig. 1a is a schematic framework diagram of a network data storage system provided by an embodiment of the present application;

图1b是本申请实施例提供的一种大数据处理平台的示意图;Fig. 1b is a schematic diagram of a big data processing platform provided by an embodiment of the present application;

图2是本申请实施例提供的一种网络数据存储方法的流程示意图;FIG. 2 is a schematic flow diagram of a network data storage method provided by an embodiment of the present application;

图3是本申请实施例提供的又一种网络数据存储方法的流程示意图;Fig. 3 is a schematic flowchart of another network data storage method provided by the embodiment of the present application;

图4是本申请实施例提供的一种子空间分布示意图;FIG. 4 is a schematic diagram of a subspace distribution provided by an embodiment of the present application;

图5是本申请实施例提供的一种栅格融合示意图;Fig. 5 is a schematic diagram of grid fusion provided by the embodiment of the present application;

图6是本申请实施例提供的一种网络数据存储装置的结构示意图;FIG. 6 is a schematic structural diagram of a network data storage device provided by an embodiment of the present application;

图7是本申请实施例提供的另一种网络数据存储装置的结构示意图。FIG. 7 is a schematic structural diagram of another network data storage device provided by an embodiment of the present application.

具体实施方式Detailed ways

下面结合本申请实施例中的附图对本申请实施例进行描述。本申请实施例的实施方式部分使用的术语仅用于对本申请的具体实施例进行解释,而非旨在限定本申请。Embodiments of the present application are described below with reference to the drawings in the embodiments of the present application. The terms used in the implementation of the embodiments of the present application are only used to explain the specific embodiments of the present application, and are not intended to limit the present application.

测量报告(Measurement Report,MR)数据是指信息在业务信道上每480ms(信令信道上470ms)发送一次数据,这些数据可用于网络评估和优化数据。Measurement report (Measurement Report, MR) data means that information is sent every 480 ms on a traffic channel (470 ms on a signaling channel), and these data can be used for network evaluation and optimization data.

栅格:Grid,通过运用固定的正方形格子切分地理空间,将地理空间的网络指标细化至网格中表征。Grid: Grid, by using a fixed square grid to segment the geographic space, refine the network indicators of the geographic space into the grid representation.

多尺度栅格,即运用大小不同的栅格表征地理空间网络指标。Multi-scale grids, which use grids of different sizes to represent geospatial network indicators.

本方案中的价值模式表可以理解为,包含对空间进行剖分后表征各个子空间内包含的栅格编码线段集、对应密度、价值排序等信息的数据表。The value model table in this solution can be understood as a data table that contains information such as raster coded line segment sets, corresponding densities, and value rankings contained in each subspace after the space is divided.

参照图1a所示,为本申请实施例提供的一种网络数据存储系统的框架示意图。该系统可包括基站101和大数据处理平台102。其中,用户终端向基站上报MR数据,基站101收集后传输给大数据处理平台102。大数据处理平台102从基站接收全量MR数据并对各条栅格数据标记价值标签,依据价值标签及栅格编码将等大栅格融合成多尺度栅格,生成多尺度栅格全量MR数据,进而基于该多尺度栅格全量MR数据来存储。Referring to FIG. 1 a , it is a schematic framework diagram of a network data storage system provided by an embodiment of the present application. The system may include a base station 101 and a big data processing platform 102 . Wherein, the user terminal reports the MR data to the base station, and the base station 101 collects and transmits the MR data to the big data processing platform 102 . The big data processing platform 102 receives the full amount of MR data from the base station and marks each piece of grid data with a value label, and fuses the equal-sized grid into a multi-scale grid according to the value label and grid code to generate the full amount of MR data of the multi-scale grid. Then, based on the multi-scale grid full MR data, it is stored.

其中,如图1b所示,大数据处理平台102可包括价值模式提取模块1021、业务指标聚合模块1022和数据存储模块1023。Wherein, as shown in FIG. 1 b , the big data processing platform 102 may include a value model extraction module 1021 , a business indicator aggregation module 1022 and a data storage module 1023 .

价值模式提取模块1021用于运用空间聚类算法将全量等大栅格数据进行聚合,在电信栅格区域内部划分多个子空间,以进行空间维度价值分析;价值模式提取模块1021还用于对多个子空间进行数据价值分析,例如按照一定比例归类高、中、低、无价值子空间,并对原有全量MR单据添加价值等级标签;该价值模式提取模块1021还用于依照业务指标聚合模块更新后的键值,针对不同价值级别区域生成对应区域的线段集,刻画价值模式,以及随着原始等大栅格数据的更新,对其进行时间序列异常检测,对现有价值模式的复用与否提出明确结论。The value model extraction module 1021 is used to use the spatial clustering algorithm to aggregate the full amount of large-scale grid data, and divide multiple subspaces within the telecommunications grid area to analyze the value of the spatial dimension; the value model extraction module 1021 is also used for multiple Perform data value analysis on each subspace, such as classifying high, medium, low, and worthless subspaces according to a certain proportion, and adding value level tags to the original full amount of MR documents; the value pattern extraction module 1021 is also used to aggregate modules according to business indicators The updated key value generates line segment sets corresponding to areas of different value levels, describes the value mode, and performs time series anomaly detection on it as the original large raster data is updated, and reuses the existing value mode whether to draw clear conclusions.

业务指标聚合模块1022用于按照价值模式提取模块1021生成的价值标签,参照预先定义的多尺度栅格使用策略,在不同价值子空间内生成多尺度栅格,更新各个多尺度栅格键值,并对相应业务指标进行聚合。The business index aggregation module 1022 is used to generate multi-scale grids in different value subspaces according to the value tags generated by the value model extraction module 1021, and to update the key values of each multi-scale grid, And aggregate the corresponding business indicators.

数据存储模块1023用于存储全量MR单据,为后续处理提供数据来源。The data storage module 1023 is used to store all MR documents and provide data sources for subsequent processing.

作为一种可选的实现方式,如图1a所示,该系统还包括网规网优平台103。大数据处理平台102将多尺度栅格全量MR数据传输给网规网优平台103,以便网规网优平台103针对接收到的此数据进行进一步数据分析及呈现,以支撑业务。该网规是网络规划的简称,指在建设通信网络之前根据建网目标、用户需求、当地实际情况等对网络建设进行规划。该网优是网络优化的简称,指在现网基础上通过话务数据分析、现场测试数据采集、参数分析、硬件检查等手段,找出影响网络质量的原因,在此基础上进行各种优化。As an optional implementation manner, as shown in FIG. 1 a , the system further includes a network planning and network optimization platform 103 . The big data processing platform 102 transmits the full MR data of multi-scale grids to the network planning and optimization platform 103, so that the network planning and optimization platform 103 can further analyze and present the received data to support services. The network planning is the abbreviation of network planning, which refers to the planning of network construction according to the network construction goals, user needs, and local actual conditions before the construction of communication networks. The network optimization is the abbreviation of network optimization, which means to find out the reasons that affect the network quality through traffic data analysis, field test data collection, parameter analysis, hardware inspection and other means on the basis of the existing network, and to perform various optimizations on this basis .

参照图2所示,为本申请实施例提供的一种网络数据存储方法的流程示意图。如图2所示,该方法包括步骤201-204,具体如下:Referring to FIG. 2 , it is a schematic flowchart of a network data storage method provided by an embodiment of the present application. As shown in Figure 2, the method includes steps 201-204, specifically as follows:

201、获取多个栅格对应的全量MR数据,并根据所述全量MR数据将所述多个栅格划分为P个子空间,P为不小于1的整数;201. Acquire full MR data corresponding to multiple grids, and divide the multiple grids into P subspaces according to the full MR data, where P is an integer not less than 1;

可选的,本申请实施例的执行主体可以是大数据处理平台,例如具体为服务器等。Optionally, the execution subject of this embodiment of the present application may be a big data processing platform, such as specifically a server.

上述多个栅格,可以是多个相同尺寸的栅格,例如可以均为50*50大小。The plurality of grids mentioned above may be multiple grids of the same size, for example, may all have a size of 50*50.

具体地,服务器周期性地获取多个栅格对应的全量MR数据。该周期性例如可以是每天,或者间隔几天等,本方案对此不做具体限定。例如,服务器从基站获取整个电信网格区域内的所有栅格对应的当天的全量MR数据。该数据以等大栅格为基本单位进行存储,各条数据包含某个或多个衡量该栅格网络质量好坏的KPI字段。Specifically, the server periodically acquires the full amount of MR data corresponding to multiple grids. The periodicity may be, for example, every day, or every few days, etc., which is not specifically limited in this solution. For example, the server obtains from the base station the full amount of MR data corresponding to all the grids in the entire telecommunications grid area of the day. The data is stored in equal-sized grids as the basic unit, and each piece of data contains one or more KPI fields to measure the quality of the grid network.

上述子空间,可以理解为,是通过将上述多个栅格进行划分而得到的。例如,每个子空间包含的栅格数量不小于预设值等。The foregoing subspace can be understood as being obtained by dividing the foregoing plurality of grids. For example, the number of grids contained in each subspace is not less than the preset value and so on.

作为一种实现方式,上述P个子空间,可以是根据每个栅格中MR数据的数据量大小来进行划分得到的,例如,根据每个栅格中MR数据的数据量大小得到每个栅格的数据密度,然后将数据密度较大的栅格划分到一个子空间,将数据密度较小的栅格划分到另一个子空间;或者,将数据密度相近的栅格划分到一个子空间等。As an implementation, the above P subspaces can be obtained by dividing according to the data volume of MR data in each grid, for example, according to the data volume of MR data in each grid, each grid can be obtained Then divide the raster with higher data density into one subspace, and divide the raster with lower data density into another subspace; or divide the raster with similar data density into a subspace, etc.

上述仅为一种示例,本方案对此不做具体限定。The foregoing is only an example, and this solution does not specifically limit it.

202、根据所述P个子空间的全量MR数据,确定所述P个子空间的价值级别;202. Determine the value levels of the P subspaces according to the full MR data of the P subspaces;

基于上述划分得到的P个子空间中每个子空间中的MR数据,来确定每个子空间的价值级别。The value level of each subspace is determined based on the MR data in each of the P subspaces obtained from the above division.

例如,基于每个子空间中的MR数据的数据密度等,来确定每个子空间的价值级别。For example, the value level of each subspace is determined based on the data density and the like of MR data in each subspace.

该价值级别,例如可以是高、中、低三个级别,或者,按照从大到小依次为第一级别、第二级别、第三级别等。The value level may be, for example, three levels of high, medium, and low, or, in descending order, the value level may be the first level, the second level, the third level, and so on.

203、根据所述P个子空间的价值级别对所述P个子空间中的栅格进行融合处理,以得到所述每个子空间的融合后的栅格;203. Perform fusion processing on the grids in the P subspaces according to the value levels of the P subspaces, so as to obtain the fused grids of each subspace;

上述通过确定每个子空间的价值级别,进而基于不同价值级别对应不同的栅格融合处理,以此得到每个子空间的融合后的栅格。By determining the value level of each subspace above, and then corresponding to different grid fusion processes based on different value levels, the fused grid of each subspace is obtained.

例如,价值级别高的子空间的融合后的栅格尺寸上限较小,价值级别低的子空间的融合后的栅格尺寸上限较大等。For example, the upper limit of the fused grid size of a subspace with a high value level is smaller, and the upper limit of the fused grid size of a subspace with a lower value level is larger.

204、根据所述融合后的栅格存储所述全量MR数据。204. Store the full amount of MR data according to the fused grid.

其中,融合后的栅格,可以看做是将多个原来的栅格进行了合并,将原来多个栅格包含的MR数据存储在对应一个融合后的栅格中,融合后的栅格由于数量减少,因此在存储全量MR数据时占用的存储空间较少,节省了存储空间。Among them, the fused grid can be regarded as merging multiple original grids, storing the MR data contained in the original multiple grids in a corresponding fused grid, and the fused grid is due to The number is reduced, so less storage space is occupied when storing the full amount of MR data, saving storage space.

本申请实施例,通过将多个栅格划分为P个子空间,并根据所述P个子空间的全量MR数据确定该P个子空间的价值级别,进而根据该P个子空间的价值级别对P个子空间中的栅格进行融合处理,然后根据融合后的栅格存储全量MR数据。采用该手段,基于不同价值级别,将不同子空间的多个原来的栅格分别进行了合并,使得栅格数量减少,在存储全量MR数据时占用的存储空间较少,相较于现有技术中采用数量极多的等大栅格存储MR数据,本发明采用融合后的栅格存储数据,在不丢失数据本身信息、存储全量数据的前提下,栅格总数有所减少,节省了存储空间,降低了数据库中存储开销以及提升了查询性能。In this embodiment of the present application, multiple grids are divided into P subspaces, and the value levels of the P subspaces are determined according to the full MR data of the P subspaces, and then the P subspaces are evaluated according to the value levels of the P subspaces. The raster in is fused, and then the full amount of MR data is stored according to the fused raster. Using this method, multiple original grids of different subspaces are merged based on different value levels, so that the number of grids is reduced, and less storage space is occupied when storing the full amount of MR data. Compared with the existing technology A very large number of equal-sized grids are used to store MR data. The present invention uses fused grids to store data. On the premise of not losing the information of the data itself and storing the full amount of data, the total number of grids is reduced, saving storage space , reducing storage overhead in the database and improving query performance.

另一方面,本方案将全量MR数据由传统的等大栅格存储转换成多尺度栅格存储,还可以解决现有技术中由于数据采样引起的无法完成实时网络分析的缺陷。本方案采用全量MR数据进行分析,无需积累多天数据来满足分析标准,较短时间范围内的数据量即可满足分析要求,通过与历史分析数据做对比,直观体现网络变化的影响,为网规网优调节提供效果验证。On the other hand, this solution converts the full amount of MR data from traditional equal-sized grid storage to multi-scale grid storage, and can also solve the defect that real-time network analysis cannot be completed due to data sampling in the prior art. This solution uses the full amount of MR data for analysis. It does not need to accumulate multiple days of data to meet the analysis standards. The amount of data in a short period of time can meet the analysis requirements. By comparing with historical analysis data, it can intuitively reflect the impact of network changes. Network planning optimization and adjustment provide effect verification.

下面对本申请实施例提供的网络数据存储方法进行详细的介绍。参照图3所示,为本申请实施例提供的一种网络数据存储方法的示意图。如图3所示,该方法包括步骤301-307,该实施例以基于时空聚类算法Zorder-Hclustering,进行子空间划分,进而进行栅格融合,具体如下:The network data storage method provided by the embodiment of the present application will be described in detail below. Referring to FIG. 3 , it is a schematic diagram of a network data storage method provided by an embodiment of the present application. As shown in Figure 3, the method includes steps 301-307. In this embodiment, based on the spatio-temporal clustering algorithm Zorder-Hclustering, subspace division is performed, and then grid fusion is performed, as follows:

301、获取多个栅格对应的全量MR数据;301. Obtain full MR data corresponding to multiple grids;

例如,服务器从基站中获取到全量MR数据。For example, the server acquires the full amount of MR data from the base station.

302、根据所述全量MR数据将所述多个栅格划分为P个子空间,P为不小于1的整数;302. Divide the plurality of grids into P subspaces according to the full amount of MR data, where P is an integer not less than 1;

作为一种具体的实现方式,基于时空聚类算法Zorder-Hclustering,进行子空间划分,相应地,步骤302可包括如下步骤S1-S5,具体如下:As a specific implementation manner, based on the space-time clustering algorithm Zorder-Hclustering, the subspace is divided. Correspondingly, step 302 may include the following steps S1-S5, specifically as follows:

S1、根据每个所述栅格内的MR数据的数据量,确定每个所述栅格的数据密度;S1. Determine the data density of each grid according to the amount of MR data in each grid;

鉴于电信网格内部单个栅格面积相同,因此基于每个栅格对应的尺寸,以及每个栅格对应的MR数据的数据量Q,可确定每个栅格的数据密度Density。Given that the area of a single grid in the telecommunications grid is the same, the data density Density of each grid can be determined based on the corresponding size of each grid and the data volume Q of MR data corresponding to each grid.

例如,当栅格尺寸为50m时,该数据密度可表示为Density=Q/50*50。For example, when the grid size is 50m, the data density can be expressed as Density=Q/50*50.

S2、根据每个所述栅格的数据密度确定所述多个栅格中任意两个栅格之间的不相似度值;该任意两个栅格Grid_1、Grid_2之间的不相似度值D可表示为:S2. Determine the dissimilarity value between any two grids in the plurality of grids according to the data density of each grid; the dissimilarity value D between any two grids Grid_1 and Grid_2 Can be expressed as:

D=|Density(Grid_1)-Density(Grid_2)|;D=|Density(Grid_1)-Density(Grid_2)|;

S3、将所述多个栅格中任意两个栅格之间不相似度值最小的两个栅格记为同组栅格;S3. Record the two grids with the smallest dissimilarity value between any two grids among the plurality of grids as the same group of grids;

例如,栅格Grid_1、Grid_2之间的不相似度值最小,则栅格Grid_1、Grid_2即为同组栅格。For example, if the dissimilarity value between Grid_1 and Grid_2 is the smallest, then Grid_1 and Grid_2 are the same group of grids.

S4、计算包含所述同组栅格的所述多个栅格中任意两个栅格之间的不相似度值;S4. Calculating a dissimilarity value between any two grids in the plurality of grids including the same group of grids;

也就是说,将同组栅格看做是一个栅格,然后再计算该同组栅格与其他任意栅格之间的不相似度值,然后基于前述已经计算得到的除同组栅格外的其他任意两个栅格之间的不相似度值以及该同组栅格与其他任意栅格之间的不相似度值,从中确定出最小不相似度值。That is to say, treat the same group of rasters as one raster, and then calculate the dissimilarity value between the same group of rasters and any other raster, and then based on the previously calculated The dissimilarity value between any other two rasters and the dissimilarity value between the same group of rasters and other arbitrary rasters, and determine the minimum dissimilarity value.

其中,同组栅格的数据密度为该同组栅格中的所有栅格的数据量除以该同组栅格的总面积而得到。Wherein, the data density of the grids in the same group is obtained by dividing the data volume of all the grids in the same group by the total area of the grids in the same group.

S5、重复执行步骤S3-S4,直到所述多个栅格中的每个栅格均为同组栅格中的栅格,得到P个同组栅格,所述P个子空间与P个同组栅格对应,所述P个子空间中每个子空间包含的栅格的数量不小于第一预设值。S5. Steps S3-S4 are repeatedly executed until each of the multiple grids is a grid in the same group of grids, and P grids of the same group are obtained, and the P subspaces are the same as the P subspaces of the same group. Corresponding to a group of grids, the number of grids contained in each of the P subspaces is not less than a first preset value.

也就是说,当每个栅格均有同组栅格,且得到了P个子空间,该每个子空间包含的栅格的数量均不小于第一预设值,则停止迭代。该第一预设值可以理解为,预设的每个子空间内包含的最少栅格数。That is to say, when each grid has the same group of grids, and P subspaces are obtained, and the number of grids contained in each subspace is not less than the first preset value, the iteration is stopped. The first preset value can be understood as the preset minimum number of grids contained in each subspace.

其中,若存在子空间A,其中所述子空间A包含的栅格的数量小于所述第一预设值,则从所述P个子空间中获取子空间B,所述子空间B与所述子空间A的不相似度值差值小于第二预设值,则将所述子空间A合并至所述子空间B中,以更新所述子空间B。Wherein, if there is a subspace A, wherein the number of grids contained in the subspace A is less than the first preset value, the subspace B is obtained from the P subspaces, and the subspace B and the If the dissimilarity value difference of the subspace A is less than a second preset value, the subspace A is merged into the subspace B, so as to update the subspace B.

具体地,可以从与子空间A空间相邻的子空间中确定与所述子空间A的不相似度值差值小于第二预设值的子空间B。Specifically, the subspace B whose dissimilarity value difference with the subspace A is smaller than a second preset value may be determined from the subspaces spatially adjacent to the subspace A.

例如,通过将包含的栅格的数量小于第一预设值的子空间与其不相似度值最接近的子空间进行合并,以便最终得到的P个子空间中每个子空间包含的栅格的数量均不小于第一预设值。该手段可以让生成的子空间的范围满足实际业务分布范围,更贴合业务自身的需要。For example, by merging the subspaces containing the number of grids less than the first preset value and the subspace with the closest dissimilarity value, so that the number of grids contained in each of the finally obtained P subspaces is equal to not less than the first preset value. This method can make the range of the generated subspace meet the actual business distribution range, and better meet the needs of the business itself.

该方案基于空间邻近性,将整块电信网格区域划分多个子空间,以便为后续价值分析、多尺度栅格存储创造空间载体,并提供数据支撑、理论支撑。以子空间为单位,对其价值进行排序,为多尺度栅格存储提供理论基础。Based on spatial proximity, the scheme divides the entire telecommunications grid area into multiple subspaces to create space carriers for subsequent value analysis and multi-scale grid storage, and provide data support and theoretical support. Take the subspace as the unit, sort its value, and provide a theoretical basis for multi-scale raster storage.

作为一种实现方式,针对上述步骤S1-S5,可示例如下:As an implementation manner, for the above steps S1-S5, an example may be as follows:

令每个栅格为一个单独组,根据任意两个栅格之间的不相似度值以及栅格编号构建矩阵M,矩阵M中的值为所有组之间的不相似度值。设共有i个栅格(组),其中,Di-1,i表示栅格Grid_i-1、Grid_i之间的不相似度值。矩阵M可参照表一所示。Let each grid be a separate group, construct a matrix M according to the dissimilarity value between any two grids and the grid number, and the value in the matrix M is the dissimilarity value between all groups. It is assumed that there are i grids (groups) in total, where D i-1, i represent the dissimilarity value between grids Grid_i-1 and Grid_i. Matrix M can be shown in Table 1.

表一Table I

DD. Grid_1Grid_1 Grid_2Grid_2 Grid_3Grid_3 Grid_4Grid_4 Grid_5Grid_5 ……... Grid_i-1Grid_i-1 Grid_iGrid_i Grid_1Grid_1 00 D1,2 D 1,2 D1,3 D 1,3 D1,4 D 1,4 D1,5 D 1,5 ……... D1,i-1 D 1,i-1 D1,i D 1,i Grid_2Grid_2 D2,1 D 2,1 00 D2,3 D 2,3 D2,4 D 2,4 D2,5 D 2,5 ……... D2,i-1 D 2,i-1 D2,i D 2,i Grid_3Grid_3 D3,1 D 3,1 D3,2 D 3,2 00 D3,4 D 3,4 D3,5 D 3,5 D3,i-1 D 3,i-1 D3,i D 3,i Grid_4Grid_4 D4,1 D 4,1 D4,2 D 4,2 D4,3 D 4,3 00 D4,5 D 4,5 D4,i-1 D 4,i-1 D4,i D 4,i Grid_5Grid_5 D5,1 D 5,1 D5,2 D 5,2 D5,3 D 5,3 D5,4 D 5,4 00 D5,i-1 D 5,i-1 D5,i D 5,i ……... ……... ……... 00 ……... ……... Grid_i-1Grid_i-1 Di-1,1 D i-1,1 Di-1,2 D i-1,2 Di-1,3 D i-1,3 Di-1,4 D i-1,4 Di-1,5 D i-1,5 ……... 00 Di-1,i D i-1,i Grid_iGrid_i Di,1 D i,1 Di,2 D i,2 Di,3 D i,3 Di,4 D i,4 Di,5 D i,5 ……... Di,i-1 D i,i-1 00

通过遍历矩阵M,将具有最小不相似度值的两个栅格组看做一个同组栅格,并更新矩阵M。By traversing the matrix M, the two grid groups with the minimum dissimilarity value are regarded as the same group of grids, and the matrix M is updated.

例如:D1,2为矩阵M中的最小值,则合并组Grid_1与组Grid_2为组Grid_1_2,且更新其Density和矩阵M,如表二所示为更新后的矩阵M:For example: D 1,2 is the minimum value in the matrix M, then merge the group Grid_1 and the group Grid_2 into the group Grid_1_2, and update its Density and matrix M, as shown in Table 2 for the updated matrix M:

表二Table II

DD. Grid_1_2Grid_1_2 Grid_3Grid_3 Grid_4Grid_4 Grid_5Grid_5 ……... Grid_i-1Grid_i-1 Grid_iGrid_i Grid_1_2Grid_1_2 00 D1_2,3 D 1_2,3 D1_2,4 D 1_2,4 D1_2,5 D 1_2,5 ……... D1_2,i-1 D 1_2,i-1 D1_2,i D 1_2,i Grid_3Grid_3 D3,1_2 D 3,1_2 00 D3,4 D 3,4 D3,5 D 3,5 D3,i-1 D 3,i-1 D3,i D 3,i Grid_4Grid_4 D4,1_2 D 4,1_2 D4,3 D 4,3 00 D4,5 D 4,5 D4,i-1 D 4,i-1 D4,i D 4,i Grid_5Grid_5 D5,1_2 D 5,1_2 D5,3 D 5,3 D5,4 D 5,4 00 D5,i-1 D 5,i-1 D5,i D 5,i ……... ……... 00 ……... ……... Grid_i-1Grid_i-1 Di-1,1_2 D i-1,1_2 Di-1,3 D i-1,3 Di-1,4 D i-1,4 Di-1,5 D i-1,5 ……... 00 Di-1,i D i-1,i Grid_iGrid_i Di,1_2 D i,1_2 Di,3 D i,3 Di,4 D i,4 Di,5 D i,5 ……... Di,i-1 D i,i-1 00

通过重复合并最小不相似度值的两个组,直到有P个组中的栅格数量均不小于第一预设值。By repeatedly merging the two groups with the minimum dissimilarity value, until the number of grids in P groups is not less than the first preset value.

此时若存在单组内栅格数小于第一预设值的组A,通过确定与组A相邻且不相似度值最接近的组B,将组A合并至组B中。At this time, if there is a group A whose number of grids in a single group is less than the first preset value, group A is merged into group B by determining group B which is adjacent to group A and has the closest dissimilarity value.

例如,若Grid_4内的栅格数小于第一预设值,则比较D3,4与D4,5的大小。For example, if the number of grids in Grid_4 is less than the first preset value, compare the sizes of D 3,4 and D 4,5 .

若D3,4<D4,5,则合并Grid_3与Grid_4为Grid_3_4,并更新其Density和矩阵M,如表三所示。If D 3,4 <D 4,5 , merge Grid_3 and Grid_4 into Grid_3_4, and update its Density and matrix M, as shown in Table 3.

若D3,4>D4,5,则合并Grid_4与Grid_5为Grid_4_5,并更新其Density和矩阵M,如表四所示。If D 3,4 >D 4,5 , merge Grid_4 and Grid_5 into Grid_4_5, and update its Density and matrix M, as shown in Table 4.

表三Table three

DD. Grid_1_2Grid_1_2 Grid_3_4Grid_3_4 Grid_5Grid_5 ……... Grid_i-1Grid_i-1 Grid_iGrid_i Grid_1_2Grid_1_2 00 D1_2,3_4 D 1_2,3_4 D1_2,5 D 1_2,5 ……... D1_2,i-1 D 1_2,i-1 D1_2,i D 1_2,i Grid_3_4Grid_3_4 D3_4,1_2 D 3_4,1_2 00 D3_4,5 D 3_4,5 D3_4,i-1 D 3_4,i-1 D3_4,i D 3_4,i Grid_5Grid_5 D5,1_2 D 5,1_2 D5,3_4 D 5,3_4 00 D5,i-1 D 5,i-1 D5,i D 5,i ……... ……... 00 ……... ……... Grid_i-1Grid_i-1 Di-1,1_2 D i-1,1_2 Di-1,3_4 D i-1,3_4 Di-1,5 D i-1,5 ……... 00 Di-1,i D i-1,i Grid_iGrid_i Di,1_2 D i,1_2 Di,3_4 D i,3_4 Di,5 D i,5 ……... Di,i-1 D i,i-1 00

表四Table four

DD. Grid_1_2Grid_1_2 Grid_3Grid_3 Grid_4_5Grid_4_5 ……... Grid_i-1Grid_i-1 Grid_iGrid_i Grid_1_2Grid_1_2 00 D1_2,3 D 1_2,3 D1_2,4_5 D 1_2,4_5 ……... D1_2,i-1 D 1_2,i-1 D1_2,i D 1_2,i Grid_3Grid_3 D3,1_2 D 3,1_2 00 D3,4_5 D 3,4_5 D3,i-1 D 3,i-1 D3,i D 3,i Grid_4_5Grid_4_5 D4_5,1_2 D 4_5,1_2 D4_5,3 D 4_5,3 00 D4_5,i-1 D 4_5,i-1 D4_5,i D 4_5,i ……... ……... 00 ……... ……... Grid_i-1Grid_i-1 Di-1,1_2 D i-1,1_2 Di-1,3 D i-1,3 Di-1,4_5 D i-1,4_5 ……... 00 Di-1,i D i-1,i Grid_iGrid_i Di,1_2 D i,1_2 Di,3 D i,3 Di,4_5 D i,4_5 ……... Di,i-1 D i,i-1 00

重复上述合并过程,直至没有单独分组栅格数量小于第一预设值时,算法结束。The above merging process is repeated until the number of individual grouped grids is less than the first preset value, and the algorithm ends.

303、根据所述P个子空间中每个子空间所包含的栅格的数据密度和每个子空间包含的栅格的数量确定所述P个子空间中每个子空间的数据密度平均值;303. Determine the average data density of each of the P subspaces according to the data density of the grids contained in each of the P subspaces and the number of grids contained in each subspace;

具体地,根据每个栅格内的MR数据的数据量,确定每个栅格的数据密度,进而将每个子空间中包含的所有栅格的数据密度进行求和,即得到每个子空间所包含的栅格的数据密度。Specifically, according to the amount of MR data in each grid, the data density of each grid is determined, and then the data densities of all grids contained in each subspace are summed to obtain the The data density of the raster.

而每个子空间包含的栅格的数量可以根据该子空间中栅格的编码的数量确定。其中,栅格的编码是根据每个栅格的栅格中心点经纬度得到的。The number of grids contained in each subspace can be determined according to the number of codes of the grids in the subspace. Wherein, the coding of the grid is obtained according to the latitude and longitude of the grid center point of each grid.

例如,依据四进制Z阶曲线算法,通过输入50米栅格经纬度可得出栅格编码。For example, according to the quaternary Z-order curve algorithm, the grid code can be obtained by inputting the latitude and longitude of a 50-meter grid.

该手段仅为一种示例,其还可以是其他方式,本方案对此不做具体限定。This means is only an example, and it may also be in other manners, which are not specifically limited in this solution.

每个子空间的数据密度平均值即为该子空间中栅格的数据密度之和与栅格的数量的比值。The average data density of each subspace is the ratio of the sum of the data densities of the grids in the subspace to the number of grids.

例如,采用MR数据中自带Z序四进制栅格编码可构造键值对,其基本格式为:键Key:值Value=>Z序四进制栅格编码:对应栅格的数据密度。相应地,上述P个子空间可以理解为是P个包含一系列键值对的栅格组。For example, the key-value pair can be constructed by using the built-in Z-order quaternary raster code in MR data, and its basic format is: key Key: value Value => Z-order quaternary raster code: corresponding to the data density of the raster. Correspondingly, the above P subspaces can be understood as P grid groups containing a series of key-value pairs.

根据各子空间包含键值对的键(即栅格编码)的个数,即为每个子空间中包含的栅格的数量,进而可确定P个子空间中每个子空间的数据密度平均值。According to the number of keys (that is, raster codes) containing key-value pairs in each subspace, that is, the number of rasters contained in each subspace, the average data density of each subspace in the P subspaces can be determined.

304、根据所述P个子空间中每个子空间的数据密度平均值得到所述P个子空间的价值级别,其中,所述数据密度平均值越大,所述子空间的价值级别越高;304. Obtain the value level of the P subspaces according to the average data density of each of the P subspaces, wherein the greater the average value of the data density, the higher the value level of the subspace;

将上述数据密度平均值进行排序,进而可得到P个子空间的价值级别。Sorting the average values of the above data densities can then obtain the value levels of the P subspaces.

优选的,可将二维的栅格空间转换为一维的点集、线段集,以便转换之后在给MR数据打标签时,将一条数据的编码看做一个点,一系列连续的编码相当于一条线,用一维的点线代替二维栅格来判断栅格编码的归属速度则更快。Preferably, the two-dimensional grid space can be converted into a one-dimensional point set and line segment set, so that when labeling MR data after conversion, the code of a piece of data is regarded as a point, and a series of continuous codes is equivalent to It is faster to use one-dimensional dotted lines instead of two-dimensional grids to determine the attribution of grid codes.

如图4所示,当P为4时,即存在子空间Group1、Group2、Group3和Group4,若:As shown in Figure 4, when P is 4, there are subspaces Group1, Group2, Group3 and Group4, if:

DensityGroup1>DensityGroup2>DensityGroup3>DensityGroup4,则有:Density Group1 >Density Group2 >Density Group3 >Density Group4 , then:

Group1包括:013,021,023,030-033,102-123…;Group1 includes: 013, 021, 023, 030-033, 102-123…;

Group2包括:133,303,310-313,321-323…;Group2 includes: 133, 303, 310-313, 321-323…;

Group3包括:002-012,020-030,032,100,101,110,111…;Group3 includes: 002-012, 020-030, 032, 100, 101, 110, 111…;

Group4包括:000,001;Group4 includes: 000,001;

其中,Group1、Group2、Group3和Group4分别对应为高、中、低、无价值区域。Among them, Group1, Group2, Group3 and Group4 correspond to high, medium, low and valueless areas respectively.

现有技术中不对地理空间进行区分,对所有地理空间都采取同样的数据存储、处理策略;而本方案引入了“价值”的概念,对地理空间进行区分,如设立“高、中、低、无价值区域”等,可以针对各种区域数据特点进行不同的处理。In the existing technology, no distinction is made between geographical spaces, and the same data storage and processing strategies are adopted for all geographical spaces; however, this solution introduces the concept of "value" to distinguish geographical spaces, such as setting up "high, medium, low, Valueless areas", etc., can be processed differently according to the characteristics of various regional data.

进一步地,在得到上述价值级别后,可构造价值模式表。通过构造表格来存储单个电信网格内部子空间的线段集,如表五所示:Further, after obtaining the above value levels, a value pattern table can be constructed. The line segment set of the internal subspace of a single telecom grid is stored by constructing a table, as shown in Table 5:

表五Table five

该表格包含字段如下:The form contains the following fields:

“GroupID”即为单个电信网格内进一步细分的子空间的价值区域编号;"GroupID" is the value area number of the further subdivided subspace within a single telecom grid;

“Contains”为单个子空间Group所包含的编码点/线段集合;"Contains" is a collection of coded points/line segments contained in a single subspace Group;

“Data”和“Area”为单个Group包含的编码点/段对应栅格的总数据量及总面积;"Data" and "Area" are the total data volume and total area of the grid corresponding to the code point/segment contained in a single Group;

“Density”为单个Group的数据密度;"Density" is the data density of a single Group;

“ValueOrder”为各组间的价值排序。"ValueOrder" is the value order among the groups.

然后,关联MR单据(即存储后得到的结构化MR数据),填充对应的价值排序。生成对应价值模式表后,可依据全量栅格MR数据中的Z序四进制栅格编码(“CODE”),对照价值模式表中的“Contains”字段,对每一条记录追加该栅格在表中对应的价值区域的价值级别“ValueOrder”,为后续生成多尺度栅格提供分区依据,示例性如表六所示更新“ValueOrder”后全量MR单据结构:Then, associate the MR document (that is, the structured MR data obtained after storage), and fill in the corresponding value order. After generating the corresponding value mode table, according to the Z-order quaternary raster code ("CODE") in the full raster MR data, compare the "Contains" field in the value mode table, and append the raster in the value mode table to each record. The value level "ValueOrder" of the corresponding value area in the table provides a partition basis for subsequent generation of multi-scale grids. An example is the full MR document structure after updating "ValueOrder" as shown in Table 6:

表六Table six

通过该表,可清晰直观的了解到每个栅格编码对应的价值级别。Through this table, you can clearly and intuitively understand the value level corresponding to each raster code.

305、根据所述P个子空间的价值级别确定所述P个子空间中每个子空间的栅格尺寸上限;305. Determine the upper limit of the grid size of each of the P subspaces according to the value levels of the P subspaces;

其中,依据不同价值级别制定不同大小的栅格存储策略。其中栅格大小趋势可随“ValueOrder”价值级别降低而变大,随其升高而变小。Among them, grid storage strategies of different sizes are formulated according to different value levels. The grid size trend can become larger as the value level of "ValueOrder" decreases, and becomes smaller as it increases.

多尺度栅格使用策略:以使用大小不一的栅格存储MR数据为基础制定的栅格使用方案,其中多尺度栅格尺寸为50米的2n倍,如50*50、100*100、200*200…Multi - scale grid usage strategy: a grid usage plan based on the use of grids of different sizes to store MR data. 200*200…

具体策略示例如下:Examples of specific strategies are as follows:

策略1:结合价值级别分析结果,对不同级别价值区域使用不同尺寸栅格,如,高价值区域使用50*50栅格,中价值区域使用栅格尺寸上限为100*100,低价值区域使用栅格尺寸上限为200*200,无价值区域使用栅格尺寸上限为400*400等。Strategy 1: Combining with the value level analysis results, use different size grids for different levels of value areas, for example, use 50*50 grids for high-value areas, use grids with an upper limit of 100*100 for medium-value areas, and use grids for low-value areas. The upper limit of grid size is 200*200, the upper limit of grid size for useless areas is 400*400, etc.

策略2:结合价值级别分析结果,对高价值及中价值区域使用50米、100米级别的栅格,而低价值与无价值区域暂不做聚合计算等。Strategy 2: Combined with the value level analysis results, use 50-meter and 100-meter-level grids for high-value and medium-value areas, and do not perform aggregation calculations for low-value and non-value areas.

策略3:默认只计算如覆盖类等应用较多的部分特性,但提供其他非默认计算特性的按需补算机制,例如单用户类、话务类、网络性能类。Strategy 3: By default, only some features with many applications, such as coverage, are calculated, but an on-demand compensation mechanism for other non-default calculation features is provided, such as single-user, traffic, and network performance.

306、根据每个所述栅格的编码,将所述每个子空间中的栅格集合进行融合,以得到所述每个子空间的融合后的栅格,其中,每个子空间中融合后的栅格尺寸不大于该子空间的栅格尺寸上限,所述每个子空间中的栅格集合中的栅格的数量为预设数值,且所述栅格集合中的栅格的编码仅最后一位不相同,所述栅格集合中的栅格为所述每个子空间中的栅格,和/或,所述栅格集合中的栅格为根据所述每个子空间中的栅格进行融合得到的;306. According to the encoding of each grid, fuse the set of grids in each subspace to obtain a fused grid of each subspace, where the fused grid in each subspace The grid size is not greater than the upper limit of the grid size of the subspace, the number of grids in the grid set in each subspace is a preset value, and the coding of the grids in the grid set is only the last bit Not the same, the grids in the grid set are the grids in each subspace, and/or, the grids in the grid set are obtained by fusion according to the grids in each subspace of;

如图5所示,基于四进制Z序编码,在现有50米级别栅格的基础上进行融合,每次栅格级别扩大一倍,则在现有栅格编码的基础上删掉最后一位,并对4个具有相同编号的栅格进行融合。重复此过程直至融合而成的栅格达到策略指定的最大尺度,且无栅格可继续融合。融合过程如表七所示融合尺度上限为200*200:As shown in Figure 5, based on the quaternary Z-sequence coding, fusion is performed on the basis of the existing 50-meter-level grid, and each time the grid level is doubled, the last grid is deleted based on the existing grid coding. One bit, and fuse 4 rasters with the same number. This process is repeated until the fused rasters reach the maximum size specified by the policy and there are no more rasters to continue merging. The fusion process is shown in Table 7. The upper limit of the fusion scale is 200*200:

表七Table seven

307、根据所述融合后的栅格的编码存储所述全量MR数据,所述融合后的栅格的编码为将所述栅格集合中的栅格的编码中不相同的最后一位删除得到的。307. Store the full amount of MR data according to the encoding of the fused grid, the encoding of the fused grid is obtained by deleting the last bit that is different from the encoding of the grids in the grid set of.

多尺度栅格生成之后,则更新最终价值模式,例如:图5中原来的50米级栅格价值模式变换至最大200米级栅格模式后,栅格数量可缩减至40%左右,价值模式表中“Contains”更新为02,030,032,2,300,302,320。基于此更新价值模式表的“Contains”字段及根据“CODE”映射关系整合MR单据。基于该整合,则可以有效节省存储空间。After the multi-scale grid is generated, the final value model is updated. For example, after the original 50-meter-level grid value model in Figure 5 is converted to the maximum 200-meter-level grid model, the number of grids can be reduced to about 40%. The value model "Contains" in the table is updated to 02,030,032, 2,300,302,320. Based on this update the "Contains" field of the value model table and integrate the MR documents according to the "CODE" mapping relationship. Based on this integration, storage space can be effectively saved.

在该实施例的基础上,其中,在同一电信网格下,单个子空间的数据量即“Data”会随着时间变化,因此可应用时间序列异常检测判定是否需要更新现有价值模式。On the basis of this embodiment, where under the same telecommunications grid, the amount of data in a single subspace, namely "Data", will change with time, so time series anomaly detection can be applied to determine whether the existing value model needs to be updated.

通过检测当前价值模式是否适用于新数据,避免重复计算增加资源开销,保障业务响应速度,且同时保证业务精确度。By detecting whether the current value model is applicable to new data, it avoids repeated calculations that increase resource overhead, ensures business response speed, and at the same time ensures business accuracy.

具体地,结合3-sigma原则及滑窗法,对历史数据趋势综合分析,在时域空间上对价值模式的复用性进行检测,若各个子空间未发现异常值则直接复用历史价值级别。例如,若一个值远离平均值的n倍标准差,则判定其为异常值。Specifically, combined with the 3-sigma principle and the sliding window method, the trend of historical data is comprehensively analyzed, and the reusability of the value model is detected in the time domain space. If no abnormal value is found in each subspace, the historical value level is directly reused. . For example, a value is considered an outlier if it is n times the standard deviation away from the mean.

若发现异常情况,对于未处理过的新数据,保持价值模式中“GroupID”,“Contains”,“Area”不变,随“Data”更新“Density”与“ValueOrder”,以确定新的价值排序,该更新的目的是兼容相同区域时间维度上不同的价值级别。If an abnormal situation is found, for new unprocessed data, keep "GroupID", "Contains", and "Area" in the value mode unchanged, and update "Density" and "ValueOrder" with "Data" to determine the new value order , the purpose of this update is to be compatible with different value levels on the time dimension of the same region.

上述异常,可以是任一子空间中的MR数据变化量超出预设变化量,则根据该子空间中变化后的MR数据,更新该子空间的价值级别。The above abnormality may be that the change amount of MR data in any subspace exceeds the preset change amount, and the value level of the subspace is updated according to the changed MR data in the subspace.

采用该手段,基于不同价值级别,将不同子空间的多个原来的栅格分别进行了合并,使得栅格数量减少,在存储全量MR数据时占用的存储空间较少,相较于现有技术中采用数量极多的等大栅格存储MR数据,本发明采用融合后的栅格存储数据,在不丢失数据本身信息、存储全量数据的前提下,栅格总数有所减少,节省了存储空间。Using this method, multiple original grids of different subspaces are merged based on different value levels, so that the number of grids is reduced, and less storage space is occupied when storing the full amount of MR data. Compared with the existing technology A very large number of equal-sized grids are used to store MR data. The present invention uses fused grids to store data. On the premise of not losing the information of the data itself and storing the full amount of data, the total number of grids is reduced, saving storage space .

需要说明的是,本申请实施例以网络数据为MR数据为例进行说明,其他无线网络数据如通话记录报告(call history report,CHR)数据等也可以适用本方案。本方案对此不做具体限定。It should be noted that the embodiment of the present application takes network data as MR data as an example for illustration, and other wireless network data such as call history report (call history report, CHR) data may also be applicable to this solution. This plan does not specifically limit this.

参照图6所示,基于上述网络数据存储方法实施例的描述,本发明实施例还公开了一种网络数据存储装置,参考图6,图6是本发明实施例提供的一种网络数据存储装置的结构示意图,所述网络数据存储装置包括处理模块601、确定模块602、融合模块603和存储模块604;其中:Referring to Figure 6, based on the description of the above-mentioned embodiment of the network data storage method, the embodiment of the present invention also discloses a network data storage device, referring to Figure 6, Figure 6 is a network data storage device provided by the embodiment of the present invention A schematic structural diagram of the network data storage device including a processing module 601, a determination module 602, a fusion module 603 and a storage module 604; wherein:

处理模块601,用于获取多个栅格对应的全量MR数据,并根据所述全量MR数据将所述多个栅格划分为P个子空间,P为不小于1的整数;A processing module 601, configured to acquire full MR data corresponding to multiple grids, and divide the multiple grids into P subspaces according to the full MR data, where P is an integer not less than 1;

确定模块602,用于根据所述P个子空间的全量MR数据,确定所述P个子空间的价值级别;A determining module 602, configured to determine the value levels of the P subspaces according to the full amount of MR data of the P subspaces;

融合模块603,用于根据所述P个子空间的价值级别对所述P个子空间中的栅格进行融合处理,以得到所述每个子空间的融合后的栅格;A fusion module 603, configured to perform fusion processing on the grids in the P subspaces according to the value levels of the P subspaces, so as to obtain the fused grids of each subspace;

存储模块604,用于根据所述融合后的栅格存储所述全量MR数据。A storage module 604, configured to store the full amount of MR data according to the fused grid.

其中,所述处理模块601,用于执行以下步骤:Wherein, the processing module 601 is configured to perform the following steps:

S1、根据每个所述栅格内的MR数据的数据量,确定每个所述栅格的数据密度;S1. Determine the data density of each grid according to the amount of MR data in each grid;

S2、根据每个所述栅格的数据密度确定所述多个栅格中任意两个栅格之间的不相似度值;S2. Determine the dissimilarity value between any two grids in the plurality of grids according to the data density of each grid;

S3、将所述多个栅格中任意两个栅格之间不相似度值最小的两个栅格记为同组栅格;S3. Record the two grids with the smallest dissimilarity value between any two grids among the plurality of grids as the same group of grids;

S4、计算包含所述同组栅格的所述多个栅格中任意两个栅格之间的不相似度值;S4. Calculating a dissimilarity value between any two grids in the plurality of grids including the same group of grids;

S5、重复执行步骤S3-S4,直到所述多个栅格中的每个栅格均为同组栅格中的栅格,得到P个同组栅格,所述P个子空间与P个同组栅格对应,所述P个子空间中每个子空间包含的栅格的数量不小于第一预设值。S5. Steps S3-S4 are repeatedly executed until each of the multiple grids is a grid in the same group of grids, and P grids of the same group are obtained, and the P subspaces are the same as the P subspaces of the same group. Corresponding to a group of grids, the number of grids contained in each of the P subspaces is not less than a first preset value.

进一步,所述处理模块601,还用于:Further, the processing module 601 is also used for:

若存在子空间A,其中所述子空间A包含的栅格的数量小于所述第一预设值,则从所述P个子空间中获取子空间B,所述子空间B与所述子空间A的不相似度值差值小于第二预设值;If there is a subspace A, wherein the number of grids contained in the subspace A is less than the first preset value, obtain a subspace B from the P subspaces, and the subspace B and the subspace The difference of the dissimilarity value of A is smaller than the second preset value;

将所述子空间A合并至所述子空间B中,以更新所述子空间B。The subspace A is merged into the subspace B to update the subspace B.

可选的,所述确定模块602,用于:Optionally, the determining module 602 is configured to:

根据所述P个子空间中每个子空间所包含的栅格的数据密度和每个子空间包含的栅格的数量确定所述P个子空间中每个子空间的数据密度平均值;Determine the average data density of each of the P subspaces according to the data density of the grids contained in each of the P subspaces and the number of grids contained in each subspace;

根据所述P个子空间中每个子空间的数据密度平均值得到所述P个子空间的价值级别,其中,所述数据密度平均值越大,所述子空间的价值级别越高。The value level of the P subspaces is obtained according to the average data density of each of the P subspaces, wherein the greater the average data density, the higher the value level of the subspace.

其中,所述装置还包括编码模块,用于:Wherein, the device also includes an encoding module for:

根据每个所述栅格的栅格中心点经纬度得到每个所述栅格的编码;Obtain the code of each grid according to the latitude and longitude of the grid center point of each grid;

所述融合模块603,用于:The fusion module 603 is configured to:

根据所述P个子空间的价值级别确定所述P个子空间中每个子空间的栅格尺寸上限;determining the upper limit of the grid size of each subspace in the P subspaces according to the value level of the P subspaces;

根据每个所述栅格的编码,将所述每个子空间中的栅格集合进行融合,以得到所述每个子空间的融合后的栅格,其中,每个子空间中融合后的栅格尺寸不大于该子空间的栅格尺寸上限,所述每个子空间中的栅格集合中的栅格的数量为预设数值,且所述栅格集合中的栅格的编码仅最后一位不相同,所述栅格集合中的栅格为所述每个子空间中的栅格,和/或,所述栅格集合中的栅格为根据所述每个子空间中的栅格进行融合得到的;According to the encoding of each grid, the grid set in each subspace is fused to obtain the fused grid of each subspace, wherein the size of the fused grid in each subspace Not greater than the upper limit of the grid size of the subspace, the number of grids in the grid set in each subspace is a preset value, and the codes of the grids in the grid set are only different in the last bit , the grids in the grid set are the grids in each subspace, and/or, the grids in the grid set are fused according to the grids in each subspace;

所述存储模块604,用于:The storage module 604 is configured to:

根据所述融合后的栅格的编码存储所述全量MR数据,所述融合后的栅格的编码为将所述栅格集合中的栅格的编码中不相同的最后一位删除得到的。The full amount of MR data is stored according to the coding of the fused grid, which is obtained by deleting the last bit that is different from the coding of the grids in the grid set.

进一步地,所述装置还包括更新模块,用于:Further, the device also includes an update module, configured to:

当所述P个子空间中的任一子空间C中的MR数据变化量超出预设变化量,根据所述子空间C中变化后的MR数据,更新所述子空间C的价值级别。When the variation of MR data in any subspace C of the P subspaces exceeds a preset variation, the value level of the subspace C is updated according to the changed MR data in the subspace C.

本申请实施例,通过将多个栅格划分为P个子空间,并根据所述P个子空间的全量MR数据确定该P个子空间的价值级别,进而根据该P个子空间的价值级别对P个子空间中的栅格进行融合处理,然后根据融合后的栅格存储全量MR数据。采用该手段,基于不同价值级别,将不同子空间的多个原来的栅格分别进行了合并,使得栅格数量减少,在存储全量MR数据时占用的存储空间较少,相较于现有技术中采用数量极多的等大栅格存储MR数据,本发明采用融合后的栅格存储数据,在不丢失数据本身信息、存储全量数据的前提下,栅格总数有所减少,节省了存储空间,降低了数据库中存储开销以及提升了查询性能。In this embodiment of the present application, multiple grids are divided into P subspaces, and the value levels of the P subspaces are determined according to the full MR data of the P subspaces, and then the P subspaces are evaluated according to the value levels of the P subspaces. The raster in is fused, and then the full amount of MR data is stored according to the fused raster. Using this method, multiple original grids of different subspaces are merged based on different value levels, so that the number of grids is reduced, and less storage space is occupied when storing the full amount of MR data. Compared with the existing technology A very large number of equal-sized grids are used to store MR data. The present invention uses fused grids to store data. On the premise of not losing the information of the data itself and storing the full amount of data, the total number of grids is reduced, saving storage space , reducing storage overhead in the database and improving query performance.

另一方面,本方案将全量MR数据由传统的等大栅格存储转换成多尺度栅格存储,还可以解决现有技术中由于数据采样引起的无法完成实时网络分析的缺陷。本方案采用全量MR数据进行分析,无需积累多天数据来满足分析标准,较短时间范围内的数据量即可满足分析要求,通过与历史分析数据做对比,直观体现网络变化的影响,为网规网优调节提供效果验证。On the other hand, this solution converts the full amount of MR data from traditional equal-sized grid storage to multi-scale grid storage, and can also solve the defect that real-time network analysis cannot be completed due to data sampling in the prior art. This solution uses the full amount of MR data for analysis. It does not need to accumulate multiple days of data to meet the analysis standards. The amount of data in a short period of time can meet the analysis requirements. By comparing with historical analysis data, it can intuitively reflect the impact of network changes. Network planning optimization and adjustment provide effect verification.

值得指出的是,其中,网络数据存储装置的具体功能实现方式可以参见上述网络数据存储方法的描述,这里不再进行赘述。网络数据存储装置中的各个单元或模块可以分别或全部合并为一个或若干个另外的单元或模块来构成,或者其中的某个(些)单元或模块还可以再拆分为功能上更小的多个单元或模块来构成,这可以实现同样的操作,而不影响本发明的实施例的技术效果的实现。上述单元或模块是基于逻辑功能划分的,在实际应用中,一个单元(或模块)的功能也可以由多个单元(或模块)来实现,或者多个单元(或模块)的功能由一个单元(或模块)实现。It is worth pointing out that, for the specific function implementation manner of the network data storage device, reference may be made to the description of the above network data storage method, which will not be repeated here. Each unit or module in the network data storage device can be separately or all combined into one or several other units or modules to form, or one (some) units or modules can be split into functionally smaller ones. Multiple units or modules can achieve the same operation without affecting the realization of the technical effects of the embodiments of the present invention. The above-mentioned units or modules are divided based on logical functions. In practical applications, the functions of one unit (or module) can also be realized by multiple units (or modules), or the functions of multiple units (or modules) can be realized by one unit (or module) implementation.

基于上述方法实施例以及装置实施例的描述,本发明实施例还提供一种网络数据存储装置。Based on the descriptions of the foregoing method embodiments and device embodiments, embodiments of the present invention further provide a network data storage device.

请参见图7,是本发明实施例提供的一种网络数据存储装置的结构示意图。图7所示的网络数据存储装置700(该装置700具体可以是一种计算机设备)包括存储器701、处理器702、通信接口703以及总线704。其中,存储器701、处理器702、通信接口703通过总线704实现彼此之间的通信连接。Please refer to FIG. 7 , which is a schematic structural diagram of a network data storage device provided by an embodiment of the present invention. The network data storage device 700 shown in FIG. 7 (the device 700 may specifically be a computer device) includes a memory 701 , a processor 702 , a communication interface 703 and a bus 704 . Wherein, the memory 701 , the processor 702 , and the communication interface 703 are connected to each other through a bus 704 .

存储器701可以是只读存储器(Read Only Memory,ROM),静态存储设备,动态存储设备或者随机存取存储器(Random Access Memory,RAM)。The memory 701 may be a read only memory (Read Only Memory, ROM), a static storage device, a dynamic storage device or a random access memory (Random Access Memory, RAM).

存储器701可以存储程序,当存储器701中存储的程序被处理器702执行时,处理器702和通信接口703用于执行本申请实施例的网络数据存储方法的各个步骤。The memory 701 may store programs, and when the programs stored in the memory 701 are executed by the processor 702, the processor 702 and the communication interface 703 are used to execute various steps of the network data storage method of the embodiment of the present application.

处理器702可以采用通用的中央处理器(Central Processing Unit,CPU),微处理器,应用专用集成电路(Application Specific Integrated Circuit,ASIC),图形处理器(graphics processing unit,GPU)或者一个或多个集成电路,用于执行相关程序,以实现本申请实施例的网络数据存储装置中的单元所需执行的功能,或者执行本申请方法实施例的网络数据存储方法。The processor 702 may be a general-purpose central processing unit (Central Processing Unit, CPU), a microprocessor, an application specific integrated circuit (Application Specific Integrated Circuit, ASIC), a graphics processing unit (graphics processing unit, GPU) or one or more The integrated circuit is used to execute related programs to realize the functions required by the units in the network data storage device of the embodiment of the present application, or to execute the network data storage method of the method embodiment of the present application.

处理器702还可以是一种集成电路芯片,具有信号的处理能力。在实现过程中,本申请的网络数据存储方法的各个步骤可以通过处理器702中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器702还可以是通用处理器、数字信号处理器(DigitalSignal Processing,DSP)、专用集成电路(ASIC)、现成可编程门阵列(Field ProgrammableGate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。可以实现或者执行本申请实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。结合本申请实施例所公开的方法的步骤可以直接体现为硬件译码处理器执行完成,或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于随机存储器,闪存、只读存储器,可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质中。该存储介质位于存储器701,处理器702读取存储器701中的信息,结合其硬件完成本申请实施例的网络数据存储装置中包括的单元所需执行的功能,或者执行本申请方法实施例的网络数据存储方法。The processor 702 may also be an integrated circuit chip, which has a signal processing capability. During implementation, each step of the network data storage method of the present application may be completed by an integrated logic circuit of hardware in the processor 702 or instructions in the form of software. The above-mentioned processor 702 may also be a general-purpose processor, a digital signal processor (Digital Signal Processing, DSP), an application-specific integrated circuit (ASIC), an off-the-shelf programmable gate array (Field Programmable Gate Array, FPGA) or other programmable logic devices, discrete Gate or transistor logic devices, discrete hardware components. Various methods, steps, and logic block diagrams disclosed in the embodiments of the present application may be implemented or executed. A general-purpose processor may be a microprocessor, or the processor may be any conventional processor, or the like. The steps of the method disclosed in connection with the embodiments of the present application may be directly implemented by a hardware decoding processor, or implemented by a combination of hardware and software modules in the decoding processor. The software module can be located in a mature storage medium in the field such as random access memory, flash memory, read-only memory, programmable read-only memory or electrically erasable programmable memory, register. The storage medium is located in the memory 701, and the processor 702 reads the information in the memory 701, and combines its hardware to complete the functions required by the units included in the network data storage device of the embodiment of the application, or execute the network data storage device of the method embodiment of the application. data storage method.

通信接口703使用例如但不限于收发器一类的收发装置,来实现装置700与其他设备或通信网络之间的通信。例如,可以通过通信接口703获取数据。The communication interface 703 implements communication between the apparatus 700 and other devices or communication networks by using a transceiver device such as but not limited to a transceiver. For example, data can be acquired through the communication interface 703 .

总线704可包括在装置700各个部件(例如,存储器701、处理器702、通信接口703)之间传送信息的通路。The bus 704 may include pathways for transferring information between various components of the device 700 (eg, memory 701 , processor 702 , communication interface 703 ).

应注意,尽管图7所示的装置700仅仅示出了存储器、处理器、通信接口,但是在具体实现过程中,本领域的技术人员应当理解,装置700还包括实现正常运行所必须的其他器件。同时,根据具体需要,本领域的技术人员应当理解,装置700还可包括实现其他附加功能的硬件器件。此外,本领域的技术人员应当理解,装置700也可仅仅包括实现本申请实施例所必须的器件,而不必包括图7中所示的全部器件。It should be noted that although the device 700 shown in FIG. 7 only shows a memory, a processor, and a communication interface, in the specific implementation process, those skilled in the art should understand that the device 700 also includes other devices necessary for normal operation . Meanwhile, according to specific needs, those skilled in the art should understand that the apparatus 700 may also include hardware devices for implementing other additional functions. In addition, those skilled in the art should understand that the device 700 may only include components necessary to realize the embodiment of the present application, and does not necessarily include all the components shown in FIG. 7 .

本申请实施例还提供了一种芯片系统,所述芯片系统应用于电子设备;所述芯片系统包括一个或多个接口电路,以及一个或多个处理器;所述接口电路和所述处理器通过线路互联;所述接口电路用于从所述电子设备的存储器接收信号,并向所述处理器发送所述信号,所述信号包括所述存储器中存储的计算机指令;当所述处理器执行所述计算机指令时,所述电子设备执行所述的网络数据存储方法。The embodiment of the present application also provides a chip system, the chip system is applied to electronic equipment; the chip system includes one or more interface circuits, and one or more processors; the interface circuit and the processor Interconnected by wires; the interface circuit is used to receive signals from the memory of the electronic device and send the signals to the processor, the signals include computer instructions stored in the memory; when the processor executes When the computer instructs, the electronic device executes the network data storage method.

本申请实施例还提供了一种计算机可读存储介质,该计算机可读存储介质中存储有指令,当其在计算机或处理器上运行时,使得计算机或处理器执行上述任一个方法中的一个或多个步骤。The embodiment of the present application also provides a computer-readable storage medium, the computer-readable storage medium stores instructions, and when it is run on a computer or a processor, the computer or the processor executes one of the above-mentioned methods or multiple steps.

本申请实施例还提供了一种包含指令的计算机程序产品。当该计算机程序产品在计算机或处理器上运行时,使得计算机或处理器执行上述任一个方法中的一个或多个步骤。The embodiment of the present application also provides a computer program product including instructions. When the computer program product is run on the computer or the processor, the computer or the processor is made to perform one or more steps in any one of the above methods.

所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应步骤过程的具体描述,在此不再赘述。Those skilled in the art can clearly understand that for the convenience and brevity of the description, the specific working process of the above-described system, device and unit can refer to the specific description of the corresponding steps in the foregoing method embodiments, which will not be repeated here. repeat.

应理解,在本申请的描述中,除非另有说明,“/”表示前后关联的对象是一种“或”的关系,例如,A/B可以表示A或B;其中A,B可以是单数或者复数。并且,在本申请的描述中,除非另有说明,“多个”是指两个或多于两个。“以下至少一项(个)”或其类似表达,是指的这些项中的任意组合,包括单项(个)或复数项(个)的任意组合。例如,a,b,或c中的至少一项(个),可以表示:a,b,c,a-b,a-c,b-c,或a-b-c,其中a,b,c可以是单个,也可以是多个。另外,为了便于清楚描述本申请实施例的技术方案,在本申请的实施例中,采用了“第一”、“第二”等字样对功能和作用基本相同的相同项或相似项进行区分。本领域技术人员可以理解“第一”、“第二”等字样并不对数量和执行次序进行限定,并且“第一”、“第二”等字样也并不限定一定不同。同时,在本申请实施例中,“示例性的”或者“例如”等词用于表示作例子、例证或说明。本申请实施例中被描述为“示例性的”或者“例如”的任何实施例或设计方案不应被解释为比其它实施例或设计方案更优选或更具优势。确切而言,使用“示例性的”或者“例如”等词旨在以具体方式呈现相关概念,便于理解。It should be understood that in the description of this application, unless otherwise specified, "/" means that the objects associated with each other are an "or" relationship, for example, A/B can mean A or B; where A and B can be singular or plural. And, in the description of the present application, unless otherwise specified, "plurality" means two or more than two. "At least one of the following" or similar expressions refer to any combination of these items, including any combination of single or plural items. For example, at least one item (piece) of a, b, or c can represent: a, b, c, a-b, a-c, b-c, or a-b-c, where a, b, c can be single or multiple . In addition, in order to clearly describe the technical solutions of the embodiments of the present application, in the embodiments of the present application, words such as "first" and "second" are used to distinguish the same or similar items with basically the same function and effect. Those skilled in the art can understand that words such as "first" and "second" do not limit the quantity and execution order, and words such as "first" and "second" do not necessarily limit the difference. Meanwhile, in the embodiments of the present application, words such as "exemplary" or "for example" are used as examples, illustrations or illustrations. Any embodiment or design scheme described as "exemplary" or "for example" in the embodiments of the present application shall not be interpreted as being more preferred or more advantageous than other embodiments or design schemes. To be precise, the use of words such as "exemplary" or "such as" is intended to present related concepts in a concrete manner for easy understanding.

在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其它的方式实现。例如,该单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如,多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。所显示或讨论的相互之间的耦合、或直接耦合、或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed systems, devices and methods may be implemented in other ways. For example, the division of this unit is only a logical function division, and there may be other division methods in actual implementation, for example, multiple units or components can be combined or integrated into another system, or some features can be ignored, or not implement. The mutual coupling, or direct coupling, or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or units may be in electrical, mechanical or other forms.

作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。A unit described as a separate component may or may not be physically separated, and a component displayed as a unit may or may not be a physical unit, that is, it may be located in one place, or may be distributed to multiple network units. Part or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.

在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。该计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行该计算机程序指令时,全部或部分地产生按照本申请实施例的流程或功能。该计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。该计算机指令可以存储在计算机可读存储介质中,或者通过该计算机可读存储介质进行传输。该计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(digital subscriber line,DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。该计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。该可用介质可以是只读存储器(read-onlymemory,ROM),或随机存取存储器(random access memory,RAM),或磁性介质,例如,软盘、硬盘、磁带、磁碟、或光介质,例如,数字通用光盘(digital versatile disc,DVD)、或者半导体介质,例如,固态硬盘(solid state disk,SSD)等。In the above embodiments, all or part of them may be implemented by software, hardware, firmware or any combination thereof. When implemented using software, it may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on the computer, the processes or functions according to the embodiments of the present application will be generated in whole or in part. The computer can be a general purpose computer, a special purpose computer, a computer network, or other programmable device. The computer instructions may be stored in or transmitted over a computer-readable storage medium. The computer instructions can be sent from one website site, computer, server, or data center to another via wired (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.) A website site, computer, server or data center for transmission. The computer-readable storage medium may be any available medium that can be accessed by a computer, or a data storage device such as a server or a data center integrated with one or more available media. The available medium may be a read-only memory (read-only memory, ROM), or a random access memory (random access memory, RAM), or a magnetic medium, for example, a floppy disk, a hard disk, a magnetic tape, a magnetic disk, or an optical medium, for example, A digital versatile disc (digital versatile disc, DVD), or a semiconductor medium, for example, a solid state disk (solid state disk, SSD).

以上所述,仅为本申请实施例的具体实施方式,但本申请实施例的保护范围并不局限于此,任何在本申请实施例揭露的技术范围内的变化或替换,都应涵盖在本申请实施例的保护范围之内。因此,本申请实施例的保护范围应以所述权利要求的保护范围为准。The above is only the specific implementation of the embodiment of the application, but the protection scope of the embodiment of the application is not limited thereto, and any changes or replacements within the technical scope disclosed in the embodiment of the application shall be covered by this application. Within the scope of protection of the application examples. Therefore, the protection scope of the embodiments of the present application should be based on the protection scope of the claims.

Claims (15)

1.一种网络数据存储方法,其特征在于,包括:1. A network data storage method, characterized in that, comprising: 获取多个栅格对应的全量MR数据,并根据所述全量MR数据将所述多个栅格划分为P个子空间,P为不小于1的整数;Acquiring full MR data corresponding to multiple grids, and dividing the multiple grids into P subspaces according to the full MR data, where P is an integer not less than 1; 根据所述P个子空间的全量MR数据,确定所述P个子空间的价值级别;Determine the value level of the P subspaces according to the full amount of MR data in the P subspaces; 根据所述P个子空间的价值级别对所述P个子空间中的栅格进行融合处理,以得到所述每个子空间的融合后的栅格;performing fusion processing on the grids in the P subspaces according to the value levels of the P subspaces, so as to obtain the fused grids of each subspace; 根据所述融合后的栅格存储所述全量MR数据。The full amount of MR data is stored according to the fused grid. 2.根据权利要求1所述的方法,其特征在于,所述根据所述全量MR数据将所述多个栅格划分为P个子空间,包括:2. The method according to claim 1, wherein said dividing said plurality of grids into P subspaces according to said full amount of MR data comprises: S1、根据每个所述栅格内的MR数据的数据量,确定每个所述栅格的数据密度;S1. Determine the data density of each grid according to the amount of MR data in each grid; S2、根据每个所述栅格的数据密度确定所述多个栅格中任意两个栅格之间的不相似度值;S2. Determine the dissimilarity value between any two grids in the plurality of grids according to the data density of each grid; S3、将所述多个栅格中任意两个栅格之间不相似度值最小的两个栅格记为同组栅格;S3. Record the two grids with the smallest dissimilarity value between any two grids among the plurality of grids as the same group of grids; S4、计算包含所述同组栅格的所述多个栅格中任意两个栅格之间的不相似度值;S4. Calculating a dissimilarity value between any two grids in the plurality of grids including the same group of grids; S5、重复执行步骤S3-S4,直到所述多个栅格中的每个栅格均为同组栅格中的栅格,得到P个同组栅格,所述P个子空间与P个同组栅格对应,所述P个子空间中每个子空间包含的栅格的数量不小于第一预设值。S5. Steps S3-S4 are repeatedly executed until each of the multiple grids is a grid in the same group of grids, and P grids of the same group are obtained, and the P subspaces are the same as the P subspaces of the same group. Corresponding to a group of grids, the number of grids contained in each of the P subspaces is not less than a first preset value. 3.根据权利要求2所述的方法,其特征在于,所述方法还包括:3. The method according to claim 2, wherein the method further comprises: 若存在子空间A,其中所述子空间A包含的栅格的数量小于所述第一预设值,则从所述P个子空间中获取子空间B,所述子空间B与所述子空间A的不相似度值差值小于第二预设值;If there is a subspace A, wherein the number of grids contained in the subspace A is less than the first preset value, obtain a subspace B from the P subspaces, and the subspace B and the subspace The difference of the dissimilarity value of A is smaller than the second preset value; 将所述子空间A合并至所述子空间B中,以更新所述子空间B。The subspace A is merged into the subspace B to update the subspace B. 4.根据权利要求2或3所述的方法,其特征在于,所述根据所述P个子空间的全量MR数据,确定所述P个子空间的价值级别,包括:4. The method according to claim 2 or 3, wherein the determination of the value levels of the P subspaces according to the full amount of MR data of the P subspaces includes: 根据所述P个子空间中每个子空间所包含的栅格的数据密度和每个子空间包含的栅格的数量确定所述P个子空间中每个子空间的数据密度平均值;Determine the average data density of each of the P subspaces according to the data density of the grids contained in each of the P subspaces and the number of grids contained in each subspace; 根据所述P个子空间中每个子空间的数据密度平均值得到所述P个子空间的价值级别,其中,所述数据密度平均值越大,所述子空间的价值级别越高。The value level of the P subspaces is obtained according to the average data density of each of the P subspaces, wherein the greater the average data density, the higher the value level of the subspace. 5.根据权利要求4所述的方法,其特征在于,所述方法还包括:5. method according to claim 4, is characterized in that, described method also comprises: 根据每个所述栅格的栅格中心点经纬度得到每个所述栅格的编码;Obtain the code of each grid according to the latitude and longitude of the grid center point of each grid; 所述根据所述P个子空间的价值级别对所述P个子空间中的栅格进行融合处理,以得到所述每个子空间的融合后的栅格,包括:The step of merging the grids in the P subspaces according to the value levels of the P subspaces to obtain the fused grids of each subspace includes: 根据所述P个子空间的价值级别确定所述P个子空间中每个子空间的栅格尺寸上限;determining the upper limit of the grid size of each subspace in the P subspaces according to the value level of the P subspaces; 根据每个所述栅格的编码,将所述每个子空间中的栅格集合进行融合,以得到所述每个子空间的融合后的栅格,其中,每个子空间中融合后的栅格尺寸不大于该子空间的栅格尺寸上限,所述每个子空间中的栅格集合中的栅格的数量为预设数值,且所述栅格集合中的栅格的编码仅最后一位不相同,所述栅格集合中的栅格为所述每个子空间中的栅格,和/或,所述栅格集合中的栅格为根据所述每个子空间中的栅格进行融合得到的;According to the encoding of each grid, the grid set in each subspace is fused to obtain the fused grid of each subspace, wherein the size of the fused grid in each subspace Not greater than the upper limit of the grid size of the subspace, the number of grids in the grid set in each subspace is a preset value, and the codes of the grids in the grid set are only different in the last bit , the grids in the grid set are the grids in each subspace, and/or, the grids in the grid set are fused according to the grids in each subspace; 所述根据所述融合后的栅格存储所述全量MR数据,包括:The storing the full amount of MR data according to the fused grid includes: 根据所述融合后的栅格的编码存储所述全量MR数据,所述融合后的栅格的编码为将所述栅格集合中的栅格的编码中不相同的最后一位删除得到的。The full amount of MR data is stored according to the coding of the fused grid, which is obtained by deleting the last bit that is different from the coding of the grids in the grid set. 6.根据权利要求1至5任一项所述的方法,其特征在于,所述方法还包括:6. The method according to any one of claims 1 to 5, characterized in that the method further comprises: 当所述P个子空间中的任一子空间C中的MR数据变化量超出预设变化量,根据所述子空间C中变化后的MR数据,更新所述子空间C的价值级别。When the variation of MR data in any subspace C of the P subspaces exceeds a preset variation, the value level of the subspace C is updated according to the changed MR data in the subspace C. 7.一种网络数据存储装置,其特征在于,包括:7. A network data storage device, characterized in that, comprising: 处理模块,用于获取多个栅格对应的全量MR数据,并根据所述全量MR数据将所述多个栅格划分为P个子空间,P为不小于1的整数;A processing module, configured to obtain full MR data corresponding to multiple grids, and divide the multiple grids into P subspaces according to the full MR data, where P is an integer not less than 1; 确定模块,用于根据所述P个子空间的全量MR数据,确定所述P个子空间的价值级别;A determining module, configured to determine the value level of the P subspaces according to the full amount of MR data of the P subspaces; 融合模块,用于根据所述P个子空间的价值级别对所述P个子空间中的栅格进行融合处理,以得到所述每个子空间的融合后的栅格;a fusion module, configured to perform fusion processing on the grids in the P subspaces according to the value levels of the P subspaces, so as to obtain the fused grids of each of the subspaces; 存储模块,用于根据所述融合后的栅格存储所述全量MR数据。A storage module, configured to store the full amount of MR data according to the fused grid. 8.根据权利要求7所述的装置,其特征在于,所述处理模块,用于执行以下步骤:8. The device according to claim 7, wherein the processing module is configured to perform the following steps: S1、根据每个所述栅格内的MR数据的数据量,确定每个所述栅格的数据密度;S1. Determine the data density of each grid according to the amount of MR data in each grid; S2、根据每个所述栅格的数据密度确定所述多个栅格中任意两个栅格之间的不相似度值;S2. Determine the dissimilarity value between any two grids in the plurality of grids according to the data density of each grid; S3、将所述多个栅格中任意两个栅格之间不相似度值最小的两个栅格记为同组栅格;S3. Record the two grids with the smallest dissimilarity value between any two grids among the plurality of grids as the same group of grids; S4、计算包含所述同组栅格的所述多个栅格中任意两个栅格之间的不相似度值;S4. Calculating a dissimilarity value between any two grids in the plurality of grids including the same group of grids; S5、重复执行步骤S3-S4,直到所述多个栅格中的每个栅格均为同组栅格中的栅格,得到P个同组栅格,所述P个子空间与P个同组栅格对应,所述P个子空间中每个子空间包含的栅格的数量不小于第一预设值。S5. Steps S3-S4 are repeatedly executed until each of the multiple grids is a grid in the same group of grids, and P grids of the same group are obtained, and the P subspaces are the same as the P subspaces of the same group. Corresponding to a group of grids, the number of grids contained in each of the P subspaces is not less than a first preset value. 9.根据权利要求8所述的装置,其特征在于,所述处理模块,还用于:9. The device according to claim 8, wherein the processing module is further used for: 若存在子空间A,其中所述子空间A包含的栅格的数量小于所述第一预设值,则从所述P个子空间中获取子空间B,所述子空间B与所述子空间A的不相似度值差值小于第二预设值;If there is a subspace A, wherein the number of grids contained in the subspace A is less than the first preset value, obtain a subspace B from the P subspaces, and the subspace B and the subspace The difference of the dissimilarity value of A is smaller than the second preset value; 将所述子空间A合并至所述子空间B中,以更新所述子空间B。The subspace A is merged into the subspace B to update the subspace B. 10.根据权利要求8或9所述的装置,其特征在于,所述确定模块,用于:10. The device according to claim 8 or 9, wherein the determining module is configured to: 根据所述P个子空间中每个子空间所包含的栅格的数据密度和每个子空间包含的栅格的数量确定所述P个子空间中每个子空间的数据密度平均值;Determine the average data density of each of the P subspaces according to the data density of the grids contained in each of the P subspaces and the number of grids contained in each subspace; 根据所述P个子空间中每个子空间的数据密度平均值得到所述P个子空间的价值级别,其中,所述数据密度平均值越大,所述子空间的价值级别越高。The value level of the P subspaces is obtained according to the average data density of each of the P subspaces, wherein the greater the average data density, the higher the value level of the subspace. 11.根据权利要求10所述的装置,其特征在于,所述装置还包括编码模块,用于:11. The device according to claim 10, wherein the device further comprises an encoding module, configured to: 根据每个所述栅格的栅格中心点经纬度得到每个所述栅格的编码;Obtain the code of each grid according to the latitude and longitude of the grid center point of each grid; 所述融合模块,用于:The fusion module is used for: 根据所述P个子空间的价值级别确定所述P个子空间中每个子空间的栅格尺寸上限;determining the upper limit of the grid size of each subspace in the P subspaces according to the value level of the P subspaces; 根据每个所述栅格的编码,将所述每个子空间中的栅格集合进行融合,以得到所述每个子空间的融合后的栅格,其中,每个子空间中融合后的栅格尺寸不大于该子空间的栅格尺寸上限,所述每个子空间中的栅格集合中的栅格的数量为预设数值,且所述栅格集合中的栅格的编码仅最后一位不相同,所述栅格集合中的栅格为所述每个子空间中的栅格,和/或,所述栅格集合中的栅格为根据所述每个子空间中的栅格进行融合得到的;According to the encoding of each grid, the grid set in each subspace is fused to obtain the fused grid of each subspace, wherein the size of the fused grid in each subspace Not greater than the upper limit of the grid size of the subspace, the number of grids in the grid set in each subspace is a preset value, and the codes of the grids in the grid set are only different in the last bit , the grids in the grid set are the grids in each subspace, and/or, the grids in the grid set are fused according to the grids in each subspace; 所述存储模块,用于:The storage module is used for: 根据所述融合后的栅格的编码存储所述全量MR数据,所述融合后的栅格的编码为将所述栅格集合中的栅格的编码中不相同的最后一位删除得到的。The full amount of MR data is stored according to the coding of the fused grid, which is obtained by deleting the last bit that is different from the coding of the grids in the grid set. 12.根据权利要求7至11任一项所述的装置,其特征在于,所述装置还包括更新模块,用于:12. The device according to any one of claims 7 to 11, wherein the device further comprises an update module, configured to: 当所述P个子空间中的任一子空间C中的MR数据变化量超出预设变化量,根据所述子空间C中变化后的MR数据,更新所述子空间C的价值级别。When the variation of MR data in any subspace C of the P subspaces exceeds a preset variation, the value level of the subspace C is updated according to the changed MR data in the subspace C. 13.一种网络数据存储装置,其特征在于,包括处理器和存储器;其中,所述存储器用于存储程序代码,所述处理器用于调用所述程序代码,以执行如权利要求1至6任意一项所述的方法。13. A network data storage device, characterized by comprising a processor and a memory; wherein, the memory is used to store program codes, and the processor is used to call the program codes to execute any of claims 1 to 6 one of the methods described. 14.一种计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行以实现权利要求1至6任意一项所述的方法。14. A computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and the computer program is executed by a processor to implement the method according to any one of claims 1 to 6. 15.一种计算机程序产品,其特征在于,当计算机程序产品在计算机上运行时,使得所述计算机执行如权利要求1至6任意一项所述的方法。15. A computer program product, characterized in that, when the computer program product is run on a computer, the computer is made to execute the method according to any one of claims 1 to 6.
CN202210093050.3A 2022-01-26 2022-01-26 Network data storage method, related system, and storage medium Pending CN116540921A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202210093050.3A CN116540921A (en) 2022-01-26 2022-01-26 Network data storage method, related system, and storage medium
PCT/CN2022/114359 WO2023142466A1 (en) 2022-01-26 2022-08-23 Network data storage method, related system, and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210093050.3A CN116540921A (en) 2022-01-26 2022-01-26 Network data storage method, related system, and storage medium

Publications (1)

Publication Number Publication Date
CN116540921A true CN116540921A (en) 2023-08-04

Family

ID=87449327

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210093050.3A Pending CN116540921A (en) 2022-01-26 2022-01-26 Network data storage method, related system, and storage medium

Country Status (2)

Country Link
CN (1) CN116540921A (en)
WO (1) WO2023142466A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001098925A2 (en) * 2000-06-20 2001-12-27 Globexplorer, Inc. Method, system and computer program product for delivering spatially referenced information in a global computer network
US20110055212A1 (en) * 2009-09-01 2011-03-03 Cheng-Fa Tsai Density-based data clustering method
CN107992555A (en) * 2017-11-28 2018-05-04 鲁东大学 A kind of storage of raster data and read method
CN110677859A (en) * 2018-07-03 2020-01-10 中国电信股份有限公司 Method and device for determining weak coverage area and computer readable storage medium
CN113873541A (en) * 2020-06-30 2021-12-31 中兴通讯股份有限公司 Logic grid obtaining and wireless strategy determining method, device, equipment and medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001098925A2 (en) * 2000-06-20 2001-12-27 Globexplorer, Inc. Method, system and computer program product for delivering spatially referenced information in a global computer network
US20110055212A1 (en) * 2009-09-01 2011-03-03 Cheng-Fa Tsai Density-based data clustering method
CN107992555A (en) * 2017-11-28 2018-05-04 鲁东大学 A kind of storage of raster data and read method
CN110677859A (en) * 2018-07-03 2020-01-10 中国电信股份有限公司 Method and device for determining weak coverage area and computer readable storage medium
CN113873541A (en) * 2020-06-30 2021-12-31 中兴通讯股份有限公司 Logic grid obtaining and wireless strategy determining method, device, equipment and medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
刘明亮, 唐先明, 庄大方, 刘纪远: "基于融合技术的结构性栅格时空数据平台的构建", 地球信息科学, no. 4, 30 December 2003 (2003-12-30), pages 64 - 67 *

Also Published As

Publication number Publication date
WO2023142466A1 (en) 2023-08-03

Similar Documents

Publication Publication Date Title
CN109992633B (en) User position-based geo-fence determination method and device and electronic equipment
CN108574933A (en) User track recovery method and device
US20220101350A1 (en) Information pushing method and apparatus
KR102655903B1 (en) Processing method and processing device for saving energy in a base station
CN110636445B (en) WIFI-based indoor positioning method, device, equipment and medium
WO2020215783A1 (en) Locating method and apparatus, and storage medium
CN106326923B (en) A check-in location data clustering method considering location repetition and density peaks
CN115270003B (en) An information recommendation method and system based on behavior data mining of Internet of Things platform
CN108650684A (en) A kind of correlation rule determines method and device
CN110298687B (en) Regional attraction assessment method and device
CN112867147B (en) Positioning method and positioning device
CN114430530B (en) Spatial division method, apparatus, apparatus, medium and program product
Amirkhanyan et al. Real-time clustering of massive geodata for online maps to improve visual analysis
CN106921978B (en) Position distribution determination method and device
CN111506682A (en) Map data processing method, data query device and electronic equipment
WO2023184732A1 (en) Genome assembly method and apparatus, and device and storage medium
CN115203340A (en) Method, device, equipment and storage medium for determining companion relationship
CN116540921A (en) Network data storage method, related system, and storage medium
CN113076451B (en) Abnormal behavior identification and risk model library establishment method, device and electronic equipment
CN108460424B (en) Spatial motion trajectory identification method based on line point conversion and cluster comparison
CN116303548A (en) Data blood-edge full-link analysis method and system based on graph technology
CN114885348A (en) Quality difference cell classification method and device
CN111814328A (en) A method for modeling and querying spatiotemporal data cubes with flight zones
CN116680468B (en) Fingerprint database generation method and electronic device
CN117319929A (en) Method, device, equipment and storage medium for determining user&#39;s stay location

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination