[go: up one dir, main page]

WO2018165960A1 - Hash value-based e-commerce data processing system and method - Google Patents

Hash value-based e-commerce data processing system and method Download PDF

Info

Publication number
WO2018165960A1
WO2018165960A1 PCT/CN2017/077007 CN2017077007W WO2018165960A1 WO 2018165960 A1 WO2018165960 A1 WO 2018165960A1 CN 2017077007 W CN2017077007 W CN 2017077007W WO 2018165960 A1 WO2018165960 A1 WO 2018165960A1
Authority
WO
WIPO (PCT)
Prior art keywords
hash
commerce
commerce data
hash value
association list
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2017/077007
Other languages
French (fr)
Chinese (zh)
Inventor
李楚斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Xiustyle Brand Communication Co Ltd
Original Assignee
Shenzhen Xiustyle Brand Communication Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Xiustyle Brand Communication Co Ltd filed Critical Shenzhen Xiustyle Brand Communication Co Ltd
Priority to PCT/CN2017/077007 priority Critical patent/WO2018165960A1/en
Publication of WO2018165960A1 publication Critical patent/WO2018165960A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor

Definitions

  • the present invention relates to the field of data processing technologies, and in particular, to a hash value based e-commerce data processing system and method.
  • Data mining is the process of mining effective knowledge from large amounts of data stored in databases, data warehouses, or other information repositories. Data mining extracts implicit, valuable, and understandable information from large amounts of data to guide people's activities. Data mining techniques mainly include association rules, classification rules, cluster analysis and sequence patterns.
  • mobile terminals for example, mobile terminals such as mobile phones, tablet computers, etc.
  • mobile terminals for example, mobile terminals such as mobile phones, tablet computers, etc.
  • the e-commerce data thus resulting in redundancy of e-commerce data on the e-commerce platform e-commerce platform and reducing the access speed of the e-commerce platform website.
  • the main object of the present invention is to provide a hash value based e-commerce data processing system and method, aiming at solving the technical problem of cleaning redundant e-commerce data on an e-commerce platform.
  • the present invention provides a hash value-based e-commerce data processing system, which runs in an e-commerce platform, and the e-commerce platform is communicably connected to a terminal through a network, and the system includes:
  • a calculation module configured to calculate a hash value of each group of e-commerce data on the e-commerce platform
  • a search module configured to find the same hash value from the calculated hash value, and group the same hash value into a group, and the same hash value in each group and the same hash in the group Storage path of e-commerce data corresponding to the Greek value The path is saved in a hash association list;
  • a deleting module configured to select a storage path in each hash association list, and delete the e-commerce data corresponding to the unselected storage path in the hash association list;
  • a modification module configured to modify, in each hash association list, an identifier corresponding to the deleted e-commerce data
  • a receiving module configured to receive a file query instruction sent by the terminal
  • a determining module configured to determine whether a storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list
  • a sending module configured to: when the storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list, the e-commerce corresponding to the selected storage path in the corresponding hash association list The data is sent to the terminal.
  • the hash association list includes four fields, and each of the four fields is a data name, a hash value, a storage path, and an identifier.
  • the calculation module calculates a hash value of each group of e-commerce data on the e-commerce platform as follows: traverses e-commerce data on the e-commerce platform, and calculates each group of e-commerce by using a hash algorithm The hash value of the data.
  • the calculation module calculates a hash value of each set of e-commerce data on the e-commerce platform by using a hash algorithm, where the hash algorithm is a SHA256 algorithm.
  • the determining module determines whether the storage path in the file query instruction has a corresponding hash association list as follows: parsing the file query instruction and obtaining a storage path of the e-commerce data to be queried, The storage path of the e-commerce data to be queried is compared with each hash association list in the e-commerce platform. If a hash association list in the e-commerce platform includes a storage path of the e-commerce data to be queried, The hash association list is a hash association list corresponding to the storage path of the e-commerce data to be queried.
  • the present invention further provides a method for processing an e-commerce data based on a hash value, which is applied to an e-commerce platform, wherein the e-commerce platform is communicatively connected to the terminal through a network, and the method includes the following steps:
  • selecting a storage path in each hash association list, and deleting the hash association list is not selected E-commerce data corresponding to the storage path;
  • the storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list
  • the e-commerce data corresponding to the selected storage path in the corresponding hash association list is sent to the terminal.
  • the hash association list includes four fields, and each of the four fields is a data name, a hash value, a storage path, and an identifier.
  • the step of calculating a hash value of each group of e-commerce data on the e-commerce platform comprises the steps of: traversing e-commerce data on an e-commerce platform, and calculating each group of e-commerce by using a hash algorithm The hash value of the data.
  • the hash value of each group of e-commerce data on the e-commerce platform is calculated by a hash algorithm, where the hash algorithm is a SHA256 algorithm.
  • the step of determining whether the storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list comprises the following steps: parsing the file query instruction and obtaining the electricity to be queried a storage path of the quotient data, comparing the storage path of the e-commerce data to be queried with each hash association list in the e-commerce platform, if there is a hash association list in the e-commerce platform, the s
  • the storage association path of the quotient data, the hash association list is a hash association list corresponding to the storage path of the quotient data to be queried.
  • the present invention adopts the above technical solution, and brings the technical effects as follows:
  • the present invention uses the hash value comparison method to find the same e-commerce data, and deletes repeated e-commerce data, thereby accelerating the access speed of the e-commerce platform. , reducing the redundancy of the e-commerce platform.
  • FIG. 1 is a schematic diagram of an application environment of a hash value-based e-commerce data processing system according to the present invention
  • FIG. 2 is a block diagram showing a preferred embodiment of a hash value based e-commerce data processing system of the present invention
  • 3 is a flow chart of a preferred embodiment of a hash value based e-commerce data processing method of the present invention
  • FIG. 4 is a schematic diagram of a hash association list of the present invention.
  • FIG. 1 is a schematic diagram of an application environment of a hash value-based e-commerce data processing system according to the present invention.
  • the hash value based e-commerce data processing system 30 of the present invention operates on the e-commerce platform 3.
  • the e-commerce platform 3 is communicatively coupled to a plurality of terminals 1 via a network 2.
  • the e-commerce platform 3 stores various types of e-commerce data (e.g., user name, product text information, merchandise type information, courier information, warehousing information, merchandise pictures, shopping records, etc.).
  • the e-commerce data stored by the e-commerce platform 3 is classified into multiple sets of data according to the time division and location, for example, the data generated every second is classified into a group of data.
  • the e-commerce platform 3 is a cluster composed of a plurality of servers.
  • the network 2 may be a wired communication network or a wireless communication network.
  • the network is preferably a wireless communication network, including but not limited to a GSM network, a GPRS network, a CDMA network, a TD-S CDMA network, a WiMAX network, a TD-LTE network, an FDD-LTE network, and the like. .
  • the terminal 1 may be, but not limited to, any other suitable portable touch screen electronic device such as a smart phone, a tablet computer, a personal digital assistant (PDA), a personal computer, an electronic signboard, and the like.
  • the terminal 1 is configured to access the e-commerce platform 3 through the network 2, and obtain e-commerce data on the e-commerce platform 3.
  • the terminal 1 is further configured to generate e-commerce data, and send the generated e-commerce data to the e-commerce platform 3 for storage.
  • FIG. 2 there is shown a block diagram of a preferred embodiment of a hash value based e-commerce data processing system of the present invention.
  • the hash value based e-commerce data processing system 30 is applied to the e-commerce platform 3.
  • the e-commerce platform 3 includes, but is not limited to, a hash value based e-commerce data processing system 30, a storage unit 31, a processing unit 32, and a communication unit 33.
  • the storage unit 31 may be a read only storage unit ROM, an electrically erasable storage unit EEPRO M, flash memory unit FLASH or solid hard disk.
  • the processing unit 32 may be a central processing unit (CPU), a microcontroller (MCU), a data processing chip, or an information processing unit having a data processing function.
  • CPU central processing unit
  • MCU microcontroller
  • data processing chip or an information processing unit having a data processing function.
  • the communication unit 33 is a communication interface with a remote communication function, for example, supports GSM, GPR.
  • the communication interface of wireless communication technologies such as S, WCDMA, CDMA, TD-SCDMA, WiMAX, TD-LTE, FDD-LTE, and the wired communication interface supporting USB and network cable.
  • the data processing system of the electricity supplier based on a hash value 30 include, but are not limited to, the calculation module 312, a searching module 313, a deleting module 314, modification module 315, a receiving module 316, a determining module 317 and a sending module 318.
  • module refers to a series of computer program instruction segments that can be executed by the processing unit 32 of the e-commerce platform 3 and that can perform fixed functions, which are stored in the storage unit of the e-commerce platform 3. 31.
  • the calculation module 312 is configured to calculate a hash value of each group of e-commerce data on the e-commerce platform 3. Specifically, the calculation module 312 traverses the e-commerce data on the e-commerce platform 3 and calculates a hash value of each set of e-commerce data through a hash algorithm.
  • the hash algorithm may be, but is not limited to, the SHA256 algorithm.
  • the searching module 313 is configured to search for the same hash value from the calculated hash values, and group the same hash values into a group, and the same hash value in each group is the same as the same in the group.
  • the storage path of the e-commerce data corresponding to the hash value is stored in a hash association list.
  • the hash association list includes a plurality of identical hash values and a storage path of each hash value corresponding to the e-commerce data. It should be noted that the hash values in the hash association list are the same.
  • the search module 313 obtains multiple sets of identical hash values and generates a hash association list for each set of identical hash values.
  • the hash association list includes at least four fields, and each of the four fields is a name of each group of data (representing the name of the e-commerce data), and a hash value (a representation of the e-commerce data).
  • the value of the storage path (representing the storage path of the e-commerce data on the e-commerce platform 3) and the identification (representing whether the e-commerce data is retained in the e-commerce platform 3, where Y is reserved and N is deleted).
  • the deleting module 314 is configured to select one storage path in each hash association list, and delete the e-commerce data corresponding to the unselected storage paths in the hash association list. In this embodiment, the deleting module 3 14 selects the storage path T1 and deletes the e-commerce data corresponding to the remaining storage paths T2 to T9 in the e-commerce platform 3.
  • the modification module 315 is configured to modify the identifier corresponding to the deleted e-commerce data in each hash association list. Specifically, as shown in FIG. 4, the modification module 315 modifies the identifier corresponding to T2 to T9 in the hash association list from Y to N.
  • the receiving module 316 is configured to receive a file query instruction sent by the terminal 1. Generally, the terminal 1 queries the e-commerce data on the e-commerce platform 3, and the file query instruction includes a storage path of the e-commerce data to be queried.
  • the determining module 317 is configured to determine whether a storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list.
  • the determining module 317 parses the file query instruction and obtains a storage path of the e-commerce data to be queried, and compares the storage path of the e-commerce data to be queried with each hash association list in the e-commerce platform 3. If there is a hash association list in the e-commerce platform 3 that includes the storage path of the e-commerce data to be queried, the hash association list is a hash association list corresponding to the storage path of the e-commerce data to be queried.
  • the sending module 318 is configured to send the e-commerce data corresponding to the selected storage path in the corresponding hash association list to the terminal 1. Specifically, as shown in FIG. 4, if the storage path of the e-commerce data to be queried is T2, the sending module 318 sends the e-commerce data A corresponding to the storage path T1 to the terminal 1.
  • FIG. 3 there is shown a flow chart of a preferred embodiment of the hash value based e-commerce data processing method of the present invention.
  • the hash value-based e-commerce data processing method is applied to the e-commerce platform 3, and the method includes the following steps:
  • Step S12 The calculation module 312 calculates a hash value of each group of e-commerce data on the e-commerce platform 3. Specifically, the calculation module 312 traverses the e-commerce data on the e-commerce platform 3, and calculates a hash value of each set of e-commerce data through a hash algorithm.
  • the hash algorithm may be, but is not limited to, the SHA256 algorithm.
  • Step S13 The searching module 313 searches for the same hash value from the calculated hash values, and groups the same hash values into a group, and sets the same hash value in each group and the group.
  • the storage path of the e-commerce data corresponding to the same hash value is stored in a hash association list.
  • the hash association list includes a plurality of identical hash values and a storage path of each hash value corresponding to the e-commerce data. It should be noted that the hash values in the hash association list are the same.
  • the search module 313 obtains multiple sets of identical hash values and generates a hash association list for each set of identical hash values.
  • the hash association list includes at least four fields, and each of the four fields is a name of each group (representing the name of the e-commerce data).
  • a hash value (representing a hash value of the e-commerce data), a storage path (representing a storage path of the e-commerce data on the e-commerce platform 3), and a logo (indicating whether the e-commerce data is retained in the e-commerce platform 3, wherein Y means reserved, N means delete).
  • Y means reserved, N means delete.
  • the letter B in FIG. 4 merely refers to the hash value, and does not mean that the hash value is B
  • the letters T1 to T9 in FIG. 4 are merely referring to the storage path, and do not indicate T1 to T9 are storage paths.
  • Step S14 The deleting module 314 selects one storage path in each hash association list, and deletes the e-commerce data corresponding to the unselected storage paths in the hash association list.
  • the deletion module 314 selects the storage path T1, and deletes the e-commerce data corresponding to the remaining storage paths ⁇ 2 to ⁇ 9 in the e-commerce platform 3.
  • Step S15 The modification module 315 modifies the identifier corresponding to the deleted e-commerce data in each hash association list. Specifically, as shown in FIG. 4, the modification module 315 modifies the identifier corresponding to ⁇ 2 to ⁇ 9 in the hash association list from ⁇ to ⁇ .
  • Step S16 The receiving module 316 receives the file query instruction sent by the terminal 1. Generally, the terminal 1 queries the e-commerce data on the e-commerce platform 3, and the query instruction includes a storage path of the e-commerce data to be queried.
  • Step S17 The determining module 317 determines whether the storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list.
  • the determining module 317 parses the file query instruction and obtains a storage path of the e-commerce data to be queried, and compares the storage path of the e-commerce data to be queried with each hash association list in the e-commerce platform 3. If there is a hash association list in the e-commerce platform 3 that includes the storage path of the e-commerce data to be queried, the hash association list is a hash association list corresponding to the storage path of the e-commerce data to be queried.
  • step S18 If the storage path in the file query instruction has a corresponding hash association list, the process proceeds to step S18. Otherwise, if the storage path in the file query instruction does not have a corresponding hash association list, The e-commerce data in the e-commerce platform 3 is traversed directly through the storage path in the file query command (omitted in FIG. 3), and then the flow is ended.
  • Step S18 The sending module 318 sends the e-commerce data corresponding to the selected storage path in the corresponding hash association list to the terminal 1. Specifically, as shown in FIG. 4, if the storage path of the e-commerce data to be queried is T2, the sending module 318 sends the e-commerce data ⁇ corresponding to the storage path T1 to the terminal 1. Industrial applicability
  • the present invention uses the hash value comparison method to find the same e-commerce data, and deletes the repeated e-commerce data, accelerates the access speed of the e-commerce platform, and reduces the redundancy of the e-commerce platform.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A hash value-based e-commerce data processing system and method, the method comprising: calculating a hash value of each group of e-commerce data by means of a hash algorithm; searching for the same hash value, and storing the same hash value and the storage path of e-commerce data corresponding to the same hash value in a hash association list; modifying an identifier corresponding to deleted e-commerce data in each hash association list; receiving a file inquiry instruction sent by a terminal; and when the storage path of e-commerce data to be queried in the file query instruction has a corresponding hash association list, sending e-commerce data corresponding to the selected storage path in the corresponding hash association list to the terminal. Implementation of the described solution may increase the speed of accessing an e-commerce platform.

Description

基于哈希值的电商数据处理系统及方法 技术领域  E-commerce data processing system and method based on hash value

[0001] 本发明涉及数据处理技术领域, 尤其涉及一种基于哈希值的电商数据处理系统 及方法。  [0001] The present invention relates to the field of data processing technologies, and in particular, to a hash value based e-commerce data processing system and method.

背景技术  Background technique

[0002] 随着电子商务的快速发展, 人们越来越习惯于使用电子商务网站来购买商品。  [0002] With the rapid development of electronic commerce, people are increasingly accustomed to using e-commerce websites to purchase goods.

然而, 卖家在通过网络渠道赚取利润的同吋, 却无法通过现有的互联网信息来 进行预测以后的热销商品, 来达到赚取更多利润的目的。  However, sellers who are making profits through online channels cannot use the existing Internet information to predict the hot products afterwards, in order to achieve more profit.

[0003] 数据挖掘是指从存放在数据库、 数据仓库或其他信息库的大量数据中挖掘有效 知识的过程。 数据挖掘从大量数据中提取出隐含的、 有价值的和可理解的信息 , 以指导人们的活动。 数据挖掘技术主要有关联规则, 分类规则, 聚类分析和 序列模式等等。  [0003] Data mining is the process of mining effective knowledge from large amounts of data stored in databases, data warehouses, or other information repositories. Data mining extracts implicit, valuable, and understandable information from large amounts of data to guide people's activities. Data mining techniques mainly include association rules, classification rules, cluster analysis and sequence patterns.

[0004] 进一步地, 现在越来越多的用户都使用移动终端 (例如, 手机、 平板电脑等触 摸屏的移动终端) 査看电商数据并存储电商数据, 导致电子商务平台保存多个 相同的电商数据, 如此造成电子商务平台电商平台上电商数据的冗余且降低了 电子商务平台网站的访问速度。  [0004] Further, more and more users now use mobile terminals (for example, mobile terminals such as mobile phones, tablet computers, etc.) to view e-commerce data and store e-commerce data, resulting in e-commerce platforms storing multiple identical The e-commerce data, thus resulting in redundancy of e-commerce data on the e-commerce platform e-commerce platform and reducing the access speed of the e-commerce platform website.

技术问题  technical problem

[0005] 本发明的主要目的在于提供一种基于哈希值的电商数据处理系统及方法, 旨在 解决对电商平台上冗余的电商数据进行清理的技术问题。  [0005] The main object of the present invention is to provide a hash value based e-commerce data processing system and method, aiming at solving the technical problem of cleaning redundant e-commerce data on an e-commerce platform.

问题的解决方案  Problem solution

技术解决方案  Technical solution

[0006] 为实现上述目的, 本发明提供了一种基于哈希值的电商数据处理系统, 运行于 电商平台中, 所述电商平台通过网络与终端通信连接, 该系统包括:  [0006] In order to achieve the above object, the present invention provides a hash value-based e-commerce data processing system, which runs in an e-commerce platform, and the e-commerce platform is communicably connected to a terminal through a network, and the system includes:

[0007] 计算模块, 用于计算所述电商平台上每组电商数据的哈希值; [0007] a calculation module, configured to calculate a hash value of each group of e-commerce data on the e-commerce platform;

[0008] 査找模块, 用于从计算的哈希值中査找相同的哈希值, 并将相同的哈希值归为 一组, 将每组中相同的哈希值及该组中相同的哈希值对应的电商数据的存储路 径保存于一个哈希关联列表中; [0008] a search module, configured to find the same hash value from the calculated hash value, and group the same hash value into a group, and the same hash value in each group and the same hash in the group Storage path of e-commerce data corresponding to the Greek value The path is saved in a hash association list;

[0009] 刪除模块, 用于在每一个哈希关联列表中选定一个存储路径, 并刪除该哈希关 联列表中未选定的存储路径所对应的电商数据;  [0009] a deleting module, configured to select a storage path in each hash association list, and delete the e-commerce data corresponding to the unselected storage path in the hash association list;

[0010] 修改模块, 用于在每一个哈希关联列表修改所刪除的电商数据对应的标识; [0011] 接收模块, 用于接收终端发送过来的文件査询指令; [0010] a modification module, configured to modify, in each hash association list, an identifier corresponding to the deleted e-commerce data; [0011] a receiving module, configured to receive a file query instruction sent by the terminal;

[0012] 判断模块, 用于判断文件査询指令中待査询的电商数据的存储路径是否有对应 的哈希关联列表; 及  [0012] a determining module, configured to determine whether a storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list; and

[0013] 发送模块, 用于当文件査询指令中待査询的电商数据的存储路径有对应的哈希 关联列表, 将该对应的哈希关联列表中选定的存储路径对应的电商数据发送给 终端。  [0013] a sending module, configured to: when the storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list, the e-commerce corresponding to the selected storage path in the corresponding hash association list The data is sent to the terminal.

[0014] 优选的, 所述哈希关联列表包括四个栏位, 所述四个栏位分别为每组数据名称 、 哈希值、 存储路径及标识。  [0014] Preferably, the hash association list includes four fields, and each of the four fields is a data name, a hash value, a storage path, and an identifier.

[0015] 优选的, 所述计算模块计算所述电商平台上每组电商数据的哈希值的方式如下 : 遍历电商平台上的电商数据, 并通过哈希算法计算每组电商数据的哈希值。  [0015] Preferably, the calculation module calculates a hash value of each group of e-commerce data on the e-commerce platform as follows: traverses e-commerce data on the e-commerce platform, and calculates each group of e-commerce by using a hash algorithm The hash value of the data.

[0016] 优选的, 所述计算模块通过哈希算法计算所述电商平台上每组电商数据的哈希 值, 其中, 所述哈希算法为 SHA256算法。  [0016] Preferably, the calculation module calculates a hash value of each set of e-commerce data on the e-commerce platform by using a hash algorithm, where the hash algorithm is a SHA256 algorithm.

[0017] 优选的, 所述判断模块判断文件査询指令中的存储路径是否有对应的哈希关联 列表的方式如下: 解析文件査询指令并获取待査询的电商数据的存储路径, 将 该待査询的电商数据的存储路径与电商平台中每一个哈希关联列表进行比对, 若电商平台中有一个哈希关联列表包含待査询的电商数据的存储路径, 则该哈 希关联列表为待査询的电商数据的存储路径对应的哈希关联列表。  [0017] Preferably, the determining module determines whether the storage path in the file query instruction has a corresponding hash association list as follows: parsing the file query instruction and obtaining a storage path of the e-commerce data to be queried, The storage path of the e-commerce data to be queried is compared with each hash association list in the e-commerce platform. If a hash association list in the e-commerce platform includes a storage path of the e-commerce data to be queried, The hash association list is a hash association list corresponding to the storage path of the e-commerce data to be queried.

[0018] 另一方面, 本发明还提供一种基于哈希值的电商数据处理方法, 应用于电商平 台中, 所述电商平台通过网络与终端通信连接, 该方法包括如下步骤:  [0018] In another aspect, the present invention further provides a method for processing an e-commerce data based on a hash value, which is applied to an e-commerce platform, wherein the e-commerce platform is communicatively connected to the terminal through a network, and the method includes the following steps:

[0019] 计算所述电商平台上每组电商数据的哈希值;  [0019] calculating a hash value of each group of e-commerce data on the e-commerce platform;

[0020] 从计算的哈希值中査找相同的哈希值, 并将相同的哈希值归为一组, 将每组中 相同的哈希值及该组中相同的哈希值对应的电商数据的存储路径保存于一个哈 希关联列表中;  [0020] searching for the same hash value from the calculated hash value, and grouping the same hash value into a group, and the same hash value in each group and the same hash value in the group corresponding to the electricity The storage path of the quotient data is stored in a hash association list;

[0021] 在每一个哈希关联列表中选定一个存储路径, 并刪除该哈希关联列表中未选定 的存储路径所对应的电商数据; [0021] selecting a storage path in each hash association list, and deleting the hash association list is not selected E-commerce data corresponding to the storage path;

[0022] 在每一个哈希关联列表修改所刪除的电商数据对应的标识; [0022] modifying, in each hash association list, an identifier corresponding to the deleted e-commerce data;

[0023] 接收终端发送过来的文件査询指令;  [0023] receiving a file query instruction sent by the terminal;

[0024] 判断文件査询指令中待査询的电商数据的存储路径是否有对应的哈希关联列表 ; 及  [0024] determining whether a storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list; and

[0025] 当文件査询指令中待査询的电商数据的存储路径有对应的哈希关联列表, 将该 对应的哈希关联列表中选定的存储路径对应的电商数据发送给终端。  [0025] When the storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list, the e-commerce data corresponding to the selected storage path in the corresponding hash association list is sent to the terminal.

[0026] 优选的, 所述哈希关联列表包括四个栏位, 所述四个栏位分别为每组数据名称 、 哈希值、 存储路径及标识。  [0026] Preferably, the hash association list includes four fields, and each of the four fields is a data name, a hash value, a storage path, and an identifier.

[0027] 优选的, 所述计算所述电商平台上每组电商数据的哈希值的步骤包括如下步骤 : 遍历电商平台上的电商数据, 并通过哈希算法计算每组电商数据的哈希值。  [0027] Preferably, the step of calculating a hash value of each group of e-commerce data on the e-commerce platform comprises the steps of: traversing e-commerce data on an e-commerce platform, and calculating each group of e-commerce by using a hash algorithm The hash value of the data.

[0028] 优选的, 所述电商平台上每组电商数据的哈希值通过哈希算法计算, 其中, 所 述哈希算法为 SHA256算法。  [0028] Preferably, the hash value of each group of e-commerce data on the e-commerce platform is calculated by a hash algorithm, where the hash algorithm is a SHA256 algorithm.

[0029] 优选的, 所述判断文件査询指令中待査询的电商数据的存储路径是否有对应的 哈希关联列表的步骤包括如下步骤: 解析文件査询指令并获取待査询的电商数 据的存储路径, 将该待査询的电商数据的存储路径与电商平台中每一个哈希关 联列表进行比对, 若电商平台中有一个哈希关联列表包含待査询的电商数据的 存储路径, 则该哈希关联列表为待査询的电商数据的存储路径对应的哈希关联 列表。  [0029] Preferably, the step of determining whether the storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list comprises the following steps: parsing the file query instruction and obtaining the electricity to be queried a storage path of the quotient data, comparing the storage path of the e-commerce data to be queried with each hash association list in the e-commerce platform, if there is a hash association list in the e-commerce platform, the s The storage association path of the quotient data, the hash association list is a hash association list corresponding to the storage path of the quotient data to be queried.

发明的有益效果  Advantageous effects of the invention

有益效果  Beneficial effect

[0030] 本发明采用上述技术方案, 带来的技术效果为: 本发明采用哈希值比对的方式 寻找相同的电商数据, 并刪除重复的电商数据, 加快了电商平台的访问速度, 降低了电商平台的冗余。  [0030] The present invention adopts the above technical solution, and brings the technical effects as follows: The present invention uses the hash value comparison method to find the same e-commerce data, and deletes repeated e-commerce data, thereby accelerating the access speed of the e-commerce platform. , reducing the redundancy of the e-commerce platform.

对附图的简要说明  Brief description of the drawing

附图说明  DRAWINGS

[0031] 图 1是本发明基于哈希值的电商数据处理系统的应用环境示意图;  1 is a schematic diagram of an application environment of a hash value-based e-commerce data processing system according to the present invention;

[0032] 图 2是本发明基于哈希值的电商数据处理系统的优选实施例的模块示意图; [0033] 图 3是本发明基于哈希值的电商数据处理方法的优选实施例的流程图; 2 is a block diagram showing a preferred embodiment of a hash value based e-commerce data processing system of the present invention; 3 is a flow chart of a preferred embodiment of a hash value based e-commerce data processing method of the present invention;

[0034] 图 4是本发明哈希关联列表的示意图。 4 is a schematic diagram of a hash association list of the present invention.

实施该发明的最佳实施例  BEST MODE FOR CARRYING OUT THE INVENTION

本发明的最佳实施方式  BEST MODE FOR CARRYING OUT THE INVENTION

[0035] 为更进一步阐述本发明为达成预定发明目的所采取的技术手段及功效, 以下结 合附图及较佳实施例, 对本发明的具体实施方式、 结构、 特征及其功效, 详细 说明如下。 应当理解, 此处所描述的具体实施例仅仅用以解释本发明, 并不用 于限定本发明。 The specific embodiments, structures, features and functions of the present invention are described in detail below with reference to the accompanying drawings and preferred embodiments. It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

[0036] 参照图 1所示, 图 1是本发明基于哈希值的电商数据处理系统的应用环境示意图 。 本发明中的基于哈希值的电商数据处理系统 30运行于电商平台 3上。 所述电商 平台 3通过网络 2与多个终端 1通信连接。 所述电商平台 3上存储有各种类型的电 商数据 (例如, 用户名、 商品文字信息、 商品种类信息、 快递信息、 仓储信息 、 商品图片、 购物记录等等) 。 需要说明的是, 所述电商平台 3存储的电商数据 是按照吋间及地点等分类, 分为多组数据, 例如, 每一秒生成的数据归为一组 数据。 所述电商平台 3为多个服务器组成的集群。  Referring to FIG. 1, FIG. 1 is a schematic diagram of an application environment of a hash value-based e-commerce data processing system according to the present invention. The hash value based e-commerce data processing system 30 of the present invention operates on the e-commerce platform 3. The e-commerce platform 3 is communicatively coupled to a plurality of terminals 1 via a network 2. The e-commerce platform 3 stores various types of e-commerce data (e.g., user name, product text information, merchandise type information, courier information, warehousing information, merchandise pictures, shopping records, etc.). It should be noted that the e-commerce data stored by the e-commerce platform 3 is classified into multiple sets of data according to the time division and location, for example, the data generated every second is classified into a group of data. The e-commerce platform 3 is a cluster composed of a plurality of servers.

[0037] 在本实施例中, 所述网络 2可以是有线通讯网络或无线通讯网络。 所述网络优 选为无线通讯网络, 包括但不限于, GSM网络、 GPRS网络、 CDMA网络、 TD-S CDMA网络、 WiMAX网络、 TD-LTE网络、 FDD-LTE网络等无线传输网络。 。  [0037] In this embodiment, the network 2 may be a wired communication network or a wireless communication network. The network is preferably a wireless communication network, including but not limited to a GSM network, a GPRS network, a CDMA network, a TD-S CDMA network, a WiMAX network, a TD-LTE network, an FDD-LTE network, and the like. .

[0038] 所述终端 1可以是, 但不限于, 智能手机、 平板电脑、 个人数字助理 (Personal Digital Assistant, PDA) 、 个人电脑、 电子看板等其它任意合适的便携式触摸屏 电子设备。 所述终端 1用于通过网络 2访问所述电商平台 3, 并获取电商平台 3上 的电商数据。 所述终端 1还用于产生电商数据, 并将所述产生的电商数据发送至 电商平台 3保存。  [0038] The terminal 1 may be, but not limited to, any other suitable portable touch screen electronic device such as a smart phone, a tablet computer, a personal digital assistant (PDA), a personal computer, an electronic signboard, and the like. The terminal 1 is configured to access the e-commerce platform 3 through the network 2, and obtain e-commerce data on the e-commerce platform 3. The terminal 1 is further configured to generate e-commerce data, and send the generated e-commerce data to the e-commerce platform 3 for storage.

[0039] 参照图 2所示, 是本发明基于哈希值的电商数据处理系统的优选实施例的模块 示意图。 在本实施例中, 所述基于哈希值的电商数据处理系统 30应用于电商平 台 3。 该电商平台 3包括, 但不仅限于, 基于哈希值的电商数据处理系统 30、 存 储单元 31、 处理单元 32、 及通讯单元 33。  Referring to FIG. 2, there is shown a block diagram of a preferred embodiment of a hash value based e-commerce data processing system of the present invention. In the present embodiment, the hash value based e-commerce data processing system 30 is applied to the e-commerce platform 3. The e-commerce platform 3 includes, but is not limited to, a hash value based e-commerce data processing system 30, a storage unit 31, a processing unit 32, and a communication unit 33.

[0040] 所述的存储单元 31可以为一种只读存储单元 ROM, 电可擦写存储单元 EEPRO M、 快闪存储单元 FLASH或固体硬盘等。 [0040] The storage unit 31 may be a read only storage unit ROM, an electrically erasable storage unit EEPRO M, flash memory unit FLASH or solid hard disk.

[0041] 所述的处理单元 32可以为一种中央处理器 (Central Processing Unit, CPU) 、 微控制器 (MCU) 、 数据处理芯片、 或者具有数据处理功能的信息处理单元。 [0041] The processing unit 32 may be a central processing unit (CPU), a microcontroller (MCU), a data processing chip, or an information processing unit having a data processing function.

[0042] 所述的通讯单元 33为一种具有远程通讯功能的通讯接口, 例如支持 GSM、 GPR[0042] The communication unit 33 is a communication interface with a remote communication function, for example, supports GSM, GPR.

S、 WCDMA、 CDMA、 TD-SCDMA、 WiMAX、 TD-LTE、 FDD-LTE等无线通讯 技术的通讯接口, 支持 USB、 网线的有线通讯接口。 The communication interface of wireless communication technologies such as S, WCDMA, CDMA, TD-SCDMA, WiMAX, TD-LTE, FDD-LTE, and the wired communication interface supporting USB and network cable.

[0043] 所述基于哈希值的电商数据处理系统 30包括, 但不局限于, 计算模块 312、 査 找模块 313、 刪除模块 314、 修改模块 315、 接收模块 316、 判断模块 317及发送模 块 318, 本发明所称的模块是指一种能够被所述电商平台 3的处理单元 32执行并 且能够完成固定功能的一系列计算机程序指令段, 其存储在所述电商平台 3的存 储单元 31中。 [0043] The data processing system of the electricity supplier based on a hash value 30 include, but are not limited to, the calculation module 312, a searching module 313, a deleting module 314, modification module 315, a receiving module 316, a determining module 317 and a sending module 318. The term "module" as used in the present invention refers to a series of computer program instruction segments that can be executed by the processing unit 32 of the e-commerce platform 3 and that can perform fixed functions, which are stored in the storage unit of the e-commerce platform 3. 31.

[0044] 所述计算模块 312用于计算电商平台 3上每组电商数据的哈希值。 具体地说, 所 述计算模块 312遍历电商平台 3上的电商数据, 并通过哈希算法计算每组电商数 据的哈希值。 所述哈希算法可以是, 但不限于, SHA256算法。  [0044] The calculation module 312 is configured to calculate a hash value of each group of e-commerce data on the e-commerce platform 3. Specifically, the calculation module 312 traverses the e-commerce data on the e-commerce platform 3 and calculates a hash value of each set of e-commerce data through a hash algorithm. The hash algorithm may be, but is not limited to, the SHA256 algorithm.

[0045] 所述査找模块 313用于从计算的哈希值中査找相同的哈希值, 并将相同的哈希 值归为一组, 将每组中相同的哈希值及该组中相同的哈希值对应的电商数据的 存储路径保存于一个哈希关联列表中。 一般而言, 由于哈希值的唯一性, 若査 找到相同的哈希值, 表明电商平台 3中存在相同的电商数据。 在本实施例中, 所 述哈希关联列表包括多个相同的哈希值及每个哈希值对应电商数据的存储路径 。 需要说明的是, 所述哈希关联列表中的哈希值都是相同的。 所述电商平台 3中 有多组相同的电商数据, 则所述査找模块 313获得多组相同的哈希值并对每组相 同的哈希值生成一个哈希关联列表。 如图四所示, 所述哈希关联列表至少包括 四个栏位, 所述四个栏位分别为每组数据名称 (表示电商数据的名称) 、 哈希 值 (表示电商数据的哈希值) 、 存储路径 (表示电商数据在电商平台 3上的存储 路径) 及标识 (表示电商数据是否保留于电商平台 3中, 其中, Y表示保留, N表 示刪除) 。 进一步地, 为了方便起见, 图 4中的字母 B仅仅是代指哈希值, 并不 表示哈希值为 B, 而图 4中的字母 T1至 T9也仅仅是代指存储路径, 并不表示 T1至 T9为存储路径。 [0046] 所述刪除模块 314用于在每一个哈希关联列表中选定一个存储路径, 并刪除该 哈希关联列表中未选定的存储路径所对应的电商数据。 在本实施例中, 所述刪 除模块 314选定存储路径 T 1, 并在电商平台 3中刪除其余存储路径为 T2至 T9对应 的电商数据。 [0045] The searching module 313 is configured to search for the same hash value from the calculated hash values, and group the same hash values into a group, and the same hash value in each group is the same as the same in the group. The storage path of the e-commerce data corresponding to the hash value is stored in a hash association list. In general, due to the uniqueness of the hash value, if the same hash value is found, it indicates that the same e-commerce data exists in the e-commerce platform 3. In this embodiment, the hash association list includes a plurality of identical hash values and a storage path of each hash value corresponding to the e-commerce data. It should be noted that the hash values in the hash association list are the same. If there are multiple sets of identical e-commerce data in the e-commerce platform 3, the search module 313 obtains multiple sets of identical hash values and generates a hash association list for each set of identical hash values. As shown in FIG. 4, the hash association list includes at least four fields, and each of the four fields is a name of each group of data (representing the name of the e-commerce data), and a hash value (a representation of the e-commerce data). The value of the storage path (representing the storage path of the e-commerce data on the e-commerce platform 3) and the identification (representing whether the e-commerce data is retained in the e-commerce platform 3, where Y is reserved and N is deleted). Further, for the sake of convenience, the letter B in FIG. 4 merely refers to the hash value, and does not mean that the hash value is B, and the letters T1 to T9 in FIG. 4 are merely referring to the storage path, and do not indicate T1 to T9 are storage paths. [0046] The deleting module 314 is configured to select one storage path in each hash association list, and delete the e-commerce data corresponding to the unselected storage paths in the hash association list. In this embodiment, the deleting module 3 14 selects the storage path T1 and deletes the e-commerce data corresponding to the remaining storage paths T2 to T9 in the e-commerce platform 3.

[0047] 所述修改模块 315用于在每一个哈希关联列表修改所刪除的电商数据对应的标 识。 具体地说, 如图 4所示, 所述修改模块 315将哈希关联列表中 T2至 T9对应的 标识从 Y修改为 N。  [0047] The modification module 315 is configured to modify the identifier corresponding to the deleted e-commerce data in each hash association list. Specifically, as shown in FIG. 4, the modification module 315 modifies the identifier corresponding to T2 to T9 in the hash association list from Y to N.

[0048] 所述接收模块 316用于接收终端 1发送过来的文件査询指令。 一般而言, 所述终 端 1査询电商平台 3上的电商数据吋, 文件査询指令中包括待査询的电商数据的 存储路径。  [0048] The receiving module 316 is configured to receive a file query instruction sent by the terminal 1. Generally, the terminal 1 queries the e-commerce data on the e-commerce platform 3, and the file query instruction includes a storage path of the e-commerce data to be queried.

[0049] 所述判断模块 317用于判断文件査询指令中待査询的电商数据的存储路径是否 有对应的哈希关联列表。 所述判断模块 317解析文件査询指令并获取待査询的电 商数据的存储路径, 将该待査询的电商数据的存储路径与电商平台 3中每一个哈 希关联列表进行比对, 若电商平台 3中有一个哈希关联列表包含待査询的电商数 据的存储路径, 则该哈希关联列表为待査询的电商数据的存储路径对应的哈希 关联列表。  [0049] The determining module 317 is configured to determine whether a storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list. The determining module 317 parses the file query instruction and obtains a storage path of the e-commerce data to be queried, and compares the storage path of the e-commerce data to be queried with each hash association list in the e-commerce platform 3. If there is a hash association list in the e-commerce platform 3 that includes the storage path of the e-commerce data to be queried, the hash association list is a hash association list corresponding to the storage path of the e-commerce data to be queried.

[0050] 所述发送模块 318用于将该对应的哈希关联列表中选定的存储路径对应的电商 数据发送给终端 1。 具体而言, 如图 4所示, 若待査询的电商数据的存储路径为 T 2, 则所述发送模块 318将存储路径为 T1对应的电商数据 A发送给终端 1。  [0050] The sending module 318 is configured to send the e-commerce data corresponding to the selected storage path in the corresponding hash association list to the terminal 1. Specifically, as shown in FIG. 4, if the storage path of the e-commerce data to be queried is T2, the sending module 318 sends the e-commerce data A corresponding to the storage path T1 to the terminal 1.

[0051] 参照图 3所示, 是本发明基于哈希值的电商数据处理方法的优选实施例的流程 图。 在本实施例中, 所述的基于哈希值的电商数据处理方法应用于电商平台 3, 该方法包括以下步骤:  Referring to FIG. 3, there is shown a flow chart of a preferred embodiment of the hash value based e-commerce data processing method of the present invention. In this embodiment, the hash value-based e-commerce data processing method is applied to the e-commerce platform 3, and the method includes the following steps:

[0052] 步骤 S12: 所述计算模块 312计算电商平台 3上每组电商数据的哈希值。 具体地 说, 所述计算模块 312遍历电商平台 3上的电商数据, 并通过哈希算法计算每组 电商数据的哈希值。 所述哈希算法可以是, 但不限于, SHA256算法。  [0052] Step S12: The calculation module 312 calculates a hash value of each group of e-commerce data on the e-commerce platform 3. Specifically, the calculation module 312 traverses the e-commerce data on the e-commerce platform 3, and calculates a hash value of each set of e-commerce data through a hash algorithm. The hash algorithm may be, but is not limited to, the SHA256 algorithm.

[0053] 步骤 S13: 所述査找模块 313从计算的哈希值中査找相同的哈希值, 并将相同的 哈希值归为一组, 将每组中相同的哈希值及该组中相同的哈希值对应的电商数 据的存储路径保存于一个哈希关联列表中。 一般而言, 由于哈希值的唯一性, 若査找到相同的哈希值, 表明电商平台 3中存在相同的电商数据。 在本实施例中 , 所述哈希关联列表包括多个相同的哈希值及每个哈希值对应电商数据的存储 路径。 需要说明的是, 所述哈希关联列表中的哈希值都是相同的。 所述电商平 台 3中有多组相同的电商数据, 则所述査找模块 313获得多组相同的哈希值并对 每组相同的哈希值生成一个哈希关联列表。 如图四所示, 所述哈希关联列表至 少包括四个栏位, 所述四个栏位分别为每组数据名称 (表示电商数据的名称)[0053] Step S13: The searching module 313 searches for the same hash value from the calculated hash values, and groups the same hash values into a group, and sets the same hash value in each group and the group. The storage path of the e-commerce data corresponding to the same hash value is stored in a hash association list. In general, due to the uniqueness of the hash value, If the same hash value is found, it indicates that the same e-commerce data exists in the e-commerce platform 3. In this embodiment, the hash association list includes a plurality of identical hash values and a storage path of each hash value corresponding to the e-commerce data. It should be noted that the hash values in the hash association list are the same. If there are multiple sets of identical e-commerce data in the e-commerce platform 3, the search module 313 obtains multiple sets of identical hash values and generates a hash association list for each set of identical hash values. As shown in FIG. 4, the hash association list includes at least four fields, and each of the four fields is a name of each group (representing the name of the e-commerce data).

、 哈希值 (表示电商数据的哈希值) 、 存储路径 (表示电商数据在电商平台 3上 的存储路径) 及标识 (表示电商数据是否保留于电商平台 3中, 其中, Y表示保 留, N表示刪除) 。 进一步地, 为了方便起见, 图 4中的字母 B仅仅是代指哈希值 , 并不表示哈希值为 B, 而图 4中的字母 T1至 T9也仅仅是代指存储路径, 并不表 示 T1至 T9为存储路径。 a hash value (representing a hash value of the e-commerce data), a storage path (representing a storage path of the e-commerce data on the e-commerce platform 3), and a logo (indicating whether the e-commerce data is retained in the e-commerce platform 3, wherein Y means reserved, N means delete). Further, for the sake of convenience, the letter B in FIG. 4 merely refers to the hash value, and does not mean that the hash value is B, and the letters T1 to T9 in FIG. 4 are merely referring to the storage path, and do not indicate T1 to T9 are storage paths.

[0054] 步骤 S14: 所述刪除模块 314在每一个哈希关联列表中选定一个存储路径, 并刪 除该哈希关联列表中未选定的存储路径所对应的电商数据。 在本实施例中, 所 述刪除模块 314选定存储路径 Tl, 并在电商平台 3中刪除其余存储路径为 Τ2至 Τ9 对应的电商数据。  [0054] Step S14: The deleting module 314 selects one storage path in each hash association list, and deletes the e-commerce data corresponding to the unselected storage paths in the hash association list. In this embodiment, the deletion module 314 selects the storage path T1, and deletes the e-commerce data corresponding to the remaining storage paths Τ2 to Τ9 in the e-commerce platform 3.

[0055] 步骤 S15: 所述修改模块 315在每一个哈希关联列表修改所刪除的电商数据对应 的标识。 具体地说, 如图 4所示, 所述修改模块 315将哈希关联列表中 Τ2至 Τ9对 应的标识从 Υ修改为 Ν。  [0055] Step S15: The modification module 315 modifies the identifier corresponding to the deleted e-commerce data in each hash association list. Specifically, as shown in FIG. 4, the modification module 315 modifies the identifier corresponding to Τ2 to Τ9 in the hash association list from Υ to Ν.

[0056] 步骤 S16: 所述接收模块 316接收终端 1发送过来的文件査询指令。 一般而言, 所述终端 1査询电商平台 3上的电商数据吋, 査询指令中包括待査询的电商数据 的存储路径。 [0056] Step S16: The receiving module 316 receives the file query instruction sent by the terminal 1. Generally, the terminal 1 queries the e-commerce data on the e-commerce platform 3, and the query instruction includes a storage path of the e-commerce data to be queried.

[0057] 步骤 S17: 所述判断模块 317判断文件査询指令中待査询的电商数据的存储路径 是否有对应的哈希关联列表。 所述判断模块 317解析文件査询指令并获取待査询 的电商数据的存储路径, 将该待査询的电商数据的存储路径与电商平台 3中每一 个哈希关联列表进行比对, 若电商平台 3中有一个哈希关联列表包含待査询的电 商数据的存储路径, 则该哈希关联列表为待査询的电商数据的存储路径对应的 哈希关联列表。 若文件査询指令中的存储路径有对应的哈希关联列表, 流程进 入步骤 S18, 否则, 若文件査询指令中的存储路径没有对应的哈希关联列表, 则 直接通过文件査询指令中的存储路径遍历电商平台 3中的电商数据 (附图 3中进 行了省略) , 之后结束流程。 [0057] Step S17: The determining module 317 determines whether the storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list. The determining module 317 parses the file query instruction and obtains a storage path of the e-commerce data to be queried, and compares the storage path of the e-commerce data to be queried with each hash association list in the e-commerce platform 3. If there is a hash association list in the e-commerce platform 3 that includes the storage path of the e-commerce data to be queried, the hash association list is a hash association list corresponding to the storage path of the e-commerce data to be queried. If the storage path in the file query instruction has a corresponding hash association list, the process proceeds to step S18. Otherwise, if the storage path in the file query instruction does not have a corresponding hash association list, The e-commerce data in the e-commerce platform 3 is traversed directly through the storage path in the file query command (omitted in FIG. 3), and then the flow is ended.

[0058] 步骤 S18: 所述发送模块 318将该对应的哈希关联列表中选定的存储路径对应的 电商数据发送给终端 1。 具体而言, 如图 4所示, 若待査询的电商数据的存储路 径为 T2, 则所述发送模块 318将存储路径为 T1对应的电商数据 Α发送给终端 1。 工业实用性 [0058] Step S18: The sending module 318 sends the e-commerce data corresponding to the selected storage path in the corresponding hash association list to the terminal 1. Specifically, as shown in FIG. 4, if the storage path of the e-commerce data to be queried is T2, the sending module 318 sends the e-commerce data 存储 corresponding to the storage path T1 to the terminal 1. Industrial applicability

[0059] 本发明采用哈希值比对的方式寻找相同的电商数据, 并刪除重复的电商数据, 加快了电商平台的访问速度, 降低了电商平台的冗余。  [0059] The present invention uses the hash value comparison method to find the same e-commerce data, and deletes the repeated e-commerce data, accelerates the access speed of the e-commerce platform, and reduces the redundancy of the e-commerce platform.

Claims

权利要求书 Claim 一种基于哈希值的电商数据处理系统, 运行于电商平台中, 其特征在 于, 所述电商平台通过网络与终端通信连接, 该系统包括: 计算模块, 用于计算所述电商平台上每组电商数据的哈希值; 査找模块, 用于从计算的哈希值中査找相同的哈希值, 并将相同的哈 希值归为一组, 将每组中相同的哈希值及该组中相同的哈希值对应的 电商数据的存储路径保存于一个哈希关联列表中; An e-commerce data processing system based on a hash value, running in an e-commerce platform, wherein the e-commerce platform is connected to a terminal through a network, the system comprising: a calculation module, configured to calculate the e-commerce A hash value of each set of e-commerce data on the platform; a lookup module, configured to find the same hash value from the calculated hash value, and group the same hash values into a group, the same ha in each group The storage path of the e-commerce data corresponding to the hash value and the same hash value in the group is saved in a hash association list; 刪除模块, 用于在每一个哈希关联列表中选定一个存储路径, 并刪除 该哈希关联列表中未选定的存储路径所对应的电商数据; a deleting module, configured to select a storage path in each hash association list, and delete the e-commerce data corresponding to the unselected storage path in the hash association list; 修改模块, 用于在每一个哈希关联列表修改所刪除的电商数据对应的 标识; a modifying module, configured to modify, in each hash association list, an identifier corresponding to the deleted e-commerce data; 接收模块, 用于接收终端发送过来的文件査询指令; a receiving module, configured to receive a file query instruction sent by the terminal; 判断模块, 用于判断文件査询指令中待査询的电商数据的存储路径是 否有对应的哈希关联列表; 及 a judging module, configured to determine whether a storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list; and 发送模块, 用于当文件査询指令中待査询的电商数据的存储路径有对 应的哈希关联列表, 将该对应的哈希关联列表中选定的存储路径对应 的电商数据发送给终端。 a sending module, configured to: when the storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list, send the e-commerce data corresponding to the selected storage path in the corresponding hash association list to terminal. 如权利要求 1所述的基于哈希值的电商数据处理系统, 其特征在于, 所述哈希关联列表包括四个栏位, 所述四个栏位分别为每组数据名称 、 哈希值、 存储路径及标识。 The hash value-based e-commerce data processing system according to claim 1, wherein the hash association list includes four fields, and each of the four fields is a data name and a hash value of each group. , storage path and identity. 如权利要求 1所述的基于哈希值的电商数据处理系统, 其特征在于, 所述计算模块计算所述电商平台上每组电商数据的哈希值的方式如下 : 遍历电商平台上的电商数据, 并通过哈希算法计算每组电商数据的 哈希值。 The hash value-based e-commerce data processing system according to claim 1, wherein the calculating module calculates a hash value of each group of e-commerce data on the e-commerce platform as follows: traversing the e-commerce platform The e-commerce data on the above, and the hash value of each group of e-commerce data is calculated by a hash algorithm. 如权利要求 3所述的基于哈希值的电商数据处理系统, 其特征在于, 所述计算模块通过哈希算法计算所述电商平台上每组电商数据的哈希 值, 其中, 所述哈希算法为 SHA256算法。 The hash value-based e-commerce data processing system according to claim 3, wherein the calculation module calculates a hash value of each group of e-commerce data on the e-commerce platform by using a hash algorithm, where The hash algorithm is the SHA256 algorithm. 如权利要求 1所述的基于哈希值的电商数据处理系统, 其特征在于, 所述判断模块判断文件査询指令中待査询的电商数据的存储路径是否 有对应的哈希关联列表的方式如下: 解析文件査询指令并获取待査询 的电商数据的存储路径, 将该待査询的电商数据的存储路径与电商平 台中每一个哈希关联列表进行比对, 若电商平台中有一个哈希关联列 表包含待査询的电商数据的存储路径, 则该哈希关联列表为待査询的 电商数据的存储路径对应的哈希关联列表。 A hash value based e-commerce data processing system according to claim 1, wherein: The determining module determines whether the storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list as follows: parsing the file query instruction and obtaining a storage path of the e-commerce data to be queried, Comparing the storage path of the e-commerce data to be queried with each hash association list in the e-commerce platform, if a hash association list in the e-commerce platform includes a storage path of the e-commerce data to be queried, Then, the hash association list is a hash association list corresponding to the storage path of the e-commerce data to be queried. [权利要求 6] —种基于哈希值的电商数据处理方法, 应用于电商平台中, 其特征在 于, 所述电商平台通过网络与终端通信连接, 该方法包括如下步骤: 计算所述电商平台上每组电商数据的哈希值;  [Claim 6] A method for processing an e-commerce data based on a hash value, which is applied to an e-commerce platform, wherein the e-commerce platform is connected to a terminal through a network, the method comprising the following steps: The hash value of each group of e-commerce data on the e-commerce platform; 从计算的哈希值中査找相同的哈希值, 并将相同的哈希值归为一组, 将每组中相同的哈希值及该组中相同的哈希值对应的电商数据的存储 路径保存于一个哈希关联列表中;  Find the same hash value from the calculated hash value, and group the same hash value into a group, and compare the same hash value in each group with the same hash value in the group. The storage path is saved in a hash association list; 在每一个哈希关联列表中选定一个存储路径, 并刪除该哈希关联列表 中未选定的存储路径所对应的电商数据;  Selecting a storage path in each hash association list, and deleting the e-commerce data corresponding to the unselected storage paths in the hash association list; 在每一个哈希关联列表修改所刪除的电商数据对应的标识; 接收终端发送过来的文件査询指令;  Modifying, in each hash association list, an identifier corresponding to the deleted e-commerce data; receiving a file query instruction sent by the terminal; 判断文件査询指令中待査询的电商数据的存储路径是否有对应的哈希 关联列表; 及  Determining whether the storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list; and 当文件査询指令中待査询的电商数据的存储路径有对应的哈希关联列 表, 将该对应的哈希关联列表中选定的存储路径对应的电商数据发送 给终端。  When the storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list, the e-commerce data corresponding to the selected storage path in the corresponding hash association list is sent to the terminal. [权利要求 7] 如权利要求 6所述的基于哈希值的电商数据处理方法, 其特征在于, 所述哈希关联列表包括四个栏位, 所述四个栏位分别为每组数据名称 、 哈希值、 存储路径及标识。  [Claim 7] The hash value-based e-commerce data processing method according to claim 6, wherein the hash association list includes four fields, and the four fields are respectively for each group of data. Name, hash, storage path, and identity. [权利要求 8] 如权利要求 6所述的基于哈希值的电商数据处理方法, 其特征在于, 所述计算所述电商平台上每组电商数据的哈希值的步骤包括如下步骤 : 遍历电商平台上的电商数据, 并通过哈希算法计算每组电商数据的 哈希值。 [权利要求 9] 如权利要求 8所述的基于哈希值的电商数据处理方法, 其特征在于, 所述电商平台上每组电商数据的哈希值通过哈希算法计算, 其中, 所 述哈希算法为 SHA256算法。 [Claim 8] The hash value-based e-commerce data processing method according to claim 6, wherein the calculating the hash value of each group of e-commerce data on the e-commerce platform comprises the following steps : Traversing the e-commerce data on the e-commerce platform, and calculating the hash value of each set of e-commerce data through a hash algorithm. The hash value-based e-commerce data processing method according to claim 8, wherein the hash value of each set of e-commerce data on the e-commerce platform is calculated by a hash algorithm, where The hash algorithm is the SHA256 algorithm. [权利要求 10] 如权利要求 6所述的基于哈希值的电商数据处理方法, 其特征在于, 所述判断文件査询指令中待査询的电商数据的存储路径是否有对应的 哈希关联列表的步骤包括如下步骤: 解析文件査询指令并获取待査询 的电商数据的存储路径, 将该待査询的电商数据的存储路径与电商平 台中每一个哈希关联列表进行比对, 若电商平台中有一个哈希关联列 表包含待査询的电商数据的存储路径, 则该哈希关联列表为待査询的 电商数据的存储路径对应的哈希关联列表。  [Claim 10] The hash value-based e-commerce data processing method according to claim 6, wherein the determining whether the storage path of the e-commerce data to be queried in the file query instruction has a corresponding Ha The step of associating the list includes the following steps: parsing the file query instruction and obtaining a storage path of the e-commerce data to be queried, and storing the storage path of the e-commerce data to be queried with each hash association list in the e-commerce platform For comparison, if a hash association list in the e-commerce platform includes a storage path of the e-commerce data to be queried, the hash association list is a hash association list corresponding to the storage path of the e-commerce data to be queried. .
PCT/CN2017/077007 2017-03-17 2017-03-17 Hash value-based e-commerce data processing system and method Ceased WO2018165960A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/077007 WO2018165960A1 (en) 2017-03-17 2017-03-17 Hash value-based e-commerce data processing system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/077007 WO2018165960A1 (en) 2017-03-17 2017-03-17 Hash value-based e-commerce data processing system and method

Publications (1)

Publication Number Publication Date
WO2018165960A1 true WO2018165960A1 (en) 2018-09-20

Family

ID=63521739

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/077007 Ceased WO2018165960A1 (en) 2017-03-17 2017-03-17 Hash value-based e-commerce data processing system and method

Country Status (1)

Country Link
WO (1) WO2018165960A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101702179A (en) * 2009-12-01 2010-05-05 百度在线网络技术(北京)有限公司 Method and device for removing duplication from data mining
CN104199863A (en) * 2014-08-15 2014-12-10 小米科技有限责任公司 Method and device for searching for files on storage device and router
CN106599260A (en) * 2016-12-21 2017-04-26 深圳市易特科信息技术有限公司 Medical informatization repeated medical document deleting system and method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101702179A (en) * 2009-12-01 2010-05-05 百度在线网络技术(北京)有限公司 Method and device for removing duplication from data mining
CN104199863A (en) * 2014-08-15 2014-12-10 小米科技有限责任公司 Method and device for searching for files on storage device and router
CN106599260A (en) * 2016-12-21 2017-04-26 深圳市易特科信息技术有限公司 Medical informatization repeated medical document deleting system and method

Similar Documents

Publication Publication Date Title
JP5721627B2 (en) Method and system for saving database storage space
TWI611305B (en) Method and device for identifying feature groups and search method and device
TWI522942B (en) User favorites data processing method and device, user favorite data searching method and device, and user favorite system
US8150813B2 (en) Using relationships in candidate discovery
CN103177075B (en) The detection of Knowledge based engineering entity and disambiguation
CN108804516B (en) Similar user searching device, method and computer readable storage medium
CN111242709B (en) A message push method and its device, equipment and storage medium
US8352496B2 (en) Entity name matching
US9720946B2 (en) Efficient storage of related sparse data in a search index
US9405840B2 (en) Using social signals to rank search results
CN114328632A (en) User data analysis method and device based on bitmap and computer equipment
KR20140139623A (en) Discovering spam merchants using product feed similarity
CN104536984A (en) Verification method and system for space text Top-k query in outsourced database
WO2018165968A1 (en) Webpage search-based merchant evaluation and recommendation system and method
CN103455637B (en) A kind of method and apparatus of license number search
CN111651749A (en) Method, device, computer equipment and storage medium for retrieving account based on password
CN105187439A (en) Phishing website detection method and device
TWI639093B (en) Object set and processing method and device thereof
CN109241360B (en) Matching method and device of combined character strings and electronic equipment
WO2020056976A1 (en) Optimized sequencing method, device, and program for search results, and computer readable storage medium
TWI547888B (en) A method of recording user information and a search method and a server
CN109101630B (en) Method, device and equipment for generating search result of application program
WO2018165960A1 (en) Hash value-based e-commerce data processing system and method
JP2020525949A (en) Media search method and device
WO2018165963A1 (en) E-commerce data redundancy processing system and method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17900618

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17900618

Country of ref document: EP

Kind code of ref document: A1