JP5443236B2

JP5443236B2 - Distributed database system

Info

Publication number: JP5443236B2
Application number: JP2010076536A
Authority: JP
Inventors: 智傑千里
Original assignee: Infodeliver
Current assignee: Infodeliver
Priority date: 2010-03-30
Filing date: 2010-03-30
Publication date: 2014-03-19
Anticipated expiration: 2030-03-30
Also published as: JP2011209974A

Description

本発明は、データを分散して管理する分散型データベース（以下、「ＤＢ」と称する。）の処理技術に係り、特に、セキュリティに改良を施した分散型ＤＢ処理装置、処理方法及び処理プログラムに関する。 The present invention relates to a processing technology for a distributed database (hereinafter referred to as “DB”) that manages data in a distributed manner, and more particularly to a distributed DB processing device, processing method, and processing program with improved security. .

従来から、取り扱うデータを所定の基準で分割し、複数箇所に分散して記憶することにより、データ管理を行う分散型ＤＢシステムが存在する。このような分散型ＤＢシステムは、管理すべきデータ量が膨大になる場合などに用いられることが多い。 2. Description of the Related Art Conventionally, there is a distributed DB system that performs data management by dividing data to be handled according to a predetermined standard and distributing and storing the data at a plurality of locations. Such a distributed DB system is often used when the amount of data to be managed is enormous.

たとえば、一つの会社全体のデータ量が多いため、単一のサーバで処理するには、処理負担が過大となる場合がある。このような場合には、会社の支店や支社がそれぞれ通常必要とするデータに分割し、各支店や支社が管理するサーバに分散して管理させる。これにより、各サーバの処理負担を軽減できる。また、このように分散管理を行うことにより、通常時は、各サーバは、他のサーバへアクセスする必要がなくなるので、通信負荷とコストの軽減にもつながる。 For example, since the amount of data for one company is large, the processing load may be excessive for processing with a single server. In such a case, it is divided into data normally required by the branch offices and branch offices of the company, and distributed to servers managed by the branch offices and branch offices. Thereby, the processing burden of each server can be reduced. Also, by performing distributed management in this way, each server does not need to access other servers during normal times, leading to a reduction in communication load and cost.

ところで、近年の個人情報保護の要請から、個人情報を管理するＤＢからの情報漏洩を防止する必要性が高まっている。これに対処するために、上記の分散型ＤＢのしくみが利用できる。たとえば、特許文献１には、個人情報を、基本データ項目と、属性データ項目とに分割して格納し、特定のコードで関連付けることにより検索する記録方法が提案されている。 By the way, the necessity of preventing information leakage from a DB managing personal information is increasing due to the recent demand for personal information protection. To deal with this, the above distributed DB mechanism can be used. For example, Patent Document 1 proposes a recording method in which personal information is stored by being divided into basic data items and attribute data items and associated with a specific code.

特開平１１−２７２６８１号公報JP-A-11-272681

しかし、個人情報の漏洩は、例えば、システムの外部からの侵入によって発生する場合ばかりではない。システムの内部の人間が、社内のネットワークから情報を盗み取る場合もある。かかる場合にも、個人情報が分散管理されている場合には、住所のみ、年齢のみといったように、情報の一部のみが漏洩しても、個人が特定できなければ、あまり問題は生じない。 However, leakage of personal information is not only caused by intrusion from the outside of the system, for example. Sometimes people inside the system steal information from the corporate network. Even in such a case, when the personal information is distributed and managed, even if only a part of the information leaks, such as only the address and only the age, there is not much problem if the individual cannot be specified.

もっとも、特許文献１の発明は、システムの利用権限を有する者が、分散管理された特定の個人情報を検索して結合しようとする場合、同一人の個人情報が内部のネットワークの経路上に、ほぼ同時に（短時間にまとまって）流れることになる。このため、個人情報の漏洩は限定的である。 However, in the invention of Patent Document 1, when a person who has the authority to use the system tries to search and combine specific personal information distributed and managed, the personal information of the same person is on the path of the internal network. It will flow almost simultaneously (in short time). For this reason, leakage of personal information is limited.

そこで、本発明は、個人情報の漏洩防止を拡充することを課題とする。 Therefore, an object of the present invention is to enhance the prevention of leakage of personal information.

上記の課題を解決するために、本発明の分散型データベースシステムは、
複数の属性のデータ（例えば、氏名、住所、年齢というデータ）から構成される複数の情報（例えば、個人情報）を前記属性毎に分割する分割手段（例えば、図１の分割部３４１等）と、
前記分割手段によって分割された各データに対して第１アルゴリズムに基づいてユニークな第１キーを割り当てる第１割当手段（例えば、図１の第１の付与部３４２）と、
前記第１割当手段によって割り当てられた各第１キーとこれらに各々対応するデータとが一対でランダムな位置に記憶される第１記憶部（例えば、図１の第１の記憶部１１０）と、
前記複数の情報に対して第２アルゴリズムに基づいてユニークな第２キーを割り当てる第２割当手段（例えば、図１の第２の付与部３４４）と、
前記複数の情報を暗号化する暗号化手段（例えば、図１の暗号化部３３２）と、
前記第２割当手段によって割り当てられた第２キーと前記暗号化手段によって暗号化済みのデータとが一対で記憶される第２記憶部（例えば、図１の第２の記憶部１２０）と、
前記第１及び第２キーと前記第１及び第２アルゴリズムと前記情報を構成するデータ間の紐付情報とが記憶されている第３記憶部（例えば、図１のルール記憶部３２０）と、
を備える。 In order to solve the above problems, the distributed database system of the present invention is
A dividing unit (for example, the dividing unit 341 in FIG. 1) that divides a plurality of pieces of information (for example, personal information) composed of data of a plurality of attributes (for example, data such as name, address, and age) for each attribute; ,
A first assigning unit (for example, the first assigning unit 342 in FIG. 1) that assigns a unique first key to each data divided by the dividing unit based on a first algorithm;
A first storage unit (for example, the first storage unit 110 in FIG. 1) in which each first key assigned by the first assigning means and data corresponding to each of the first keys is stored in a random position;
A second assigning unit (for example, the second assigning unit 344 in FIG. 1) that assigns a unique second key to the plurality of pieces of information based on a second algorithm;
An encryption means for encrypting the plurality of pieces of information (for example, the encryption unit 332 in FIG. 1);
A second storage unit (for example, the second storage unit 120 in FIG. 1) that stores a pair of the second key allocated by the second allocation unit and the data encrypted by the encryption unit;
A third storage unit (for example, the rule storage unit 320 in FIG. 1) in which the first and second keys, the first and second algorithms, and association information between the data constituting the information are stored;
Is provided.

本発明によれば、第１〜第３記憶部の全ての記憶内容が漏えいしない限り、「複数の情報」が復元されることがなく、また、典型的には、データのいずれかの属性から当該データに係る他の属性のデータを検索することが可能となる。 According to the present invention, the “plurality of information” is not restored unless all the stored contents of the first to third storage units are leaked, and typically, from any attribute of the data It becomes possible to search data of other attributes related to the data.

仮に、第１記憶部に記憶されているデータ、すなわち、暗号化前の情報を構成するデータが漏洩したとしても、それは、単に、名前だけだったり、住所だけだったり、年齢だけだったりするだけので、個人情報自体が特定されることはない。 Even if the data stored in the first storage unit, that is, the data that constitutes the information before encryption is leaked, it is only the name, the address, or the age. Therefore, personal information itself is not specified.

また、第２記憶部に記憶されているデータが漏洩したとしても、それは、暗号化済みのデータであるため復号化が困難であるし、仮に復号化されたとしても、その結果得られるものは、単に、名前だけだったり、住所だけだったり、年齢だけだったりするだけので、やはり、個人情報自体が特定されることはない。 Moreover, even if the data stored in the second storage unit is leaked, it is difficult to decrypt because it is encrypted data. Even if it is decrypted, what is obtained as a result is Because it is just the name, the address, and the age, the personal information itself is not specified.

さらに、第３記憶部の記憶内容が漏洩したとしても、それは、単に、紐付情報だったり、アルゴリズムだったりするだけなので、個人情報自体が特定されることはない。 Furthermore, even if the storage content of the third storage unit is leaked, it is merely association information or an algorithm, so that the personal information itself is not specified.

ここで重要な点は、第１記憶部に暗号化前のデータを記憶している点である。これにより、典型的には、特定の名前データから、その人の住所データ・年齢データをすぐに検索可能となる。その理由は、特定の名前データに基づいて第３記憶部を参照すれば、その名前データに割り当てられている第１キーが特定でき、かつ、当該第１キーに対応する住所データ・年齢データを特定できるからである。 The important point here is that the data before encryption is stored in the first storage unit. Thus, typically, the address data and age data of the person can be immediately searched from specific name data. The reason is that if the third storage unit is referred to based on specific name data, the first key assigned to the name data can be specified, and the address data / age data corresponding to the first key can be specified. This is because it can be identified.

また、本発明のＤＢ処理装置は、分割されたデータを分散記憶する第１の記憶部と、前記第１の記憶部に記憶されたデータを暗号化して分散記憶する第２の記憶部と、入力部若しくは通信ネットワークを介して入力された検索要求が、分割されたデータの結合が必要か否かを判定する要求判定部と、前記要求判定部が、分割されたデータの結合が必要でないと判定した場合に、前記第１の記憶部からデータを検索する第１の検索部と、前記要求判定部が、分割されたデータの結合が必要と判定した場合に、前記第２の記憶部からデータを検索する第２の検索部と、を有することを特徴とする。 In addition, the DB processing apparatus of the present invention includes a first storage unit that distributes and stores the divided data, a second storage unit that encrypts and stores the data stored in the first storage unit, and A request determination unit that determines whether a search request input via an input unit or a communication network needs to combine divided data, and the request determination unit does not need to combine divided data When the determination is made, when the first search unit that searches for data from the first storage unit and the request determination unit determine that it is necessary to combine the divided data, the second storage unit And a second search unit for searching for data.

なお、本ＤＢ処理装置の動作に対応する分散型ＤＢ処理方法、及び、当該処理方法をコンピュータに実行させる処理プログラムも、本発明に含まれる。 Note that a distributed DB processing method corresponding to the operation of the DB processing apparatus and a processing program for causing a computer to execute the processing method are also included in the present invention.

他の態様は、分割されたデータを、結合させるための結合情報（紐付情報）を記憶する結合情報記憶部を有することを特徴とする。
以上のような態様では、結合情報を、データとは別に管理しておくことにより、結合できる複数のデータが漏洩しても、それを結合することができない。 Another aspect is characterized by having a combined information storage unit that stores combined information (linking information) for combining the divided data.
In the above aspect, by managing the combination information separately from the data, even if a plurality of data that can be combined leaks, it cannot be combined.

他の態様は、前記第１の記憶部に記憶されたデータに基づいて、これに対応する前記第２の記憶部に記憶されたデータを探索するための探索情報を記憶する探索情報記憶部を有することを特徴とする。 According to another aspect, the search information storage unit stores search information for searching for data stored in the second storage unit corresponding to the data stored in the first storage unit. It is characterized by having.

以上のような態様では、探索情報を、データとは別に記憶しておくことにより、第１の記憶部に記憶されたデータから、第２の記憶部に記憶されたデータの探索ができず、第２の記憶部に記憶されたデータの結合もできない。 In the above aspect, by storing the search information separately from the data, the data stored in the second storage unit cannot be searched from the data stored in the first storage unit, The data stored in the second storage unit cannot be combined.

以上のように、本発明によれば、分割されたデータが結合できるような形での情報の漏洩を防止できるとともに、分割されたままでのデータの検索を容易とする分散型ＤＢ処理装置、分散型ＤＢ処理方法及び処理プログラムを提供することができる。 As described above, according to the present invention, it is possible to prevent the leakage of information in such a form that the divided data can be combined, and to facilitate the search of the data while being divided, A type DB processing method and a processing program can be provided.

なお、分散型データベースシステム内に記憶されているデータを読み出す場合には、暗号化済みのデータを読み出すとよい。こうすると、分散型データベースシステム内で暗号化されていない生データが盗難されることを防止できるし、ハッカーなどによってデータの読み出しログが解析されたとしても、その解析結果からは特定されるものは暗号化済みのデータであって、生データが特定されることはない。 In addition, when data stored in the distributed database system is read, it is preferable to read encrypted data. In this way, raw data that is not encrypted in the distributed database system can be prevented from being stolen, and even if the data read log is analyzed by a hacker or the like, what is identified from the analysis result is The encrypted data is not specified as raw data.

本発明の分散型ＤＢ処理装置の一実施形態を示す機能ブロック図である。It is a functional block diagram which shows one Embodiment of the distributed database processing apparatus of this invention. 図１の実施形態におけるデータ登録処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the data registration process in embodiment of FIG. 図１の実施形態におけるデータ検索処理の一例を示すフローチャートである。It is a flowchart which shows an example of the data search process in embodiment of FIG. 図１の実施形態における第１の記憶部に分散記憶されるデータの一例を示す説明図である。It is explanatory drawing which shows an example of the data distributedly memorize | stored in the 1st memory | storage part in embodiment of FIG. 図１の実施形態における第１の記憶部に記憶されたデータから、第２の記憶部に記憶されたデータを探索する一例を示す説明図である。It is explanatory drawing which shows an example which searches the data memorize | stored in the 2nd memory | storage part from the data memorize | stored in the 1st memory | storage part in embodiment of FIG. 図１の実施形態における第２の記憶部に記憶されたデータから、個人情報を検索する一例を示す説明図である。It is explanatory drawing which shows an example which searches personal information from the data memorize | stored in the 2nd memory | storage part in embodiment of FIG.

１…分散型ＤＢ処理装置
１００…データベース（ＤＢ）
１１０…第１の記憶部
１２０…第２の記憶部
２００…関連情報記憶部
２１０…探索情報記憶部
２２０…結合情報記憶部
３００…データベースマネージメントシステム（ＤＢＭＳ）
３０１…認証部
３０２…認証情報記憶部
３０３…判定部
３１０…関連付情報生成部
３１１…探索情報生成部
３１２…結合情報生成部
３２０…ルール記憶部
３２１…分割ルール記憶部
３２２…探索ルール記憶部
３２２…結合ルール記憶部
３３０…暗号処理部
３３１…暗号化部
３３２…復号化部
３４０…登録処理部
３４１…分割部
３４２…第１の付与部
３４３…複製部
３４４…第２の付与部
３４５…格納部
３５０…検索処理部
３５１…要求判定部
３５２…第１の検索部
３５３…探索部
３５４…第２の検索部
３５５…結合部
４００…入力部
５００…出力部 1 ... Distributed DB processing apparatus 100 ... Database (DB)
110 ... first storage unit 120 ... second storage unit 200 ... related information storage unit 210 ... search information storage unit 220 ... combined information storage unit 300 ... database management system (DBMS)
301 ... Authentication unit 302 ... Authentication information storage unit 303 ... Determination unit 310 ... Associated information generation unit 311 ... Search information generation unit 312 ... Combined information generation unit 320 ... Rule storage unit 321 ... Split rule storage unit 322 ... Search rule storage unit 322 ... Combining rule storage unit 330 ... Encryption processing unit 331 ... Encryption unit 332 ... Decryption unit 340 ... Registration processing unit 341 ... Dividing unit 342 ... First assigning unit 343 ... Duplicating unit 344 ... Second assigning unit 345 ... Storage unit 350 ... search processing unit 351 ... request determination unit 352 ... first search unit 353 ... search unit 354 ... second search unit 355 ... combination unit 400 ... input unit 500 ... output unit

以下、本発明の実施形態について、図面を参照して説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

［１．実施形態の構成］
［１−１．全体構成］ [1. Configuration of Embodiment]
[1-1. overall structure]

図１は、本実施形態の各機能を仮想的にブロック化した機能ブロック図である。なお、たとえば、所定のプログラムで動作するコンピュータ若しくは電子回路によって、以下に説明するような機能を実現することができる。かかる機能を実現する方法、そのためのプログラム及びプログラムを記録した記録媒体も、本発明の一態様である。 FIG. 1 is a functional block diagram in which each function of the present embodiment is virtually blocked. Note that, for example, the functions described below can be realized by a computer or an electronic circuit that operates according to a predetermined program. A method for realizing such a function, a program therefor, and a recording medium on which the program is recorded are also one embodiment of the present invention.

図１に示すように、本実施形態の分散型ＤＢ処理装置１は、以下説明する、データベース（以下、「ＤＢ」と称する）１００、関連付情報記憶部２００、データベースマネージメントシステム（以下、「ＤＢＭＳ」と称する。）３００、入力部４００、出力部５００に大別される。 As shown in FIG. 1, a distributed database processing apparatus 1 of this embodiment includes a database (hereinafter referred to as “DB”) 100, an associated information storage unit 200, a database management system (hereinafter referred to as “DBMS”), which will be described below. 3), 300, the input unit 400, and the output unit 500.

［１−２．ＤＢ］ [1-2. DB]

ＤＢ１００は、本実施形態の取り扱うデータを記憶した構成部である。このＤＢ１００において扱われるデータは、所定の基準（分割ルール）に従って分割したデータである。そして、ＤＢ１００は、分割したデータを、下記のように分散して記憶している。 The DB 100 is a component that stores data handled by the present embodiment. Data handled in the DB 100 is data divided according to a predetermined standard (division rule). The DB 100 stores the divided data in a distributed manner as described below.

たとえば、データとして個人情報を用いる場合を説明する。この個人情報には、氏名、住所、年齢のテキストデータが含まれているものとする。この場合に、複数人の個人情報を、氏名、住所、年齢に分割し、これらを、第１の記憶部１１０を構成する記憶部Ａ、Ｂ、Ｃに分散記憶させる。なお、第１の記憶部１１０に記憶されたデータは、暗号化がされていないデータである。また、記憶対象のデータは、テキストデータに限定されるものではない。 For example, a case where personal information is used as data will be described. This personal information includes text data of name, address, and age. In this case, the personal information of a plurality of persons is divided into name, address, and age, and these are distributed and stored in the storage units A, B, and C constituting the first storage unit 110. Note that the data stored in the first storage unit 110 is unencrypted data. The data to be stored is not limited to text data.

図４は、図１に示す第１の記憶部１１０へのデータの記憶例を示す図である。図４（ａ）には、複数人の氏名とこれらに各々割り当てられているユニークな第１の識別情報（ｋｅｙ１）とが一対で記憶部Ａに記憶されている状態を示している。例えば、氏名「鈴木太郎」に対して、Ｋｅｙ１「０１０」の如くである。第１の識別情報は、所定のアルゴリズムに基づいて割り当てている。 FIG. 4 is a diagram illustrating an example of data storage in the first storage unit 110 illustrated in FIG. 1. FIG. 4A shows a state in which a plurality of names and unique first identification information (key1) assigned to them are stored in the storage unit A as a pair. For example, for the name “Taro Suzuki”, Key1 is “010”. The first identification information is assigned based on a predetermined algorithm.

つまり、本実施形態の分散型データベースシステムは、まず、例えば氏名、住所、年齢という複数の属性のデータデータから構成される複数の個人情報を前記属性毎に分割して、それらの各データに対して所定のアルゴリズムに基づいてユニークなｋｅｙ１（第１キー）を割り当てている。 That is, the distributed database system of the present embodiment first divides a plurality of pieces of personal information composed of data data of a plurality of attributes such as name, address, and age for each attribute, and for each of those data A unique key 1 (first key) is assigned based on a predetermined algorithm.

氏名「鈴木太郎」に係る住所及び年齢のＫｅｙ１には、氏名「鈴木太郎」に割り当てられたｋｅｙ１「０１０」と同じｋｅｙ１の割り当てを行ってもよい。つまり、「鈴木太郎」という氏名にも、「鈴木太郎」の住所にも、「鈴木太郎」の年齢にも、「０１０」というｋｅｙ１を割り当ててもよい。 The same key 1 as the key 1 “010” assigned to the name “Taro Suzuki” may be assigned to the address and age Key 1 associated with the name “Taro Suzuki”. In other words, key1 “010” may be assigned to the name “Taro Suzuki”, the address of “Taro Suzuki”, and the age of “Taro Suzuki”.

或いは、氏名「鈴木太郎」に係る住所及び年齢のＫｅｙ１には、氏名「鈴木太郎」に割り当てられたｋｅｙ１「０１０」とは別のｋｅｙ１を割り当ててもよい。ただし、この場合には、ｋｅｙ１相互間の対応関係を記憶しておく必要がある。この場合の記憶先は、ＤＢＭＳ３００又はＤＢ１００の外部である関連付情報記憶部２００などとすればよい。 Alternatively, a key 1 different from the key 1 “010” assigned to the name “Taro Suzuki” may be assigned to the address and age Key 1 associated with the name “Taro Suzuki”. However, in this case, it is necessary to store the correspondence between the keys1. The storage destination in this case may be the association information storage unit 200 that is external to the DBMS 300 or the DB 100.

同様に、図４（ｂ）、図４（ｃ）に示すように、それぞれ、記憶部Ｂには住所とこれらに対応するｋｅｙ１とが記憶され、記憶部Ｃには年齢とこれらに対応するｋｅｙ１とが記憶される。本実施形態では、各々のデータが、第１の記憶部１１０のランダムな位置に記憶される。図４でいえば、「鈴木太郎」という氏名が記憶部Ａの１行目に記憶されているが、「鈴木太郎」の住所及び年齢は、記憶部Ｂ及びＣの任意の行に記憶されることになる。 Similarly, as shown in FIGS. 4B and 4C, the address and key1 corresponding to the address are stored in the storage unit B, and the age and key1 corresponding to these are stored in the storage unit C, respectively. Is memorized. In the present embodiment, each piece of data is stored at a random position in the first storage unit 110. In FIG. 4, the name “Taro Suzuki” is stored in the first row of the storage unit A, but the address and age of “Taro Suzuki” are stored in any row of the storage units B and C. It will be.

図５は、図１に示す記憶部Ａに記憶されている各データと記憶部Ａ’に記憶されている各データとの関係を示す図である。図５（ａ）には、図４（ａ）と同じものを示している。図５（ｂ）には、図５（ａ）に示すデータと図５（ｃ）に示すデータとの記憶位置を紐付ける探索情報を示している。ここで、探索情報の「１」列目には記憶部Ａに係るＫｅｙ１が記憶され、かつ、探索情報の「２」列目には記憶部Ａ’に係るＫｅｙ１が記憶されている。 FIG. 5 is a diagram showing a relationship between each data stored in the storage unit A shown in FIG. 1 and each data stored in the storage unit A ′. FIG. 5A shows the same thing as FIG. FIG. 5B shows search information that links the storage positions of the data shown in FIG. 5A and the data shown in FIG. Here, Key1 related to the storage unit A is stored in the “1” column of the search information, and Key1 related to the storage unit A ′ is stored in the “2” column of the search information.

図５（ｃ）には、図５（ａ）に示すデータが、暗号化され、かつ、図５（ｂ）に示す探索情報に従って記憶位置が決定された状態で、記憶部Ａ’に係るＫｅｙ１と別途所定のアルゴリズムによって割り当てられたＫｅｙ２とともに、記憶部Ａ’に記憶されている状態を示している。 FIG. 5C shows the Key1 associated with the storage unit A ′ in a state where the data shown in FIG. 5A is encrypted and the storage position is determined according to the search information shown in FIG. A state stored in the storage unit A ′ together with Key2 assigned by a predetermined algorithm is shown.

なお、本実施形態では、図５（ａ）に示す各データの位置と図５（ｃ）に示す各データの位置とが異なっている。例えば、「鈴木太郎」というデータが、図５（ａ）では１行目に示されているのに対して、図５（ｃ）では２行目に示されている。しかし、これは例示であって、図５（ａ）に示す各データの位置と図５（ｃ）に示す各データの位置とが同じであってもよい。 In the present embodiment, the position of each data shown in FIG. 5A is different from the position of each data shown in FIG. For example, the data “Taro Suzuki” is shown in the first line in FIG. 5A, whereas it is shown in the second line in FIG. 5C. However, this is an example, and the position of each data shown in FIG. 5A may be the same as the position of each data shown in FIG.

つまり、第１の記憶部１１０に対しては、一人の個人情報を構成する氏名・住所・年齢という一組のデータを、ランダムに記憶することが必須であったが、第２の記憶部１２０に対しては、第１の記憶部１１０のデータ全体を暗号化しただけで、各々のデータの位置は変更して記憶してもよいし、変更せずに記憶してもよい。もっというと、第２の記憶部１２０に記憶されるデータは、暗号化されているのだから、第１の記憶部１１０に記憶済みのデータ単位で暗号化することは必須ではなく、例えば、暗号化前の個人情報全体を一単位で暗号化してもよいし、一人の個人情報を構成する氏名・住所・年齢という一組のデータ単位で暗号化してもよいし、或いは、氏名・住所を一単位で暗号化してもよい。この際、例えば、一組のデータ単位で記憶位置的に一まとめとして暗号化してもよい。こうすると、データの早期読み出しが実現できるというメリットがある。 That is, for the first storage unit 110, it was indispensable to randomly store a set of data such as name, address, and age constituting one person's personal information, but the second storage unit 120 In contrast, the entire data in the first storage unit 110 may be encrypted, and the position of each data may be changed and stored, or may be stored without being changed. More specifically, since the data stored in the second storage unit 120 is encrypted, it is not essential to encrypt the data stored in the first storage unit 110. The entire personal information before conversion may be encrypted in one unit, or may be encrypted in one set of data units of name, address, and age constituting one person's personal information, or one name and address may be encrypted. You may encrypt by a unit. At this time, for example, the data may be encrypted as a set in a storage unit by a set of data units. In this way, there is an advantage that early reading of data can be realized.

つぎに、暗号化の手法について説明する。例えば、氏名「鈴木太郎」を暗号化する場合には、氏名「鈴木太郎」に対して割り当てられているＫｅｙ１を読み出す。このＫｅｙ１は「０１０」であるから、これに基づいて、図５（ｂ）に示す探索情報の１列目を参照して、対応する２列目のＫｅｙを読み出す。探索情報から読み出したＫｅｙは「０１５」であり、これが記憶部Ａ’に係るＫｅｙ１となる。 Next, an encryption method will be described. For example, when the name “Taro Suzuki” is encrypted, Key1 assigned to the name “Taro Suzuki” is read. Since this Key1 is “010”, based on this, the first column of the search information shown in FIG. The key read from the search information is “015”, which is the key 1 related to the storage unit A ′.

また、所定のアルゴリズムによってＫｅｙ「０１５」には、ユニークなＫｅｙ２が割り当てられる。ここでは、Ｋｅｙ２は「００９」となる。さらに、暗号化対象である氏名「鈴木太郎」というテキストデータ自体が暗号化される。そして、記憶部Ａ’に係るＫｅｙ１「０１５」と、これに対応するＫｅｙ２「００９」と暗号化された氏名「鈴木太郎」とが一組で、記憶部Ａ’に記憶される。 Further, a unique key 2 is assigned to the key “015” by a predetermined algorithm. Here, Key2 is “009”. Furthermore, the text data itself “Taro Suzuki”, which is the object of encryption, is encrypted. Then, Key1 “015” related to the storage unit A ′, the corresponding Key2 “009”, and the encrypted name “Taro Suzuki” are stored in the storage unit A ′ as a set.

同様に、記憶部Ｂ、記憶部Ｃにそれぞれ記憶されているデータも、所定の探索情報に従って暗号化されたのちに、さらに、記憶部Ｂ’、記憶部Ｃ’に記憶される。 Similarly, the data stored in the storage unit B and the storage unit C are encrypted according to predetermined search information and then stored in the storage unit B ′ and the storage unit C ′.

なお、ｋｅｙ１及びｋｅｙ２の割り当ては、既述のように、所定のアルゴリズムによって行われる。本実施形態では、当該アルゴリズム、探索情報については、ＤＢＭＳ３００又はＤＢ１００の外部である関連付情報記憶部２００に置くようにしている。 The assignment of key1 and key2 is performed by a predetermined algorithm as described above. In the present embodiment, the algorithm and search information are placed in the associated information storage unit 200 that is external to the DBMS 300 or the DB 100.

図６は、記憶部Ａ’と記憶部Ｂ’と記憶部Ｃ’とに各々格納されたデータの関係図である。図６（ａ）には記憶部Ａ’へのデータの記憶例を示している。図６（ｂ）には記憶部Ｂ’へのデータの記憶例を示している。図６（ｃ）には記憶部Ｃ’へのデータの記憶例を示している。図６（ｄ）には記憶部Ａ’と記憶部Ｂ’と記憶部Ｃ’のＫｅｙ２を相互に紐付ける探索情報例を示している。 FIG. 6 is a relationship diagram of data stored in the storage unit A ′, the storage unit B ′, and the storage unit C ′. FIG. 6A shows an example of storing data in the storage unit A ′. FIG. 6B shows an example of storing data in the storage unit B ′. FIG. 6C shows an example of storing data in the storage unit C ′. FIG. 6D shows an example of search information that links the storage units A ′, B ′, and Key2 of the storage unit C ′ to each other.

図６（ｄ）に示すように、探索情報の１列目には記憶部Ａ’に係るＫｅｙ２が記憶され、探索情報の２列目には記憶部Ｂ’に係るＫｅｙ２が記憶され、探索情報の３列目には記憶部Ｃ’に係るＫｅｙ２が記憶されている。ここでは、同じ列に記憶されている各ｋｅｙ２が、同一人の各個人情報に割り当てられている。 As shown in FIG. 6D, Key2 related to the storage unit A ′ is stored in the first column of the search information, and Key2 related to the storage unit B ′ is stored in the second column of the search information. In the third column, Key2 related to the storage unit C ′ is stored. Here, each key 2 stored in the same column is assigned to each personal information of the same person.

例えば、図６内に矢印で示すように、探索情報４行目を見ると、１列目に「００９」が、２列目に「０１２」が、１列目に「００５」が、記憶されている。図６（ａ）を見ると、１列目に係るｋｅｙ２「００９」は「鈴木太郎」という名前に割り当てられている。２列目に係るｋｅｙ２「０１２」は「鈴木太郎」の「住所」に割り当てられている。３列目に係るｋｅｙ２「００５」は「鈴木太郎」の「年齢」に割り当てられている。 For example, as shown by the arrow in FIG. 6, when the fourth row of search information is viewed, “009” is stored in the first column, “012” is stored in the second column, and “005” is stored in the first column. ing. As shown in FIG. 6A, the key 2 “009” in the first column is assigned to the name “Taro Suzuki”. Key2 “012” in the second column is assigned to “Address” of “Taro Suzuki”. Key2 “005” in the third column is assigned to “Age” of “Taro Suzuki”.

したがって、「鈴木太郎」の個人情報を検索する場合には、まず、図５（ａ）に示す記憶部Ａを参照して、「鈴木太郎」に係る記憶部Ａのｋｅｙ１が「０１０」であることを特定する。つぎに、記憶部ＡのＫｅｙ１「０１０」に基づいて、図５（ｂ）に示す検索情報を参照して、記憶部ＡのＫｅｙ１「０１０」に対応する、記憶部Ａ’のｋｅｙ１が「０１５」であることを特定する。 Therefore, when searching for personal information of “Taro Suzuki”, first, referring to the storage unit A shown in FIG. 5A, the key 1 of the storage unit A related to “Taro Suzuki” is “010”. Identify that. Next, based on the Key1 “010” of the storage unit A, the search information shown in FIG. 5B is referred to, and the key1 of the storage unit A ′ corresponding to the Key1 “010” of the storage unit A is “015”. ”.

つぎに、記憶部Ａ’のｋｅｙ１「０１５」に基づいて、図６（ａ）に示す記憶部Ａ’を参照して、これに対応するｋｅｙ２「００９」を特定する。その後、記憶部Ａ’に係るｋｅｙ２「００９」に基づいて、図６（ｄ）に示す探索情報を参照して、対応する記憶部Ｂ’及び記憶部Ｃ’に係るＫｅｙ２を読み出す。この結果、これらのｋｅｙ２に基づいて、記憶部Ｂ’及び記憶部Ｃ’を参照することによって、「鈴木太郎」の「住所」及び「年齢」を読み出すことができる。 Next, based on the key 1 “015” of the storage unit A ′, the storage unit A ′ shown in FIG. 6A is referred to, and the corresponding key 2 “009” is specified. Thereafter, based on the key2 “009” related to the storage unit A ′, the search information shown in FIG. 6D is referred to, and the Key2 related to the corresponding storage unit B ′ and storage unit C ′ is read out. As a result, the “address” and “age” of “Taro Suzuki” can be read by referring to the storage unit B ′ and the storage unit C ′ based on these keys 2.

なお、ここでは、「鈴木太郎」に係る記憶部Ａのｋｅｙ１が「０１０」に基づいて、「鈴木太郎」の「住所」及び「年齢」を読み出す例を示したが、同様に、「鈴木太郎」の「住所」又は「年齢」に基づいて、他の個人情報を読み出すこともできる。 Here, an example is shown in which “address” and “age” of “Taro Suzuki” are read based on “010” in the key 1 of the storage unit A related to “Taro Suzuki”. Other personal information can also be read based on “address” or “age” of “”.

なお、図５及び図６では、２つのＫｅｙを用いる例を示したが、ｋｅｙの数は例示である。また、識別情報、探索情報及び探索ルールは、第１の記憶部１１０のデータと、第２の記憶部１２０のデータとを関連付けることができる情報であれば、上記の例には限定されず、どのようなものであってもよい。ここで重要なのは、本実施形態では、既述のように、ｋｅｙとアルゴリズムとをＤＢＭＳ３００又はＤＢ１００の外部である関連付情報記憶部２００に置くことである。 5 and 6 show an example in which two keys are used, the number of keys is merely an example. In addition, the identification information, the search information, and the search rule are not limited to the above example as long as the data can be associated with the data in the first storage unit 110 and the data in the second storage unit 120. Any thing is acceptable. What is important here is that in this embodiment, as described above, the key and the algorithm are placed in the associated information storage unit 200 outside the DBMS 300 or the DB 100.

また、第２の記憶部１２０における分散されたデータを、どのように結合させるかについては、第２の記憶部１２０の各データに付与された第２の識別情報と、この第２の識別情報に基づいて、対応する情報を結合するための結合情報及び結合ルールとによって、あらかじめ決定されている。 Further, as to how the dispersed data in the second storage unit 120 are combined, the second identification information given to each data in the second storage unit 120 and the second identification information. Based on the combination information and the combination rule for combining the corresponding information.

［１−３．関連付情報記憶部］ [1-3. Associated information storage unit]

関連付情報記憶部２００は、探索情報記憶部２１０、結合情報記憶部２２０を有している。探索情報記憶部２１０は、上記のように、第１の記憶部１１０のデータと、第２の記憶部１２０のデータとを関連付ける探索情報を記憶した構成部である。結合情報記憶部２２０は、上記のように、第２の記憶部１２０において分散されたデータを、関連付ける結合情報を記憶した構成部である。なお、上記の例では、探索情報、結合情報として、乱数表を用いている。 The association information storage unit 200 includes a search information storage unit 210 and a combined information storage unit 220. As described above, the search information storage unit 210 is a configuration unit that stores search information for associating data in the first storage unit 110 and data in the second storage unit 120. As described above, the combined information storage unit 220 is a configuration unit that stores the combined information for associating the data distributed in the second storage unit 120. In the above example, a random number table is used as search information and combination information.

［１−４．ＤＢＭＳ］ [1-4. DBMS]

ＤＢＭＳ３００は、ＤＢ１００に対するアクセスを統合的に管理する構成部である。本実施形態のＤＢＭＳ３００は、下記のような構成部を備えることにより、利用者の認証、データの分割、格納、検索、探索及び結合等の処理を行う。このために、ＤＢＭＳ３００は、認証部３０１、関連付情報生成部３１０、ルール記憶部３２０、暗号処理部３３０、登録処理部３４０、検索処理部３５０等を有している。 The DBMS 300 is a component that manages access to the DB 100 in an integrated manner. The DBMS 300 according to the present embodiment includes the following components to perform processing such as user authentication, data division, storage, search, search, and combination. For this purpose, the DBMS 300 includes an authentication unit 301, an associated information generation unit 310, a rule storage unit 320, an encryption processing unit 330, a registration processing unit 340, a search processing unit 350, and the like.

認証部３０１は、入力部４００から入力された認証情報に基づいて、ＤＢ１００の正当な利用者を認証する構成部である。認証部３０１は、認証情報記憶部３０２と判定部３０３等を有している。認証情報記憶部３０２は、正当な利用者の認証情報をあらかじめ記憶する手段である。判定部３０３は、入力部４００から入力された認証情報と、認証情報記憶部３０２に記憶された認証情報とが一致するか否かを判定する手段である。なお、認証情報としては、ＩＤ、パスワード等が一般的であるが、現在及び将来において利用可能なあらゆる認証情報が含まれる。 The authentication unit 301 is a configuration unit that authenticates a valid user of the DB 100 based on the authentication information input from the input unit 400. The authentication unit 301 includes an authentication information storage unit 302, a determination unit 303, and the like. The authentication information storage unit 302 is a means for storing in advance authentication information of a legitimate user. The determination unit 303 is a unit that determines whether the authentication information input from the input unit 400 matches the authentication information stored in the authentication information storage unit 302. The authentication information is generally an ID, a password, etc., but includes any authentication information that can be used at present and in the future.

関連付情報生成部３１０は、探索情報生成部３１１、結合情報生成部３１２を有している。探索情報生成部３１１は、探索情報を生成する構成部である。結合情報記憶部３１２は、結合情報を生成する構成部である。なお、上記の例では、探索情報、結合情報として、乱数表を用いている。乱数表の生成は、周知技術であり、現在又は将来において適用可能なあらゆる手法を用いることができる。 The association information generation unit 310 includes a search information generation unit 311 and a combined information generation unit 312. The search information generation unit 311 is a configuration unit that generates search information. The combined information storage unit 312 is a component that generates combined information. In the above example, a random number table is used as search information and combination information. The generation of the random number table is a well-known technique, and any method applicable at present or in the future can be used.

ルール記憶部３２０は、分割ルール記憶部３２１、探索ルール記憶部３２２、結合ルール記憶部３２３等を有している。分割ルール記憶部３２１は、データを分割するためのルールを記憶した構成部である。分割ルールとしては、上記の個人情報の例では、氏名、住所、年齢といった、それぞれが意味を持つデータに分割するルールが考えられる。但し、本発明は、これに限定されない。 The rule storage unit 320 includes a division rule storage unit 321, a search rule storage unit 322, a combination rule storage unit 323, and the like. The division rule storage unit 321 is a configuration unit that stores a rule for dividing data. As the division rule, in the example of the personal information described above, a rule that divides the data into meaning data such as name, address, and age can be considered. However, the present invention is not limited to this.

探索ルール記憶部３２２は、上記のように、第１の記憶部１１０のデータと、第２の記憶部１２０のデータとを関連付けるための探索ルールを記憶した構成部である。結合ルール記憶部３２３は、上記のように、第２の記憶部１２０において分散されたデータを、関連付けるための結合ルールを記憶した構成部である。 As described above, the search rule storage unit 322 is a configuration unit that stores a search rule for associating data in the first storage unit 110 with data in the second storage unit 120. As described above, the combination rule storage unit 323 is a configuration unit that stores a combination rule for associating data distributed in the second storage unit 120.

暗号処理部３３０は、暗号化部３３１、復号化部３３２等を有している。暗号化部３３１は、第２の記憶部１２０に記憶されるデータを暗号化する構成部である。復号化部３３２は、第２の記憶部１２０に記憶されたデータの復号化の復号化を行う構成部である。暗号化、復号化については、在又は将来において適用可能なあらゆる手法を用いることができる。 The encryption processing unit 330 includes an encryption unit 331, a decryption unit 332, and the like. The encryption unit 331 is a configuration unit that encrypts data stored in the second storage unit 120. The decryption unit 332 is a configuration unit that decrypts the data stored in the second storage unit 120. For encryption and decryption, any method that can be applied in the present or future can be used.

登録処理部３４０は、分割部３４１、第１の付与部３４２、複製部３４３、第２の付与部３４４、格納部３４５等を有している。分割部３４１は、入力されたデータを、分割ルールに基づいて、分割する構成部である。第１の付与部３４２は、分割された各データに、第１の識別情報を付与する構成部である。第１の識別情報の付与の例は、上記の通りであるが、本発明はこれには限定されない。 The registration processing unit 340 includes a dividing unit 341, a first adding unit 342, a duplicating unit 343, a second adding unit 344, a storing unit 345, and the like. The dividing unit 341 is a component that divides input data based on a division rule. The 1st provision part 342 is a structure part which provides 1st identification information to each divided | segmented data. An example of giving the first identification information is as described above, but the present invention is not limited to this.

複製部３４３は、分割されたデータを複製する構成部である。第２の付与部３４４は、複製された各データに、第２の識別情報を付与する構成部である。第２の識別情報の付与の例は、上記の通りであるが、本発明はこれには限定されない。格納部３４５は、分割されて第１の識別情報が付与されたデータを、第１の記憶部１１０に分散して格納し、複製されて第２の識別情報が付与されたデータを、第２の記憶部１２０に分散して格納する構成部である。 The duplication unit 343 is a component that duplicates the divided data. The 2nd provision part 344 is a structure part which provides 2nd identification information to each replicated data. An example of providing the second identification information is as described above, but the present invention is not limited to this. The storage unit 345 stores the data that has been divided and provided with the first identification information in a distributed manner in the first storage unit 110, and the data that has been duplicated and provided with the second identification information is stored in the second storage unit 345. The storage unit 120 stores the information in a distributed manner.

検索処理部３５０は、要求判定部３５１、第１の検索部３５２、探索部３５３、第２の検索部３５４、結合部３５５等を有している。要求判定部３５１は、入力された検索要求が、分割されたデータの結合を必要とするものか否かを判定する構成部である。たとえば、特定の地域に居住している人の人数を求める検索要求の場合には、住所のデータのみを検索すればよく、データの結合は必要がない。一方、ある氏名の人の住所、年齢を求める検索要求の場合には、氏名、住所、年齢のデータを結合させる必要がある。 The search processing unit 350 includes a request determination unit 351, a first search unit 352, a search unit 353, a second search unit 354, a combining unit 355, and the like. The request determination unit 351 is a configuration unit that determines whether or not an input search request requires a combination of divided data. For example, in the case of a search request for the number of people living in a specific area, only the address data need be searched, and no data combination is required. On the other hand, in the case of a search request for obtaining the address and age of a person with a name, it is necessary to combine the data of name, address, and age.

第１の検索部３５２は、要求判定部３５１による判定結果に従い、検索要求に含まれる検索条件に応じて、第１の記憶部１１０に格納されたデータを検索する構成部である。探索部３５３は、要求判定部３５１による判定結果に従い、探索情報及び探索ルールに基づいて、第１の記憶部１１０に格納されたデータに対応する第２の記憶部１２０に格納されたデータを探索する構成部である。さらに、第２の検索部３５４は、探索部３５３によって探索されたデータに対応するデータを、結合情報及び結合ルールに基づいて、第２の記憶部１２０から検索する構成部である。結合部３５５は、第２の検索部３５４によって検索されたデータを結合する構成部である。 The first search unit 352 is a configuration unit that searches the data stored in the first storage unit 110 in accordance with the search condition included in the search request according to the determination result by the request determination unit 351. The search unit 353 searches for data stored in the second storage unit 120 corresponding to the data stored in the first storage unit 110, based on the search information and the search rules, according to the determination result by the request determination unit 351. It is the component which performs. Furthermore, the second search unit 354 is a component that searches the second storage unit 120 for data corresponding to the data searched by the search unit 353 based on the combination information and the combination rule. The combining unit 355 is a component that combines the data searched by the second search unit 354.

なお、ＤＢＭＳ３００は、一般的なＤＢＭＳと同様に、入力部４００から入力された要求に従って、ＤＢ１００のデータの更新、削除等の処理を行うことができる。その他、データの一貫性、同時実行制御等、一般的なＤＢＭＳが有する機能については、周知技術であるため、説明を省略する。 Note that the DBMS 300 can perform processing such as update and deletion of data in the DB 100 in accordance with a request input from the input unit 400, as in a general DBMS. Other functions such as data consistency and concurrent execution control that are included in a general DBMS are well-known techniques and will not be described.

［１−４．入力部］ [1-4. Input section]

入力部４００は、利用者（プログラマ、管理者、一般ユーザ等、システムを利用する者を広く含む）が、分散型ＤＢ処理装置１に、種々の情報を入力して操作するための構成部である。この入力部４００としては、キーボード、マウス、タッチパネル等、現在又は将来において利用可能なあらゆる入力装置を用いることができる。この入力部４００によって、利用者は、データ、検索要求（検索条件等を含む）等を入力することができ、これに応じて、ＤＢＭＳ３００が処理を行う。 The input unit 400 is a configuration unit that allows users (including programmers, managers, general users, etc., who use the system widely) to input and operate various types of information in the distributed database processing apparatus 1. is there. As the input unit 400, any input device that can be used now or in the future, such as a keyboard, a mouse, and a touch panel, can be used. The input unit 400 allows the user to input data, a search request (including search conditions, etc.), and the DBMS 300 performs processing in response to this.

［１−５．出力部］ [1-5. Output section]

出力部５００は、分散型ＤＢ処理装置１におけるＤＢ１００の入力画面、検索結果等を、種々の態様で出力することにより、ユーザにより視認可能とする手段である。この出力部５００としては、ディスプレイ、プリンタ等、現在又は将来において利用可能なあらゆる出力装置を用いることができる。 The output unit 500 is a means that allows the user to visually recognize the input screen, search results, and the like of the DB 100 in the distributed DB processing apparatus 1 in various ways. As the output unit 500, any output device that can be used now or in the future, such as a display and a printer, can be used.

なお、上記のＤＢ１００、関連付情報記憶部２００、認証情報記憶部３１１、ルール記憶部３２０等の記憶部としては、コンピュータの各種メモリ、ハードディスク等、現在又は将来において利用可能なあらゆる記憶媒体が利用可能である。上記の説明では、各記憶部及びＤＢを概念的に区別したものであり、その一部若しくは全部を共通の記憶媒体において実現してもよいし、通信経路（バス、通信ネットワーク等を含む）を介して接続された別個の記憶媒体によって実現してもよい。これは、記憶部Ａ〜Ｃ、Ａ’〜Ｃ’についても同様である。 As the storage unit such as the DB 100, the associated information storage unit 200, the authentication information storage unit 311, and the rule storage unit 320, any storage medium that can be used at present or in the future, such as various types of computer memory or hard disk, is used. Is possible. In the above description, each storage unit and DB are conceptually distinguished, and a part or all of them may be realized in a common storage medium, and communication paths (including buses, communication networks, etc.) are included. You may implement | achieve by the separate storage medium connected via the. The same applies to the storage units A to C and A 'to C'.

また、ＤＢＭＳ３００は、プログラムによって動作するＣＰＵ、メモリその他の周辺回路により構成される制御部によって実現されるものである。この制御部は、入力部４００及び出力部５００との間での情報の入出力機能、演算機能等、一般的なコンピュータが備える機能を有している。 The DBMS 300 is realized by a control unit configured by a CPU that operates according to a program, a memory, and other peripheral circuits. This control unit has functions provided in a general computer such as an input / output function of information and an arithmetic function between the input unit 400 and the output unit 500.

また、ＤＢＭＳ３００は、一般的には、オペレーティングシステム及びアプリケーションプログラム間で動作するミドルウェアの一種として機能する。但し、上記の各部は、オペレーションシステム及びアプリケーションプログラムの機能の一部を含んでいるか、その機能と連携して動作する機能の場合もあるが、厳密に区別して示すことは困難であるため、説明は省略する。 The DBMS 300 generally functions as a kind of middleware that operates between an operating system and application programs. However, each of the above-mentioned units may include a part of the functions of the operation system and application program, or may be a function that operates in cooperation with the function. Is omitted.

さらに、入力部４００、出力部５００に関しても、遠隔に配置され、通信ネットワークを介して接続された構成としてもよい。 Further, the input unit 400 and the output unit 500 may be configured to be remotely arranged and connected via a communication network.

［２．実施形態の作用］
以上のような本実施形態による処理の流れを、図２、図３を参照して説明する。 [2. Operation of the embodiment]
The processing flow according to the present embodiment as described above will be described with reference to FIGS.

［２−１．データ登録処理］ [2-1. Data registration process]

まず、データを分散して記憶するデータ登録処理について、図２のフローチャート、図４、図６の説明図に従って説明する。すなわち、利用者は、入力部４００を用いて、認証情報を入力する（ステップ０１）。判定部３０３は、入力された認証情報が、認証情報記憶部３０２に記憶された認証情報と一致するか否かを判定する（ステップ０２）。判定部３０３によって、認証情報が一致しないと判定された場合には、正当な利用者ではないとして、それ以降の処理進むことはできない（ステップ０２のＮＯ）。 First, data registration processing for storing data in a distributed manner will be described with reference to the flowchart of FIG. 2 and the explanatory diagrams of FIGS. That is, the user inputs authentication information using the input unit 400 (step 01). The determination unit 303 determines whether or not the input authentication information matches the authentication information stored in the authentication information storage unit 302 (step 02). If the determination unit 303 determines that the authentication information does not match, it is determined that the user is not a valid user, and the subsequent processing cannot proceed (NO in step 02).

認証情報が一致すると判定された場合には、正当な利用者として認証される（ステップ０２のＹＥＳ）。この利用者が、入力部４００を用いて、個人情報を入力する（ステップ０３）。分割部３４１は、入力された個人情報を、分割ルールに従って分割する（ステップ０４）。上記の例では、個人情報が、氏名、住所、年齢に分割される。 If it is determined that the authentication information matches, the user is authenticated as a valid user (YES in step 02). This user inputs personal information using the input unit 400 (step 03). The dividing unit 341 divides the input personal information according to the division rule (step 04). In the above example, personal information is divided into name, address, and age.

また、複製部３４３は、分割された各データを複製する（ステップ０５）。このとき、暗号化部３３１が、分割された各データを暗号化する（ステップ０６）。そして、第１の付与部３４２が、探索情報及び結合ルールに従って、分割された各データと複製された各データに、第１の識別情報を付与する（ステップ０７）。上記の例では、乱数表に基づいて、それぞれにキー（ｋｅｙ１）が付与される（図４参照）。なお、探索情報は、あらかじめ探索情報生成部３１１によって生成され、探索情報記憶部２１０に記憶されていてもよいし、第１の識別情報の付与の際に、探索情報生成部３１１によって生成して、その後、探索情報記憶部３１０に記憶しておいてもよい。 Further, the duplicating unit 343 duplicates each divided data (step 05). At this time, the encryption unit 331 encrypts each divided data (step 06). And the 1st provision part 342 assign | provides 1st identification information to each divided | segmented data and each replicated data according to search information and a coupling rule (step 07). In the above example, a key (key1) is assigned to each based on the random number table (see FIG. 4). Note that the search information may be generated in advance by the search information generation unit 311 and stored in the search information storage unit 210, or may be generated by the search information generation unit 311 when the first identification information is given. Thereafter, it may be stored in the search information storage unit 310.

第２の付与部３４４は、探索情報及び探索ルールに従って、複製され、暗号化されたた各データに、第２の識別情報を付与する（ステップ０８）。上記の例では、氏名、住所、年齢が複製、暗号化され、乱数表に基づいて、それぞれにキー（ｋｅｙ２）が付与される（図６参照）。なお、結合情報は、あらかじめ結合情報生成部３１２によって生成され、結合情報記憶部２２０に記憶されていてもよいし、第２の識別情報の付与の際に、結合情報生成部３１２によって生成して、その後、結合情報記憶部３２０に記憶しておいてもよい。 The second assigning unit 344 assigns the second identification information to each replicated and encrypted data according to the search information and the search rule (step 08). In the above example, the name, address, and age are copied and encrypted, and a key (key2) is assigned to each based on the random number table (see FIG. 6). Note that the combined information may be generated in advance by the combined information generation unit 312 and stored in the combined information storage unit 220, or may be generated by the combined information generation unit 312 when the second identification information is given. Thereafter, the information may be stored in the combined information storage unit 320.

格納部３４５は、分割部３４１によって分割され、第１の識別情報が付与された各データを、第１の記憶部１１０に分散して格納する（ステップ０９）。上記の例では、記憶部Ａ、Ｂ、Ｃに、それぞれ氏名、住所、年齢が格納される。さらに、格納部３４５は、暗号化され、第２の識別情報が付与された各データを、第２の記憶部１２０に分散して格納する（ステップ１０）。上記の例では、記憶部Ａ’、Ｂ’、Ｃ’に、それそれ暗号化された氏名、住所、年齢が格納される。 The storage unit 345 stores the data divided by the dividing unit 341 and provided with the first identification information in a distributed manner in the first storage unit 110 (step 09). In the above example, names, addresses, and ages are stored in the storage units A, B, and C, respectively. Further, the storage unit 345 stores the encrypted data to which the second identification information is added in a distributed manner in the second storage unit 120 (step 10). In the above example, the encrypted names, addresses, and ages are stored in the storage units A ′, B ′, and C ′.

［２−２．データ検索処理］ [2-2. Data search processing]

次に、データを検索する処理について、図３のフローチャート、図５、図６の説明図に従って説明する。まず、上記と同様に、利用者の認証を行う（ステップ１１、１２）。次に、利用者は、入力部４００を用いて、検索要求を入力する（ステップ１３）。要求判定部３５１は、検索要求が、分割された各データのみを対象とするか、分割された各データの結合を必要とするかを判定する（ステップ１４）。 Next, processing for retrieving data will be described with reference to the flowchart of FIG. 3 and the explanatory diagrams of FIGS. First, in the same manner as described above, user authentication is performed (steps 11 and 12). Next, the user uses the input unit 400 to input a search request (step 13). The request determination unit 351 determines whether the search request is for only the divided data or needs to combine the divided data (step 14).

検索要求は、分割された各データのみを対象とする検索条件を含む場合には、要求判定部３５１は、結合不要と判定する（ステップ１５のＮＯ）。たとえば、検索要求が、氏名だけ、住所だけ、年齢だけといった場合がこれに該当する。すると、第１の検索部３５２が、第１の記憶部１１０（記憶部Ａ、Ｂ、Ｃ）のいずれかから、そのデータのみを検索する（ステップ２１）。 When the search request includes a search condition for only the divided data, the request determination unit 351 determines that the connection is unnecessary (NO in step 15). For example, this is the case when the search request is only the name, only the address, and only the age. Then, the first search unit 352 searches only the data from one of the first storage units 110 (storage units A, B, and C) (step 21).

この場合には、他データとの結合の必要はないので、検索結果は、そのまま出力部５００に出力される（ステップ２０）。なお、同姓の人の人数、同じ区に居住する人の人数といったように、第１の記憶部１１０をそれぞれに単独で検索すれば済むような検索条件の場合にも、同様に対応できる。 In this case, since it is not necessary to combine with other data, the search result is output to the output unit 500 as it is (step 20). It should be noted that the same can be applied to a search condition in which the first storage unit 110 only needs to be searched independently, such as the number of people with the same surname and the number of people living in the same ward.

一方、検索要求が、分割された各データの結合を必要とする場合には、要求判定部３５１は、結合必要と判定する（ステップ１５のＹＥＳ）。たとえば、特定の氏名とその人の住所といったように、分散されたデータを結合して、個人を特定できるようなものとする要求である場合がこれに該当する。 On the other hand, if the search request requires combining of the divided data, the request determination unit 351 determines that combining is necessary (YES in step 15). For example, this is the case where the request is such that a specific name and the address of the person can be used to identify the individual by combining the distributed data.

かかる場合には、探索部３５３が、入力された検索条件と、探索情報及び探索ルールに従って、第１の記憶部１１０のデータに対応する第２の記憶部１２０のデータを探索する（ステップ１６）。たとえば、氏名から住所を検索する場合には、第１の記憶部１１０の氏名に付与された対応する第２の記憶部１２０の氏名を探索する。上記の例では、乱数表から、氏名（鈴木太郎）に付与されたキー（０１０）に対応するキー（０１５）を判定し、これに対応するデータを探索する（図５参照）。 In such a case, the search unit 353 searches for data in the second storage unit 120 corresponding to the data in the first storage unit 110 according to the input search condition, search information, and search rule (step 16). . For example, when searching for an address from the name, the name of the corresponding second storage unit 120 assigned to the name of the first storage unit 110 is searched. In the above example, the key (015) corresponding to the key (010) assigned to the name (Taro Suzuki) is determined from the random number table, and data corresponding to this is searched (see FIG. 5).

次に、第２の検索部３５４が、結合情報及び結合ルールに従って、探索されたデータに結合すべきデータを、第２の記憶部１２０から検索する（ステップ１７）。上記の例では、乱数表から、暗号化された氏名（鈴木太郎）に付与されたキー（００９）に対応するキー（０１２，００５）を判定し、これに対応するデータを検索する（図６参照）。 Next, the second search unit 354 searches the second storage unit 120 for data to be combined with the searched data in accordance with the combination information and the combination rule (step 17). In the above example, the key (012,005) corresponding to the key (009) assigned to the encrypted name (Taro Suzuki) is determined from the random number table, and data corresponding to this is searched (FIG. 6). reference).

そして、復号化部３３２が、検索された各データを復号化する（ステップ１８）。さらに、結合部３５５が、復号化されたデータを結合する（ステップ１９）。結合された検索結果は、出力部５００に出力される（ステップ２０）。 Then, the decryption unit 332 decrypts each retrieved data (step 18). Further, the combining unit 355 combines the decrypted data (step 19). The combined search result is output to the output unit 500 (step 20).

［３．実施形態の効果］ [3. Effects of the embodiment]

以上のような本実施形態（本発明）の効果は、次の通りである。すなわち、本実施形態によれば、分割された各データを、単独で検索した場合には、暗号化等がなされていないために、処理負担が軽く、高速に検索することができる。この場合に、もしデータが内部の経路（たとえば、図１のＸ）から漏洩しても、個人を特定できないので、問題は少ない。 The effects of the present embodiment (the present invention) as described above are as follows. In other words, according to the present embodiment, when each divided data is searched independently, the processing load is light and the search can be performed at high speed because encryption is not performed. In this case, even if the data leaks from the internal route (for example, X in FIG. 1), the individual cannot be specified, so there are few problems.

一方、分割された各データを結合させて検索する場合には、暗号化されたデータのみが経路を流れる。このため、同時間に内部の経路（たとえば、図１のＹ）を流通する複数のデータが漏洩したとしても、内容を秘匿することができる。 On the other hand, when searching by combining the divided data, only the encrypted data flows through the path. For this reason, even if a plurality of data flowing through the internal route (for example, Y in FIG. 1) leaks at the same time, the contents can be concealed.

［４．他の実施形態］ [4. Other Embodiments]

本発明は、上記のような実施形態に限定されるものではない。たとえば、本発明は、特定の１台のコンピュータによって実現されるものには限定されない。複数のサーバ装置により構成したり、各機能ブロックに応じて複数台のコンピュータで処理を分散したりすることにより、複数台が連繋して処理を行うシステムとして構成することもできる。 The present invention is not limited to the embodiment as described above. For example, the present invention is not limited to that realized by a specific single computer. By configuring with a plurality of server devices or distributing the processing with a plurality of computers according to each functional block, it is also possible to configure a system in which a plurality of devices are connected to perform processing.

上記の実施形態では、第１の記憶部及び第２の記憶部において、データをそれぞれ３つに分散させていたが、この分散数についても、２つ以上であれば、どのような数でもよい。第１の記憶部と第２の記憶部で分散数が異なっていてもよい。 In the above embodiment, the first storage unit and the second storage unit each distribute data to three. However, the number of distributions may be any number as long as it is two or more. . The number of distributions may be different between the first storage unit and the second storage unit.

本発明が取り扱うデータについても、特定のものには限定されない。上記で例示したデータはその一例に過ぎず、分散管理されるデータに広く適用可能である。個人情報として、どのようなデータを含めるかも自由であり、分割の単位、数についても自由である。本発明は、個人情報の漏洩防止に有効であるが、使用するデータは、個人情報には限定されない。 The data handled by the present invention is not limited to specific data. The data illustrated above is only one example, and can be widely applied to data that is distributed and managed. What kind of data can be included as personal information is free, and the division unit and number are also free. The present invention is effective for preventing leakage of personal information, but the data used is not limited to personal information.

Claims

Dividing means for dividing a plurality of pieces of information composed of data of a plurality of attributes for each of the attributes;
First assigning means for assigning a unique first key to each data divided by the dividing means based on a first algorithm;
A first storage unit in which each first key assigned by the first assigning means and data corresponding to each of the first keys are stored in a random position;
Second assigning means for assigning a unique second key to the plurality of information based on a second algorithm;
Encryption means for encrypting the plurality of information;
A second storage unit in which the second key assigned by the second assigning unit and the data encrypted by the encrypting unit are stored in a pair at random positions;
A third storage unit in which the first and second keys, the first and second algorithms, and association information between data constituting the information are stored;
A distributed database system comprising:

The distributed database system according to claim 1, further comprising means for restoring the information based on the storage contents of the third storage unit.

2. The distributed database system according to claim 1, wherein the second assigning unit assigns a unique second key to each data stored in the first storage unit based on the second algorithm.

The distributed database system according to claim 1, wherein the encryption unit encrypts each data stored in the first storage unit.

2. The distributed database system according to claim 1, further comprising: a reading unit that reads out the encrypted data stored in the second storage unit at the time of reading the information data.