JPH04181461A

JPH04181461A - How to manage distributed files

Info

Publication number: JPH04181461A
Application number: JP2308900A
Authority: JP
Inventors: Kazuhisa Eguchi; 江口　和久; Kenji Sato; 健二佐藤; Masayuki Orimo; 織茂　昌之; Shigeki Hirasawa; 茂樹平澤; Kinji Mori; 森　欣司; Hitoshi Suzuki; 仁鈴木; Ichitaro Ogo; 小合　一太郎
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1990-11-16
Filing date: 1990-11-16
Publication date: 1992-06-29

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】〔産業上の利用分野〕この発明は分散ファイルの管理方法、とくに複数コンピ
ュータに多重化された分散ファイルのオンライン回復時
に好適な管理方法に関する。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a distributed file management method, and particularly to a management method suitable for online recovery of distributed files multiplexed on multiple computers.

[Conventional technology]

従来の技術には、ファイルにアクセスしようとした時に
、該ファイルが回復中でアクセスできなかった場合には
、アクセス依頼元コンピュータは、直ちにアクセス依頼
を再送するというものがある。Conventional techniques include a technique in which when an attempt is made to access a file and the file cannot be accessed because it is being recovered, the access requesting computer immediately resends the access request.

なお、この種の技術に関連するものには、例えば、特開
昭６２−１０２３４２号公報等が挙げられる。Incidentally, examples related to this type of technology include, for example, Japanese Unexamined Patent Publication No. 102342/1983.

[Problem to be solved by the invention]

上記従来技術では、回復までにかかる時間を考慮してい
ないため、アクセス依頼元コンピュータは、ファイルへ
のアクセス依頼を、アクセスできるまで何度も送信をし
なければならず、無駄な送受信が多いという問題点があ
る。The above conventional technology does not take into account the time required for recovery, so the access requesting computer has to send the file access request many times until it can access the file, resulting in a lot of wasteful sending and receiving. There is a problem.

また、ファイルのアクセス状況を考慮していないので１
分割されたファイルの回復中の部分にアクセスしてしま
う可能性が大きく、ファイルへのアクセス依頼を、アク
セスできるまで何度も送信をしなければならず、無駄な
送受信が多いという問題点がある。Also, since the access status of the file is not taken into consideration, 1
There is a high possibility that the part of the divided file that is being recovered will be accessed, and requests to access the file must be sent many times until the file can be accessed, resulting in a lot of wasteful sending and receiving. .

本発明の目的は、ファイルの回復中に何度もアクセス依
頼、アクセス結果のメツセージを送受信する無駄を防止
することができる分散ファイルの管理方法を提供するこ
とにある。An object of the present invention is to provide a distributed file management method that can prevent the waste of sending and receiving access requests and access result messages many times during file recovery.

[Means to solve the problem]

上記目的を達成するために、アクセス出来なかった場合
には、ファイル全体の長さと回復済み部分の長さ等から
、ファイルの回復にかかる時間を算出し、算出結果に基
づいてアクセス依頼を伝送路に再送出する。In order to achieve the above purpose, if access is not possible, the time required to recover the file is calculated from the length of the entire file and the length of the recovered part, and an access request is sent to the transmission line based on the calculation result. resend to.

またファイルの回復は分割されたファイルの部分毎に行
なわれる場合、ファイルのアクセス状況に応じてファイ
ルの回復単位の大きさを変える。Furthermore, when file recovery is performed for each divided file, the size of the file recovery unit is changed depending on the file access status.

[Effect]

ファイル全体の長さと回復済み部分の長さ等から、ファ
イルの回復にかがる時間を算出し、算出結果に基づいて
アクセス依頼を伝送路に再送出するので、何度も無駄な
送受信をしなくて済む。The time required to recover the file is calculated from the length of the entire file and the length of the recovered portion, etc., and the access request is re-sent to the transmission path based on the calculation results, thereby avoiding unnecessary transmission and reception. You don't have to do it.

また、ファイルを分割して部分毎に回復し、回復中の部
分以外には、アクセスが可能な、分散ファイルシステム
において、ファイルのアクセス状況に応じてファイルの
回復単位の大きさを変えるので、ファイルにアクセスで
きない可能性が減り、もし回復中の部分にアクセスしよ
うとした場合には、回復中の部分の大きさにより、ファ
イルの回復にかかる時間を算出し、算出結果に基づいて
アクセス依頼を伝送路に再送出するので何度も無駄な送
受信をしなくて済む。In addition, in a distributed file system where a file is divided into parts and recovered part by part, and parts other than the part being recovered can be accessed, the size of the file recovery unit is changed depending on the access status of the file. The possibility of not being able to access the file is reduced, and if you try to access the part that is being recovered, the time required to recover the file is calculated based on the size of the part that is being recovered, and an access request is sent based on the calculation result. Since the data is retransmitted to the network, there is no need to repeatedly send and receive data.

〔Example〕

以下、本発明の一実施例を図面により詳細に説明する。 Hereinafter, one embodiment of the present invention will be described in detail with reference to the drawings.

第４図は、マルチコンピュータシステムの一例を示すも
ので、複数のコンピュータ１，２゜３・・・が伝送路７
を介して接続され、相互にデータを蓄積するための外部
記憶装置！４．，５．６・・・には。FIG. 4 shows an example of a multi-computer system, in which a plurality of computers 1, 2, 3, etc. are connected to a transmission line 7.
External storage devices connected to each other to accumulate data! 4. , 5.6...

それぞれ、ファイルＡ８，１０．ファイルＢ９゜ファイ
ルＣ１ｌ、１２．ファイルＤ１３・・・が定義されてお
り、多重化されたファイルが存在する（ファイルＢ、フ
ァイルＤは図に示されていない、伝送路７を介して接続
された、他のコンピュータの外部記憶装置上に存在する
ものとする）。Files A8, 10. File B9゜File C1l, 12. File D13... is defined, and multiplexed files exist (File B and File D are not shown in the figure, and are connected to the external storage device of another computer via transmission path 7. ).

第１図は、本発明の一実施例のコンピュータの構成の一
例である。コンピュータ１，２．３・・は伝送制御装置
１０１と情報処理装置１０４とから成る。FIG. 1 shows an example of the configuration of a computer according to an embodiment of the present invention. The computers 1, 2, 3, . . . consist of a transmission control device 101 and an information processing device 104.

伝送制御装置１０１は、選択受信管理ユニット１０２と
、選択受信テーブル１０３とを有しており、選択受信テ
ーブル１０３には、外部記憶装置上にあるファイルの論
理ファイル名が登録されている。The transmission control device 101 has a selective reception management unit 102 and a selective reception table 103, and the logical file names of files on the external storage device are registered in the selective reception table 103.

情報処理装Ｗ１０４は、受信バッファ１０５゜１０７、
送信バッファ１０６．１０８、クライアント管理ユニッ
ト１１０、サーバ管理ユニット１１４、分散ファイル管
理テーブル１１９、及び外部記憶管理ユニット１２０を
有している。The information processing device W104 includes reception buffers 105, 107,
It has transmission buffers 106 and 108, a client management unit 110, a server management unit 114, a distributed file management table 119, and an external storage management unit 120.

ユーザプログラムとのインタフェースをとるクライアン
ト管理ユニット１１０は、アクセス依頼送信部１１１、
アクセス結果受信部１１２、タイマ設定部１１３より成
る。The client management unit 110 that interfaces with the user program includes an access request transmission section 111;
It consists of an access result receiving section 112 and a timer setting section 113.

また、実際に分散ファイルにアクセスを行なうサーバ管
理ユニット１１４は、アクセス依頼受信部１１５、アク
セス結果送信部１１６、ファイル回復データ送信部１１
７、ファイル回復データ受信部１１８より成る。Further, the server management unit 114 that actually accesses the distributed file includes an access request receiving section 115, an access result transmitting section 116, and a file recovery data transmitting section 11.
7. It consists of a file recovery data receiving section 118.

分散ファイル管理テーブル１１９は、第２図に示すよう
に、分散ファイルのファイル名により区分けされたバッ
ファエリアを有している。各バッファエリアは、論理フ
ァイル名２１、物理ファイル名２２、回復中フラグ２３
、回復済みレコード数２４から成る。As shown in FIG. 2, the distributed file management table 119 has buffer areas divided by file names of distributed files. Each buffer area has a logical file name 21, a physical file name 22, and a recovery flag 23.
, the number of recovered records is 24.

第３図は、伝送路７に流れるメツセージ３０のフォーマ
ットを示すもので、３１はメッセージ長、３２は論理フ
ァイル名、３３はメツセージの発信元のコンピュータの
アドレス、３４はメツセージ種別、３５はメツセージ通
番、３６はインフォメーションである。FIG. 3 shows the format of the message 30 flowing through the transmission path 7, where 31 is the message length, 32 is the logical file name, 33 is the address of the message source computer, 34 is the message type, and 35 is the message serial number. , 36 is information.

メツセージ種別３４は、メツセージ３０が、アクセス依
頼、アクセス結果、ファイル回復データのいずれかの種
別を表す。The message type 34 indicates whether the message 30 is an access request, an access result, or file recovery data.

インフォメーション３６は、メツセージ種別３４がアク
セス依頼、アクセス結果の場合は、ファイルクライアン
トリスト３７と、データ３８とから成る。ファイル回復
データの場合は、データ３８から成る。ファイルクライ
アントリスト３７には、レコード長、ブロック長、レコ
ード数、アクセスステータス等、ファイルアクセスに必
要な情報が含まれている。The information 36 consists of a file client list 37 and data 38 when the message type 34 is an access request or an access result. In the case of file recovery data, it consists of data 38. The file client list 37 includes information necessary for file access, such as record length, block length, number of records, and access status.

コンピュータ１のファイルＡに障害が起きた場合（内容
が食い違った場合）、分散ファイル管理テーブル１１９
の当該ファイルの回復中フラグ２３を＋１″にして、コ
ンピュータ２のファイル回復データ送信部１１７はメツ
セージ３０を分散ファイルのレコード単位に送信バッフ
ァ１０８に設定し、選択管理ユニット１０２を通じ伝送
路７へ送信する。If a failure occurs in file A on computer 1 (if the contents are inconsistent), the distributed file management table 119
The file recovery data sending unit 117 of the computer 2 sets the message 30 in the sending buffer 108 in units of records of the distributed file, and sends the message 30 to the transmission line 7 through the selection management unit 102. do.

伝送路７上を流れる当該メツセージ３０の論理ファイル
名３２が選択受信テーブル１０３の論理ファイル名に登
録されているので、コンピュータ１の選択受信管理ユニ
ット１０２は当該メツセージ３０を、メツセージ種別３
４がファイル回復データであるので、受信バッファ１０
７へ受信する。Since the logical file name 32 of the message 30 flowing on the transmission path 7 is registered in the logical file name of the selective reception table 103, the selective reception management unit 102 of the computer 1 classifies the message 30 as message type 3.
4 is the file recovery data, so the receive buffer 10
Receive to 7.

コンピュータ１のサーバ管理ユニット１１４は、受信バ
ッファ１０７にメツセージ３０を受は取ると、そのメツ
セージ種別３４により処理の振り分けを行う。When the server management unit 114 of the computer 1 receives the message 30 in the reception buffer 107, it distributes processing according to the message type 34.

メツセージ種別３４がファイル回復データであるので、
ファイル回復データ受信部１１８において、メツセージ
の論理ファイル名３２で分散ファイル管理テーブル１１
９を検索し、その回復中フラグ２３をＩＩ　Ｉ　ＩＩに
して、インフォメーション３７の情報からファイルコピ
ーを行う。ファイルコピーはレコード単位に行い、分散
ファイル管理テーブル１１９の回復済レコード数２４に
カウントしておく。ファイルコピーが完了すると回復中
フラグ２３を”Ｏ′″にする。Since message type 34 is file recovery data,
In the file recovery data receiving unit 118, the distributed file management table 11 is
9 is searched, its recovery flag 23 is set to II II II, and file copy is performed based on the information in the information 37. File copying is performed record by record and counted in the number of recovered records 24 in the distributed file management table 119. When the file copy is completed, the recovery flag 23 is set to "O'".

コンピュータ３が回復中ではないファイルＢをアクセス
する場合、アクセス依頼送信部１１１は、ユーザプログ
ラムのアクセス依頼に対し、メツセージを送信バッファ
１０６に設定し、選択受信管理ユニットを１０２を通じ
、伝送路７へ送信する。When the computer 3 accesses the file B that is not being recovered, the access request sending unit 111 sets a message in the sending buffer 106 in response to the access request from the user program, and sends the selected reception management unit to the transmission line 7 through the selected reception management unit 102. Send.

伝送路７上を流れる当該メツセージ３０の論理ファイル
名３２が選択受信テーブル１０３の論理ファイル名に登
録されているので、コンピュータ１の選択受信管理ユニ
ット１０２は当該メツセージを受信する。Since the logical file name 32 of the message 30 flowing on the transmission path 7 is registered in the logical file name of the selective reception table 103, the selective reception management unit 102 of the computer 1 receives the message.

選択受信管理ユニット１０２はメツセージ種別３４をみ
て、受は取るバッファを切り換える。アクセス依頼なの
で受信バッファ１０７へ受信する。The selective reception management unit 102 looks at the message type 34 and switches the buffer to be received. Since it is an access request, it is received in the reception buffer 107.

受信バッファ１０７に受は取ったメツセージ３０のメツ
セージ種別３４がアクセス依頼なので、ファイルアクセ
ス依頼受信部１１５において、メツセージ３０の論理フ
ァイル名３２で分散ファイル管理テーブル１１９を検索
し、その回復中フラグ２３が○”であるので、ファイル
クライアントリスト３７の情報をもとにファイルアクセ
スを行い、ファイルアクセス結果送信部１１６において
アクセス結果のメツセージ３０を作成しアクセス依頼元
コンピュータ３へ送出する。Since the message type 34 of the message 30 received in the reception buffer 107 is an access request, the file access request receiving unit 115 searches the distributed file management table 119 using the logical file name 32 of the message 30 and checks that the recovery flag 23 is ○'', the file is accessed based on the information in the file client list 37, and the file access result transmitter 116 creates a message 30 representing the access result and sends it to the access requesting computer 3.

コンピュータ３のクライアント管理ユニット１１０では
、アクセス結果受信部１１２で、メツセージ種別３４が
アクセス結果のメツセージ３０を受信バッファ１０５に
受は取る。In the client management unit 110 of the computer 3, the access result receiving unit 112 receives the message 30 whose message type 34 is the access result into the receiving buffer 105.

コンピュータ３が回復中のファイルＡをアクセスする場
合、アクセス依頼送信部１１１は、ユーザプログラムの
アクセス依頼に対し、メツセージを送信バッファ１０６
に設定し、選択受信管理ユニットを１０２を通じ、伝送
路１１へ送信する。When the computer 3 accesses the file A that is being recovered, the access request sending unit 111 sends a message to the sending buffer 106 in response to the access request from the user program.
, and transmits it to the transmission path 11 through the selective reception management unit 102.

伝送路７上を流れるメツセージ３０の論理ファイル名３
２が選択受信テーブル１０３の論理ファイル名に登録さ
れているので、コンピュータ１の選択受信管理ユニット
１０２は該メツセージを受信する。Logical file name 3 of message 30 flowing on transmission path 7
2 is registered in the logical file name of the selective reception table 103, the selective reception management unit 102 of the computer 1 receives the message.

受信バッファ１０７に受は取ったメツセージ３０のメツ
セージ種別３４がアクセス依頼なので、ファイルアクセ
ス依頼受信部１１５において、メツセージ３０の論理フ
ァイル名３２で分散ファイル管理テーブル１１９を検索
し、その回復中フラグ２３が”１″であるので、ファイ
ルクライアントリスト３７のアクセスステータスにファ
イル回復中のステータス及び回復済みレコード数を設定
して、アクセス依頼元コンピュータ３へ送出する。Since the message type 34 of the message 30 received in the reception buffer 107 is an access request, the file access request receiving unit 115 searches the distributed file management table 119 using the logical file name 32 of the message 30 and checks that the recovery flag 23 is Since it is "1", the file recovery status and the number of recovered records are set in the access status of the file client list 37 and sent to the access request source computer 3.

コンピュータ３のクライアント管理ユニット１１０では
、アクセス結果受信部１１２で、メツセージ種別３４が
アクセス結果のメツセージ３ｏを受信バッファ３５に受
は取る。ファイルクライアントリスト３７のアクセスス
テータスがファイル回復中のステータスであるので、タ
イマ設定部ｌ１３で、そのファイルの総レコード数、回
復済みレコード数、およびレコード長からファイル未回
復部分の長さをを求め、ある一定の長さのデータをディ
スクに出力する時間は性能実測値として得られるので、
それらから、未回復部分の回復処理に要する時間を計算
し、アクセスを回復に必要な時間の経過後再び行なうよ
うに設定する。In the client management unit 110 of the computer 3, the access result receiving unit 112 receives the message 3o whose message type 34 is the access result into the receiving buffer 35. Since the access status of the file client list 37 is the file recovery status, the timer setting unit 113 calculates the length of the unrecovered portion of the file from the total number of records, the number of recovered records, and the record length of the file. The time to output a certain length of data to disk can be obtained as an actual performance value, so
From these, the time required to recover the unrecovered portion is calculated, and the access is set to be performed again after the time required for recovery has elapsed.

なお、ここで言う回復処理に要する時間は、理論値であ
る。ＣＰＵや回線の負荷、ＣＰＵの処理時間、サーバ管
理ユニットでアクセス結果を送信してからのアクセス結
果を受信するまでの間かかる時間等を考慮していないの
で、回復処理に要する時間の経過後再度アクセス依頼を
しても、まだ、回復中で、再度タイマを設定しなければ
ならない場合や、既に回復が終了している場合がありえ
る。Note that the time required for the recovery process referred to here is a theoretical value. This does not take into account the load on the CPU or line, the processing time of the CPU, the time it takes from the time the server management unit sends the access result to the time it receives the access result, so try again after the time required for recovery processing has elapsed. Even if you make an access request, there are cases where the recovery is still in progress and the timer needs to be set again, or where the recovery has already finished.

そこで、前回迄に回復処理に要する時間の経過後再度ア
クセス依頼をしてから待たなければならなかった時間、
待つ必要の無かった時間をもとに、補正値を設定して、
理論値の回復処理に要する時間を増減する。Therefore, the time you had to wait after making an access request again after the time required for recovery processing had elapsed,
Set the correction value based on the time you did not have to wait,
Increase or decrease the time required to recover the theoretical value.

タイマにより回復処理に要する時間が経過すると、アク
セス依頼送信部１１１において、再度アクセス依頼メツ
セージを送出する。When the time required for the recovery process has elapsed according to the timer, the access request sending unit 111 sends the access request message again.

なお、再度アクセス依頼メツセージを送出した結果、ま
だ回復中であった場合には、再度未回復部分の回復処理
に要する時間を計算し、アクセスを回復に必要な時間の
経過後再び行なうように設定する。If the result of sending the access request message again is that it is still being recovered, the time required to recover the unrecovered portion is calculated again, and the access is set to be performed again after the time required for recovery has elapsed. do.

さらに、分散ファイル管理テーブル１１９を第５図の様
にして、ファイルの回復単位の大きさを可変にする場合
の例について説明する。分散ファイル管理テーブル１１
９は、論理ファイル名２１、物理ファイル名２２、回復
中フラグ２３、回復中先頭レコード番号２５、回復中最
終レコード番号２６、アクセスユーザ数２７から成る。Furthermore, an example in which the distributed file management table 119 is configured as shown in FIG. 5 and the size of the file recovery unit is made variable will be described. Distributed file management table 11
9 consists of a logical file name 21, a physical file name 22, a recovery flag 23, a recovery first record number 25, a recovery last record number 26, and the number of access users 27.

コンピュータ３のユーザプログラムからの分散ファイル
Ａへのアクセス依頼は、クライアント管理ユニット１１
０がアクセス依頼送信部１１１において、メツセージ種
別３４がアクセス依頼のメツセージ３０をサーバ管理ユ
ニット１１４に送信することにより行なう。この時、メ
ツセージ３゜のメツセージ通番３４にはユーザプログラ
ム毎に○″から始まり、１”　１１２　ＩＩ　、　１１
３　Ｐ＋・・と値を設定する。ユーザプログラムが当該
分散ファイルへのアクセスを終了した場合には、メツセ
ージ通番３４に７１１１１を設定してメツセージを送る
。A request for access to the distributed file A from the user program of the computer 3 is sent to the client management unit 11.
This is done by transmitting a message 30 whose message type 34 is an access request to the server management unit 114, where 0 is the access request transmitter 111. At this time, the message serial number 34 of message 3° starts with ○'' for each user program, and is 1'' 112 II, 11
3 Set the value as P+... When the user program finishes accessing the distributed file, it sets the message serial number 34 to 71111 and sends the message.

コンピュータ２のサーバ管理ユニット１１４が受信バッ
ファ１０７にメツセージ種別３４がアクセス依頼のメツ
セージ３０を受は取ると、ファイルアクセス依頼受信部
１１５において、メツセージ３０の論理ファイル名３２
で分散ファイル管理テーブル１１９を検索する。メツセ
ージ通番３５がｎｏ”であれば、ユーザプログラムから
の最初のアクセス依頼であることを意味するので、分散
ファイル管理テーブル１１９の該当するアクセスユーザ
数２７に１をプラスする。また、メツセージ通番３５が
Ｐ＋１１″であれば、ユーザプログラムからのアクセス
依頼が終了したことを意味するので。When the server management unit 114 of the computer 2 receives a message 30 whose message type 34 is an access request in the reception buffer 107, the file access request receiving unit 115 stores the logical file name 32 of the message 30.
to search the distributed file management table 119. If the message serial number 35 is "no", it means that this is the first access request from the user program, so 1 is added to the corresponding access user number 27 in the distributed file management table 119. If it is P+11'', it means that the access request from the user program has ended.

分散ファイル管理テーブル１１９の該当するアクセスユ
ーザ数２７から１を引く。以上の処理によリ、分散ファ
イル管理テーブル１１９を参照することにより、当該分
散ファイルＡにアクセス中のユーザ数がわかる。Subtract 1 from the corresponding number of access users 27 in the distributed file management table 119. Through the above processing and by referring to the distributed file management table 119, the number of users accessing the distributed file A can be determined.

コンピュータ１のファイルＡの回復のためにユーザがフ
ァイル回復要求を行なうと、コンピュータ２のサーバ管
理ユニット１１４は、ファイル回復データ送信部１１７
において、分散ファイル管理テーブル１１９を検索し、
そのアクセスユーザ数２７に応して、その回復単位の大
きさを決定する。アクセスユーザ数２７がＯならば、フ
ァイル回復を早く完了させるために大きな単位で行ない
、Ｏでなければ、アクセス依頼をできるだけ待たせない
ように小さな単位でファイル回復を行なう。When a user makes a file recovery request to recover file A on computer 1, server management unit 114 on computer 2 sends file recovery data transmitter 117 to
, searches the distributed file management table 119,
The size of the recovery unit is determined according to the number of access users 27. If the number of access users 27 is O, file recovery is performed in large units in order to complete it quickly, and if it is not O, file recovery is performed in small units so that access requests are not made to wait as much as possible.

実際に１回で送信できるメツセージの大きさは送信バッ
ファのサイズまでなので、アクセスユーザ数２７が０な
らば送信バッファ１０８の大きさでファイルを分割し、
アクセスユーザ数２７が１増える毎に例えば２分の１の
大きさで分割を行なう。The size of a message that can actually be sent at one time is up to the size of the sending buffer, so if the number of access users 27 is 0, the file is divided by the size of the sending buffer 108,
Each time the number of access users 27 increases by 1, division is performed, for example, by half the size.

これは、コンピュータシステムの特性等で、最適な数値
にするのが望ましい。ただし、レコード長以下の分割は
行なわない。分割したファイルの最初の分割部分の先頭
のレコード番号、最終のレコード番号を、分散ファイル
管理テーブル１１９に回復中先頭レコード番号２５、回
復中最終レコード番号２６として設定してから回復デー
タを送信する。そして、送信した部分が回復されたら、
前回送信した回復中最終レコード番号２６の次のレコー
ド番号を回復中先頭レコード番号２５とし、前回送信し
たレコード数と同じになるように回復中最終レコード番
号２６を設定してから回復データを送信する。It is desirable to set this to an optimal value depending on the characteristics of the computer system. However, division below the record length is not performed. The first record number and the last record number of the first divided part of the divided file are set in the distributed file management table 119 as the first record number 25 during recovery and the last record number 26 during recovery, and then the recovery data is transmitted. And once the part you sent is recovered,
Set the record number next to the last record number 26 during recovery that was sent last time as the first record number 25 during recovery, set the last record number 26 during recovery so that it is the same as the number of records sent last time, and then send the recovery data. .

回復中のファイルに対して、コンピュータ２のサーバ管
理ユニット１１４がアクセス依頼のメツセージ３０を受
は取ると、分散ファイル管理テーブル１１９を検索し、
そのアクセス依頼が回復中先頭レコード番号２５と回復
中最終レコード番号２６の間のレコードに関するもので
あれば、ファイルクライアントリスト３７のアクセスス
テータスにファイル回復中のステータス及び、回復中先
頭レコード番号２５と回復中最終レコード番号２６を設
定して、アクセス依頼元のクライアント管理ユニット１
１０に返す。それをアクセス結果受信部１１２が受信し
、タイマ設定部１１３が回復中先頭レコード番号２５と
回復中最終レコード番号２６等を基に回復にかかる時間
を計算し、回復に必要な時間の経過後、再度アクセス依
頼を行なうのは前述の場合と同様である。When the server management unit 114 of the computer 2 receives the message 30 requesting access to the file being recovered, it searches the distributed file management table 119,
If the access request relates to a record between the first record number under recovery 25 and the last record number under recovery 26, the access status of the file client list 37 will include the status ``file recovery'' and the first record number 25 under recovery and the record number 26. Set the intermediate and final record number 26, and the client management unit 1 of the access request source
Return to 10. The access result receiving unit 112 receives it, and the timer setting unit 113 calculates the time required for recovery based on the first record number 25 during recovery, the last record number 26 during recovery, etc., and after the time necessary for recovery has elapsed, The access request is made again in the same way as in the case described above.

アクセス依頼が回復中先頭レコード番号２５と回復中最
終レコード番号２６の間のレコードに関するものでなけ
ればアクセスを行なう。アクセス出来るのは、ファイル
のうち回復の終了した部分だけではなく、回復が終了し
ていないが、回復中でないところも含む。回復の終了し
ていない部分のデータが更新されても、その部分が回復
されれば同じになるので問題はない。If the access request does not relate to a record between the top record number 25 during recovery and the last record number 26 during recovery, access is performed. What can be accessed is not only the portion of the file that has been recovered, but also includes portions of the file that have not yet been recovered but are not currently being recovered. There is no problem even if the data in the unrecovered part is updated, as it will be the same once that part is recovered.

本実施例では１回復処理開始時点でのアクセスユーザ数
に応じて、回復単位の大きさを変えているが１回復中の
アクセスユーザ数に応じて可変にしても良い。In this embodiment, the size of the recovery unit is changed depending on the number of accessing users at the start of one recovery process, but it may be made variable depending on the number of accessing users during one recovery.

本実施例によれば、ファイルが分割数されているので、
ファイルの回復中の部分以外のところにはアクセスでき
るので、回復中の分散ファイルにアクセスするユーザプ
ログラムの回復処理終了待ち時間を短縮できる。According to this embodiment, since the file is divided into several parts,
Since parts of the file other than the part that is being recovered can be accessed, the waiting time for a user program to complete the recovery process when accessing the distributed file that is being recovered can be shortened.

〔Effect of the invention〕

本発明によれば、ファイルの回復に必要な時間の経過後
、再度アクセス依頼をすることにより、ファイルの回復
中に何度もアクセス依頼、アクセス結果のメツセージを
送受信する無駄が防止できるという効果がある。According to the present invention, by making an access request again after the time required for file recovery has elapsed, it is possible to prevent the waste of sending and receiving access requests and access result messages many times during file recovery. be.

また、アクセスユーザ数に応じて、回復単位の大きさを
変えることにより、ファイルの回復中の部分にアクセス
してしまう確率を減らしながらも、出来るだけ早く回復
できるという効果がある。Furthermore, by changing the size of the recovery unit depending on the number of accessing users, it is possible to recover as quickly as possible while reducing the probability of accessing the portion of the file that is being recovered.

[Brief explanation of drawings]

第１図は、本発明の一実施例のコンピュータの。構成図、第２図は、分散ファイル管理テーブルのフォー
マットを示す図、第３図は、メツセージのフォーマット
を示す図、第４図は、本発明の一実施例の多重化ファイ
ルを持つマルチコンピュータシステムの構成図、第５図
は、分散ファイル管理テーブルの別のフォーマットを示
す図である。１．２．３・・・コンピュータ、４．５．６・・・外部記憶装置、７・・・伝送路、８．１ｏ・・・ファイルＡ、９・・ファイルＢ、１１．１２・・・ファイルＣ１１３・・・ファイルＤ、２１・・・論理ファイル名。２２・・・物理ファイル名。２３・・・回復中フラグ、２４・・・回復済レコード数、２５・・・回復中先頭レコード番号、２６・・・回復中最終レコード番号、２７・・・アクセスユーザ数、３０・・・メツセージ、３１・・・メッセージ長、３２・・論理ファイル名、３３・・・発信元のコンピュータのアドレス、３４・・
・メツセージ種別、３５・メツセージ通番、３６・・・インフォメーション、３７・・・ファイルクライアントリスト。３８・・・データ、１０１・・・伝送制御装置、１０２・・・選択受信管理ユニット、１０３・・・選択受信テーブル、１０４・・・情報処理装置、１０５．１０７・・・受信バッファ、１０６．１０８・送信バッファ、１１０・・・クライアント管理ユニット、１１１・・・
アクセス依頼送信部、１１２・・・アクセス結果受信部、１１３・・・タイマ設定部、１１４・・・サーバ管理ユニット、１１５・・・アクセス依頼受信部、１１６・・・アクセス結果送信部、１１７・・・ファイル回復データ送信部、１１８・・・
ファイル回復データ受信部、１１９・・分散ファイル管
理テーブル、１２０・・・外部記憶管理ユニット。FIG. 1 shows a computer according to an embodiment of the present invention. 2 is a diagram showing the format of a distributed file management table, FIG. 3 is a diagram showing a message format, and FIG. 4 is a multicomputer system with multiplexed files according to an embodiment of the present invention. FIG. 5 is a diagram showing another format of the distributed file management table. 1.2.3...Computer, 4.5.6...External storage device, 7...Transmission line, 8.1o...File A, 9...File B, 11.12... File C1 13...File D, 21...Logical file name. 22...Physical file name. 23...Recovery flag, 24...Recovered record number, 25...Recovery first record number, 26...Recovery last record number, 27...Number of access users, 30...Message , 31... Message length, 32... Logical file name, 33... Source computer address, 34...
- Message type, 35 - Message serial number, 36... Information, 37... File client list. 38... Data, 101... Transmission control device, 102... Selected reception management unit, 103... Selected reception table, 104... Information processing device, 105.107... Reception buffer, 106. 108・Transmission buffer, 110... Client management unit, 111...
Access request transmitting section, 112... Access result receiving section, 113... Timer setting section, 114... Server management unit, 115... Access request receiving section, 116... Access result transmitting section, 117. ...File recovery data transmission section, 118...
File recovery data receiving unit, 119... Distributed file management table, 120... External storage management unit.

Claims

[Claims] 1. In a distributed file system that is composed of multiple computers and in which multiple files storing the same data exist distributed among the multiple computers, the access requesting computer attempts to access the file system. 1. A distributed file management method, comprising: requesting file access again to the file after a time period required for recovery of the file has elapsed in response to the fact that the file is being recovered. 2. A distributed file system consisting of multiple computers, where multiple files containing the same data are distributed across multiple computers, and file recovery is performed for each subdivided file part. A distributed file management method characterized in that the size of a file recovery unit is changed depending on the file access status. 3. The access request source computer is characterized in that, in response to the fact that the file to be accessed is being recovered, the access requesting computer issues another file access request for the file after the time required to recover the file has elapsed. The distributed file management method according to claim 2.