JPH07183891A

JPH07183891A - Computer system

Info

Publication number: JPH07183891A
Application number: JP5328162A
Authority: JP
Inventors: Hoze Uemura; ホゼウエムラ; Takashi Sakakura; 隆史坂倉
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1993-12-24
Filing date: 1993-12-24
Publication date: 1995-07-21
Anticipated expiration: 2018-09-16
Also published as: JP3447347B2

Abstract

(57)【要約】【目的】計算機システムにおいて、より少ないハード
ウェア開発で稼働率の向上、ならびに、信頼性の向上を
目的とする。【構成】１つ、あるいは、複数のネットワークに接続
された複数の計算機から構成される計算機システムで、
被検プログラム１１２，１１３，１１４は計算機１０３
の負荷状況により検出信号の送出間隔を変えながら検出
信号を障害検出プログラム１０８，１０７または１０６
へ送出する。一方、障害検出プログラムも計算機１０３
の負荷状況を調べ、送られてきた検出信号がタイムアウ
トか否かチェックし障害発生の有無を判断する。 (57) [Summary] [Purpose] In computer systems, the objective is to improve operating rates and reliability by developing less hardware. [Structure] A computer system composed of one or a plurality of computers connected to a plurality of networks,
The programs 112, 113 and 114 to be inspected are the computer 103
Fault detection program 108, 107 or 106 while changing the transmission interval of the detection signal according to the load condition of
Send to. On the other hand, the fault detection program is also the computer 103
The load condition is checked to see if the sent detection signal has timed out to determine whether a failure has occurred.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、耐故障性を備えた計算
機システムに関し、特に、ネットワークに接続された、
複数台の計算機から構成される計算機システムに関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a fault tolerant computer system, and more particularly to a computer system connected to a network.
The present invention relates to a computer system including a plurality of computers.

【０００２】[0002]

【従来の技術】近年、計算機システムの信頼性に対する
関心が高まってきている。その背景として、従来の計算
機システムにおいて、ダウンサイジングとよばれるよう
に汎用機上にあった機能を端末側に移行する、また、セ
ンタ機能自体をマイクロプロセッサベースの計算機上で
実現することによる低コスト化を図る試みが盛んになさ
れている。ところが、汎用機システムがハードウェアの
故障、誤動作、また、これらを要因とするソフトウェア
障害、ソフトウェア自体の障害に対してその対処手段を
持ち、より高い稼働率や信頼性を実現するのに対し、マ
イクロプロセッサベースの計算機システムでは、障害対
処手段が貧弱であり、高い稼働率や、信頼性を要求され
るアプリケーションを稼働させるのは難しかった。2. Description of the Related Art In recent years, interest in the reliability of computer systems has increased. As the background, in the conventional computer system, the function that was on the general-purpose computer is moved to the terminal side, which is called downsizing, and the low cost by realizing the center function itself on the microprocessor-based computer. There are many attempts to achieve this. However, while a general-purpose machine system has a means for coping with hardware failures, malfunctions, software failures caused by these, and failures of the software itself, while achieving higher operating rates and reliability, In a microprocessor-based computer system, failure coping measures are poor, and it has been difficult to run an application that requires high availability and reliability.

【０００３】ところで、ダウンサイジングにあたって、
特徴的なこととして、計算機の接続形態の変化がある。
従来の汎用機を中心としたシステムが、その端末群と、
電話回線や、シリアルラインによってクラスタ接続され
ていたのに対し、主にほぼ等価なマイクロプロセッサベ
ースの計算機で構成されるシステムでは個々の計算機
は、高速なローカルエリアネットワークに接続されるこ
とが多くなった。By the way, in downsizing,
Characteristically, there is a change in the connection form of computers.
A system centered on conventional general-purpose machines, with its terminal group,
In contrast to systems that were cluster-connected by telephone lines or serial lines, individual computers are often connected to high-speed local area networks in systems that consist mainly of microprocessor-based computers that are almost equivalent. It was

【０００４】本発明の課題である計算機システムの高い
稼働率、高信頼性を実現するために、過去に商用化され
たものとして、タンデム社、ストラタス社、セコイアシ
ステム社などによるものがある。これらのシステムは、
ハードウェアのコンポーネントを冗長化し、シングルポ
イントフェイル、つまり、ある一つのコンポーネントが
故障したためにシステム全体が停止しないようにできて
いる。しかし、ハードウェアの冗長化のため、これらシ
ステムは高価なものとなっている。In order to realize a high operating rate and high reliability of a computer system, which is the subject of the present invention, there have been commercialized products in the past by Tandem, Stratus, Sequoia Systems and the like. These systems
The hardware components are made redundant so that a single point of failure, that is, the entire system does not stop due to the failure of one component. However, hardware redundancy makes these systems expensive.

【０００５】近年のハードウェアの信頼性の向上、ま
た、要求される稼働率や信頼性のレベルにより、必ずし
も全てのコンポーネントの冗長化は、コスト面も考える
とそれを要求できない場合も多い。本発明は、かかるシ
ステムに適用され、稼働率、信頼性の向上を図るもの
で、ハードウェアの機構を最小限にとどめ、主にソフト
ウェアによる実現を図る。ハードウェアの冗長度が低い
計算機群で、これらの課題を達成するにあたって、高速
なローカルエリアネットワークが鍵となる。つまり、ハ
ードウェアの冗長度に欠けるところを、ネットワークを
通して、複数台の計算機で補い合うことでカバーする。
このようなアプローチによる製品も幾つか市場に既に出
ていて、ＤＥＣ社のＶＡＸクラスタ、ＩＢＭ社のＨＡＮ
ＦＳなどがある。ＤＥＣ社のＶＡＸクラスタと呼ばれる
製品は、デュアルポートディスクを共有する、ネットワ
ークで接続された複数台の計算機で構成され、主系の計
算機に障害が発生した時は、従系の計算機が業務を代行
する。ＩＢＭ社のＨＡＮＦＳは、同様にディスク装置を
共有するネットワークに接続された複数の計算機から構
成される計算機上で実行されており、複数の計算機間で
ファイル共有サービスを行うプログラムに障害が発生し
た時は、該サービスプログラムは、ディスク装置を共有
している他の計算機上で起動される。In recent years, due to the improvement in reliability of hardware and the required operation rate and reliability level, redundancy of all components cannot be required in many cases in view of cost. The present invention is applied to such a system to improve the operating rate and reliability, and minimizes the hardware mechanism, and is mainly implemented by software. A high-speed local area network is the key to achieving these tasks in a computer group with low hardware redundancy. In other words, the lack of hardware redundancy can be covered by complementing multiple computers through the network.
Some products based on this approach are already on the market, such as DEC's VAX cluster and IBM's HAN.
There is FS etc. The product called VAX cluster of DEC is composed of multiple computers that share a dual port disk and are connected by a network. When a failure occurs in the primary computer, the secondary computer takes over the work. To do. IBM's HANFS is running on a computer that is also composed of multiple computers connected to a network that shares disk devices, and when a program that performs a file sharing service between multiple computers fails. The service program is started on another computer sharing the disk device.

【０００６】特開平４−２３０５３８には、一定時間間
隔内に応答が受信されるか否かを障害発生の判断の１つ
とした障害ソフトウェアコンポーネントの検出方法、特
開平４−３４０６４９には、検知信号の発信によらない
ソフトウェア障害の検出方法が述べられている。特開平
１−６１８５５には、マルチプロセッサシステムにおけ
るバックアッププロセッサの起動方法が述べられてい
る。Japanese Unexamined Patent Publication No. 4-230538 discloses a method of detecting a faulty software component in which whether or not a response is received within a certain time interval is used as one of the judgments of the fault occurrence, and Japanese Unexamined Patent Publication No. 4-340649 discloses a detection signal. A method of detecting a software failure that does not depend on the transmission of a message is described. JP-A-1-61855 describes a method of activating a backup processor in a multiprocessor system.

【０００７】また、障害発生時の主記憶上のデータのダ
ンプの高速化を図るものの先行事例としては、特開平３
−２１１６３８に、コアデータを圧縮した上で２次記憶
上に退避する方法が述べられている。また、ディスクデ
ータの一貫性を図るために考えられた先行特許として、
特開平１−２７７３７２では、エラー発生すると書き込
んだデータ内容を、ホストシステムに返送する方式が述
べられている。Further, as a precedent example for speeding up the dumping of data on the main memory when a failure occurs, Japanese Patent Laid-Open No. Hei 3 (1998)
-2111638 describes a method of compressing core data and then saving it in secondary storage. Also, as a prior patent that was conceived to ensure consistency of disk data,
Japanese Patent Application Laid-Open No. 1-277372 describes a method of returning the written data contents to the host system when an error occurs.

【０００８】[0008]

【発明が解決しようとする課題】本発明の課題は、より
少ないハードウェアの投資で、高い稼働率や、信頼性を
提供する計算機システムを構築するための、基本的要素
を提供するものである。SUMMARY OF THE INVENTION An object of the present invention is to provide a basic element for constructing a computer system that provides a high operating rate and reliability with less investment of hardware. .

【０００９】従来のシステムにおいて、ソフトウェア障
害検知機構がタイムアウトを、その障害発生状態か否か
を判断する基準にしている場合、計算機の負荷状態によ
っては送信のために決められた時間よりも長い時間がか
かってしまい、タイムアウトとなり、障害発生と判断さ
れ、正確な判断が下せないという問題点があった。In the conventional system, when the software failure detection mechanism uses the time-out as a criterion for judging whether or not the failure has occurred, depending on the load state of the computer, the time longer than the time determined for transmission may be exceeded. However, there was a problem that it was timed out, it was judged that a failure occurred, and an accurate judgment could not be made.

【００１０】また、障害検知しようとするプログラムが
幾つかのプログラムのサービスを利用して成り立ってい
るとき、あるいは、相互にサービスを利用しあって成り
立っているとき、目的とするプログラムの障害検知を行
うだけでは不十分で、正確な判断ができないという問題
点があった。Further, when a program for which a failure is to be detected is made up of services of several programs or mutually made up of services, it is possible to detect a failure of a target program. There was a problem that it was not enough to do it, and an accurate judgment could not be made.

【００１１】そして、従来の障害検知方式では、検知機
構自体のシングルポイントフェイルに対応できないとい
う問題点があった。The conventional failure detection method has a problem that it cannot cope with a single point failure of the detection mechanism itself.

【００１２】更に、従来の障害検知、復旧方式では、復
旧手段が一律的であり、障害に対して、必要以上の処置
をとらざるを得ない場合が多いという問題点があった。Further, in the conventional fault detection and restoration method, the restoration means is uniform, and there is a problem that in many cases there is no choice but to take unnecessary measures against the fault.

【００１３】また、ソフトウェア障害が発生し、その復
旧にあたって、ネットワーク内の他の計算機上に移行す
る必要が生じたときに、固定的に移行先の計算機を決め
たのでは、移行先の負荷状況、資源状況によって、必ず
しも移行先として望ましいものとはならない場合がある
という問題点があった。Further, when a software failure occurs and it is necessary to transfer the software to another computer in the network for recovery, if the transfer destination computer is fixedly determined, the load condition of the transfer destination may be fixed. However, there is a problem that it may not always be desirable as a migration destination depending on the resource situation.

【００１４】より高稼働率、高信頼性システムを構築す
る上で、障害発生時になくてはならない、主記憶データ
の採取を高速に行うことも課題である。稼働率は、故障
修理期間を短くすることによって向上できる。従って、
システム再立ち上げを行う場合に、主記憶データのダン
プ時間を短くすることは、稼働率向上に寄与する。In constructing a system having a higher operating rate and a higher reliability, it is also a problem to collect main memory data at high speed, which is indispensable when a failure occurs. The operating rate can be improved by shortening the repair period. Therefore,
When the system is restarted, shortening the dump time of the main memory data contributes to improving the operating rate.

【００１５】また、主記憶上にキャッシュされたディス
クデータの一貫性を維持することを課題としており、こ
れによりシステム再立ち上げの場合、ディスク上に構築
されたファイルシステムの一貫性回復のために要する時
間を最小に押さえることができる。It is another object to maintain the consistency of the disk data cached in the main memory, so that when the system is restarted, the consistency of the file system constructed on the disk can be recovered. The time required can be kept to a minimum.

【００１６】また、データベースプログラムなどの保証
すべき信頼性が特に高いアプリケーションに利用される
べき機能で、全てのディスクに対する書き込みオペレー
ションに対し、成功か不成功かの場合に、更新後、更新
前の状態を必ず保証することにより、従来アプリケーシ
ョンプログラムの中で行っていたデータ一貫性保持操作
を簡略化し、かつ、ハードウェアレベルでデータの一貫
性を保証し、システムの高速化を図ることも課題であ
る。A function to be used for a particularly reliable application to be guaranteed, such as a database program. When a write operation to all disks is successful or unsuccessful, after updating and before updating. By guaranteeing the state without fail, the task of maintaining data consistency, which was conventionally performed in application programs, can be simplified, and data consistency can be guaranteed at the hardware level to speed up the system. is there.

【００１７】本発明は、上記のような問題点を解決し、
課題を達成するためになされたもので、より少ないハー
ドウェア開発で、より高い計算機システムの稼働率、信
頼性を実現することを目的とする。また、障害復旧時の
計算機立ち上げ時間を短縮することにより、稼働率の向
上を図ることを目的とする。また、ディスクデータの一
貫性を保証して、アプリケーションプログラムの中で行
っていた、データ一貫性保持操作を簡略化することによ
り、システムの高速化を図ることが目的である。The present invention solves the above problems,
It was made in order to achieve the task, and aims to realize higher computer system operation rate and reliability with less hardware development. It also aims to improve the operating rate by shortening the computer startup time during failure recovery. It is also an object of the present invention to speed up the system by guaranteeing the consistency of the disk data and simplifying the data consistency maintaining operation performed in the application program.

【００１８】[0018]

【課題を解決するための手段】第１の発明に係る計算機
システムは、（ａ）所定の処理を実行するとともに、そ
の処理の実行中に障害の発生の有無を検出し検出信号と
して送信する被検ソフトウェア、（ｂ）上記被検ソフト
ウェアからの検出信号を受信して、被検ソフトウェアの
障害を検出する障害検出ソフトウェアを有する計算機シ
ステムにおいて、上記被検ソフトウェア及び障害検出ソ
フトウェアは、それぞれシステムの負荷を検出する負荷
検出部と、システムの負荷に応じて検出信号の送受信間
隔を調整するための頻度表を持つことを特徴とする。According to a first aspect of the present invention, there is provided a computer system which: (a) executes a predetermined process, detects the presence or absence of a failure during the process, and transmits the detected signal as a detection signal. (B) In a computer system having failure detection software for receiving a detection signal from the software to be tested and detecting a failure in the software to be tested, the software to be tested and the software for failure detection each have a load on the system. It is characterized by having a load detection unit for detecting the frequency and a frequency table for adjusting the transmission / reception interval of the detection signal according to the load of the system.

【００１９】第２の発明に係る計算機システムは、
（ａ）所定の処理を実行するとともに、その処理の実行
中に障害の発生の有無を検出し検出信号として送信する
被検ソフトウェア、（ｂ）上記被検ソフトウェアからの
検出信号を受信して、被検ソフトウェアの障害を検出す
る障害検出ソフトウェアを有する計算機システムにおい
て、上記被検ソフトウェアは、検出信号の送出間隔の情
報を検出信号にふくませ上記障害検出ソフトウェアに通
知し、障害検出ソフトウェアは通知された検出信号の送
出間隔に基づいて被検ソフトウェアの障害を検出するこ
とを特徴とする。A computer system according to the second invention is
(A) A software under test that executes a predetermined process and detects whether or not a failure has occurred during the execution of the process and transmits it as a detection signal; (b) receives a detection signal from the software under test, In a computer system having a failure detection software for detecting a failure of the software to be tested, the software to be tested includes the information of the transmission interval of the detection signal in the detection signal and notifies the failure detection software, and the failure detection software is notified. The failure of the software under test is detected based on the transmission interval of the detected signal.

【００２０】第３の発明に係る計算機システムは、上記
第２の発明に係る計算機システムにおいて、上記被検ソ
フトウェアにシステムの負荷を検出する負荷検出部を備
え、負荷検出部により検出したシステムの負荷に応じて
検出信号の送出間隔を調整するための頻度表を持つこと
を特徴とする。A computer system according to a third aspect of the present invention is the computer system according to the second aspect of the present invention, wherein the software to be tested includes a load detecting section for detecting the load of the system, and the load of the system detected by the load detecting section. It is characterized by having a frequency table for adjusting the transmission interval of the detection signal according to.

【００２１】第４の発明に係る計算機システムにおい
て、上記被検ソフトウェアは、複数のプログラムを備
え、各プログラムが検出信号を送信するとともに、上記
障害検出ソフトウェアは、上記複数のプログラムの中の
関連するプログラムを示す管理情報を記憶するととも
に、管理情報に基づき関連する複数のプログラムからの
検出信号を受信して関連する複数のプログラムの障害を
検出することを特徴とする。In the computer system according to a fourth aspect of the present invention, the software to be tested includes a plurality of programs, each program sends a detection signal, and the fault detection software is related to the plurality of programs. It is characterized in that management information indicating a program is stored, and a detection signal from a plurality of related programs is received based on the management information to detect a failure of the plurality of related programs.

【００２２】第５の発明に係る計算機システムにおい
て、上記障害検出ソフトウェアは、第１の障害検出ソフ
トウェアと第２の障害検出ソフトウェアを備えており、
一方の障害検出ソフトウェアが他方の障害検出ソフトウ
ェアの障害を検出することを特徴とする。In the computer system according to the fifth aspect of the present invention, the fault detection software comprises first fault detection software and second fault detection software,
It is characterized in that one fault detection software detects a fault of the other fault detection software.

【００２３】第６の発明に係る計算機システムにおい
て、上記被検ソフトウェアは、障害検出ソフトウェアに
対する動作手順を指示するメッセージを検出信号中に含
ませるとともに、上記障害検出ソフトウェアは、メッセ
ージに対応する手順を記載した手順情報を持ち、メッセ
ージと手順情報により指示された手順を実行することを
特徴とする。In the computer system according to the sixth aspect of the present invention, the software under test includes a message instructing an operation procedure for the fault detection software in a detection signal, and the fault detection software includes a procedure corresponding to the message. It is characterized by having the written procedure information and executing the procedure instructed by the message and the procedure information.

【００２４】第７の発明に係る計算機システムは、複数
の計算機を備えており、各計算機は各計算機の状態を調
査して上記障害検出ソフトウェアに出力する調査プログ
ラムを備え、上記障害検出ソフトウェアは調査プログラ
ムからの調査結果と被検ソフトウェアの実行条件を考慮
して、障害が検出された被検ソフトウェアを他の計算機
により実行させることを特徴とする。A computer system according to a seventh aspect of the present invention comprises a plurality of computers, each computer comprising an investigation program for investigating the status of each computer and outputting it to the fault detection software. It is characterized in that the software under test in which the failure is detected is executed by another computer in consideration of the investigation result from the program and the execution condition of the software under test.

【００２５】第８の発明に係る計算機システムは、主記
憶装置および２次記憶装置を持つ複数の計算機が接続さ
れた計算機システムにおいて、主記憶装置の内容を分割
して複数の計算機の２次記憶装置に退避する分割退避手
段を備えたことを特徴とする。A computer system according to an eighth aspect of the present invention is a computer system in which a plurality of computers having a main storage device and a secondary storage device are connected to each other, and the contents of the main storage device are divided into secondary storages of the plurality of computers. It is characterized in that the apparatus is provided with a split retraction means.

【００２６】第９の発明に係る計算機システムは、上記
第８の発明に係る計算機システムにおいて、あらかじ
め、分割退避先を設定した管理表を備えたことを特徴と
する。A computer system according to a ninth invention is characterized in that, in the computer system according to the eighth invention, a management table in which a division save destination is set in advance is provided.

【００２７】第１０の発明に係る計算機システムは、上
記第８の発明に係る計算機システムにおいて、さらに、
ネットワークに２次記憶装置を持つ複数の計算機が接続
され、上記分割退避手段は、ネットワークを用いて複数
の２次記憶装置にデータを退避することを特徴とする。A computer system according to a tenth invention is the computer system according to the eighth invention, further comprising:
A plurality of computers having a secondary storage device are connected to the network, and the division saving means saves data to the plurality of secondary storage devices using the network.

【００２８】第１１の発明に係る計算機システムは、以
下の要素を有することを特徴とする。（ａ）データを記
憶する２次記憶装置、（ｂ）上記データを一時的に記憶
する主記憶装置、（ｃ）上記主記憶装置に記憶されたデ
ータの２次記憶装置への書き戻しの有無と、そのデータ
の正当性を判断するデータを管理情報として記憶する記
憶手段、（ｄ）障害発生時に、上記記憶手段により記憶
された管理情報により上記主記憶装置に記憶されたデー
タのうち、２次記憶装置に書き戻されていないデータを
判定し、その正当性をチェックしてデータを書き戻す書
き戻し手段。A computer system according to the eleventh invention is characterized by having the following elements. (A) Secondary storage device for storing data, (b) Main storage device for temporarily storing the data, (c) Presence / absence of write-back of data stored in the main storage device to the secondary storage device And storage means for storing data for judging the validity of the data as management information, (d) of the data stored in the main storage device by the management information stored by the storage means when a failure occurs, 2 A write-back means for determining data that has not been written back to the next storage device, checking its validity, and writing back the data.

【００２９】第１２の発明に係る計算機システムは、上
記第１１の発明に係る計算機システムにおいて、複数の
２次記憶装置を備えた多重系システムであり、上記書き
戻し手段は、複数の２次記憶装置に対してデータを書き
戻すことを特徴とする。A computer system according to a twelfth aspect of the present invention is the computer system according to the eleventh aspect of the present invention, which is a multiple system including a plurality of secondary storage devices, wherein the write-back means is a plurality of secondary storage devices. The feature is that the data is written back to the device.

【００３０】第１３の発明に係る計算機システムは、以
下の要素を有することを特徴とする。（ａ）データを記
憶する２次記憶装置、（ｂ）上記２次記憶装置に対して
データを書込む書込み手段、（ｃ）上記書込み手段によ
り更新しようとする時、元のデータを不揮発性記憶媒体
に退避し、更新開始を示すマークと、更新完了後に更新
完了のマークを退避データにマークする退避手段、
（ｄ）障害発生時に、上記退避手段により退避したデー
タを調べ、更新開始を示すマークがあり、更新完了のマ
ークがないデータをデータの更新中にエラーが発生した
とみなし、退避データを用いて上記２次記憶装置のデー
タを復旧する復旧手段、（ｅ）更新成功時には退避デー
タを開放する開放手段。A computer system according to a thirteenth invention is characterized by having the following elements. (A) a secondary storage device for storing data, (b) a writing means for writing data to the secondary storage device, and (c) a non-volatile storage of the original data when updating by the writing means. Saving means for saving to the medium and marking the start of updating and the mark of completion of updating in the saved data after completion of updating,
(D) When a failure occurs, the data saved by the saving means is examined, and it is considered that an error has occurred during the update of data that has a mark indicating the start of update and no mark indicating the completion of update, and the saved data is used. Restoring means for restoring the data in the secondary storage device, and (e) releasing means for releasing the saved data when the update is successful.

【００３１】[0031]

【作用】第１の発明における計算機システムは、被検ソ
フトウェアが所定の処理実行中に障害の発生を検出し、
検出信号として送信し、障害検出ソフトウェアが検出信
号を受信して被検ソフトウェアの障害を検出する。この
ように２つの独立したプログラムとすることにより、検
出信号が所定の間隔で到着しない場合、障害検出ソフト
ウェア側は被検ソフトウェア側で検出信号を送信できな
い障害が発生したとみなして、タイムアウトを障害発生
か否かの判断の１つにすることができる。このとき、検
出信号の到着に要する時刻は、システムの負荷によって
遅れることがあるため、双方のソフトウェアにシステム
の負荷を検出する負荷検出部を持たせ、負荷に応じて検
出信号の送受信間隔を調整する。The computer system according to the first aspect of the present invention detects the occurrence of a failure during execution of predetermined processing by the software under test,
It transmits as a detection signal, and the failure detection software receives the detection signal and detects the failure of the software under test. By using two independent programs in this way, if the detection signal does not arrive at a predetermined interval, the fault detection software side considers that a fault in which the detection signal cannot be transmitted on the software side to be detected has occurred and the time-out fault occurs. It can be one of the judgments as to whether or not it has occurred. At this time, the time required for the detection signal to arrive may be delayed depending on the system load. Therefore, both software should have a load detection unit that detects the system load, and the transmission / reception interval of the detection signal should be adjusted according to the load. To do.

【００３２】第２の発明における計算機システムは、被
検ソフトウェアが検出信号の送出間隔を障害検出ソフト
ウェアに通知するため、障害検出ソフトウェア側で送出
間隔を調べる手順を省くことができる。In the computer system according to the second aspect of the present invention, since the software under test notifies the fault detection software of the transmission interval of the detection signal, the fault detection software can omit the procedure for checking the transmission interval.

【００３３】第３の発明における計算機システムは、被
検ソフトウェア側に負荷検出部と頻度表を持たせ、負荷
により検出信号の送出間隔の調整を行う。また、その送
出間隔の情報も検出信号中に含ませ障害検出ソフトウェ
ア側に通知するので、障害検出ソフトウェア側では、負
荷検出部や頻度表による処理を行う必要がない。In the computer system according to the third aspect of the present invention, the software to be tested has a load detecting section and a frequency table, and the sending interval of the detection signal is adjusted according to the load. Further, since the information of the transmission interval is also included in the detection signal and notified to the fault detection software side, it is not necessary for the fault detection software side to perform processing by the load detection unit or the frequency table.

【００３４】第４の発明における計算機システムでは、
上記被検ソフトウェアは、複数のプログラムを備え、各
プログラムが検出信号を送信する。上記障害検出ソフト
ウェアは、上記複数のプログラムの中の関連するプログ
ラムを示す管理情報を記憶し、管理情報に基づき関連す
る複数のプログラムからの検出信号を受信して、関連す
る複数のプログラムの障害を検出することができる。In the computer system of the fourth invention,
The software to be tested includes a plurality of programs, and each program sends a detection signal. The failure detection software stores management information indicating a related program among the plurality of programs, receives a detection signal from the plurality of related programs based on the management information, and detects a failure of the plurality of related programs. Can be detected.

【００３５】第５の発明における計算機システムは、障
害検出ソフトウェアの障害を検出するソフトウェアを備
えることにより、障害検知機構自体のシングルポイント
フェイルを回避できる。The computer system according to the fifth aspect of the present invention can avoid the single point fail of the fault detection mechanism itself by providing the fault detection software for detecting the fault.

【００３６】第６の発明における計算機システムは、上
記被検ソフトウェアが、障害検出ソフトウェアに対する
動作手順を指示するメッセージを検出信号中に含ませ
る。上記障害検出ソフトウェアは、メッセージに対応す
る手順を記した手順情報を持ち、手順情報によりメッセ
ージに対応する手順を実行するため、無駄な処理を省く
ことができる。In the computer system according to the sixth aspect of the present invention, the software under test includes a message instructing an operation procedure for the fault detection software in the detection signal. The fault detection software has procedure information describing the procedure corresponding to the message, and executes the procedure corresponding to the message according to the procedure information, so that wasteful processing can be omitted.

【００３７】第７の発明における計算機システムでは、
上記計算機システムは、複数の計算機を備えており、各
計算機は各計算機の状態を調査して上記障害検出ソフト
ウェアに出力する調査プログラムを備える。上記障害検
出プログラムは被検ソフトウェアの実行条件を考慮して
調査プログラムから得られた負荷状況、資源状況に基づ
き、障害が検出され、かつ他の計算機への移行が必要と
なった被検ソフトウェアの移行に最適な計算機を探し、
移行して実行する。In the computer system of the seventh invention,
The computer system includes a plurality of computers, and each computer includes an investigation program that investigates the state of each computer and outputs the investigation status to the failure detection software. The failure detection program is based on the load status and resource status obtained from the investigation program in consideration of the execution condition of the software to be tested, and the failure is detected, and the software to be tested needs to be migrated to another computer. Find the best calculator for the transition,
Migrate and execute.

【００３８】第８の発明における計算機システムは、障
害発生時に主記憶装置の内容を分割して、接続されてい
る複数の他の計算機の２次記憶装置に退避することがで
きる。In the computer system according to the eighth aspect, when a failure occurs, the contents of the main storage device can be divided and saved in the secondary storage devices of a plurality of other connected computers.

【００３９】第９の発明における計算機システムは、分
割退避先を設定した管理表を備えているので、主記憶装
置のデータの採取先をすみやかに決めることができる。Since the computer system in the ninth aspect of the present invention has the management table in which the division save destination is set, the data collection destination of the main storage device can be promptly determined.

【００４０】第１０の発明における計算機システムは、
ネットワークを介して分割されたデータを退避する作業
を他の計算機に分割することができる。A computer system according to the tenth invention is
The work of saving the divided data via the network can be divided into other computers.

【００４１】第１１の発明における計算機システムは、
主記憶装置上にキャッシュされたデータの２次記憶装置
への書き戻しの有無と、そのデータの正当性を判断する
データを管理情報として持つ。これにより、障害発生時
にこの管理情報に基づいて主記憶装置に記憶されたデー
タのうち、２次記憶装置に書き戻されていないデータを
判断し、またそのデータに障害の影響があるかどうか正
当性を再度チェックし、データを書き戻す。A computer system according to the eleventh invention is
It has, as management information, whether or not the data cached in the main storage device is written back to the secondary storage device and the data for judging the validity of the data. Thus, when a failure occurs, the data stored in the main storage device, which has not been written back to the secondary storage device, is determined based on this management information, and whether the data is affected by the failure is valid. Check the sex again and write the data back.

【００４２】第１２の発明における計算機システムは、
多重系システムにおいても上記第１１の発明と同様に、
障害発生時の主記憶装置上に読み出され、変更されたデ
ータとディスクデータの一貫性を保つことができる。A computer system according to the twelfth invention is
Also in the multiple system, similar to the eleventh invention,
It is possible to keep the consistency between the data read and read on the main memory and the disk data when a failure occurs.

【００４３】第１３の発明における計算機システムは、
ディスク装置への書き込みの際に、更新される前のデー
タに更新開始のマークをしてそのデータを退避した後、
ディスク装置への書き込みを開始し、更新終了後、退避
したデータに更新完了のマークをする。ディスクへの書
き込み命令が異常終了した場合に、この退避したデータ
の中で更新完了のマークがない退避データを書き戻すこ
とにより、書き込み命令発行以前のデータ状態を保証す
ることができる。また、更新成功時には退避データを開
放する。A computer system according to the thirteenth invention is
At the time of writing to the disk device, after marking the start of update to the data before being updated and saving that data,
Writing to the disk device is started, and after the update is completed, the saved data is marked as completed. When the write command to the disk ends abnormally, the saved data without the update completion mark is written back in the saved data, so that the data state before the write command is issued can be guaranteed. When the update is successful, the saved data is released.

【００４４】[0044]

【Example】

実施例１．本実施例における計算機システムの例を図１
を用いて説明する。１０３および１０４は独立した計算
機で、ネットワーク１０１，１０２にそれぞれ接続され
ている。図中には計算機は２台しか描かれていないが、
台数に制限はない。主／副形態で、機能の冗長性を実現
する場合は、主／副それぞれの計算機からアクセスでき
るディスク装置１１０により、二つの計算機間でのコン
システントなデータの引渡しを可能にする。プライマリ
ーサーバ１１１にあるプログラム群１１２，１１３，１
１４はそれぞれ依存関係を持つアプリケーションプログ
ラム、つまり、被検プログラムに相当する。１０６，１
０７，１０８は障害検出プログラムで、被検プログラム
が実行されている同一計算機上で、また、ネットワーク
内の違う計算機上で実行される。１０９は、これら障害
検知機構のセクションの開始／終了などのサービスを行
うチェックエージェントプログラムである。Example 1. FIG. 1 shows an example of a computer system in this embodiment.
Will be explained. Independent computers 103 and 104 are connected to the networks 101 and 102, respectively. Only two computers are shown in the figure,
There is no limit to the number of units. When the function redundancy is realized in the main / sub mode, the disk device 110 that can be accessed from each of the main and sub computers enables the transfer of consistent data between the two computers. Program group 112, 113, 1 in the primary server 111
Reference numerals 14 correspond to application programs having respective dependency relationships, that is, test programs. 106,1
Reference numerals 07 and 108 denote failure detection programs, which are executed on the same computer on which the test program is executed and on different computers in the network. Reference numeral 109 is a check agent program that performs services such as start / end of the section of the failure detection mechanism.

【００４５】障害検知機構は、アプリケーションプログ
ラム（被検プログラム）の中で処理される部分と障害検
出プログラムの中で処理される部分に分かれて存在して
いる。図２は本実施例による障害検知機構のソフトウェ
ア構成を示している。被検プログラム２０１は、計算機
の負荷状況を調査する負荷検出部２０２と、図３にその
内容を示す負荷に対しての障害検知頻度を示す頻度表２
０３と、障害検出信号を障害検出プログラムに送出する
送信部２０４を含んでいる。一方、障害検出プログラム
は、２０２に等価な負荷検出部２０６，２０３に等しい
頻度表２０７、被検プログラムからの障害検出信号を受
信する受信部２０８を含んでいる。被検プログラム２０
１は、一定期間中に何回かの割合で送信部２０４から”
私は正常である”旨のメッセージを障害検出プログラム
２０５に送信している。障害検出プログラム２０５の受
信部２０８はそのメッセージを受け取り、その内容また
はメッセージが到着するか否かで被検プログラムが正常
であるか否かを判断している。The failure detection mechanism is divided into a part processed in the application program (test program) and a part processed in the failure detection program. FIG. 2 shows the software configuration of the failure detection mechanism according to this embodiment. The test program 201 includes a load detection unit 202 that investigates the load status of a computer and a frequency table 2 that indicates the failure detection frequency for the load whose contents are shown in FIG.
03, and a transmitter 204 for sending a failure detection signal to the failure detection program. On the other hand, the fault detection program includes a frequency table 207 equal to the load detection units 206 and 203 equivalent to 202, and a reception unit 208 that receives a fault detection signal from the test program. Test program 20
1 is transmitted from the transmission unit 204 at a rate of several times during a certain period.
I have sent a message "I am normal" to the failure detection program 205. The reception unit 208 of the failure detection program 205 receives the message, and the program under test is normal depending on the content or whether the message arrives. It is determined whether or not

【００４６】そのため、ソフトウェア障害検知機構がタ
イムアウト（次のメッセージが到着すべき時刻に到着し
ない場合に異常であると判断するまでの時間）を障害発
生状態か否かを判断する基準にしているので、計算機の
負荷状態によっては、一律な判断基準では正確な判断が
下せないという問題点があった。そこで、双方のプログ
ラムは、計算機の負荷状態を定期的に採取し、その値か
ら、障害検知頻度を頻度表に従って設定する。例えば、
頻度表の内容が図３のよう場合、負荷が０のときは頻度
は１０であるから、双方のプログラムは一定期間中に１
０回障害検知のためのメッセージのやり取りを行うこと
になる。なお、負荷は計算機の稼動率とジョブキューの
長さで決まる。Therefore, the software failure detection mechanism uses the timeout (the time until it is judged to be abnormal when the next message does not arrive at the time when it should arrive) as a criterion for judging whether or not a failure has occurred. However, depending on the load condition of the computer, there is a problem that an accurate judgment cannot be made with a uniform judgment standard. Therefore, both programs periodically collect the load state of the computer and set the failure detection frequency according to the frequency table from the value. For example,
In the case where the content of the frequency table is as shown in FIG. 3, the frequency is 10 when the load is 0, so that both programs have 1 during a certain period.
Messages will be exchanged for 0 times of failure detection. The load is determined by the operating rate of the computer and the length of the job queue.

【００４７】また、なぜ双方のプログラムで、負荷検出
部と頻度表を持つかを説明する。被検プログラムで、計
算機の負荷状態により送信頻度を変えているため、障害
検出プログラム側のタイムアウト値も変える必要があ
る。そのため、障害検出プログラム側でも被検プログラ
ムと同様の処理を行い、新しいタイムアウト値を設定す
る。The reason why both programs have the load detecting section and the frequency table will be described. Since the program to be tested changes the transmission frequency depending on the load state of the computer, it is necessary to change the timeout value on the fault detection program side. Therefore, the failure detection program side also performs the same processing as that of the test program, and sets a new timeout value.

【００４８】被検プログラムと障害検出プログラムの中
の負荷検出部は、オペレーティングシステムの問い合わ
せ手段を用い、ジョブキューの長さと計算機の稼動率に
より得る。ただし計算機の負荷状況は急峻に変化するこ
とがあるので、双方の負荷検出値に差異が生じることを
防ぐために、負荷検出部には十分長いサンプリング期間
を持たせる。負荷状況はたえず変化するものなので、負
荷は一瞬一瞬の細かい値の検出ではなく、その時間帯の
大まかな傾向値とした方がより実際的である。The load detection section in the test program and the failure detection program uses the inquiry means of the operating system to obtain the load according to the length of the job queue and the operating rate of the computer. However, since the load condition of the computer may change abruptly, the load detection unit should have a sufficiently long sampling period in order to prevent a difference between the load detection values of the two. Since the load status is constantly changing, it is more practical to set the load as a rough trend value in that time zone rather than detecting a fine value for each moment.

【００４９】また、被検プログラムと障害検出プログラ
ムに分けているのはプログラムの作り勝手によるため
と、独立したプログラムにすることにより、被検プログ
ラム側で検出信号を送信できない状態に陥った場合、障
害検出プログラム側ではタイムアウトを障害発生か否か
の判断の１つにするためである。Further, the reason why the program to be inspected and the failure detection program are divided is that it depends on the creativity of the program, and if the program becomes independent and the detection signal cannot be transmitted, This is because the failure detection program side uses the timeout as one of the judgments as to whether or not a failure has occurred.

【００５０】以上のようにこの実施例では、該システム
上で走行するソフトウェアに発生する障害を検知し、障
害状態から回復せしめることを特徴とする計算機システ
ムであって、該障害検知に用いる、被検ソフトウェア自
身の送出する、検出信号の送出頻度を、該システムの負
荷状況により調整する。As described above, in this embodiment, a computer system characterized by detecting a fault occurring in software running on the system and recovering from the fault state, which is used for detecting the fault, is used. The frequency of transmission of the detection signal transmitted by the inspection software itself is adjusted according to the load status of the system.

【００５１】そのために計算機システムの負荷状況検出
手段、および、計算機システムの負荷状況と、障害検知
頻度の頻度表を、検知信号送信側、つまり、被検プログ
ラムと検知機構にそれぞれ持つことによって、計算機シ
ステムの負荷状況によって、障害検知頻度の調整を行う
ようにしている。For this purpose, the load condition detecting means of the computer system, the load condition of the computer system, and the frequency table of the fault detection frequency are provided on the detection signal transmitting side, that is, the test program and the detection mechanism, respectively. The failure detection frequency is adjusted according to the system load status.

【００５２】実施例２．この実施例では、被検プログラ
ム側の障害検知信号中に、障害検知信号の送信間隔情報
（インターバル情報ともいう）を送り、それをもとに、
障害検出プログラムがタイムアウト時間を設定する例に
ついて述べる。Example 2. In this embodiment, transmission interval information (also referred to as interval information) of the failure detection signal is sent in the failure detection signal on the side of the program to be tested, and based on that,
An example in which the failure detection program sets the timeout time will be described.

【００５３】図４はこの実施例の障害検知機構のソフト
ウェア構成を示している。被検プログラム４０１は、計
算機の負荷状況を調査する負荷検出部４０２と、図５に
その内容を示す負荷に対しての障害検知の頻度と、障害
検出プログラムへ送信する障害検知のインターバル情報
を示す頻度表４０３と、障害検出信号を障害検出プログ
ラムに送出する送信部４０５を含んでいる。一方、障害
検出プログラム４０６は、被検プログラムからの障害検
出信号を受信する受信部４０７を含んでいる。被検プロ
グラムは、計算機の負荷状態を定期的に採取し、その値
から頻度表に従って障害検知頻度を設定し、また、頻度
表から障害検知信号の一部として送るべき送出情報を設
定する。例えば、図５の５０１ならば、負荷が０の時は
一定期間に１０回の頻度で、被検プログラムは送出情報
１を含む障害検知信号を、障害検出プログラムに送信す
る。障害検出プログラムは、タイムアウト値を送出情報
に合わせて設定して、障害か否かの判断を行う。FIG. 4 shows the software configuration of the fault detection mechanism of this embodiment. The test program 401 shows a load detection unit 402 for investigating the load status of the computer, the frequency of fault detection for the load whose contents are shown in FIG. 5, and fault detection interval information transmitted to the fault detection program. It includes a frequency table 403 and a transmission unit 405 that sends a failure detection signal to the failure detection program. On the other hand, the failure detection program 406 includes a reception unit 407 that receives a failure detection signal from the program under test. The test program periodically collects the load state of the computer, sets the failure detection frequency from the value according to the frequency table, and sets the transmission information to be sent as a part of the failure detection signal from the frequency table. For example, in 501 of FIG. 5, when the load is 0, the test program transmits a failure detection signal including the transmission information 1 to the failure detection program at a frequency of 10 times in a certain period. The failure detection program sets a timeout value in accordance with the transmission information and determines whether or not there is a failure.

【００５４】これは被検プログラムが正常か否かを障害
検出プログラムにおいて、ある一定期間内に次のメッセ
ージが到着するか否かでも判断しているためである。ま
た、被検プログラム側で負荷の状況により、頻度表を参
照して検出信号の送信間隔を変えているので、その値を
送出情報として障害検出プログラムに知らせる。これに
より障害検出プログラム側では送出情報により被検プロ
グラム側での変化に合わせてタイムアウト値の変更を行
うことができる。This is because whether or not the test program is normal is also determined by the failure detection program by whether or not the next message arrives within a certain period. Also, since the program to be detected changes the transmission interval of the detection signal by referring to the frequency table depending on the load condition, the value is notified to the failure detection program as transmission information. As a result, the failure detection program side can change the timeout value according to the change on the tested program side by the transmission information.

【００５５】以上のようにこの実施例では、障害検出信
号に検知頻度調整のための情報を付加することを特徴と
している。As described above, this embodiment is characterized in that the information for adjusting the detection frequency is added to the fault detection signal.

【００５６】実施例３．障害検知しようとするプログラ
ムが幾つかのプログラムのサービスを利用して成り立っ
ているとき、あるいは、相互にサービスを利用しあって
成り立っているとき、目的とするプログラムの障害検知
を行うだけでは不十分で正確な判断ができない。そのた
めこの実施例では、監視すべきプログラム、および、監
視すべきプログラムがそのサービスを利用しているプロ
グラムとのプログラム間の依存関係を示す表を、障害検
出機構に持つことにより、上記依存関係を持つプログラ
ムの監視を可能にする例について述べる。Example 3. When the program that is trying to detect a failure consists of using the services of several programs, or when the programs are using each other's services, it is not enough to detect the failure of the target program. I can't make an accurate decision. Therefore, in this embodiment, by providing the failure detection mechanism with a table indicating the program to be monitored and the dependency relationship between the program to be monitored and the program using the service, the above dependency can be realized. This section describes an example that enables monitoring of the programs that it has.

【００５７】被検プログラム、障害検出プログラムは、
実施例１または、実施例２の機能を持つ。図６は計算機
システム上で、ある瞬間のプログラムの実行状況を示し
た図である。アプリケーションプログラムＡ（６０
２）、アプリケーションプログラムＢ（６０３）、アプ
リケーションプログラムＣ（６０１）、障害検出プログ
ラム６０４が実行されている。障害検出プログラム６０
４は、図７に詳細を示すアプリケーションプログラム間
の依存関係表６０５、各アプリケーションからの障害検
出信号を受信する受信部６０６を含んでいる。The test program and the fault detection program are
It has the functions of the first or second embodiment. FIG. 6 is a diagram showing the execution status of a program at a certain moment on the computer system. Application program A (60
2), the application program B (603), the application program C (601), and the failure detection program 604 are executed. Fault detection program 60
4 includes a dependency relationship table 605 between application programs, the details of which are shown in FIG. 7, and a reception unit 606 that receives a failure detection signal from each application.

【００５８】この依存関係表６０５は、つぎのようにし
て設定する。例えば、プログラムＡを作る時、依存する
プログラムはプログラムＢとＣであると判る。プログラ
ムＢは、プログラムＡに依存しており、またプログラム
ＡをとおしてプログラムＣに依存している。プログラム
Ｃは、どのプログラムにも依存していない。このよう
に、各プログラム間の依存関係がわかるので、障害検出
プログラムを作成するときにこれを依存関係表６０５と
して持たせる。The dependency table 605 is set as follows. For example, when creating the program A, the dependent programs are known to be the programs B and C. Program B depends on Program A and also depends on Program C through Program A. Program C does not depend on any program. In this way, since the dependency relationship between each program is known, this is provided as the dependency relationship table 605 when the failure detection program is created.

【００５９】図７は、アプリケーションプログラム間の
依存関係を表す依存関係表で、例えば、ＡはＣとＢに依
存しており（７０１）、ＢはＡ、ＡはさらにＣに依存し
ており（７０２）、Ｃはサービスは提供するがいずれの
プログラムにも依存していない（７０３）ことを示すも
のである。障害検出プログラム６０４は、この依存関係
表を参照し、例えばＡのプログラムをモニタする場合に
は、ＣおよびＢのプログラムの障害検知も行う。FIG. 7 is a dependency relationship table showing the dependency relationship between application programs. For example, A depends on C and B (701), B depends on A, and A depends on C (( 702) and C indicate that the service is provided but does not depend on any program (703). The failure detection program 604 also refers to this dependency table, and when monitoring the program A, for example, also detects failures in the programs C and B.

【００６０】このようにアプリケーションプログラム間
の依存関係表を持つことにより監視すべきプログラムお
よびこのプログラムが利用するプログラムを総合的に監
視することが可能になる。As described above, by having the dependency relationship table between the application programs, it becomes possible to comprehensively monitor the program to be monitored and the program used by this program.

【００６１】以上のように、この実施例では、該システ
ム上で互助動作する複数のソフトウェアに発生する障害
を検知し、障害状態から回復することを目的とし、該複
数ソフトウェアの管理情報を持ち、該管理情報中に記述
される全てのソフトウェアについて、障害検出、およ
び、障害回復を行うことを特徴とする計算機システムに
ついて述べた。As described above, this embodiment has the management information of a plurality of software for the purpose of detecting a failure occurring in a plurality of software which mutually operate on the system and recovering from the failure state. The computer system characterized by performing fault detection and fault recovery for all software described in the management information has been described.

【００６２】実施例４．今までの障害検知方式では、検
知機構自体のシングルポイントフェイルに対応できなか
った。この実施例では障害検知機構を２重化することに
より、障害検知機構自体の障害による、システム障害を
回避する例を説明する。Example 4. Until now, failure detection methods have not been able to deal with single point failures of the detection mechanism itself. In this embodiment, an example of avoiding a system failure due to a failure of the failure detection mechanism itself by duplicating the failure detection mechanism will be described.

【００６３】図８は被検プログラムと障害検出プログラ
ムのソフトウェア構成を示した図である。被検プログラ
ム８０１は、障害検知信号を主障害検出プログラム８０
２、および、副障害検出プログラム８０３に送信する。
図９に副障害検出プログラムの動作を示す。もし障害が
検出されたならば（９０１）、副障害検出プログラムは
主障害検出プログラムの状態をチェックし（９０２）、
健全ならば何もしない。もし健全でなければ、障害検出
プログラム復旧（９０３）を行う。障害検出プログラム
復旧とは、副障害検出プログラムが主障害検出プログラ
ムを停止させ、副障害検出プログラムが主障害検出プロ
グラムの代わりに被検プログラムの障害に対処する。ま
た、この時副障害検出プログラムは、自分の複製を作
り、以後これに自分を監視させる。なお、当実施例にお
いて主障害検出プログラムは、副障害検出プログラムの
存在を意識しない。FIG. 8 is a diagram showing the software configuration of the test program and the failure detection program. The test program 801 sends the failure detection signal to the main failure detection program 80.
2 and the secondary fault detection program 803.
FIG. 9 shows the operation of the sub-fault detection program. If a failure is detected (901), the secondary failure detection program checks the status of the primary failure detection program (902),
If sound, do nothing. If not healthy, the fault detection program is restored (903). The failure detection program recovery means that the sub failure detection program stops the main failure detection program and the sub failure detection program deals with the failure of the test program instead of the main failure detection program. Also, at this time, the secondary failure detection program makes a copy of itself and makes it monitor itself thereafter. In the present embodiment, the main failure detection program is unaware of the existence of the sub failure detection program.

【００６４】この実施例では、該システム上で走行する
ソフトウェアに発生する障害を検知し、障害状態から回
復せしめることを特徴とする計算機システムであって、
障害検知機構を２重化することにより、障害検知機構自
体の障害による、システム障害を回避することを特徴と
する計算機システムについて述べた。In this embodiment, a computer system characterized by detecting a fault occurring in software running on the system and recovering from the fault state,
The computer system characterized by avoiding the system failure due to the failure of the failure detection mechanism itself has been described by duplicating the failure detection mechanism.

【００６５】実施例５．上記実施例は１例に過ぎず、副
障害検出プログラムが、主障害検出プログラムのみを監
視する方式もある。図１０はこの実施例の被検プログラ
ムと障害検出プログラムの関係を示した図である。Example 5. The above embodiment is only one example, and there is also a method in which the sub failure detection program monitors only the main failure detection program. FIG. 10 is a diagram showing the relationship between the test program and the failure detection program of this embodiment.

【００６６】実施例６．従来の障害検知、復旧方式で
は、復旧手段が一律的であり、障害に対して、必要以上
の処置をとらざるを得ない場合が多かった。この実施例
では、被検ソフトウェアの送出する検出信号によって障
害種類を類別する障害検出機構を有し、障害種類によっ
て障害回復手順を記述した手順情報を持つこと、また、
手順情報の設定手段を持つことによって障害種類に応じ
た障害復旧手段を提供する例について述べる。Example 6. In the conventional fault detection and restoration methods, the restoration means are uniform, and in many cases, the fault must be taken more than necessary. In this embodiment, it has a failure detection mechanism that classifies failure types by detection signals sent by the software under test, and has procedure information describing failure recovery procedures according to failure types, and
An example of providing a failure recovery means according to the failure type by having a procedure information setting means will be described.

【００６７】この実施例は実施例１から５にある障害検
知機構に適用されるもので、障害検知信号に応じて、障
害復旧、もしくは、サービスを行う。図１１に、障害検
知信号と、障害検出プログラムが起動するサービスの手
順の対応を示す対応表を示す。障害検出プログラムは、
そのプログラム内にこの対応表を含み、障害検知信号を
受けとったならば、対応する手順を実行する。図１１に
ついて説明する。正常信号を受けとっている限り、障害
検出プログラムは何もしない（１００１）。停止信号を
受けとった時は、障害検出プログラムはタイムアウトを
延期する（１００２）。開始信号を受けとった時は、障
害検出プログラムは、該被検プログラムの監視を開始す
る（１００３）。終了信号を受けとった時は、被検プロ
グラムの監視を終了または、終了処理を行う（１００
４）。障害１信号を受けとった時は、同じ処理を３回リ
トライする（１００５）。障害２信号を受けとった時
は、ディスクデータを修復する（１００６）。障害３信
号を受けとった時は、他計算機で再実行する（１００
７）。This embodiment is applied to the failure detection mechanism in the first to fifth embodiments, and the failure recovery or the service is performed according to the failure detection signal. FIG. 11 shows a correspondence table showing correspondence between the failure detection signal and the service procedure started by the failure detection program. The fault detection program
If this correspondence table is included in the program and a failure detection signal is received, the corresponding procedure is executed. FIG. 11 will be described. As long as it receives the normal signal, the failure detection program does nothing (1001). When the stop signal is received, the failure detection program delays the timeout (1002). When receiving the start signal, the failure detection program starts monitoring the test program (1003). When the end signal is received, the monitoring of the test program is ended or the end processing is performed (100
4). When the fault 1 signal is received, the same processing is retried three times (1005). When the fault 2 signal is received, the disk data is restored (1006). When the fault 3 signal is received, it is re-executed by another computer (100
7).

【００６８】以上のようにこの実施例では、該システム
上で走行するソフトウェアに発生する障害を検知し、障
害状態から回復せしめることを特徴とする計算機システ
ムであって、被検ソフトウェアの送出する検出信号によ
って障害種類を類別する障害検出機構を有し、障害種類
によって障害回復手順を記述した手順情報を持つことを
特徴とする計算機システムについて説明した。As described above, in this embodiment, the computer system is characterized by detecting a fault occurring in software running on the system and recovering from the fault state. We have described a computer system that has a failure detection mechanism that classifies failure types by signals and that has procedure information that describes a failure recovery procedure depending on the failure type.

【００６９】実施例７．ソフトウェア障害が発生し、そ
の復旧にあたって、ネットワーク内の他の計算機上に移
行する必要が生じた時に、固定的に移行先を決めたので
は移行先の負荷状況、資源状況によって、必ずしも移行
先として望ましいものとはならない場合があった。そこ
で、この実施例ではあるサービスが実行されていた計算
機に障害が起きた時に、ネットワーク内のどの計算機で
サービスを継続するかを決定するシステムについて述べ
る。すなわち、ネットワーク内の各計算機の負荷状況、
資源状況を表す表と、その更新手段と、起動すべきプロ
グラムと、負荷状況、および、資源状況との対応を示す
表と、負荷、資源状況の比較結果により、指定されたプ
ログラムの起動を行うことによって達成される。Example 7. When a software failure occurs and it is necessary to migrate to another computer in the network when recovering from it, the fixed migration destination may not always be the migration destination depending on the load status and resource status of the migration destination. In some cases it was not desirable. Therefore, in this embodiment, a system for deciding which computer in the network will continue the service when a computer on which a certain service is executed fails. That is, the load status of each computer in the network,
A table indicating the resource status, its updating means, a program to be started, a load status, and a table showing the correspondence with the resource status, and the comparison result of the load and the resource status are used to start the specified program. To be achieved.

【００７０】この実施例が適用される計算機システムの
例を図１を用いて説明する。１０３および１０４は独立
した計算機で、ネットワーク１０１，１０２にそれぞれ
接続されている。図中には計算機は２台しか描かれてい
ないが、台数に制限はない。主／副形態で、機能の冗長
性を実現する場合は、主／副それぞれの計算機からアク
セスできるディスク装置１１０により、二つの計算機間
でのコンシステントなデータの引渡しを可能にする。プ
ライマリーサーバ１１１にあるプログラム群１１２，１
１３，１１４はそれぞれ依存関係を持つアプリケーショ
ンプログラム、つまり、前述した実施例で述べてきた被
検プログラムに相当する。１０６，１０７，１０８は障
害検出プログラムで、被検プログラムが実行されている
同一計算機上で、また、ネットワーク内の違う計算機上
で実行される。１０９は、これら障害検知機構のセッシ
ョンの開始／終了などのサービスを行うチェックエージ
ェントプログラムである。An example of a computer system to which this embodiment is applied will be described with reference to FIG. Independent computers 103 and 104 are connected to the networks 101 and 102, respectively. Although only two computers are shown in the figure, the number of computers is not limited. When the function redundancy is realized in the main / sub mode, the disk device 110 that can be accessed from each of the main and sub computers enables the transfer of consistent data between the two computers. Program group 112, 1 in the primary server 111
Reference numerals 13 and 114 respectively correspond to the application programs having a dependency relationship, that is, the test programs described in the above-described embodiments. Reference numerals 106, 107, and 108 are failure detection programs, which are executed on the same computer on which the test program is executed and on different computers in the network. A check agent program 109 performs services such as starting / ending a session of these failure detection mechanisms.

【００７１】もし、被検プログラムに障害が発生したと
き、さらに、計算機１０３自体が稼働不能に陥ったとき
は、スタンドバイサーバ１０５のアプリケーションプロ
グラム群は、ディスク装置１１０、または、ネットワー
クを通してディスク装置１１５のデータが複写されてい
るディスク装置１１６からデータを引き継ぎ、起動され
る。この時、これらの引き継ぎ処理を行うのは、計算機
１０４上の障害検出プログラム１０６である。If a failure occurs in the program to be tested, or if the computer 103 itself becomes inoperable, the application program group of the standby server 105 is either the disk device 110 or the disk device 115 through the network. The data is taken over from the disk device 116 to which the data is copied and is activated. At this time, it is the fault detection program 106 on the computer 104 that performs these takeover processes.

【００７２】この実施例は、あるアプリケーションプロ
グラムが実行されている計算機が稼働不能に陥ったとき
に、いずれかの計算機で再実行される時に適用されるも
ので、図１２にそのソフトウェア構成を示す。障害検出
プログラム１２０３は、ある計算機上で実行されている
被検プログラム１２０１から障害検知信号を受けとり、
また、ネットワークに接続された、各計算機の負荷、資
源状況を調査するプログラム１２０２からの状況報告を
定期的に受ける。障害検出プログラム１２０３は、被検
プログラム１２０１の実行条件データをそのプログラム
に含む。This embodiment is applied when a computer on which a certain application program is executed becomes inoperable and is re-executed by one of the computers, and FIG. 12 shows its software configuration. . The fault detection program 1203 receives a fault detection signal from the test program 1201 running on a certain computer,
In addition, a status report is regularly received from the program 1202 that investigates the load and resource status of each computer connected to the network. The fault detection program 1203 includes the execution condition data of the test program 1201 in the program.

【００７３】各計算機の負荷、資源状況を調査するプロ
グラム１２０２は、オペレーティングシステムの問い合
わせ手段を用い、各計算機の負荷、資源状況を調査す
る。計算機の負荷や資源状況は、報告を受けた時点では
変化していることもあるので、大まかな傾向がわかれば
よいと考え、十分長いサンプリング期間を持たせる。The program 1202 for investigating the load and resource status of each computer uses the inquiry means of the operating system to investigate the load and resource status of each computer. The load and resource status of the computer may have changed at the time of receiving the report, so we think that it is sufficient if we can understand the general trend, and provide a sufficiently long sampling period.

【００７４】実行条件データの例を図１３に示す。プロ
グラムＡは、計算機の負荷が１以下で、Ｉ／Ｏ頻度が１
００以下、主記憶残が２以上というのがその実行の条件
である（１１０１）。プログラムＢは、計算機の負荷が
４以下で、Ｉ／Ｏ頻度が１０００以下、主記憶残が０．
１以上というのが、その実行の条件である（１１０
２）。一方、障害検出プログラム１２０３は、負荷、資
源状況を調査するプログラム１２０２から、図１４に示
すような情報を定期的に受ける。計算機Ａは負荷が０．
１でＩ／Ｏ頻度が１０、主記憶残が１００である（１３
０１）。計算機Ｂは負荷が２で、Ｉ／Ｏ頻度が５０、主
記憶残が１０である（１３０２）。計算機Ｃは負荷が１
で、Ｉ／Ｏ頻度が１０００、主記憶残が５０である（１
３０３）。これらの情報を照らし合わせた上で、障害検
出プログラムは、障害が発生した被検プログラムをどの
計算機上で再起動するかを決定する。例えば、図１３、
および、図１４のデータで、プログラムＡに障害が発生
したとすると、プログラムＡは計算機Ａ上で再起動され
る。FIG. 13 shows an example of the execution condition data. Program A has a computer load of 1 or less and an I / O frequency of 1
The condition for the execution is that the main memory remaining is 00 or less and 2 or more (1101). Program B has a computer load of 4 or less, an I / O frequency of 1000 or less, and a main memory remaining of 0.
One or more is a condition for the execution (110
2). On the other hand, the failure detection program 1203 periodically receives the information as shown in FIG. 14 from the program 1202 which investigates the load and resource status. The load on computer A is 0.
1, the I / O frequency is 10 and the remaining main memory is 100 (13
01). The computer B has a load of 2, an I / O frequency of 50, and a main memory remaining of 10 (1302). Computer C has a load of 1
The I / O frequency is 1000 and the main memory remaining is 50 (1
303). After comparing these pieces of information, the fault detection program determines on which computer the faulty test program is to be restarted. For example, in FIG.
If the program A has a failure in the data shown in FIG. 14, the program A is restarted on the computer A.

【００７５】以上のように、この実施例では、１つ、あ
るいは、複数のネットワークに、複数台接続された計算
機によって構成され、互助動作する計算機システムにお
いて、該システム上で走行するソフトウェアは、障害検
知手段により監視され、該ソフトウェアが障害状態であ
り、かつ、走行中の計算機自体に障害があった時に、ネ
ットワークリンク内の健全な他の計算機上で、該ソフト
ウェアの再起動を行うものである。そのとき、他の計算
機上で該ソフトウェアを再起動すべきとき、ネットワー
クリンク内のいずれの計算機上で起動すべきかに関する
情報をもつこと、また、該情報を生成する手段を有する
ことを特徴とする。As described above, in this embodiment, in a computer system which is composed of a plurality of computers connected to one or a plurality of networks and mutually cooperates, the software running on the system causes a failure. The software is restarted on another healthy computer in the network link, which is monitored by the detection means, and when the software is in a failure state and there is a failure in the running computer itself. . At that time, when the software is to be restarted on another computer, it has information on which computer in the network link should be started, and also has means for generating the information. .

【００７６】実施例８．この実施例は、障害発生時の主
記憶上のデータのダンプの高速化を図る例である。主記
憶を分割し、複数のネットワークリンクを通して同時
に、複数の計算機の２次記憶上にダンプすることによ
り、処理の高速化を図る。これは、分割された主記憶領
域に対して、それぞれ、ネットワークリンク、ダンプ先
計算機、その計算機上のデータダンプ用の２次記憶領域
を登録しておくことにより、達成される。Example 8. This embodiment is an example for speeding up the dumping of data on the main memory when a failure occurs. The main memory is divided and dumped onto the secondary memory of a plurality of computers at the same time through a plurality of network links to speed up the processing. This is achieved by registering a network link, a dump-destination computer, and a secondary storage area for data dump on that computer in the divided main storage areas.

【００７７】本実施例は、図１５にあるような複数のネ
ットワークで接続された複数計算機上で適用されるので
あるが、高速主記憶データダンプは以下のように実現さ
れる。各計算機は図１６に示すような主記憶の管理表を
それぞれ持っている。図１６は、計算機０用の管理表を
示している。０から１１までの主記憶領域は３つに分け
て管理され、０から３の領域の主記憶データは計算機０
のディスク装置０に（１４０１）、４から７の領域の主
記憶データはネットワーク０を通して、計算機１のディ
スク装置０に（１４０２）、８から１１の主記憶領域の
データはネットワーク１を通して、計算機２のディスク
装置０に（１４０３）対応づけられている。計算機に障
害が発生したときは、リセット後の計算機立ち上げ時
に、該管理表に従って主記憶データのダンプが行われ
る。また、システム機構は、図１５に示したが、図１７
に示すシステムでもよい。This embodiment is applied on a plurality of computers connected by a plurality of networks as shown in FIG. 15, and the high speed main memory data dump is realized as follows. Each computer has a main memory management table as shown in FIG. FIG. 16 shows a management table for the computer 0. The main storage areas from 0 to 11 are managed separately in three areas, and the main storage data in the areas from 0 to 3 are stored in the computer 0.
To the disk device 0 of (1401) and the areas 4 to 7 of the main storage data through the network 0, and to the disk device 0 of the computer 1 (1402) and data of the areas 8 to 11 of the main storage area through the network 1 of the computer 2 Is associated with the disk device 0 of (1403). When a failure occurs in the computer, the main storage data is dumped according to the management table when the computer is started up after the reset. The system mechanism is shown in FIG.
The system shown in FIG.

【００７８】このようにして、障害発生時になくてはな
らない、主記憶データの採取を高速に行うことができ
る。また、稼働率は、故障修理期間を短くすることによ
って向上できる。従って、システム再立ち上げを行う場
合に、主記憶データのダンプ時間を短くすることは、稼
働率向上に寄与する。In this way, main memory data, which is indispensable when a failure occurs, can be collected at high speed. In addition, the operating rate can be improved by shortening the failure repair period. Therefore, when the system is restarted, shortening the dump time of the main memory data contributes to improving the operating rate.

【００７９】以上のように、この実施例では、複数のネ
ットワークリンクに複数台接続された計算機によって構
成され、互助動作する計算機システムにおいて、該シス
テム内のある計算機に障害が発生した時は、該計算機の
主記憶内容を、あらかじめ情報設定手段によって設定さ
れたコアダンプ情報によって分割し、定められたネット
ワークリンクを通して自身を含めた該ネットワークリン
ク内の計算機に送出することにより、退避することを特
徴とする計算機システムについて説明した。As described above, in this embodiment, in a computer system which is composed of a plurality of computers connected to a plurality of network links and mutually operates, when a failure occurs in a computer in the system, The main storage contents of the computer are divided by the core dump information set in advance by the information setting means, and sent out to the computers in the network link including itself through the predetermined network link to save. The computer system was explained.

【００８０】実施例９．この実施例は、主記憶上のバッ
ファにキャッシュされたディスクデータの一貫性を維持
することを課題としており、これはシステム再立ち上げ
の場合、ディスク上に構築されたファイルシステムの一
貫性回復のために要する時間を最小に押さえることを可
能にする。すなわち、チェックサムによる主記憶上のデ
ィスクデータを検証し、自計算機上のディスク装置、お
よび、複写されたデータを持つ他計算機のディスク装置
に、ネットワークを通して書き込みを行うことにより実
現される。Example 9. This example is aimed at maintaining the consistency of the disk data cached in the buffer on the main memory, and in the case of a system restart, it is necessary to recover the consistency of the file system built on the disk. This makes it possible to minimize the time required for doing so. That is, it is realized by verifying the disk data on the main memory by the checksum, and writing to the disk device on the own computer and the disk device of another computer having the copied data through the network.

【００８１】この実施例は、図１に示すような計算機シ
ステムに適用され、１１５と１１６の関係にあるディス
ク装置が本実施例の対象である。すなわち、ディスク装
置１１５のデータは、ディスク装置１１５に対して書き
込みがあるたびにディスク装置１１６に複写される。計
算機１０３上に読み出された、ディスク装置１１５のデ
ータに対して、更新が加えられるとブロック毎にチェッ
クサムデータがとられる。This embodiment is applied to the computer system as shown in FIG. 1, and the disk device having the relationship of 115 and 116 is the object of this embodiment. That is, the data in the disk device 115 is copied to the disk device 116 every time writing is performed in the disk device 115. When the data of the disk device 115 read on the computer 103 is updated, checksum data is obtained for each block.

【００８２】図１８は本実施例による、計算機上のディ
スクデータの管理情報を示したものである。ドライブー
セクタとなっている項目は、正／副ディスク装置の計算
機とドライブ番号を表しており、フラグは、各バッファ
のステートを示し、チェックサムデータには、各バッフ
ァデータ更新の度にチェックサムデータが格納される。
また、図１９はステートの種類を示す図である。図１８
では、１５０１において管理されるディスクデータは計
算機Ａのディスク装置０で、データの複写先は、計算機
Ｂのディスク装置０で、第０セクタのデータが格納され
ていることを意味する。このバッファのステートはＢＵ
ＳＹでＣＰＵがデータ参照更新、もしくは、チェックサ
ムデータ計算、書き込み中である。１５０２のバッファ
は未使用である。１５０３のバッファは計算機Ａのディ
スク装置０で、データの複写先は、計算機Ｂのディスク
装置０で、第２セクタのデータが格納されている。この
バッファのステートはＤＩＲＴＹで、ディスク装置に書
き戻す必要のあるデータである。１５０４のバッファは
未使用である。FIG. 18 shows management information of disk data on a computer according to this embodiment. The items in the drive sector indicate the computer and drive number of the primary / secondary disk device, the flag indicates the state of each buffer, and the checksum data contains the checksum for each buffer data update. Data is stored.
FIG. 19 is a diagram showing types of states. FIG.
Then, it means that the disk data managed in 1501 is the disk device 0 of the computer A, the copy destination of the data is the disk device 0 of the computer B, and the data of the 0th sector is stored. The state of this buffer is BU
In SY, the CPU is in the process of updating data reference, calculating checksum data, or writing. The buffer at 1502 is unused. The buffer of 1503 is the disk device 0 of the computer A, the copy destination of the data is the disk device 0 of the computer B, and the data of the second sector is stored. The state of this buffer is DIRTY, which is data that needs to be written back to the disk device. The buffer at 1504 is unused.

【００８３】ひとたび、この状態で、計算機１０３に相
当する計算機が障害を起こし使用不能になった時は、該
計算機をリセット後、再立ち上げの際に、該管理表に基
づき、ＢＵＳＹ、または、ＤＩＲＴＹであるバッファの
内容に対しチェックサム計算を行い、バッファの内容か
ら得られたチェックサムと管理表中のチェックサムデー
タと一致したならば、バッファ内容が正しいものとして
正／副両方のディスクにバッファの内容の書き出しを行
う。このようにして、障害発生時の主記憶上のバッファ
に読み出され変更されたデータもチェックサムにより、
データが破壊されていないことがわかれば、ディスクに
書き戻し、ディスクデータの一貫性を保つことができ
る。In this state, once the computer corresponding to the computer 103 has failed and becomes unusable, after resetting the computer and restarting it, BUSY, or Checksum calculation is performed on the contents of the buffer that is DIRTY, and if the checksum obtained from the contents of the buffer and the checksum data in the management table match, the contents of the buffer are regarded as correct and both the primary and secondary disks are confirmed. Write the contents of the buffer. In this way, the data read and modified in the buffer on the main memory at the time of failure is also checked by the checksum.
Once you know that the data has not been corrupted, you can write it back to disk to keep the disk data consistent.

【００８４】以上のように、この実施例は、１つ、ある
いは、複数のネットワークリンクに複数台接続された計
算機によって構成され、互助動作する計算機システムで
ある。該計算機システムにおいて、ディスク記憶装置上
のデータは異なる計算機上のディスク記憶装置に多重に
格納されており、データとしてファイルシステム等が構
築されており、該ディスクデータ利用時は該データは主
記憶上のバッファに展開され、主記憶上バッファのデー
タは更新時にチェックサムを実行する。そのとき、該デ
ータを利用中の計算機が障害状態となり、該計算機の停
止後、再起動時に主記憶上のバッファは展開されていた
ディスクデータで、更新済みで、かつ、チェックサムデ
ータによりデータの正当性が認められ、さらに、ディス
ク装置への書き戻しが行われていないデータを、該計算
機のディスク装置、および、データの多重化の行われて
いる他計算機のディスク装置に書き戻すことを特徴とす
るものである。As described above, this embodiment is a computer system which is composed of computers connected to one or a plurality of network links and mutually operates. In the computer system, the data in the disk storage device is stored in the disk storage devices in different computers in a multiplex manner, and a file system or the like is constructed as the data. When the disk data is used, the data is stored in the main storage. The data in the main memory buffer is subjected to checksum at the time of updating. At that time, the computer that is using the data is in a failure state, and when the computer is stopped and restarted, the buffer in the main memory is the disk data that has been expanded and has been updated. Characteristic is that data that has not been written back to the disk device and is justified is written back to the disk device of the computer and the disk device of another computer where the data is multiplexed. It is what

【００８５】実施例１０．本実施例はディスク装置への
書き込みの際にもしエラーが発生しても、書き込み前の
データ状態を保証するものである。エラーの要因として
は、電源断、ディスクヘッドの損傷などが有り得る。本
実施例の適用されたディスク装置は、データ書き込みを
図２０のフローチャートに示す手順で行う。書き込み要
求があれば、まず書き込み先のセクタのデータを退避す
る（１７０１）。退避先としてディスク領域を利用する
ことも、他の不揮発性記憶を利用することも可能であ
る。セクタデータは図２１に示すような形で退避され
る。すなわち、書き込み先のセクタ番号１６０１、書き
込み操作が始まったことを示すＢＥＧＩＮマーク１６０
２、そして、データ１６０３が退避される。このように
データ退避が終了したならば、データの書き込みを始
め、（１７０２）、データの更新に成功すると、１６０
４のＣＯＭＭＩＴマークを書き入れる（１７０３）。仮
に、データの退避中に障害がおきても、ディスクデータ
は保持される。また、データ更新中にエラーが発生した
場合、退避データを調べればＣＯＭＭＩＴマークのない
データはデータ更新中であるとわかるので、この退避し
たデータにより元のデータの復旧が可能になる。また、
退避データは不要になると開放する。Example 10. This embodiment guarantees the data state before writing even if an error occurs during writing to the disk device. The cause of the error may be a power failure or damage to the disk head. The disk device to which this embodiment is applied writes data according to the procedure shown in the flowchart of FIG. If there is a write request, the data in the write destination sector is first saved (1701). It is possible to use a disk area as a save destination or another non-volatile storage. The sector data is saved as shown in FIG. That is, the write destination sector number 1601 and the BEGIN mark 160 indicating that the write operation has started
2, and the data 1603 is saved. When the data saving is completed in this way, data writing is started (1702), and if the data updating is successful, 160
Enter the COMMIT mark of 4 (1703). Even if a failure occurs during the data saving, the disk data is retained. Further, when an error occurs during data updating, it is possible to find that the data without the COMMIT mark is being updated by examining the saved data, and thus the saved data makes it possible to restore the original data. Also,
The saved data is released when it is no longer needed.

【００８６】以上で説明した機能は、データベースプロ
グラムなどの、保証すべき信頼性が特に高いアプリケー
ションに利用されるべき機能で、全てのディスクに対す
る書き込みオペレーションに対し成功か不成功かの場合
に、更新後、更新前の状態を必ず保証するディスク装置
である。これにより、従来、アプリケーションプログラ
ムの中で行っていたデータ一貫性保持操作が簡略化さ
れ、かつハードウェアレベルでデータ一貫性が保証され
るため、システムの高速化も期待できる。The function described above is a function that should be used for a particularly reliable application that should be guaranteed, such as a database program, and is updated when a write operation to all disks is successful or unsuccessful. This is a disk device that guarantees the state before and after the update. This simplifies the data consistency maintaining operation conventionally performed in the application program and guarantees the data consistency at the hardware level, so that it is expected to speed up the system.

【００８７】以上のように、この実施例では、データベ
ースプログラムなどの、厳密にデータの一貫性が要求さ
れるプログラムに利用されるディスク記憶装置におい
て、データ更新に当たって、２フェーズコミットメント
を行うことにより、データ更新時に、該ディスク記憶装
置に障害が発生しても、該更新操作前のデータを保証す
ることを特徴とする例について述べた。As described above, in this embodiment, in a disk storage device used for a program such as a database program that strictly requires data consistency, two-phase commitment is performed when updating data. The example has been described in which the data before the update operation is guaranteed even if a failure occurs in the disk storage device when updating the data.

【００８８】[0088]

【発明の効果】第１の発明によれば、被検ソフトウェア
および障害検出ソフトウェアは、それぞれシステムの負
荷を検出し、その負荷に応じて検出信号の送受信間隔を
調整するため、タイムアウトを使った障害発生の判断が
より正確になる。また、双方のソフトウェアでそれぞれ
負荷検出を行うことにより被検ソフトウェアで送信間隔
を調整することに関する障害が発生した場合、障害検出
ソフトウェア側で検出することができる効果がある。ま
た、ハードウェアではなく、ソフトウェアで障害検知に
関する機能を実現するため障害検知に要するコストを低
減することができる。According to the first aspect of the present invention, the software to be detected and the fault detection software detect the load on the system and adjust the transmission / reception interval of the detection signal according to the load. More accurate judgment of occurrence. In addition, when a failure occurs in the software under test for adjusting the transmission interval by performing load detection by both softwares, the failure detection software can detect the failure. Further, since the function relating to the fault detection is realized by software instead of hardware, the cost required for fault detection can be reduced.

【００８９】第２の発明によれば、被検ソフトウェアが
検出信号の送出間隔を障害検出ソフトウェアに通知す
る。そのため、障害検出ソフトウェア側で送出間隔を調
べるための処理を省くことができる。それにより、シス
テムの負荷を減らすことができる。According to the second invention, the software under test notifies the fault detection software of the transmission interval of the detection signal. Therefore, it is possible to omit the processing for checking the transmission interval on the fault detection software side. Thereby, the load on the system can be reduced.

【００９０】第３の発明によれば、被検ソフトウェア側
に負荷検出部を持たせ、負荷により検出信号の送出間隔
の調整を行う。また、その送出間隔の情報も検出信号中
に含ませ障害検出ソフトウェア側に通知するので、障害
検出ソフトウェア側では、負荷検出部を持つ必要がな
く、また、その処理を行う必要がない。そのため、障害
検出ソフトウェアのプログラムステップ数を減らすこと
ができ、また、その処理を省くことができるため、シス
テムの負荷を減らすことができる。According to the third aspect of the invention, the software to be tested is provided with a load detecting section, and the transmission interval of the detection signal is adjusted by the load. Further, since the information on the transmission interval is also included in the detection signal and notified to the fault detection software side, the fault detection software side does not need to have a load detection unit and does not need to perform the processing. Therefore, the number of program steps of the fault detection software can be reduced and the processing can be omitted, so that the load on the system can be reduced.

【００９１】第４の発明によれば、被検ソフトウェアが
いくつかのプログラムのサービスを利用して成り立って
いる時、あるいは、相互にサービスを利用しあって成り
立っている時、プログラム間の関係を示す管理情報を持
つことによって、関連するすべてのプログラムを監視す
ることができ、より正確な障害検出ができるという効果
がある。According to the fourth aspect of the present invention, when the software under test is made up of the services of several programs, or when the software is made up of mutually using services, the relationship between the programs is By having the management information to be shown, it is possible to monitor all related programs, and it is possible to detect a fault more accurately.

【００９２】第５の発明によれば、障害検知機構を２重
化することにより、障害検知機構自体の障害による、シ
ステム障害を回避することができる。According to the fifth invention, by duplicating the failure detection mechanism, it is possible to avoid a system failure due to a failure of the failure detection mechanism itself.

【００９３】第６の発明によれば、検出信号中に障害検
出ソフトウェアで実行すべき動作手順を指示するメッセ
ージを含ませ、障害検出ソフトウェア側でそのメッセー
ジに対する手順情報を持つ。このため、障害検出ソフト
ウェア側で不必要な処理を省くことができ、的確な処理
を行うことができる。According to the sixth aspect of the invention, the detection signal includes a message instructing an operation procedure to be executed by the fault detection software, and the fault detection software has procedure information for the message. For this reason, unnecessary processing can be omitted on the fault detection software side, and accurate processing can be performed.

【００９４】第７の発明によれば、各計算機の状態を調
査する調査プログラムを持つ。これにより、被検ソフト
ウェアに障害が検出され、かつ他の計算機への移行が必
要となった場合、各計算機の負荷状況と資源状況を考慮
して最適な移行先を選ぶことができる。According to the seventh aspect of the present invention, it has a check program for checking the state of each computer. As a result, when a failure is detected in the software to be inspected and it is necessary to transfer to another computer, the optimum transfer destination can be selected in consideration of the load status and resource status of each computer.

【００９５】第８の発明によれば、障害発生時に主記憶
装置の内容を分割して、複数の他の計算機の２次記憶装
置に退避することができる。そのため、主記憶装置のデ
ータの採取を高速に行うことができ、主記憶装置のデー
タのダンプ時間を短くすることができる。このため、故
障修理期間を短くすることができるので、稼働率向上に
寄与する。According to the eighth invention, when a failure occurs, the contents of the main storage device can be divided and saved in the secondary storage devices of a plurality of other computers. Therefore, the data in the main memory can be collected at high speed, and the dump time of the data in the main memory can be shortened. Therefore, the repair period can be shortened, which contributes to the improvement of the operating rate.

【００９６】第９の発明によれば、分割退避先を設定し
た管理表を備えているので、主記憶装置のデータの採取
先をすみやかに決めることができ、主記憶装置のデータ
のダンプの高速化を図ることになり、稼働率向上に寄与
するという効果がある。According to the ninth invention, since the management table in which the divided save destination is set is provided, it is possible to promptly determine the data collection destination of the main memory device, and to dump the data of the main memory device at high speed. There is an effect that it contributes to the improvement of operating rate.

【００９７】第１０の発明によれば、ネットワークを介
して分割されたデータを退避する作業を他の計算機に分
割することができるため、主記憶装置のデータのダンプ
の高速化を図ることができ、稼働率向上に寄与すること
ができる。According to the tenth aspect of the present invention, the work of saving the divided data via the network can be divided into other computers, so that the speed of dumping the data in the main storage device can be increased. It can contribute to the improvement of operating rate.

【００９８】第１１の発明によれば、主記憶装置上に読
み出され、変更されたデータも、障害発生後の回復時
に、データの正当性を調べ２次記憶装置に書き戻す事が
できる。そのため、ディスクデータの一貫性を保つこと
ができ、システム再立ち上げの時、ディスク上に構築さ
れたファイルシステムの一貫性回復のために要する時間
を最小に押さえることができるので、稼働率向上に寄与
する。According to the eleventh aspect, even the data read and modified in the main storage device can be checked for correctness and written back to the secondary storage device at the time of recovery after a failure occurs. Therefore, the consistency of the disk data can be maintained, and the time required to recover the consistency of the file system built on the disk can be minimized when the system is restarted, improving the operating rate. Contribute.

【００９９】第１２の発明によれば、多重系システムに
おいても上記第１１の発明と同様の効果が得られる。According to the twelfth invention, the same effect as that of the eleventh invention can be obtained even in the multiplex system.

【０１００】第１３の発明において、ディスク装置への
書き込みの際に、更新される前のデータを退避しておく
ためディスクへの書き込み命令が異常終了した場合に
も、書き込み命令の発行以前のデータ状態を必ず保証す
ることができる。このように、本発明は、データベース
プログラムなどの、保証すべき信頼性が特に高いアプリ
ケーションに利用されるべき機能で、全てのディスクに
対する書き込みオペレーションに対し、成功か不成功か
の場合に、更新後、更新前の状態を必ず保証するディス
ク装置である。これにより、従来、アプリケーションプ
ログラムの中で行っていた、データ一貫性保持操作が簡
略化され、かつ、ハードウェアレベルでデータ一貫が保
証されるため、システムの高速化も期待できる。In the thirteenth aspect of the present invention, when the write command to the disk is abnormally terminated because the data before the update is saved when the data is written to the disk device, the data before the write command is issued You can always guarantee the condition. As described above, the present invention is a function that should be used for a particularly reliable application that should be guaranteed, such as a database program, and after a write operation to all disks succeeds or fails and is updated. It is a disk device that guarantees the state before update. This simplifies the operation of maintaining data consistency conventionally performed in an application program and guarantees data consistency at the hardware level, so that the speed of the system can be expected.

[Brief description of drawings]

【図１】本発明を適用する計算機システムの例を示す
図。FIG. 1 is a diagram showing an example of a computer system to which the present invention is applied.

【図２】本発明を適用したソフトウェア構成例を示す
図。FIG. 2 is a diagram showing a software configuration example to which the present invention is applied.

【図３】本発明における頻度表例を示す図。FIG. 3 is a diagram showing an example of a frequency table according to the present invention.

【図４】本発明を適用したソフトウェア構成例を示す
図。FIG. 4 is a diagram showing a software configuration example to which the present invention is applied.

【図５】本発明における頻度表例を示す図。FIG. 5 is a diagram showing an example of a frequency table according to the present invention.

【図６】本発明を適用したソフトウェア構成例を示す
図。FIG. 6 is a diagram showing a software configuration example to which the present invention is applied.

【図７】本発明における依存関係表例を示す図。FIG. 7 is a diagram showing an example of a dependency relationship table according to the present invention.

【図８】本発明を適用したソフトウェア構成例を示す
図。FIG. 8 is a diagram showing a software configuration example to which the present invention is applied.

【図９】本発明における副障害検出プログラムの動作ア
ルゴリズムを示す図。FIG. 9 is a diagram showing an operation algorithm of a sub-fault detection program according to the present invention.

【図１０】本発明を適用したソフトウェア構成例を示す
図。FIG. 10 is a diagram showing a software configuration example to which the present invention is applied.

【図１１】本発明における障害に対する手順についての
対応表の例を示す図。FIG. 11 is a diagram showing an example of a correspondence table regarding a procedure for a failure in the present invention.

【図１２】本発明におけるソフトウェア構成例を示す
図。FIG. 12 is a diagram showing a software configuration example according to the present invention.

【図１３】本発明における被検プログラムの実行条件デ
ータの例を表す図。FIG. 13 is a diagram showing an example of execution condition data of a test program according to the present invention.

【図１４】本発明における計算機の負荷、資源状況の調
査結果の例を示す図。FIG. 14 is a diagram showing an example of the result of a survey on the load and resource status of a computer according to the present invention.

【図１５】本発明を適用する計算機システムの例を示す
図。FIG. 15 is a diagram showing an example of a computer system to which the present invention is applied.

【図１６】本発明における計算機０用の主記憶ダンプ先
の管理表を示す図。FIG. 16 is a diagram showing a management table of a main memory dump destination for the computer 0 according to the present invention.

【図１７】本発明を適用する計算機システムの例を示す
図。FIG. 17 is a diagram showing an example of a computer system to which the present invention is applied.

【図１８】本発明における計算機上のディスクデータの
管理情報を表す図。FIG. 18 is a diagram showing management information of disk data on a computer according to the present invention.

【図１９】本発明におけるバッファのステートの種類を
示す図。FIG. 19 is a diagram showing types of buffer states in the present invention.

【図２０】本発明におけるディスクデータ書き込みアル
ゴリズムを示す図。FIG. 20 is a diagram showing a disc data writing algorithm according to the present invention.

【図２１】本発明における退避データ形態例を表す図。FIG. 21 is a diagram showing an example of a saved data form in the present invention.

[Explanation of symbols]

１０３計算機（主）１０４計算機（副）１０５スタンドバイサーバ１０６障害検出プログラム１０７障害検出プログラム（副）１０８障害検出プログラム（主）１０９チェックエージェントプログラム１１０，１１５，１１６ディスク装置１１１プライマリーサーバ１１２，１１３，１１４被検プログラム２０２，２０６負荷検出部２０３，２０７頻度表２０４送信部２０８受信部 103 computer (main) 104 computer (secondary) 105 standby server 106 failure detection program 107 failure detection program (secondary) 108 failure detection program (main) 109 check agent program 110, 115, 116 disk device 111 primary server 112, 113, 114 test program 202, 206 load detection unit 203, 207 frequency table 204 transmission unit 208 reception unit

Claims

[Claims]

1. (a) performing predetermined processing,
Software under test that detects whether or not a failure has occurred during the execution of the processing and transmits it as a detection signal, (b) failure detection software that receives a detection signal from the software under test and detects a failure in the software under test In the computer system having, the test software and the failure detection software each include a load detection unit that detects the load of the system, and adjust the transmission / reception interval of the detection signal according to the load of the system detected by the load detection unit. A computer system characterized by.

2. (a) performing predetermined processing,
Software under test that detects whether or not a failure has occurred during the execution of the processing and transmits it as a detection signal, (b) failure detection software that receives a detection signal from the software under test and detects a failure in the software under test In the computer system having the: A computer system characterized by.

3. The software under test includes a load detection unit for detecting a load of the system, and adjusts a detection signal transmission interval according to the load of the system detected by the load detection unit. The computer system described in 2.

4. The software under test comprises a plurality of programs, each program transmitting a detection signal, and the fault detection software stores management information indicating a related program among the plurality of programs. 5. The computer system according to claim 1, further comprising: receiving a detection signal from a plurality of related programs based on the management information to detect a failure of the plurality of related programs.

5. The fault detection software comprises first fault detection software and second fault detection software, and one fault detection software detects a fault of the other fault detection software. The computer system according to claim 1, 2, or 3.

6. The detection software includes a message instructing an operation procedure for the fault detection software in a detection signal, and the fault detection software has procedure information, and the procedure instructed by the message and the procedure information. 5. The computer system according to claim 1, wherein the computer system executes.

7. The computer system comprises a plurality of computers, each computer is provided with an investigation program for investigating the status of each computer and outputting it to the failure detection software, and the failure detection software is provided from the investigation program. 4. The computer system according to claim 1, wherein the software under test in which the failure is detected is executed by another computer based on the investigation result.

8. In a computer system to which a plurality of computers having a main storage device and a secondary storage device are connected, a division saving means for dividing the contents of the main storage device and saving the contents in the secondary storage devices of the plurality of computers is provided. A computer system characterized by having.

9. The computer system according to claim 8, wherein the computer system is provided with a management table in which a division save destination is set in advance.

10. The computer system is further connected to a plurality of computers having a secondary storage device on a network, and the partition saving means saves data to the plurality of secondary storage devices using the network. The computer system according to claim 8, which is characterized in that:

11. A computer system having the following elements: (a) a secondary storage device for storing data, (b) a main storage device for temporarily storing the data, and (c) a main storage device. Storage means for storing, as management information, data for judging whether the data is written back to the secondary storage device and the validity of the data, and (d) the management information stored by the storage means when a failure occurs. A write-back unit that determines, out of the data stored in the main storage device, data that has not been written back to the secondary storage device, checks its validity, and writes back the data.

12. The computer system is a multiple system including a plurality of secondary storage devices, and the write-back means writes back data to the plurality of secondary storage devices. A computer system according to item 11.

13. A computer system having the following elements: (a) a secondary storage device for storing data, (b) a writing means for writing data to the secondary storage device, and (c) an update by the writing means. When trying to save the original data to a non-volatile storage medium, a mark indicating the start of update,
Save means for marking the save data with a mark indicating update completion after update completion, (d) when a failure occurs, the data saved by the save means is checked, and there is a mark indicating update start,
Resuming means for recovering the data in the secondary storage device by using the saved data by regarding the data without the update completion mark as an error during the data update, and (e) releasing the saved data when the update is successful. means.