CN102521066A - On-board computer space environment event fault tolerance method - Google Patents
On-board computer space environment event fault tolerance method Download PDFInfo
- Publication number
- CN102521066A CN102521066A CN2011103619895A CN201110361989A CN102521066A CN 102521066 A CN102521066 A CN 102521066A CN 2011103619895 A CN2011103619895 A CN 2011103619895A CN 201110361989 A CN201110361989 A CN 201110361989A CN 102521066 A CN102521066 A CN 102521066A
- Authority
- CN
- China
- Prior art keywords
- board computer
- software
- fault tolerance
- computer
- space
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 20
- 230000002159 abnormal effect Effects 0.000 claims description 10
- 238000012937 correction Methods 0.000 claims description 4
- LMDZBCPBFSXMTL-UHFFFAOYSA-N 1-ethyl-3-(3-dimethylaminopropyl)carbodiimide Chemical compound CCN=C=NCCCN(C)C LMDZBCPBFSXMTL-UHFFFAOYSA-N 0.000 abstract description 20
- 230000005855 radiation Effects 0.000 abstract description 14
- 238000001514 detection method Methods 0.000 abstract description 5
- 238000012545 processing Methods 0.000 abstract description 5
- 230000008859 change Effects 0.000 abstract description 4
- 238000007689 inspection Methods 0.000 abstract 1
- 230000000737 periodic effect Effects 0.000 abstract 1
- 230000006870 function Effects 0.000 description 6
- 230000007613 environmental effect Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000012795 verification Methods 0.000 description 3
- 238000013461 design Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000007488 abnormal function Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Images
Landscapes
- Techniques For Improving Reliability Of Storages (AREA)
- Hardware Redundancy (AREA)
Abstract
星载计算机空间环境事件容错方法,主要包括存储器单粒子翻转的处理、空间辐照引起的芯片内部寄存器变化容错、空间辐照引起的部分电路失效容错。对于存储器的单粒子翻转,星载计算机通过对存储区加EDAC校验,定期对存储区读写进行容错。对于空间辐照引起的芯片内部寄存器变化,星载计算机对未用中断进行保护;对工作模式寄存器,采用定期循检,若不为期望值则重新初始化;对与总线发送消息相关的寄存器,每次发送消息前重新对存储器进行赋值。对于空间辐照引起的部分电路失效,采用故障RAM存储器芯片的替换、总线接口芯片故障检测与切换、CPU芯片故障检测与切换进行容错。本发明方法可以有效提高星载计算机发射和在轨运行的可靠性。
The space environment event fault tolerance method of the spaceborne computer mainly includes the processing of memory single event flipping, the fault tolerance of the internal register change of the chip caused by space radiation, and the fault tolerance of some circuit failures caused by space radiation. For the single event flip of the memory, the on-board computer periodically reads and writes the memory area for fault tolerance by adding EDAC checks to the memory area. For the change of the chip's internal registers caused by space radiation, the onboard computer protects unused interrupts; for the working mode registers, it adopts periodic inspections, and if it is not the expected value, it is re-initialized; for the registers related to the bus sending messages, each time Reassign the memory before sending the message. For some circuit failures caused by space radiation, the replacement of faulty RAM memory chips, bus interface chip fault detection and switching, and CPU chip fault detection and switching are used for fault tolerance. The method of the invention can effectively improve the reliability of launch and on-orbit operation of the spaceborne computer.
Description
技术领域 technical field
本发明涉及一种星载计算机容错方法。The invention relates to a fault-tolerant method for an on-board computer.
背景技术 Background technique
航天器在整个发射过程和运行过程中,由于空间环境、航天器特性等各种各样的原因会出现各种空间环境事件,不加以处理会引起卫星系统功能的失效甚至崩溃,因此应当采取措施应对这些异常状况,使卫星能够继续正确、稳定的运行,从而保障整个卫星系统的稳定的运行和服务。During the entire launch process and operation of the spacecraft, various space environmental events will occur due to various reasons such as the space environment and the characteristics of the spacecraft. Failure to deal with them will cause the failure or even collapse of the satellite system functions, so measures should be taken To deal with these abnormal conditions, the satellite can continue to operate correctly and stably, thereby ensuring the stable operation and service of the entire satellite system.
空间环境事件主要包括:存储器单粒子翻转;空间辐照引起的芯片内部寄存器变化;空间辐照引起的部分电路失效等。存储器单粒子翻转会导致星上软件或FPGA运行结果错误,甚至软件的跑飞跑死。空间辐照引起的芯片内部寄存器变化,会导致航天器某些芯片的功能异常,进而影响功能的实现。空间辐照引起的部分电路失效,主要是指单粒子闩锁后引起的部分电路失效。Space environmental events mainly include: memory single event flipping; changes in chip internal registers caused by space radiation; partial circuit failures caused by space radiation, etc. A memory single-event flip will cause errors in the running results of the on-board software or FPGA, or even cause the software to run away and die. Changes in the chip's internal registers caused by space radiation will cause abnormal functions of some chips in the spacecraft, which will affect the realization of functions. The partial circuit failure caused by space radiation mainly refers to the partial circuit failure caused by single event latch-up.
目前,星载计算机空间环境异常事件的容错方法尚未得到系统的研究。At present, the fault-tolerant methods for space-borne computer space environment abnormal events have not been systematically studied.
发明内容 Contents of the invention
本发明的技术解决问题是:克服现有技术的不足,提供了一种星载计算机空间环境事件的容错方法,以此建立一种适用于星载计算机设计的空间环境事件容错策略,提高星载计算机发射和在轨运行的可靠性。The technical problem of the present invention is: to overcome the deficiencies of the prior art, to provide a fault-tolerant method for the space environment event of the space-borne computer, to establish a space-environment event fault-tolerant strategy suitable for the design of the space-borne computer, and to improve the Reliability of computer launches and in-orbit operations.
本发明的技术解决方案是:星载计算机空间环境事件容错方法,步骤如下:The technical solution of the present invention is: a space-borne computer space environment event fault-tolerant method, the steps are as follows:
(1)星载计算机初始上电运行后,首先检测星载计算机软件是否可以正常启动;若星载计算机软件可以启动,则由星载计算机软件以固定周期喂软件看门狗,星载计算机软件正常运行;若星载计算机软件无法启动或者星载计算机软件以固定周期喂软件看门狗失败,则复位电路向星载计算机提供复位信号,星载计算机重新开始运行;若星载计算机连续三次无法正常启动,则切换至备份星载计算机;(1) After the on-board computer is initially powered on and running, first check whether the on-board computer software can be started normally; Normal operation; if the on-board computer software cannot be started or the on-board computer software fails to feed the software watchdog in a fixed period, the reset circuit will provide a reset signal to the on-board computer, and the on-board computer will restart; if the on-board computer fails three times in a row If it starts normally, switch to the backup onboard computer;
(2)星载计算机软件正常运行后,向所有RAM发送读写信号;若有RAM区读写不正常,则星载计算机通过软件配置,使用备份RAM替换读写不正常的RAM;(2) After the software of the on-board computer is running normally, it sends read and write signals to all RAMs; if there is abnormal reading and writing in the RAM area, the on-board computer uses the backup RAM to replace the abnormal RAM through software configuration;
(3)星载计算机软件正常运行时,周期性的向各总线终端发送轮询总线消息,当所有总线终端都不通时,星载计算机软件向星载计算机发送切机信号,星载计算机切换至备份机;(3) When the on-board computer software is running normally, it periodically sends polling bus messages to each bus terminal. backup machine;
(4)星载计算机软件正常运行时,对所有实际使用的中断源允许,同时屏蔽其它中断源;当星载计算机响应中断时,首先对中断源进行确认,当中断不是来自实际使用的中断之一时,重新对中断屏蔽寄存器进行初始化;(4) When the on-board computer software is running normally, all interrupt sources actually used are allowed, and other interrupt sources are blocked at the same time; when the on-board computer responds to an interrupt, the interrupt source is first confirmed. For a while, re-initialize the interrupt mask register;
(5)星载计算机软件正常运行时,对总线驱动芯片中处于工作状态的寄存器数值是否发生变化进行定期检查,若有寄存器的数值发生变化,则星载计算机重新初始化该寄存器及相关寄存器;同时,对于仅在部分时间有效的寄存器状态,在每次到达有效时间时对这些寄存器重新赋值;(5) When the on-board computer software is running normally, check regularly whether the value of the register in the bus driver chip in the working state changes. If the value of any register changes, the on-board computer re-initializes the register and related registers; at the same time , for register states that are only valid for part of the time, these registers are reassigned each time the valid time is reached;
(6)星载计算机软件正常运行时,利用汉明码对每个内存地址的数据计算校验和,并将校验和进行存储;星载计算机周期对每个内存地址的数据进行检查,当发现校验单bit错误时,进行纠错;当发现两bit或以上错误时,对星载计算机进行复位,重新启动。(6) When the on-board computer software is running normally, use the Hamming code to calculate the checksum of the data of each memory address, and store the checksum; the on-board computer periodically checks the data of each memory address, and when found When checking a single bit error, perform error correction; when two or more bit errors are found, reset the on-board computer and restart it.
本发明与现有技术相比的优点在于:The advantage of the present invention compared with prior art is:
(1)本发明星载计算机空间环境事件的容错方法主要针对空间环境引起的特殊事件,分不同的方法来进行容错,可以有效提高星载计算机在轨运行的可靠性;(1) The fault-tolerant method of the space environment event of the space-borne computer of the present invention is mainly aimed at the special event caused by the space environment, and divides different methods to carry out fault-tolerance, which can effectively improve the reliability of the on-orbit operation of the space-borne computer;
(2)采用星载计算机软件实现本发明星载计算机空间环境事件的容错,可以提高卫星的自主管理能力;(2) adopting the spaceborne computer software to realize the fault tolerance of the space environment event of the spaceborne computer of the present invention can improve the self-management capability of the satellite;
(3)本发明星载计算机空间环境事件的容错方法在硬件支持下,可主要采用软件完成星载计算机的检错、容错,原理简单、实现容易,可维护性强,适用于绝大多数卫星,可推广性强。(3) The fault-tolerant method of the space environment event of the space-borne computer of the present invention can mainly adopt software to complete the error detection and fault-tolerance of the space-borne computer under the support of hardware, the principle is simple, the realization is easy, the maintainability is strong, and it is suitable for most satellites , and is highly scalable.
附图说明 Description of drawings
图1为本发明方法的流程框图;Fig. 1 is a block flow diagram of the inventive method;
图2为本发明方法的具体容错内容组成图;Fig. 2 is the specific fault-tolerant content composition figure of the inventive method;
图3为本发明实施例中星载计算机硬件配置图。Fig. 3 is a hardware configuration diagram of the on-board computer in the embodiment of the present invention.
具体实施方式 Detailed ways
本发明星载计算机空间环境异常事件的容错是利用星载计算机的软硬件资源,根据空间环境事件的不同类型,进行不同的处理;同时又能满足星载计算机的重量、功耗受限的要求。The fault tolerance of the space environment abnormal event of the space-borne computer of the present invention utilizes the software and hardware resources of the space-borne computer to perform different processing according to different types of space environment events; at the same time, it can meet the requirements of the weight and power consumption of the space-borne computer. .
如图1所示,本发明方法针对空间环境引起的单粒子翻转、单粒子闩锁等事件进行不同类型的容错,适用于大多数航天器的应用,可以提高卫星设备的在轨自主能力和可靠性。主要包括存储器单粒子翻转的处理、空间辐照引起的芯片内部寄存器变化容错、空间辐照引起的部分电路失效容错三个方面,如图2所示。As shown in Figure 1, the method of the present invention performs different types of fault tolerance for events such as single event flipping and single event latching caused by the space environment, and is suitable for the application of most spacecraft, which can improve the on-orbit autonomy and reliability of satellite equipment. sex. It mainly includes three aspects: the processing of memory single event flipping, the fault tolerance of chip internal register changes caused by space radiation, and the fault tolerance of some circuit failures caused by space radiation, as shown in Figure 2.
(1)存储器单粒子翻转的处理(1) Processing of memory single event upset
对于存储器的单粒子翻转,星载计算机通过对存储区加EDAC校验,通过定期读写来对存储区进行校验。由于EDAC校验码的特点是“检一纠二”,即当发生单bit错时可以纠错,当发生双bit或多bit错时无法纠错,仅能报错。因此星载计算机硬件设计有存储器的EDAC校验电路,每当EDAC校验未通过时,软件会产生一个中断,软件在中断中通过读取EDAC校验状态,判断是单bit错还是多bit错,如果是单bit错则通过读取数据的重写来纠正存储器中的单bit错,如果是双bit错则通过软件自主复位来消除双bit错的影响。For the single event reversal of the memory, the on-board computer checks the storage area by adding EDAC to the storage area, and verifies the storage area by reading and writing regularly. The EDAC check code is characterized by "detect one and correct two", that is, it can correct errors when a single-bit error occurs, but cannot correct errors when a double-bit or multi-bit error occurs, and can only report an error. Therefore, the hardware of the on-board computer is designed with an EDAC verification circuit of the memory. Whenever the EDAC verification fails, the software will generate an interrupt. The software will read the EDAC verification status during the interrupt to determine whether it is a single-bit error or a multi-bit error. , if it is a single-bit error, the single-bit error in the memory is corrected by rewriting the read data, and if it is a double-bit error, the influence of the double-bit error is eliminated by software autonomous reset.
(2)空间辐照引起的芯片内部寄存器变化(2) Changes in chip internal registers caused by space radiation
芯片内部寄存器是芯片在设计之初,为方便使用而留给用户的接口,寄存器的不同数值会引起芯片工作模式、主要功能的变化。空间辐照引起的芯片内部寄存器变化,会引起星载计算机正常功能执行不正确。星载计算机针对芯片内部寄存器的变化主要采用以下几种手段:对未用中断进行保护,防止中断相关寄存器变化引起的不确定中断;对工作模式寄存器,采用定期循检,若不为期望值则重新初始化;对与总线发送消息相关的寄存器,每次发送消息前重新对存储器进行赋值。The internal register of the chip is the interface left to the user for the convenience of use at the beginning of the chip design. Different values of the register will cause changes in the working mode and main functions of the chip. Changes in the chip's internal registers caused by space radiation will cause incorrect execution of the normal functions of the on-board computer. The on-board computer mainly adopts the following methods for the change of the internal registers of the chip: protect unused interrupts to prevent uncertain interrupts caused by changes in interrupt-related registers; Initialization; for registers related to sending messages on the bus, re-assign the memory before sending messages each time.
(3)空间辐照引起的部分电路失效(3) Partial circuit failure caused by space radiation
空间辐照引起的部分电路失效,主要是指单粒子闩锁后引起的部分电路失效。星载计算机针对部分电路采用了故障隔离与系统重构机制,消除部分电路单粒子闩锁的影响。主要有故障RAM存储器芯片的替换、总线接口芯片故障检测与切换、CPU芯片故障检测与切换。星载计算机采用备份冗余策略,当某块RAM芯片无法正常读写时,切换为备份RAM;当总线接口芯片或CPU芯片异常后,自主切换为备份机。The partial circuit failure caused by space radiation mainly refers to the partial circuit failure caused by single event latch-up. The on-board computer adopts a fault isolation and system reconfiguration mechanism for some circuits to eliminate the influence of single event latch-up in some circuits. Mainly include replacement of faulty RAM memory chips, fault detection and switching of bus interface chips, fault detection and switching of CPU chips. The on-board computer adopts a backup redundancy strategy. When a RAM chip cannot be read and written normally, it will switch to the backup RAM; when the bus interface chip or CPU chip is abnormal, it will automatically switch to the backup machine.
本发明方法的主要步骤如下:The main steps of the inventive method are as follows:
(1)星载计算机初始上电运行;(1) Initial power-on operation of the on-board computer;
(2)检测星载计算机软件是否可以正常启动,若启动,则由软件固定周期喂狗,软件正常运行;否则,软件无法喂狗,复位电路向星载计算机提供复位信号,星载计算机重新开始运行;若连续3次无法正常启动,则切向备份星载计算机。(2) Check whether the software of the onboard computer can be started normally. If it is started, the software will feed the dog at a fixed period, and the software will run normally; otherwise, the software cannot feed the dog, and the reset circuit will provide a reset signal to the onboard computer, and the onboard computer will restart running; if it fails to start normally for 3 consecutive times, switch to back up the on-board computer.
(3)星载计算机软件运行后,向所有RAM发送读写信号,若有RAM区读写不正常,则说明RAM由于不明原因被破坏。此时星载计算机软件则通过配置,使用备份RAM。(3) After the on-board computer software is running, it sends read and write signals to all RAMs. If there is abnormal reading and writing in the RAM area, it means that the RAM is damaged due to unknown reasons. At this time, the onboard computer software is configured to use the backup RAM.
(4)星载计算机运行后,周期性向各总线终端发送轮询总线消息,当所有总线终端都不通时,证明总线驱动芯片由于某种原因损坏。此时星载计算机软件向星载计算机发送切机信号,星载计算机切备份机,使用另一片总线驱动芯片。(4) After the on-board computer is running, it periodically sends polling bus messages to each bus terminal. When all bus terminals fail, it proves that the bus driver chip is damaged for some reason. At this time, the software of the on-board computer sends a switch-off signal to the on-board computer, and the on-board computer switches off the backup machine and uses another bus driver chip.
(5)星载计算机运行时,对所有实际使用的中断源允许,屏蔽其它中断源。当星载计算机响应中断时,首先对中断源进行确认,当中断不是来自实际使用的中断之一时,说明中断屏蔽寄存器发生单粒子翻转,重新对中断屏蔽寄存器进行初始化。(5) When the on-board computer is running, all interrupt sources actually used are allowed, and other interrupt sources are shielded. When the onboard computer responds to an interrupt, it first confirms the source of the interrupt. If the interrupt is not from one of the actually used interrupts, it means that a single event flip occurs in the interrupt mask register, and the interrupt mask register is re-initialized.
(6)星载计算机运行时,对总线驱动芯片中处于工作状态的寄存器数值是否发生变化进行定期检查,当发生变化时,说明该寄存器受到单粒子的影响。此时,星载计算机重新初始化该寄存器及相关寄存器。(6) When the on-board computer is running, regularly check whether the value of the register in the bus driver chip in the working state changes. When there is a change, it means that the register is affected by a single event. At this point, the onboard computer reinitializes this register and related registers.
(7)星载计算机运行时,有一些寄存器的状态仅在部分时间有效,在每次需要使用这些寄存器时对它们重新赋值,消除在此之前这些寄存器可能受到的单粒子影响。(7) When the on-board computer is running, the status of some registers is only valid for part of the time, and these registers are reassigned every time these registers are needed to eliminate the possible single event impact on these registers before that.
(8)星载计算机运行时,利用汉明码对每个内存地址的数据计算校验和,并将校验和存储起来。星载计算机周期对每个内存地址的数据进行检查,当发现校验单bit错误时(即单粒子翻转),进行纠错;当发现多bit错时,对计算机进行复位,重新加载程序。(8) When the on-board computer is running, the Hamming code is used to calculate the checksum of the data of each memory address and store the checksum. The on-board computer periodically checks the data of each memory address, and when a single-bit error is found (that is, a single event flip), error correction is performed; when multiple-bit errors are found, the computer is reset and the program is reloaded.
(9)重复运行步骤(4)~(8),对空间环境事件进行容错处理。(9) Repeat steps (4) to (8) to perform fault-tolerant processing on space environment events.
实施例Example
下面以某卫星为例,介绍星载计算机的空间环境事件容错策略:Taking a certain satellite as an example, the space environment event fault-tolerant strategy of the space-borne computer is introduced below:
如图3所示,某卫星的星载计算机采用TSC695f作为cpu,自带EDAC电路,同时具有冗余RAM的替换电路;星载计算机使用61580作为总线的接口芯片;星载计算机具有128K的PROM和8M的RAM,RAM芯片由4片容量为2M的9Q512K32组成,系统同时备份有1片2M的RAM;同时具有遥测接口和遥控接口。应用软件在操作系统之上完成各项应用层的功能。星载计算机具有双机冷备份,每个单机的组成完全相同。As shown in Figure 3, the onboard computer of a certain satellite uses TSC695f as the cpu, comes with EDAC circuit, and has a redundant RAM replacement circuit; the onboard computer uses 61580 as the interface chip of the bus; the onboard computer has 128K PROM and 8M RAM, RAM chip is composed of 4 pieces of 9Q512K32 with a capacity of 2M, and the system is backed up with a piece of 2M RAM at the same time; it also has a telemetry interface and a remote control interface. Application software completes the functions of various application layers on top of the operating system. The on-board computer has a dual-machine cold backup, and the composition of each single machine is exactly the same.
在星载计算机启动过程中,操作系统首先对4片RAM进行自检,若某片RAM读写不正常,则采用备份RAM进行替代,若替换后仍不正常,则复位。During the start-up process of the on-board computer, the operating system first performs a self-test on the 4 pieces of RAM. If the reading and writing of a certain piece of RAM is not normal, the backup RAM is used to replace it. If it is still not normal after replacement, it is reset.
TSC695f自带EDAC电路,当存储区的EDAC校验不过时,会产生相应的中断,并纪录此时为单bit错还是双bit错,并记录此时读取到的数值。应用软件初始化过程中,将此中断挂接;当产生此中断后,软件首先判断是否为单bit错,若为单bit错则将EDAC纠错后的数值回写到RAM区内,消除单bit影响;若为双bit错,则立即复位。TSC695f has its own EDAC circuit. When the EDAC in the storage area fails to check, it will generate a corresponding interrupt, and record whether it is a single-bit error or a double-bit error at this time, and record the value read at this time. During the initialization process of the application software, hook up this interrupt; when this interrupt occurs, the software first judges whether it is a single-bit error, and if it is a single-bit error, it will write back the value after EDAC error correction to the RAM area to eliminate the single-bit error. Influence; if it is a double bit error, it will be reset immediately.
星载计算机应用软件每0.5秒对61580芯片的工作模式及695f芯片的工作模式进行一次检查,若不为设定的值则重新赋值;应用软件在每次发送总线消息时,对61580芯片的有关消息发送的寄存器进行重新赋值;操作系统软件对未用中断进行保护,当未用中断异常发生时,清除中断状态寄存器的相应位,并退出中断响应程序。The application software of the on-board computer checks the working mode of the 61580 chip and the working mode of the 695f chip every 0.5 seconds, and re-assigns the value if it is not the set value; The register for message sending is reassigned; the operating system software protects unused interrupts, and when an unused interrupt exception occurs, clears the corresponding bit of the interrupt status register and exits the interrupt response program.
星载计算机具有自主切机功能,当应用软件检测到所有总线终端不通时,认为61580芯片出现故障,立即切机;当CPU芯片出现异常,通过看门狗进行复位,若复位3次仍不能恢复,则立即切机,从而隔离空间环境事件引起故障的部位。The on-board computer has the function of self-shutdown. When the application software detects that all bus terminals are disconnected, it considers that the 61580 chip is faulty and immediately shuts down the machine. When the CPU chip is abnormal, reset it through the watchdog. If it is reset for 3 times, it still cannot be recovered. , then shut down the machine immediately, thereby isolating the parts that cause the failure due to space environmental events.
本发明说明书中未作详细描述的内容属本领域技术人员的公知技术。The content that is not described in detail in the description of the present invention belongs to the well-known technology of those skilled in the art.
Claims (1)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN2011103619895A CN102521066A (en) | 2011-11-15 | 2011-11-15 | On-board computer space environment event fault tolerance method |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN2011103619895A CN102521066A (en) | 2011-11-15 | 2011-11-15 | On-board computer space environment event fault tolerance method |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN102521066A true CN102521066A (en) | 2012-06-27 |
Family
ID=46292001
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN2011103619895A Pending CN102521066A (en) | 2011-11-15 | 2011-11-15 | On-board computer space environment event fault tolerance method |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN102521066A (en) |
Cited By (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103076779A (en) * | 2012-12-28 | 2013-05-01 | 中国人民解放军国防科学技术大学 | Independent control method and device of satellite-borne equipment on microsatellite |
| CN103984630A (en) * | 2014-05-27 | 2014-08-13 | 中国科学院空间科学与应用研究中心 | Single event upset fault processing method based on AT697 processor |
| CN104035828A (en) * | 2014-05-19 | 2014-09-10 | 上海微小卫星工程中心 | FPGA space irradiation comprehensive protection method and device |
| CN104246629A (en) * | 2012-10-02 | 2014-12-24 | 富士电机株式会社 | Redundant computation processing system |
| CN106354579A (en) * | 2016-10-14 | 2017-01-25 | 上海微小卫星工程中心 | Spaceborne computer |
| CN107273240A (en) * | 2017-05-18 | 2017-10-20 | 北京空间飞行器总体设计部 | A kind of spaceborne phased array TR components single-particle inversion means of defence |
| CN108021473A (en) * | 2017-11-29 | 2018-05-11 | 山东航天电子技术研究所 | The aerospace computer system and safe starting method that a kind of more backups start |
| CN109001778A (en) * | 2018-05-21 | 2018-12-14 | 北京空间飞行器总体设计部 | A kind of processing method based on satellite-based navigation satellite receiving system single event |
| CN109491290A (en) * | 2018-11-16 | 2019-03-19 | 西安空间无线电技术研究所 | A kind of cold standby bus complexing circuit suitable for digital processing system |
| CN109739697A (en) * | 2018-12-13 | 2019-05-10 | 北京计算机技术及应用研究所 | A kind of hard real-time two-shipper synchronous fault-tolerant system based on high-speed data exchange |
| CN111708695A (en) * | 2020-06-12 | 2020-09-25 | 上海航天计算机技术研究所 | The verification method of cache anti-single event flip effect based on AT697 |
| CN112860467A (en) * | 2021-01-20 | 2021-05-28 | 北京国电高科科技有限公司 | On-orbit fault smooth repairing device and method for satellite-borne computer |
| CN113744787A (en) * | 2021-07-27 | 2021-12-03 | 北京空间飞行器总体设计部 | A single event flip fault injection method for SRAM type FPGA user register |
| CN114090327A (en) * | 2022-01-20 | 2022-02-25 | 浙江吉利控股集团有限公司 | Single event error handling method, system and device |
| CN118377645A (en) * | 2024-06-24 | 2024-07-23 | 之江实验室 | An abnormality handling system, method and detection device for space environment |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7805245B2 (en) * | 2007-04-18 | 2010-09-28 | Honeywell International Inc. | Inertial measurement unit fault detection isolation reconfiguration using parity logic |
| CN101907888A (en) * | 2010-07-29 | 2010-12-08 | 航天东方红卫星有限公司 | A non-disruptive switching method for dual-computer cold standby in the small satellite satellite service system |
-
2011
- 2011-11-15 CN CN2011103619895A patent/CN102521066A/en active Pending
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7805245B2 (en) * | 2007-04-18 | 2010-09-28 | Honeywell International Inc. | Inertial measurement unit fault detection isolation reconfiguration using parity logic |
| CN101907888A (en) * | 2010-07-29 | 2010-12-08 | 航天东方红卫星有限公司 | A non-disruptive switching method for dual-computer cold standby in the small satellite satellite service system |
Non-Patent Citations (2)
| Title |
|---|
| 段星辉等: "一种提高星载软件可靠性的开发方法", 《计算机工程》 * |
| 贾文涛等: "一种高可靠双机温备星载计算机的设计与实现", 《计算机研究与发展》 * |
Cited By (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104246629A (en) * | 2012-10-02 | 2014-12-24 | 富士电机株式会社 | Redundant computation processing system |
| CN104246629B (en) * | 2012-10-02 | 2016-10-12 | 富士电机株式会社 | redundant operation processing system |
| CN103076779A (en) * | 2012-12-28 | 2013-05-01 | 中国人民解放军国防科学技术大学 | Independent control method and device of satellite-borne equipment on microsatellite |
| CN104035828A (en) * | 2014-05-19 | 2014-09-10 | 上海微小卫星工程中心 | FPGA space irradiation comprehensive protection method and device |
| CN103984630A (en) * | 2014-05-27 | 2014-08-13 | 中国科学院空间科学与应用研究中心 | Single event upset fault processing method based on AT697 processor |
| CN103984630B (en) * | 2014-05-27 | 2017-02-01 | 中国科学院空间科学与应用研究中心 | Single event upset fault processing method based on AT697 processor |
| CN106354579B (en) * | 2016-10-14 | 2019-07-19 | 上海微小卫星工程中心 | Onboard computer |
| CN106354579A (en) * | 2016-10-14 | 2017-01-25 | 上海微小卫星工程中心 | Spaceborne computer |
| CN107273240A (en) * | 2017-05-18 | 2017-10-20 | 北京空间飞行器总体设计部 | A kind of spaceborne phased array TR components single-particle inversion means of defence |
| CN107273240B (en) * | 2017-05-18 | 2020-04-28 | 北京空间飞行器总体设计部 | Single event upset protection method for satellite-borne phased array TR (transmitter-receiver) assembly |
| CN108021473A (en) * | 2017-11-29 | 2018-05-11 | 山东航天电子技术研究所 | The aerospace computer system and safe starting method that a kind of more backups start |
| CN109001778A (en) * | 2018-05-21 | 2018-12-14 | 北京空间飞行器总体设计部 | A kind of processing method based on satellite-based navigation satellite receiving system single event |
| CN109491290A (en) * | 2018-11-16 | 2019-03-19 | 西安空间无线电技术研究所 | A kind of cold standby bus complexing circuit suitable for digital processing system |
| CN109739697A (en) * | 2018-12-13 | 2019-05-10 | 北京计算机技术及应用研究所 | A kind of hard real-time two-shipper synchronous fault-tolerant system based on high-speed data exchange |
| CN111708695A (en) * | 2020-06-12 | 2020-09-25 | 上海航天计算机技术研究所 | The verification method of cache anti-single event flip effect based on AT697 |
| CN112860467A (en) * | 2021-01-20 | 2021-05-28 | 北京国电高科科技有限公司 | On-orbit fault smooth repairing device and method for satellite-borne computer |
| CN113744787A (en) * | 2021-07-27 | 2021-12-03 | 北京空间飞行器总体设计部 | A single event flip fault injection method for SRAM type FPGA user register |
| CN113744787B (en) * | 2021-07-27 | 2023-09-08 | 北京空间飞行器总体设计部 | A single-event flip fault injection method for SRAM-type FPGA user registers |
| CN114090327A (en) * | 2022-01-20 | 2022-02-25 | 浙江吉利控股集团有限公司 | Single event error handling method, system and device |
| CN118377645A (en) * | 2024-06-24 | 2024-07-23 | 之江实验室 | An abnormality handling system, method and detection device for space environment |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN102521066A (en) | On-board computer space environment event fault tolerance method | |
| JP7351933B2 (en) | Error recovery method and device | |
| US10078565B1 (en) | Error recovery for redundant processing circuits | |
| CN104731670B (en) | A kind of rotation formula spaceborne computer tolerant system towards satellite | |
| US20030208654A1 (en) | Computer system architecture with hot pluggable main memory boards | |
| EP2770507B1 (en) | Memory circuits, method for accessing a memory and method for repairing a memory | |
| CN104932954B (en) | Microsatellite FPGA critical data guard methods | |
| JPH05346866A (en) | System and method for establishing writing data maintenance in redundant array data storage system | |
| KR20010005956A (en) | Fault tolerant computer system | |
| US10691565B2 (en) | Storage control device and storage control method | |
| US20110043323A1 (en) | Fault monitoring circuit, semiconductor integrated circuit, and faulty part locating method | |
| CN108958987B (en) | A low-orbit small satellite fault tolerance system and method | |
| US7366948B2 (en) | System and method for maintaining in a multi-processor system a spare processor that is in lockstep for use in recovering from loss of lockstep for another processor | |
| KR20140079285A (en) | Salvaging event trace information in power loss interruption scenarios | |
| CN103744754B (en) | A kind of radiation hardening walks abreast board computer system and using method thereof | |
| US20140181435A1 (en) | Multiple computer system processing write data outside of checkpointing | |
| US12169436B2 (en) | Serial attached non-volatile memory | |
| CN106021169A (en) | Aerospace computer and refreshing and upgrading method for nonvolatile storage apparatus thereof | |
| CN119248578B (en) | Microprocessor fault recovery device and method and chip | |
| US7502958B2 (en) | System and method for providing firmware recoverable lockstep protection | |
| CN110727544A (en) | Microsatellite satellite-borne computer system based on industrial devices | |
| WO2015068285A1 (en) | Programmable device and electronic system device using same | |
| CN105068969A (en) | Single event effect protection system and method for digital signal processing platform architecture | |
| JP3068009B2 (en) | Error correction mechanism for redundant memory | |
| CN106980557B (en) | Storage partition-based satellite-borne software heterogeneous backup method |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
| WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20120627 |