[go: up one dir, main page]

CN105094696B - Based on RAID1 and RAID4 mixed structures transfer process data reliability ensuring method and device - Google Patents

Based on RAID1 and RAID4 mixed structures transfer process data reliability ensuring method and device Download PDF

Info

Publication number
CN105094696B
CN105094696B CN201510390976.9A CN201510390976A CN105094696B CN 105094696 B CN105094696 B CN 105094696B CN 201510390976 A CN201510390976 A CN 201510390976A CN 105094696 B CN105094696 B CN 105094696B
Authority
CN
China
Prior art keywords
raid4
subregion
data
area
raid1
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201510390976.9A
Other languages
Chinese (zh)
Other versions
CN105094696A (en
Inventor
贾思懿
许鲁
董欢庆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin Zhongke Bluewhale Information Technology Co ltd
Institute of Computing Technology of CAS
Original Assignee
Tianjin Zhongke Bluewhale Information Technology Co ltd
Institute of Computing Technology of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin Zhongke Bluewhale Information Technology Co ltd, Institute of Computing Technology of CAS filed Critical Tianjin Zhongke Bluewhale Information Technology Co ltd
Priority to CN201510390976.9A priority Critical patent/CN105094696B/en
Publication of CN105094696A publication Critical patent/CN105094696A/en
Application granted granted Critical
Publication of CN105094696B publication Critical patent/CN105094696B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Debugging And Monitoring (AREA)

Abstract

本发明涉及计算机存储领域,本发明提出基于RAID1和RAID4混合结构转换过程数据可靠性保证方法及装置,该方法包括:获取校验节点中BRD的占用空间值,当所述占用空间值达到设定阈值时,将所述BRD所在的区域加入suspend_area列表,将所述区域转换为对应的多个数据节点的子区域,并向所述子区域发送挂起操作请求,所述数据节点收到所述挂起操作请求后,挂起所述子区域的写请求;对所述子区域的数据进行RAID4校验计算,生成的校验数据写入逻辑卷设备,并向所述数据节点发送重映射请求,所述数据节点将所述子区域的所述BRD重映射到ARP,以完成数据转换。本发明降低RAID1向RAID4转换的过程与I/O操作在短时间内互相影响的概率,确保转换流程与写操作的异步化程度。

The present invention relates to the field of computer storage. The present invention proposes a data reliability guarantee method and device based on RAID1 and RAID4 hybrid structure conversion process. When the threshold is reached, the area where the BRD is located is added to the suspend_area list, the area is converted into a sub-area corresponding to multiple data nodes, and a suspend operation request is sent to the sub-area, and the data node receives the After suspending the operation request, suspend the write request of the sub-area; perform RAID4 verification calculation on the data of the sub-area, write the generated verification data into the logical volume device, and send a remapping request to the data node , the data node remaps the BRD of the sub-area to ARP to complete data conversion. The invention reduces the probability that the conversion process from RAID1 to RAID4 and the I/O operation interact with each other in a short time, and ensures the asynchronous degree of conversion process and writing operation.

Description

基于RAID1和RAID4混合结构转换过程数据可靠性保证方法及 装置Data reliability guarantee method and method for conversion process based on RAID1 and RAID4 hybrid structure device

技术领域technical field

本发明涉及计算机存储领域,尤其涉及基于RAID1和RAID4混合结构转换过程数据可靠性保证装置。The invention relates to the field of computer storage, in particular to a data reliability guarantee device for conversion process based on RAID1 and RAID4 hybrid structures.

背景技术Background technique

RAID(独立冗余磁盘阵列)技术是一种提供增强冗余、容量和存储性能的存储方法,有着较强的可管理性、可靠性和可用性。通过冗余计算,RAID技术能够在满足可靠性要求的前提下降低系统的容量开销,将RAID技术应用于网络存储系统的研究经历了从集中式控制到分布式控制,从单一RAID模式到混合RAID模式的发展过程,RAID1技术具有高可靠性,RAID1镜像使得写操作能有两份数据,即使其中一个节点故障,副本也能保证有一份正确的数据,RAID4冗余技术占用较少的冗余空间存储校验信息,能够容忍任意单节点故障,并可以通过其他节点的数据和校验信息恢复丢失的数据,系统可靠性比较好。RAID (Redundant Array of Independent Disks) technology is a storage method that provides enhanced redundancy, capacity and storage performance, and has strong manageability, reliability and availability. Through redundant computing, RAID technology can reduce system capacity overhead on the premise of meeting reliability requirements. The research on applying RAID technology to network storage systems has experienced from centralized control to distributed control, from single RAID mode to hybrid RAID During the development of the model, RAID1 technology has high reliability. RAID1 mirroring enables two copies of data in write operations. Even if one of the nodes fails, the copy can also guarantee a correct data. RAID4 redundancy technology occupies less redundant space Store verification information, can tolerate any single node failure, and can restore lost data through data and verification information of other nodes, and the system reliability is relatively good.

在大规模网络存储系统中需要考虑复杂网络环境对性能的影响,所以一些存储系统采用混合RAID模式。AutoRAID将最近使用的数据放在以RAID1存储的高性能的硬盘中,将不太常用的数据放在以RAID5存储的经济高效的硬盘中。DPGADR使用了与AutoRAID相似的结构,但更适用于分布式网络存储环境,采用复制和延迟校验块生成的方法提高系统性能。专利号为201410033455.3的中国专利——《RAID1和RAID4混合结构网络存储系统及方法》提供了在后端的冗余管理节点对数据节点进行集中冗余管理的方法,将冗余管理节点从数据通道的前端移到存储设备的后端,使得读写请求可以直接访问存储节点,减少了冗余计算对系统写性能的影响。In a large-scale network storage system, it is necessary to consider the impact of a complex network environment on performance, so some storage systems use a hybrid RAID mode. AutoRAID puts recently used data on high-performance hard disks stored in RAID1, and less frequently used data on cost-effective hard disks stored in RAID5. DPGADR uses a structure similar to AutoRAID, but is more suitable for distributed network storage environments, and uses replication and delayed check block generation to improve system performance. The Chinese patent No. 201410033455.3 - "RAID1 and RAID4 Mixed Structure Network Storage System and Method" provides a method for centralized redundant management of data nodes at the back-end redundant management node, and the redundant management node is separated from the data channel The front end is moved to the back end of the storage device, so that read and write requests can directly access the storage nodes, reducing the impact of redundant computing on system write performance.

AutoRAID中RAID1与RAID5数据是分离存储的,在网络环境下将大批数据从RAID1迁移到RAID5上,会带来巨大的网络带宽和磁盘带宽消耗,不适合在分布式环境下使用。而DPGADR中RAID5部分是降级方式存储,冷数据的读写性能较差。专利号为201410033455.3的中国专利——《RAID1和RAID4混合结构网络存储系统及方法》提出的后端冗余集中管理方式解决了集中式冗余管理方式性能瓶颈的问题,但数据同步过程和写I/O操作会相互影响,无法保证RAID1数据转换成RAID4数据过程中数据的一致性。In AutoRAID, RAID1 and RAID5 data are stored separately. Migrating a large amount of data from RAID1 to RAID5 in a network environment will bring huge consumption of network bandwidth and disk bandwidth, which is not suitable for use in a distributed environment. However, the RAID5 part of DPGADR is stored in a degraded mode, and the read and write performance of cold data is poor. The Chinese patent No. 201410033455.3 - "RAID1 and RAID4 mixed structure network storage system and method" proposes a back-end redundant centralized management method to solve the performance bottleneck of the centralized redundant management method, but the data synchronization process and write I The /O operations will affect each other, and the data consistency during the process of converting RAID1 data to RAID4 data cannot be guaranteed.

基于上述分析混合RAID结构网络存储技术在目前还存在以下局限:数据转换过程影响写I/O操作,无法保证数据转换的一致性。Based on the above analysis, the hybrid RAID structure network storage technology still has the following limitations: the data conversion process affects the write I/O operation, and the consistency of data conversion cannot be guaranteed.

针对以上考虑,本发明在中国专利《RAID1和RAID4混合结构网络存储系统及方法》基础上(系统结构如图示1和图示2),提出一种基于RAID1和RAID4混合结构的转换过程数据一致性保证方法。In view of the above considerations, on the basis of the Chinese patent "RAID1 and RAID4 Mixed Structure Network Storage System and Method" (system structure as shown in Figure 1 and Figure 2), the present invention proposes a conversion process data consistency based on RAID1 and RAID4 mixed structure security method.

发明内容Contents of the invention

针对现有技术的不足,本发明提出一种基于RAID1和RAID4混合结构转换过程数据可靠性保证装置。Aiming at the deficiencies of the prior art, the present invention proposes a device for ensuring data reliability during the conversion process based on the mixed structure of RAID1 and RAID4.

本发明提出一种基于RAID1和RAID4混合结构转换过程数据可靠性保证方法,包括:The present invention proposes a data reliability guarantee method based on RAID1 and RAID4 hybrid structure conversion process, including:

挂起步骤,获取校验节点中BRD的占用空间值,当所述占用空间值达到设定阈值时,将所述BRD所在的区域加入suspend_area列表,将所述区域转换为对应的多个数据节点的子区域,并向所述子区域发送挂起操作请求,所述数据节点收到所述挂起操作请求后,挂起所述子区域的写请求;The suspending step is to obtain the occupied space value of the BRD in the check node, and when the occupied space value reaches the set threshold, add the area where the BRD is located to the suspend_area list, and convert the area into corresponding multiple data nodes and send a suspend operation request to the subarea, and after receiving the suspend operation request, the data node suspends the write request of the subarea;

转换步骤,对所述子区域的数据进行RAID4校验计算,生成的校验数据写入逻辑卷设备,并向所述数据节点发送重映射请求,所述数据节点将所述子区域的所述BRD重映射到ARP,以完成数据转换。The conversion step is to perform RAID4 verification calculation on the data of the sub-area, write the generated verification data into a logical volume device, and send a remapping request to the data node, and the data node will transfer the data of the sub-area BRD is remapped to ARP to complete the data conversion.

所述的基于RAID1和RAID4混合结构转换过程数据可靠性保证方法,所述转换步骤包括:对成功执行挂起操作的子区域进行RAID4校验计算,并向所述成功执行挂起操作的子区域所属的数据节点发送重映射请求。In the method for ensuring data reliability during the conversion process based on the mixed structure of RAID1 and RAID4, the conversion step includes: performing RAID4 verification calculation on the sub-area that successfully executes the pending operation, and sending the data to the sub-area that successfully executes the pending operation The data node to which it belongs sends a remapping request.

所述的基于RAID1和RAID4混合结构转换过程数据可靠性保证方法,所述挂起步骤还包括:通过多任务并发向对应的数据节点发送子区域的挂起操作。In the method for ensuring data reliability during the conversion process based on the mixed structure of RAID1 and RAID4, the suspending step further includes: sending the suspending operation of the sub-area to the corresponding data node concurrently through multiple tasks.

所述的基于RAID1和RAID4混合结构转换过程数据可靠性保证方法,所述挂起步骤还包括:若此时所述子区域存在写请求正在执行,则等待所有写请求结束后,进入所述转换步骤。In the method for ensuring data reliability in the conversion process based on the mixed structure of RAID1 and RAID4, the suspending step further includes: if there is a write request being executed in the sub-area at this time, wait for all write requests to end, and then enter the conversion process. step.

所述的基于RAID1和RAID4混合结构转换过程数据可靠性保证方法,还包括复位步骤:所述数据节点完成重映射操作后,将所述BRD所在的区域从所述suspend_area链表中移除;将所述数据节点中被挂起的所述子区域的写请求进行下发。The described data reliability guarantee method based on the RAID1 and RAID4 hybrid structure conversion process also includes a reset step: after the data node completes the remapping operation, the area where the BRD is located is removed from the suspend_area linked list; Send the write request of the suspended sub-area in the data node.

本发明还提出一种基于RAID1和RAID4混合结构转换过程数据可靠性保证装置,包括:The present invention also proposes a device for ensuring data reliability during conversion based on the mixed structure of RAID1 and RAID4, including:

挂起模块,用于获取校验节点中BRD的占用空间值,当所述占用空间值达到设定阈值时,将所述BRD所在的区域加入suspend_area列表,将所述区域转换为对应的多个数据节点的子区域,并向所述子区域发送挂起操作请求,所述数据节点收到所述挂起操作请求后,挂起所述子区域的写请求;The suspend module is used to obtain the occupied space value of the BRD in the check node. When the occupied space value reaches the set threshold, the area where the BRD is located is added to the suspend_area list, and the area is converted into a corresponding plurality of a sub-area of the data node, and send a suspend operation request to the sub-area, and after the data node receives the suspend operation request, suspend the write request of the sub-area;

转换模块,用于对所述子区域的数据进行RAID4校验计算,生成的校验数据写入逻辑卷设备,并向所述数据节点发送重映射请求,所述数据节点将所述子区域的所述BRD重映射到ARP,以完成数据转换。A conversion module, configured to perform RAID4 verification calculation on the data in the sub-area, write the generated verification data into a logical volume device, and send a remapping request to the data node, and the data node will convert the data in the sub-area The BRD is remapped to ARP to complete data conversion.

所述的基于RAID1和RAID4混合结构转换过程数据可靠性保证装置,所述挂起模块还包括:通过多任务并发向对应的数据节点发送子区域的挂起操作。In the device for ensuring data reliability during conversion based on the hybrid structure of RAID1 and RAID4, the suspend module further includes: a suspend operation of sending sub-areas to corresponding data nodes concurrently through multiple tasks.

所述的基于RAID1和RAID4混合结构转换过程数据可靠性保证装置,所述转换模块包括:对成功执行挂起操作的子区域进行RAID4校验计算,并向所述成功执行挂起操作的子区域所属的数据节点发送重映射请求。In the data reliability guarantee device for conversion process based on RAID1 and RAID4 hybrid structure, the conversion module includes: performing RAID4 verification calculation on the sub-area that successfully executes the pending operation, and sends the data to the sub-area that successfully executes the pending operation The data node to which it belongs sends a remapping request.

所述的基于RAID1和RAID4混合结构转换过程数据可靠性保证装置,所述挂起模块还包括:若此时所述子区域存在写请求正在执行,则等待所有写请求结束后,执行所述转换模块。In the device for ensuring data reliability during conversion based on the mixed structure of RAID1 and RAID4, the suspending module further includes: if there is a write request being executed in the sub-area at this time, wait for all write requests to be completed before performing the conversion module.

所述的基于RAID1和RAID4混合结构转换过程数据可靠性保证装置,还包括复位模块,用于所述数据节点完成重映射操作后,将所述BRD所在的区域从所述suspend_area链表中移除;将所述数据节点中被挂起的所述子区域的写请求进行下发。The device for ensuring data reliability during the conversion process based on the mixed structure of RAID1 and RAID4 further includes a reset module, which is used to remove the area where the BRD is located from the suspend_area linked list after the data node completes the remapping operation; Sending the write request of the suspended sub-area in the data node.

由以上发明可知,本发明的优点在于:As can be seen from the above invention, the present invention has the advantages of:

本发明降低RAID1向RAID4转换的过程与I/O操作在短时间内互相影响的概率,确保转换流程与写I/O操作的异步化程度;The present invention reduces the probability that the conversion process from RAID1 to RAID4 and the I/O operation interact with each other in a short time, and ensures the asynchronous degree of the conversion process and the write I/O operation;

在不影响写I/O性能的前提下,保证了R1到R4转换的数据一致性。On the premise of not affecting the write I/O performance, the data consistency of the conversion from R1 to R4 is guaranteed.

附图说明Description of drawings

图1为RAID1和RAID4混合结构的冗余组示意图;Fig. 1 is a schematic diagram of a redundancy group of a mixed structure of RAID1 and RAID4;

图2为冗余组内混合RAID的系统结构图;Fig. 2 is a system structural diagram of hybrid RAID in a redundancy group;

图3为数据节点suspend操作流程示意图;Figure 3 is a schematic diagram of the data node suspend operation process;

图4为数据节点resume操作流程示意图;Figure 4 is a schematic diagram of the data node resume operation process;

图5为校验节点发送suspend和resume操作流程示意图;Figure 5 is a schematic diagram of the check node sending suspend and resume operations;

图6为数据节点写请求下发查询流程示意图;FIG. 6 is a schematic diagram of a data node write request sending query process;

图7为转换流程suspend阶段流程示意图;Fig. 7 is a flow diagram of the suspend stage of the conversion process;

图8为转换流程转换阶段流程示意图;Fig. 8 is a schematic diagram of the process flow of the conversion stage of the conversion process;

图9为转换流程resume阶段流程示意图。FIG. 9 is a schematic flow diagram of the resume stage of the conversion process.

具体实施方式Detailed ways

以下为本发明方法的具体步骤,如下所示:The following are the concrete steps of the inventive method, as follows:

挂起步骤,获取校验节点中BRD的占用空间值,当所述占用空间值达到设定阈值时,将所述BRD所在的区域加入suspend_area列表,将所述区域转换为对应的多个数据节点的子区域,并向所述子区域发送挂起操作请求,所述数据节点收到所述挂起操作请求后,挂起所述子区域的写请求;The suspending step is to obtain the occupied space value of the BRD in the check node, and when the occupied space value reaches the set threshold, add the area where the BRD is located to the suspend_area list, and convert the area into corresponding multiple data nodes and send a suspend operation request to the subarea, and after receiving the suspend operation request, the data node suspends the write request of the subarea;

转换步骤,对所述子区域的数据进行RAID4校验计算,生成的校验数据写入逻辑卷设备,并向所述数据节点发送重映射请求,所述数据节点将所述子区域的所述BRD重映射到ARP,以完成数据转换。The conversion step is to perform RAID4 verification calculation on the data of the sub-area, write the generated verification data into a logical volume device, and send a remapping request to the data node, and the data node will transfer the data of the sub-area BRD is remapped to ARP to complete the data conversion.

所述转换步骤包括:对成功执行挂起操作的子区域进行RAID4校验计算,并向所述成功执行挂起操作的子区域所属的数据节点发送重映射请求。The conversion step includes: performing RAID4 check calculation on the sub-area that successfully executes the pending operation, and sending a remapping request to the data node to which the sub-area that successfully executes the pending operation belongs.

所述挂起步骤还包括:通过多任务并发向对应的数据节点发送子区域的挂起操作。The suspending step further includes: concurrently sending the suspending operation of the sub-area to the corresponding data node through multitasking.

所述挂起步骤还包括:若此时所述子区域存在写请求正在执行,则等待所有写请求结束后,进入所述转换步骤。The suspending step further includes: if there are write requests being executed in the sub-area at this time, waiting for all write requests to be completed before entering the conversion step.

还包括复位步骤:所述数据节点完成重映射操作后,将所述BRD所在的区域从所述suspend_area链表中移除;将所述数据节点中被挂起的所述子区域的写请求进行下发。It also includes a reset step: after the data node completes the remapping operation, the area where the BRD is located is removed from the suspend_area linked list; the write request of the suspended sub-area in the data node is downloaded send.

本发明装置包括以下模块,如下所示:The device of the present invention comprises the following modules, as follows:

挂起模块,用于获取校验节点中BRD的占用空间值,当所述占用空间值达到设定阈值时,将所述BRD所在的区域加入suspend_area列表,将所述区域转换为对应的多个数据节点的子区域,并向所述子区域发送挂起操作请求,所述数据节点收到所述挂起操作请求后,挂起所述子区域的写请求;The suspend module is used to obtain the occupied space value of the BRD in the check node. When the occupied space value reaches the set threshold, the area where the BRD is located is added to the suspend_area list, and the area is converted into a corresponding plurality of a sub-area of the data node, and send a suspend operation request to the sub-area, and after the data node receives the suspend operation request, suspend the write request of the sub-area;

转换模块,用于对所述子区域的数据进行RAID4校验计算,生成的校验数据写入逻辑卷设备,并向所述数据节点发送重映射请求,所述数据节点将所述子区域的所述BRD重映射到ARP,以完成数据转换。A conversion module, configured to perform RAID4 verification calculation on the data in the sub-area, write the generated verification data into a logical volume device, and send a remapping request to the data node, and the data node will convert the data in the sub-area The BRD is remapped to ARP to complete data conversion.

所述挂起模块还包括:通过多任务并发向对应的数据节点发送子区域的挂起操作。The suspend module further includes: a suspend operation of concurrently sending sub-areas to corresponding data nodes through multi-task.

所述转换模块包括:对成功执行挂起操作的子区域进行RAID4校验计算,并向所述成功执行挂起操作的子区域所属的数据节点发送重映射请求。The conversion module includes: performing RAID4 verification calculation on the sub-area that successfully executes the pending operation, and sending a remapping request to the data node to which the sub-area that successfully executes the pending operation belongs.

所述挂起模块还包括:若此时所述子区域存在写请求正在执行,则等待所有写请求结束后,进入所述转换步骤。The suspending module further includes: if there is a write request being executed in the sub-area at this time, wait until all write requests are completed, and then enter the conversion step.

复位模块,用于所述数据节点完成重映射操作后,将所述BRD所在的区域从所述suspend_area链表中移除;将所述数据节点中被挂起的所述子区域的写请求进行下发。The reset module is used for removing the area where the BRD is located from the suspend_area linked list after the data node completes the remapping operation; and downloading the write request of the suspended sub-area in the data node send.

下面给出本发明的具体实施例,并结合附图对本发明做出了详细描述。本发明包括系统转换相关结构和转换流程两个部分。Specific embodiments of the present invention are given below, and the present invention is described in detail in conjunction with the accompanying drawings. The present invention includes two parts: system conversion related structure and conversion process.

系统转换相关结构System Conversion Related Structure

首先描述RAID1和RAID4混合结构的组成。如图示1的中间层的虚线框所示,本发明的RAID1和RAID4混合结构的单位是由一组数据结构节点和一个校验节点组成的冗余组。图示2为一个冗余组内的系统结构,冗余组内的数据节点由数据收发程序、缓存单元、映射单元以及实际存储数据的逻辑卷组成;冗余组内的校验节点由数据收发程序、缓存单元、映射单元、Cache模块(高速缓存模块)、Suspend模块、负责组内RAID计算的冗余计算单元和存储校验信息的逻辑卷组成。First, the composition of the RAID1 and RAID4 hybrid structure is described. As shown in the dotted line box of the middle layer in Figure 1, the unit of the RAID1 and RAID4 hybrid structure of the present invention is a redundancy group composed of a group of data structure nodes and a check node. Figure 2 shows the system structure in a redundancy group. The data nodes in the redundancy group are composed of data sending and receiving programs, cache units, mapping units, and logical volumes that actually store data; the check nodes in the redundancy group are composed of data sending and receiving programs. It is composed of program, cache unit, mapping unit, Cache module (high-speed cache module), Suspend module, redundant calculation unit responsible for RAID calculation in the group, and logical volume for storing verification information.

现对系统中主要的组成部分进行解释和说明:Suspend模块(挂起模块):此模块处于校验节点中,记录一个suspend_area列表(挂起区域列表);若该模块将一个区域加入suspend_area列表,则Suspend模块会通过MP(Multi Path,多路径)通知对应区域所在的数据节点挂起此区域的写请求;直到Suspend模块将该区域移除列表,写请求才正常下发(数据节点写请求下发查询流程如图示6)。Now explain and explain the main components of the system: Suspend module (suspend module): This module is in the check node, and records a suspend_area list (suspend area list); if the module adds an area to the suspend_area list, Then the Suspend module will notify the data node where the corresponding area is located through MP (Multi Path, multi-path) to suspend the write request for this area; until the Suspend module removes the area from the list, the write request will be sent normally (data node write request The query process is shown in Figure 6).

映射单元:映射单元包括BRD和ARD设备,映射的数据分别为Mirror(镜像)和R4数据。Mapping unit: The mapping unit includes BRD and ARD devices, and the mapped data are Mirror (mirror) and R4 data respectively.

BRD:Before Raid Device,镜像设备。承载着当前数据,同时也是即将进行RAID转化数据的Prev-RAID一致数据的设备;即BRD维护的映射关系的逻辑卷中对应着还未转成R4的R1数据。BRD: Before Raid Device, mirroring device. The device that carries the current data and is also the Prev-RAID consistent data that is about to undergo RAID conversion; that is, the logical volume of the mapping relationship maintained by BRD corresponds to the R1 data that has not been converted into R4.

ARD:After Raid Device,校验设备。承载着已完成RAID转化的数据的Post-RAID一致数据的设备;即ARD维护的映射关系的逻辑卷中对应着已转成R4的数据。ARD: After Raid Device, verify the device. The device that carries the Post-RAID consistent data of the data that has completed RAID conversion; that is, the logical volume of the mapping relationship maintained by ARD corresponds to the data that has been converted to R4.

以下为本发明的转换流程,如下所示:The conversion process of the present invention is as follows:

转换流程可划分为三个阶段,分别为:Suspend阶段、转换阶段和Resume阶段。The conversion process can be divided into three phases: Suspend phase, conversion phase, and Resume phase.

Suspend阶段(挂起阶段)(如图示7):Suspend phase (suspend phase) (as shown in Figure 7):

校验节点中的Cache模块发现校验节点的BRD占用空间达到一设定阈值(该阈值为综合考虑系统逻辑卷大小、缓存大小等因素决定)以上时,Cache模块通知Suspend模块将该区域加入suspend_area列表,进行suspend操作;When the Cache module in the check node finds that the space occupied by the BRD of the check node exceeds a set threshold (the threshold is determined by comprehensively considering the size of the system logical volume, cache size, etc.), the Cache module notifies the Suspend module to add the area to the suspend_area List, perform suspend operation;

Suspend模块收到suspend操作后,将该区域转换为对应的多个数据节点的子区域,通过MP(Multi Path,多路径,指从suspend模块多路径发至各节点)并发地向对应的数据节点发送子区域的suspend操作;After the Suspend module receives the suspend operation, it converts the area into a sub-area of multiple corresponding data nodes, and sends data to the corresponding data nodes concurrently through MP (Multi Path, multi-path, which refers to multi-path sending from the suspend module to each node) Send the suspend operation of the sub-area;

数据节点收到suspend操作后,挂起后续对应区域(所述子区域)的写请求,若此时该区域有写请求正在执行,则等待所有写请求结束后,进入转换阶段;After the data node receives the suspend operation, it suspends the write request of the subsequent corresponding area (the sub-area). If there is a write request in the area at this time, it will wait for all the write requests to end and enter the conversion stage;

转换阶段(如图示8):Conversion stage (as shown in Figure 8):

Cache模块对执行suspend成功的区域的数据进行RAID4校验计算;The Cache module performs RAID4 verification calculation on the data in the area where the suspend is successfully executed;

校验计算完毕后,生成的校验数据写入逻辑卷设备,并且通过WR(Write Remap,将BRD中数据重映射至ARD中)通知每一个数据节点对参与计算的数据区域进行重映射操作;After the verification calculation is completed, the generated verification data is written to the logical volume device, and each data node is notified to remap the data area participating in the calculation through WR (Write Remap, which remaps the data in the BRD to the ARD);

数据节点收到WR发来的重映射操作后,将对应区域的数据从BRP重映射到ARP,所有数据节点完成重映射操作后,转换阶段结束;After the data node receives the remapping operation sent by WR, it remaps the data in the corresponding area from BRP to ARP. After all data nodes complete the remapping operation, the conversion phase ends;

Resume阶段(复位阶段)(如图示9):Resume phase (reset phase) (as shown in Figure 9):

所有数据节点完成重映射操作后,Cache模块通知Suspend模块将该区域从suspend_area链表中移除;After all data nodes complete the remapping operation, the Cache module notifies the Suspend module to remove the area from the suspend_area linked list;

Suspend模块通过MP通知各个数据节点对原对应的suspend_area进行resume操作;The Suspend module notifies each data node to resume the original corresponding suspend_area through MP;

完成resume后,将数据节点中挂起的对应区域的写请求下发,转换流程结束。After the resume is completed, the write request for the corresponding area suspended in the data node is issued, and the conversion process ends.

Claims (10)

1. one kind is based on RAID1 and RAID4 mixed structure transfer process data reliability ensuring methods, it is characterised in that including:
Step is hung up, the space-consuming value of mirroring device BRD in check-node is obtained, when the space-consuming value reaches setting threshold During value, the region where the mirroring device BRD is added and hangs up zone list suspend_area, the region is converted to The subregion of corresponding multiple back end, and pending operation request is sent to the subregion, the back end receives institute After stating pending operation request, the write request of the subregion is hung up;
Switch process, RAID4 verification calculating is carried out to the data of the subregion, the verification data write-in logical volume of generation is set It is standby, and sent to the back end and remap request, the back end is heavy by the mirroring device BRD of the subregion Calibration equipment ARP is mapped to, to complete data conversion.
2. RAID1 and RAID4 mixed structure transfer process data reliability ensuring methods are based on as claimed in claim 1, its It is characterised by, the switch process includes:RAID4 verifications are carried out to the subregion of successful execution pending operation to calculate, and to institute State the transmission of the back end belonging to the subregion of successful execution pending operation and remap request.
3. RAID1 and RAID4 mixed structure transfer process data reliability ensuring methods are based on as claimed in claim 1, its It is characterised by, the hang-up step also includes:The hang-up that subregion is sent to corresponding back end by Multi-task Concurrency is grasped Make.
4. RAID1 and RAID4 mixed structure transfer process data reliability ensuring methods are based on as claimed in claim 1, its It is characterised by, the hang-up step also includes:If now the subregion has write request and is carrying out, wait all write please After asking end, into the switch process.
5. RAID1 and RAID4 mixed structure transfer process data reliability ensuring methods are based on as claimed in claim 1, its It is characterised by, in addition to reset process:After the back end completes remapping operation, by where the mirroring device BRD Region removes from the suspend_area lists;The write request for the subregion being suspended in the back end is entered Row issues.
6. one kind ensures device based on RAID1 and RAID4 mixed structure transfer processes data reliability, it is characterised in that including:
Suspension module, for obtaining the space-consuming value of mirroring device BRD in check-node, set when the space-consuming value reaches When determining threshold value, the region where the mirroring device BRD is added and hangs up zone list suspend_area, the region is turned The subregion of multiple back end corresponding to being changed to, and pending operation request is sent to the subregion, the data section is checked and accepted After being asked to the pending operation, the write request of the subregion is hung up;
Modular converter, calculated for carrying out RAID4 verifications to the data of the subregion, the verification data write-in logical volume of generation Equipment, and sent to the back end and remap request, the back end is by the mirroring device BRD of the subregion Calibration equipment ARP is remapped to, to complete data conversion.
7. device is ensured based on RAID1 and RAID4 mixed structure transfer processes data reliability as claimed in claim 6, its It is characterised by, the suspension module also includes:The hang-up that subregion is sent to corresponding back end by Multi-task Concurrency is grasped Make.
8. device is ensured based on RAID1 and RAID4 mixed structure transfer processes data reliability as claimed in claim 6, its It is characterised by, the modular converter includes:RAID4 verifications are carried out to the subregion of successful execution pending operation to calculate, and to institute State the transmission of the back end belonging to the subregion of successful execution pending operation and remap request.
9. device is ensured based on RAID1 and RAID4 mixed structure transfer processes data reliability as claimed in claim 6, its It is characterised by, the suspension module also includes:If now the subregion has write request and is carrying out, wait all write please After asking end, the modular converter is performed.
10. device is ensured based on RAID1 and RAID4 mixed structure transfer processes data reliability as claimed in claim 6, its It is characterised by, in addition to reseting module, after completing remapping operation for the back end, by the mirroring device BRD institutes Region removed from the suspend_area lists;Please by writing for the subregion being suspended in the back end Ask and issued.
CN201510390976.9A 2015-07-06 2015-07-06 Based on RAID1 and RAID4 mixed structures transfer process data reliability ensuring method and device Expired - Fee Related CN105094696B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510390976.9A CN105094696B (en) 2015-07-06 2015-07-06 Based on RAID1 and RAID4 mixed structures transfer process data reliability ensuring method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510390976.9A CN105094696B (en) 2015-07-06 2015-07-06 Based on RAID1 and RAID4 mixed structures transfer process data reliability ensuring method and device

Publications (2)

Publication Number Publication Date
CN105094696A CN105094696A (en) 2015-11-25
CN105094696B true CN105094696B (en) 2018-02-06

Family

ID=54575256

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510390976.9A Expired - Fee Related CN105094696B (en) 2015-07-06 2015-07-06 Based on RAID1 and RAID4 mixed structures transfer process data reliability ensuring method and device

Country Status (1)

Country Link
CN (1) CN105094696B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105843560B (en) * 2016-04-08 2018-08-10 中国科学院计算技术研究所 A kind of method and system for safeguarding the soft Raid6 consistency of master-slave mode dual control
CN106027638B (en) * 2016-05-18 2019-04-12 华中科技大学 A kind of hadoop data distributing method based on hybrid coding
CN109032513B (en) * 2018-07-16 2020-08-25 山东大学 RAID (redundant array of independent disks) architecture based on SSD (solid State disk) and HDD (hard disk drive) and backup and reconstruction methods thereof
US10691353B1 (en) 2018-11-30 2020-06-23 International Business Machines Corporation Checking of data difference for writes performed via a bus interface to a dual-server storage controller
CN113420341B (en) * 2021-06-11 2023-08-25 联芸科技(杭州)股份有限公司 Data protection method, data protection equipment and computer system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103488432A (en) * 2013-09-16 2014-01-01 哈尔滨工程大学 Hybrid disk array, deferred write verification method for hybrid disk array, and data recovery method for hybrid disk array
CN103761058A (en) * 2014-01-23 2014-04-30 天津中科蓝鲸信息技术有限公司 RAID1 and RAID4 hybrid structure network storage system and method

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9032146B2 (en) * 2010-11-30 2015-05-12 Lenovo Enterprise Solutions (Singapore) Pte. Ltd. Dynamic use of raid levels responsive to workload requirements

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103488432A (en) * 2013-09-16 2014-01-01 哈尔滨工程大学 Hybrid disk array, deferred write verification method for hybrid disk array, and data recovery method for hybrid disk array
CN103761058A (en) * 2014-01-23 2014-04-30 天津中科蓝鲸信息技术有限公司 RAID1 and RAID4 hybrid structure network storage system and method

Also Published As

Publication number Publication date
CN105094696A (en) 2015-11-25

Similar Documents

Publication Publication Date Title
CN105094696B (en) Based on RAID1 and RAID4 mixed structures transfer process data reliability ensuring method and device
CN101622606B (en) Apparatus, system and method for solid-state memory as cache memory for high-capacity, non-volatile memory
CN101689131B (en) Apparatus, system, and method for a shared, front-end, distributed raid
US10042869B1 (en) Method for data transfer between compute clusters and file system
US20220137835A1 (en) Systems and methods for parity-based failure protection for storage devices
US9916241B2 (en) Storage controller caching using symmetric storage class memory devices
US20220004320A1 (en) Sharing memory resources between asynchronous replication workloads
CN104407933B (en) A kind of backup method and device of data
CN104049918B (en) A kind of buffer memory management method of dual control storage server
US20210064234A1 (en) Systems, devices, and methods for implementing in-memory computing
US8650471B2 (en) System and method for look-aside parity based raid
JP2020161103A (en) Storage system and data transfer method
CN106066890B (en) Distributed high-performance database all-in-one machine system
CN104503923B (en) A kind of asymmetric disk array cache dispatching method
US10310995B1 (en) Arbitration control system and method for storage systems
CN105940386A (en) Move data between storage
JP6652647B2 (en) Storage system
US20170315725A1 (en) Changing Storage Volume Ownership Using Cache Memory
CN119088618A (en) Method and system for recovery when a first server is inaccessible
US10176103B1 (en) Systems, devices and methods using a solid state device as a caching medium with a cache replacement algorithm
CN105468297A (en) Quick synchronization method for master and slave device data in cloud storage system
CN100527744C (en) Intelligent network disc storage system and its realizing method
US10210060B2 (en) Online NVM format upgrade in a data storage system operating with active and standby memory controllers
US20050203974A1 (en) Checkpoint methods and systems utilizing non-disk persistent memory
US9348525B2 (en) Systems and methods for a storage array-managed initiator cache

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20180206