[go: up one dir, main page]

CN100347655C - Data storage system and data storage control device - Google Patents

Data storage system and data storage control device Download PDF

Info

Publication number
CN100347655C
CN100347655C CNB2005100796426A CN200510079642A CN100347655C CN 100347655 C CN100347655 C CN 100347655C CN B2005100796426 A CNB2005100796426 A CN B2005100796426A CN 200510079642 A CN200510079642 A CN 200510079642A CN 100347655 C CN100347655 C CN 100347655C
Authority
CN
China
Prior art keywords
data
unit
control
interface unit
disk
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB2005100796426A
Other languages
Chinese (zh)
Other versions
CN1782978A (en
Inventor
小原成介
增山和则
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Publication of CN1782978A publication Critical patent/CN1782978A/en
Application granted granted Critical
Publication of CN100347655C publication Critical patent/CN100347655C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Memory System Of A Hierarchy Structure (AREA)

Abstract

数据存储系统和数据存储控制装置。存储系统具有用于控制多个存储装置的多个控制模块,这使得安装更加容易,同时保持了低等待时间响应,即使控制模块的数量增加了也是如此。多个存储装置使用后端路由器与各个控制模块的第二接口相连接,从而保持了所有控制模块对所有存储装置进行存取的冗余度。而且控制模块与第一切换单元是通过串行总线进行连接的,这种总线具有小的信号数量,通过使用背板构成了接口。由此,在印刷电路板上进行安装成为了可能。

Figure 200510079642

Data storage system and data storage control device. Storage systems having multiple control modules for controlling multiple storage devices make installation easier while maintaining low latency response even as the number of control modules increases. Multiple storage devices are connected to the second interfaces of each control module by using a back-end router, thereby maintaining redundancy for all control modules to access all storage devices. Moreover, the control module and the first switching unit are connected through a serial bus, which has a small number of signals, and an interface is formed by using a backplane. Thus, mounting on a printed circuit board becomes possible.

Figure 200510079642

Description

数据存储系统和数据存储控制装置Data storage system and data storage control device

技术领域technical field

本发明涉及一种数据存储系统的构造和一种数据存储控制装置,它们用于计算机的外部存储装置,并且更加具体地讲,涉及一种数据存储系统和一种数据存储控制装置,具有能够以高性能和高灵活性构筑连接多个盘装置的数据存储系统的单元的组合和连接。The present invention relates to a configuration of a data storage system and a data storage control device, which are used in an external storage device of a computer, and more particularly, to a data storage system and a data storage control device, having the ability to Combination and connection of units constructing a data storage system connecting multiple disk devices with high performance and high flexibility.

背景技术Background technique

近来,随着各种数据得到了计算机化并且在计算机上进行处理,独立于执行数据处理的主计算机的、能够有效存储大量具有高可靠性的用于处理的数据的数据存储装置(外部存储装置)变得越来越重要了。Recently, as various data are computerized and processed on computers, data storage devices (external storage devices) capable of efficiently storing a large amount of data for processing with high reliability independently of a host computer performing data processing ) becomes more and more important.

这种数据存储装置使用了具有很多盘装置(例如,磁盘和光盘)的盘阵列装置和用于控制这很多个盘装置的盘控制器。这种盘阵列装置能够同时接收来自多个主计算机的盘存取请求,并且能够控制很多盘。Such a data storage device uses a disk array device having many disk devices such as magnetic disks and optical disks and a disk controller for controlling the many disk devices. Such a disk array device can simultaneously receive disk access requests from a plurality of host computers, and can control many disks.

最近,给出了能够控制具有数千乃至更多盘装置的盘装置组的盘阵列装置,这些盘装置自身具有数百吉字节(terabyte)。Recently, a disk array device capable of controlling a group of disk devices having several thousand or more disk devices each having hundreds of terabytes in itself has been proposed.

这样的盘阵列装置封装着存储器,该存储器起到了盘的缓冲的一部分的作用。由此,能够减少接收到来自主计算机的读取请求或写入请求时的数据存取时间,并且能够实现更高的性能。Such a disk array device encapsulates a memory that functions as a part of the disk buffer. Accordingly, it is possible to reduce the data access time when receiving a read request or a write request from the host computer, and achieve higher performance.

一般来说,盘阵列装置由以下多个主要单元组成,即:通道适配器,是与主计算机的连接部分;盘适配器,是与盘驱动器的连接部分;缓冲存储器;缓存控制单元,管理缓冲存储器;和很多盘驱动器。Generally speaking, a disk array device is composed of the following main units, namely: a channel adapter, which is the connection part with the host computer; a disk adapter, which is the connection part with the disk drive; a buffer memory; a cache control unit, which manages the buffer memory; and many disk drives.

附图11是示出了第一种现有技术的示意图。附图11中所示的盘阵列装置102具有两个缓存管理器(缓冲存储器和缓存控制单元)10,并且通道适配器11和盘适配器13与各个缓存管理器10相连接。Figure 11 is a schematic diagram showing the first prior art. The disk array device 102 shown in FIG.

这两个缓存管理器10经总线10c直接连接起来,从而使通信成为可能。这两个缓存管理器10和10、缓存管理器10与通道适配器11以及缓存管理器10与盘适配器13分别经PCI总线相连接,这是因为需要低的等待时间。These two cache managers 10 are directly connected via a bus 10c, thereby enabling communication. The two cache managers 10 and 10, the cache manager 10 and the channel adapter 11, and the cache manager 10 and the disk adapter 13 are respectively connected via a PCI bus because low latency is required.

通道适配器11借助例如光纤通道(Fibre Channel)或以太网(Ethernet)与主计算机(未示出)相连接,而盘适配器13借助例如光纤通道的缆线与盘封装组12的各个驱动器相连接。The channel adapter 11 is connected with a host computer (not shown) by such as Fiber Channel (Fibre Channel) or Ethernet (Ethernet ) ), and the disk adapter 13 is connected with each drive of the disk package group 12 by cables such as Fiber Channel. connect.

盘封装组12具有两个端口(例如,光纤通道端口),并且这两个端口与不同的盘适配器13相连接。这提供了冗余度,这种冗余度增加了抗故障能力。The disk packaging group 12 has two ports (eg, Fiber Channel ports), and the two ports are connected to different disk adapters 13 . This provides redundancy which increases failure resistance.

附图12是示出了根据第二种现有技术的盘阵列装置100的框图。如附图12所示,传统的盘阵列装置100具有:缓存管理器(图中由CM表示)10,该缓存管理器10由缓冲存储器和作为主要单元的缓存控制单元构成;通道适配器(图中由CA表示)11,是与主计算机(未示出)的接口;盘封装组12,由多个盘驱动器构成;和盘适配器(图中由DA表示)13,是与该盘驱动器12的接口。FIG. 12 is a block diagram showing a disk array device 100 according to a second prior art. As shown in accompanying drawing 12, traditional disk array device 100 has: cache manager (represented by CM in the figure) 10, and this cache manager 10 is made up of cache memory and the cache control unit as main unit; Denoted by CA) 11, which is an interface with a host computer (not shown); a disk package group 12, which is composed of a plurality of disk drives; and a disk adapter (denoted by DA in the figure) 13, which is an interface with this disk drive 12 .

盘阵列装置此外还具有:路由器(图中由RT表示)14,用于相互连接缓冲存储器10;通道适配器11和盘适配器13,用于进行这些主要单元之间的数据传送和通信。The disk array device also has: a router (represented by RT in the figure) 14 for interconnecting the buffer memories 10; a channel adapter 11 and a disk adapter 13 for data transfer and communication between these main units.

该盘阵列装置100包括四个缓存管理器10和与四个与这些缓存管理器10相对应的路由器14。这些缓存管理器10与路由器14一对一相互连接,因此多个缓存管理器10之间的连接是冗余的,并且可存取性得到了提高(例如,日本专利申请特开No.2001-256003)。This disk array device 100 includes four cache managers 10 and four routers 14 corresponding to these cache managers 10 . These cache managers 10 and routers 14 are connected to each other one-to-one, so the connection between a plurality of cache managers 10 is redundant, and accessibility is improved (for example, Japanese Patent Application Laid-Open No. 2001- 256003).

换句话说,即使一个路由器14发生了故障,也能够借助另一个路由器14确保多个缓存管理器10之间的连接,并且即使在这种情况下,盘阵列装置100也能够继续进行正常的操作。In other words, even if one router 14 fails, the connection between the plurality of cache managers 10 can be ensured by means of another router 14, and even in this case, the disk array device 100 can continue normal operation .

在这种盘阵列装置100中,各个路由器14与两个通道适配器11和两个盘适配器13相连接,并且盘阵列装置100包括总数为八的通道适配器和总数为八的盘适配器13。In this disk array device 100 , each router 14 is connected to two channel adapters 11 and two disk adapters 13 , and the disk array device 100 includes a total of eight channel adapters and a total of eight disk adapters 13 .

通过缓存管理器10和路由器14的相互连接,这些通道适配器11和盘适配器13能够与所有的缓存管理器10进行通信。These channel adapters 11 and disk adapters 13 are able to communicate with all cache managers 10 through the interconnection of cache managers 10 and routers 14 .

通道适配器11借助光纤通道(Fibre Channel)或以太网(Ethernet)与处理数据的主计算机相连接,并且盘适配器13通过(例如)光纤通道与盘封装组12(具体来说是盘驱动器)相连接。The channel adapter 11 is connected with the host computer processing data by means of Fiber Channel (Fibre Channel) or Ethernet ( Ethernet ), and the disk adapter 13 is connected with the disk packaging group 12 (specifically disk drive) through (for example) Fiber Channel connected.

此外,在通道适配器11与缓存管理器10之间以及盘适配器13与缓存管理器10之间,不仅要交换来自主计算机的用户数据,而且要交换用来保持盘阵列装置100的内部操作(例如,多个缓冲存储器之间数据的镜像处理)的一致性的各种信息。In addition, between the channel adapter 11 and the cache manager 10 and between the disk adapter 13 and the cache manager 10, not only user data from the host computer but also internal operations for maintaining the disk array device 100 (such as , various information about the consistency of data mirroring between multiple buffer memories).

缓存管理器10、通道适配器11和盘适配器13通过接口与路由器14相连接,能够实现比盘阵列装置100与主计算机或盘阵列装置100与盘驱动器之间的通信更低的等待时间。例如,缓存管理器10、通道适配器11和盘适配器13通过设计为用于连接LSI(大规模集成电路)与印刷电路板的总线(比如PCI(外部部件互连)总线)与路由器14相连接。Cache manager 10, channel adapter 11, and disk adapter 13 are connected to router 14 through interfaces, which can achieve lower latency than communication between disk array device 100 and host computer or disk array device 100 and disk drives. For example, cache manager 10, channel adapter 11, and disk adapter 13 are connected to router 14 through a bus designed to connect LSI (Large Scale Integration) and printed circuit boards, such as a PCI (Peripheral Component Interconnect) bus.

用于装纳盘驱动器的盘封装组12具有两个光纤通道端口,这两个端口分别与属于不同路由器14的盘适配器13连接。由此,即使在盘适配器13或路由器14发生故障时,也能够防止与缓存管理器10的连接断开。A disk package group 12 for housing disk drives has two fiber channel ports, and these two ports are respectively connected to disk adapters 13 belonging to different routers 14 . Thus, even when a failure occurs in the disk adapter 13 or the router 14 , it is possible to prevent disconnection of the connection with the cache manager 10 .

由于近来计算机化的发展,存在对更大容量和更快速度的数据存储系统的需求。在上面提到的第一种现有技术的盘阵列装置的情况下,如果对缓存管理器10、通道适配器11和盘适配器13进行扩充来增大容量和速度,则必须增加盘封装组12的端口数量并且必须增加盘适配器13与盘封装组12之间的连接缆线数量。Due to recent advances in computerization, there is a need for greater capacity and faster data storage systems. In the case of the disk array device of the first kind of prior art mentioned above, if the buffer manager 10, the channel adapter 11 and the disk adapter 13 are expanded to increase capacity and speed, the number of disk packaging groups 12 must be increased. The number of ports and the number of connecting cables between the disk adapter 13 and the disk packaging group 12 must be increased.

增加盘封装组12的端口数量会与要连接到一个盘封装组上的盘适配器的数量相应地增加缆线的数量,这增加了安装空间。这意味着装置的大小会增加。由于对于一个盘封装组来说,只有存在两个路径系统,才能够实现足够的冗余结构,因此增加端口的数量也不是个好主意。而且要连接的盘适配器的数量也不是恒定的,而是会依照用户的需求而改变,所以如果扩充了很多端口,而使用少量的盘适配器,就会造成浪费,但是如果扩充了少量端口,又不能支持很多盘适配器。换句话说就是,丧失了灵活性。Increasing the number of ports of the disc pack 12 increases the number of cables corresponding to the number of disc adapters to be connected to one disc pack, which increases the installation space. This means that the size of the device will increase. It is also not a good idea to increase the number of ports since there is only two path systems for a disk package group to achieve sufficient redundancy. Moreover, the number of disk adapters to be connected is not constant, but will change according to user needs, so if a lot of ports are expanded and a small number of disk adapters are used, it will cause waste, but if a small number of ports are expanded, it will be wasteful. Cannot support many disk adapters. In other words, flexibility is lost.

另一方面,在第二种现有技术的盘阵列装置的情况下,扩充缓存管理器10、通道适配器11和盘适配器13是可行的,但是所有的通信都是通过路由器14进行的,所以通信数据都会汇集到路由器14中,它变成了流量瓶颈,因此不能指望会有高的流量。而且在盘阵列装置100的情况下,如果构造具有很多主要单元的大规模盘阵列装置,则缓存管理器10与路由器14之间的连线数量会急剧增加,这会造成连接关系复杂并且安装在物理上会变得很困难。On the other hand, in the case of the disk array device of the second prior art, it is feasible to expand the cache manager 10, the channel adapter 11, and the disk adapter 13, but all communication is performed through the router 14, so the communication Data is all funneled into router 14, which becomes a traffic bottleneck, so high traffic cannot be expected. And in the case of the disk array device 100, if a large-scale disk array device with many main units is constructed, the number of connections between the cache manager 10 and the router 14 will increase sharply, which will cause complicated connection relationships and make it difficult to install Physically it can become difficult.

例如,在附图12所示的结构的情况下,四个(四片)缓存管理器10与四个路由器14通过背板15连接起来,如附图13所示。在这种情况下,如附图12所示,信号的数量是(4×4×(每路径信号线的数量))。例如,如果一条路径是借助64位PCI(并行路径)连接的,则包括控制线,背板15上的信号线数量就是100×16=1600。要形成这些信号线,背板15上的印刷电路板需要六个信号层。For example, in the case of the structure shown in FIG. 12 , four (four slices) cache managers 10 and four routers 14 are connected through a backplane 15 , as shown in FIG. 13 . In this case, as shown in FIG. 12, the number of signals is (4×4×(number of signal lines per path)). For example, if one path is connected by means of 64-bit PCI (parallel path), the number of signal lines on the backplane 15 is 100×16=1600 including control lines. To form these signal lines, the printed circuit board on the backplane 15 requires six signal layers.

在大规模构造的情况下,比如通过背板15连接八个(四片)缓存管理器10和八个(四片)路由器14的情况下的构造,所需的信号线数量大约为100×8×8=6400。因此背板15的印刷电路板需要24层,这是上述情况的四倍,实现这样的构造是很困难的。In the case of a large-scale configuration, such as a configuration where eight (four-chip) cache managers 10 and eight (four-chip) routers 14 are connected through the backplane 15, the required number of signal lines is about 100×8 ×8=6400. Therefore, the printed circuit board of the backplane 15 needs 24 layers, which is four times that of the above case, and it is very difficult to realize such a structure.

如果使用信号线少于64位PCI总线的四路PCI-Express总线来进行连接,则信号线的数量是16×8×8=1024。不过,PCI总线以66MHz的频率工作,而PCI-Express总线是2.5Gbps的高速总线,为了维持高速总线的信号质量,必须使用昂贵的基质材料。If four PCI-Express buses with fewer signal lines than the 64-bit PCI bus are used for connection, the number of signal lines is 16×8×8=1024. However, the PCI bus operates at a frequency of 66MHz, while the PCI-Express bus is a 2.5Gbps high-speed bus. In order to maintain the signal quality of the high-speed bus, expensive substrate materials must be used.

如果使用低速总线,可以通过使用通孔来代替接线层,但是在高速总线的情况下,应当避免使用通孔,因为这会降低信号质量。因此在高速总线的情况下,需要布局成使得所有的信号线都不交叉,所以与具有相同数量信号线的低速总线相比,需要大约两倍的信号层。例如,电路板需要12个信号层,并且这些层必须使用昂贵的材料来构造,因此这实现起来也是很困难的。If a low-speed bus is used, the wiring layer can be replaced by using vias, but in the case of a high-speed bus, vias should be avoided because they degrade the signal quality. Therefore in the case of a high-speed bus, it needs to be laid out so that all signal lines do not cross, so about twice as many signal layers are required as compared to a low-speed bus with the same number of signal lines. For example, circuit boards require 12 signal layers, and these layers must be constructed using expensive materials, so this is also difficult to achieve.

而且在第二种现有技术的盘阵列装置100的情况下,如果路由器14之一发生故障,则与这个路由器14相连的通道适配器11和盘适配器13在这个路由器14发生故障的同时也是不能使用的。And in the case of the disk array device 100 of the second prior art, if one of the routers 14 breaks down, the channel adapter 11 and the disk adapter 13 connected with this router 14 also cannot be used when this router 14 breaks down. of.

发明内容Contents of the invention

鉴于前述问题,本发明的目的是提供一种数据存储系统和数据存储控制装置,它们用于实现各单元之间的高流量数据传递,并且能够很容易地实现小规模到大规模的结构,而不会引发安装问题。In view of the aforementioned problems, an object of the present invention is to provide a data storage system and a data storage control device, which are used to realize high-flow data transfer between units, and can easily realize small-scale to large-scale structures, while Does not cause installation problems.

本发明的另一个目的是提供一种数据存储系统和数据存储控制装置,它们具有以相同单元的组合轻松实现小规模到大规模结构的灵活性,同时保持了即使在一个单元故障的情况下也能够操作的冗余度。Another object of the present invention is to provide a data storage system and a data storage control device which have the flexibility to easily realize a small-scale to large-scale structure in combination of the same units while maintaining operational redundancy.

本发明的再一个目的是提供一种数据存储系统和数据存储控制装置,它们用于轻松实现小规模到大规模的结构,而不会引发安装问题,同时保持了高流量和冗余度。Still another object of the present invention is to provide a data storage system and a data storage control device for easily implementing small-scale to large-scale structures without causing installation problems while maintaining high throughput and redundancy.

为了实现这些目的,本发明的数据存储系统具有:多个具有一个或更多个存取端口的存储装置,用于存储数据;和多个控制模块,用于按照来自于主机的存取指令进行所述存储装置的存取控制。并且所述控制模块还包括:缓冲存储器,用于存储一部分存储在所述存储装置中的数据;缓存控制单元,用于控制所述缓冲存储器;第一接口单元,所述第一接口单元与所述缓存控制单元连接,用于控制与所述主机的连接;第二接口单元,所述第二接口单元与所述缓存控制单元连接,用于控制与所述多个存储装置的连接;和多个第一切换单元,用于在所述多个控制模块与所述多个存储装置之间进行连接,并用于选择性地开关各个控制模块的所述第二接口单元和所述多个存储装置,各所述控制模块与所述多个第一切换单元相连,各所述存储装置与一个或更多个不同的所述第一切换单元相连接。并且所述多个控制模块的各所述第二接口单元与所述多个第一切换单元是使用背板通过串行总线相连接的。In order to achieve these objects, the data storage system of the present invention has: a plurality of storage devices with one or more access ports for storing data; and a plurality of control modules for performing Access control of the storage device. And the control module further includes: a buffer memory for storing a part of data stored in the storage device; a buffer control unit for controlling the buffer memory; a first interface unit for communicating with the first interface unit The cache control unit is connected to control the connection with the host; the second interface unit is connected to the cache control unit and is used to control the connection with the plurality of storage devices; and a first switching unit, configured to connect between the plurality of control modules and the plurality of storage devices, and used to selectively switch the second interface unit of each control module and the plurality of storage devices Each of the control modules is connected to the plurality of first switching units, and each of the storage devices is connected to one or more different first switching units. And each of the second interface units of the plurality of control modules is connected to the plurality of first switching units through a serial bus using a backplane.

本发明的数据存储控制装置具有:缓冲存储器,用于存储一部分存储在所述存储装置中的数据;缓存控制单元,用于控制所述缓冲存储器;多个控制模块,具有第一接口单元,所述第一接口单元与所述缓存控制单元连接,用于控制与所述主机的连接,和第二接口单元,所述第一接口单元与所述缓存控制单元连接,用于控制与所述多个存储装置的连接;和多个第一切换单元,设置在所述多个控制模块与所述多个存储装置之间,并用于选择性地开关各个控制模块的所述第二接口单元和所述多个存储装置,各所述控制模块与所述多个第一切换单元相连,各所述存储装置与一个或更多个不同的所述第一切换单元相连接。并且所述多个控制模块的各所述第二接口单元与所述多个第一切换单元是使用背板通过串行总线相连接的。The data storage control device of the present invention has: a buffer memory for storing a part of data stored in the storage device; a buffer control unit for controlling the buffer memory; a plurality of control modules with a first interface unit, the The first interface unit is connected to the cache control unit for controlling the connection with the host, and the second interface unit is connected to the cache control unit for controlling the connection with the multiple the connection of a plurality of storage devices; and a plurality of first switching units, which are arranged between the plurality of control modules and the plurality of storage devices, and are used to selectively switch the second interface unit of each control module and the plurality of storage devices. The plurality of storage devices, each of the control modules is connected to the plurality of first switching units, and each of the storage devices is connected to one or more different first switching units. And each of the second interface units of the plurality of control modules is connected to the plurality of first switching units through a serial bus using a backplane.

在本发明中,优选地,所述缓存控制单元和所述第二接口单元通过具有低等待时间的高速串行总线相连接,并且所述第二接口单元与所述多个第一切换单元使用所述背板通过串行总线相连接。In the present invention, preferably, the cache control unit and the second interface unit are connected through a high-speed serial bus with low latency, and the second interface unit and the plurality of first switching units use The backplanes are connected through a serial bus.

在本发明中,优选地,所述控制模块还包括用于与另外一个所述控制模块进行通信的通信单元,并且还包括第二切换单元,用于选择性地连接各所述控制模块中的通信单元。In the present invention, preferably, the control module further includes a communication unit for communicating with another control module, and also includes a second switching unit for selectively connecting one of the control modules communication unit.

在本发明中,优选地,各个控制模块的通信单元和第二切换单元是使用背板进行连接。In the present invention, preferably, the communication unit and the second switching unit of each control module are connected by using a backplane.

在本发明中,优选地,所述第一切换单元和所述多个存储装置是通过缆线进行连接的。In the present invention, preferably, the first switching unit and the plurality of storage devices are connected through cables.

在本发明中,优选地,所述存储装置还包括多个存取端口,并且其中多个不同的第一切换单元与所述多个存取端口相连接。In the present invention, preferably, the storage device further includes a plurality of access ports, and a plurality of different first switching units are connected to the plurality of access ports.

在本发明中,优选地,所述缓存控制单元和所述第二接口单元是通过多路高速串行总线相连接的,并且所述第二接口单元和所述多个第一切换单元是使用背板通过串行总线相连接的。In the present invention, preferably, the cache control unit and the second interface unit are connected through multiple high-speed serial buses, and the second interface unit and the plurality of first switching units use The backplanes are connected via a serial bus.

在本发明中,优选地,所述高速串行总线是PCI-Express总线。In the present invention, preferably, the high-speed serial bus is a PCI-Express bus.

在本发明中,优选地,所述串行总线是光纤通道。In the present invention, preferably, the serial bus is a fiber channel.

在本发明中,优选地,所述缓存控制单元和所述第一接口单元是通过具有低等待时间的高速串行总线相连接的。In the present invention, preferably, the cache control unit and the first interface unit are connected through a high-speed serial bus with low latency.

在本发明中,各个控制模块的第二接口与多个第一切换单元相连接,所以所有的控制模块都可以维持对所有存储装置进行存取的冗余度,并且即使控制模块的数量增加,控制模块与第一切换单元也使用背板通过串行总线(具有构成接口的少量信号)相连接,所以在印刷电路板上进行安装是可行的。In the present invention, the second interface of each control module is connected to a plurality of first switching units, so all control modules can maintain the redundancy of accessing all storage devices, and even if the number of control modules increases, The control module and the first switching unit are also connected via a serial bus (with a small number of signals constituting the interface) using the backplane, so mounting on a printed circuit board is feasible.

附图说明Description of drawings

附图1是示出了依据本发明的实施例的数据存储系统的框图;Accompanying drawing 1 is a block diagram showing a data storage system according to an embodiment of the present invention;

附图2是示出了附图1中的控制模块的框图;Accompanying drawing 2 is a block diagram showing the control module in accompanying drawing 1;

附图3是示出了附图1和附图2中的后端路由器和盘封装组的框图;Accompanying drawing 3 is a block diagram showing the backend router and disk encapsulation group in accompanying drawing 1 and accompanying drawing 2;

附图4是示出了附图1和附图3中的盘封装组的框图;Accompanying drawing 4 is the block diagram showing the disc packing group in accompanying drawing 1 and accompanying drawing 3;

附图5是示出了附图1和附图2中的结构的读取处理的示意图;Accompanying drawing 5 is the schematic diagram showing the reading processing of the structure in accompanying drawing 1 and accompanying drawing 2;

附图6是示出了附图1和附图2中的结构的写入处理的示意图;Accompanying drawing 6 is the schematic diagram showing the writing process of the structure in accompanying drawing 1 and accompanying drawing 2;

附图7是示出了依据本发明的实施例的控制模块的安装结构的示意图;Accompanying drawing 7 is the schematic diagram showing the installation structure of the control module according to the embodiment of the present invention;

附图8是示出了依据本发明的实施例的数据存储系统的安装结构示例的示意图;Accompanying drawing 8 is the schematic diagram showing the installation structure example of the data storage system according to the embodiment of the present invention;

附图9是示出了依据本发明的实施例的大规模存储系统的框图;Accompanying drawing 9 is a block diagram showing a large-scale storage system according to an embodiment of the present invention;

附图10是示出了依据本发明的另一实施例的中等规模存储系统的框图;Accompanying drawing 10 is a block diagram showing a medium-scale storage system according to another embodiment of the present invention;

附图11是示出了依据第一种现有技术的存储系统的框图;Accompanying drawing 11 is a block diagram showing a storage system according to the first prior art;

附图12是示出了依据第二种现有技术的存储系统的框图;Accompanying drawing 12 is a block diagram showing a storage system according to a second prior art;

附图13是示出了依据附图12中的第二种现有技术的存储系统的安装结构的示意图。Fig. 13 is a schematic diagram showing the installation structure of the storage system according to the second prior art in Fig. 12 .

具体实施方式Detailed ways

现在将按照数据存储系统、读/写处理、安装结构和其它实施例的顺序介绍本发明的实施例。Embodiments of the present invention will now be described in the order of data storage system, read/write processing, mounting structure and other embodiments.

数据存储系统data storage system

附图1是示出了依据本发明的实施例的数据存储系统的框图,附图2是示出了附图1中的控制模块的框图,附图3是示出了附图1中的后端路由器和盘封装组的框图,而附图4是示出了附图1和附图3中的盘封装组的框图。Accompanying drawing 1 is a block diagram showing a data storage system according to an embodiment of the present invention, accompanying drawing 2 is a block diagram showing a control module in accompanying drawing 1, and accompanying drawing 3 is showing a rear view in accompanying drawing 1 Figure 4 is a block diagram showing the disk encapsulation group in Figures 1 and 3.

附图1示出了作为示例具有八个控制模块的大规模存储系统。如附图1所示,存储系统1具有:多个用于保存数据的盘封装组2-0到2-25;多个(在本情况下是八个)设置在未示出的主计算机(数据处理单元)与多个盘封装组2-0到2-25之间的控制模块4-0到4-7;多个(在本情况下是八个)设置在多个控制模块4-0到4-7与多个盘封装组2-0到2-25之间的后端路由器(第一切换单元:在图中标为BRT,此后称其为BRT)5-0到5-7以及多个(在本情况下是两个)前端路由器(第二切换单元:在图中标为FRT,此后称之为FRT)6-0到6-1。Figure 1 shows a mass storage system with eight control modules as an example. As shown in the accompanying drawing 1, the storage system 1 has: a plurality of disk package groups 2-0 to 2-25 for storing data; data processing unit) and the control modules 4-0 to 4-7 between the plurality of disk packaging groups 2-0 to 2-25; To 4-7 and a plurality of disk encapsulation groups 2-0 to 2-25 back-end router (the first switching unit: marked as BRT in the figure, referred to as BRT hereinafter) 5-0 to 5-7 and many There are (two in this case) front-end routers (second switching unit: marked as FRT in the figure, hereinafter referred to as FRT) 6-0 to 6-1.

控制模块4-0到4-7中的每一个具有缓存管理器40、通道适配器(第一接口单元:在图中标为CA)41a到41d、盘适配器(第二接口单元:在图中标为DA)42a和42b以及DMA(直接存储器存取)引擎(通信单元:在图中标为DMA)43。Each of the control modules 4-0 to 4-7 has a cache manager 40, channel adapters (first interface unit: labeled CA in the figure) 41a to 41d, disk adapters (second interface unit: labeled DA in the figure ) 42a and 42b and a DMA (Direct Memory Access) engine (communication unit: marked as DMA in the figure) 43.

在附图1中,为了简化作图,仅对控制模块4-0标出了附图标记缓存管理器“40”,通道适配器“41a”、“41b”、“41c”和“41d”,盘适配器的“42a”和“42b”以及DMA的“43”,并且省略了其它控制模块4-1到4-7中的组成元件的这些附图标记。In the accompanying drawing 1, in order to simplify the drawing, only the control module 4-0 is marked with reference numerals buffer manager "40", channel adapters "41a", "41b", "41c" and "41d", disk "42a" and "42b" of the adapter and "43" of the DMA, and these reference numerals of the constituent elements in the other control modules 4-1 to 4-7 are omitted.

现在将参照附图2介绍控制模块4-0到4-7。缓存管理器40根据来自主计算机的处理请求(读请求或写请求)进行读/写处理,并且该缓存控制器40具有缓冲存储器40b和缓存控制单元40a。The control modules 4-0 to 4-7 will now be described with reference to FIG. 2 . The cache manager 40 performs read/write processing according to a processing request (read request or write request) from the host computer, and the cache controller 40 has a cache memory 40b and a cache control unit 40a.

缓冲存储器40b保存盘封装组2-0到2-25的多个盘中存储的数据的一部分,就是说,它起多个盘的缓存器的作用。The buffer memory 40b holds a part of the data stored in the plurality of disks of the disk package groups 2-0 to 2-25, that is, it functions as a buffer for the plurality of disks.

缓存控制单元40a控制缓冲存储器40b、通道适配器41、装置适配器42和DMA 43。为此,缓存控制单元40a具有一个或多个(在附图2中是2个)CPU 400和410以及存储器控制器420。存储器控制器420控制各个存储器的读/写,并且开关路径。The buffer control unit 40a controls the buffer memory 40b, the channel adapter 41, the device adapter 42, and the DMA 43. To this end, the cache control unit 40a has one or more (two in FIG. 2 ) CPUs 400 and 410 and a memory controller 420. The memory controller 420 controls read/write of each memory, and switches paths.

经存储器总线434与缓冲存储器40b相连接的存储器控制器420经CPU总线430和432与CPU 400和410相连接,并且还经后面将要提到的四路高速串行总线(例如,PCI-Express)440和442与盘适配器42a和42b相连接。同样,存储器控制器420经四路高速串行总线(例如,PCI-Express)443、444、445和446与通道适配器41a、41b、41c和41d相连接,并且经四路高速串行总线(例如,PCI-Express)447和448与DMA 43-a到43-b相连接。The memory controller 420 that is connected with the buffer memory 40b through the memory bus 434 is connected with the CPUs 400 and 410 through the CPU buses 430 and 432, and also through the four-way high-speed serial bus (for example, PCI-Express) that will be mentioned later. 440 and 442 are connected to disk adapters 42a and 42b. Likewise, the memory controller 420 is connected to the channel adapters 41a, 41b, 41c and 41d via four high-speed serial buses (for example, PCI-Express) 443, 444, 445 and 446, and is connected to the channel adapters 41a, 41b, 41c and 41d via four high-speed serial buses (for example, , PCI-Express) 447 and 448 are connected with DMA 43-a to 43-b.

后面将要介绍,这种高速总线(比如PCI-Express)以分组方式进行通信,并且通过设置多路串行总线,使得以很快的响应速度进行的通信成为可能,这样进行的通信具有很小的延迟,即具有很低的等待时间,即使在减少了信号线的数量的情况下也是如此。It will be introduced later that this high-speed bus (such as PCI-Express) communicates in groups, and by setting multiple serial buses, it becomes possible to communicate with a very fast response speed, and the communication carried out like this has a small Latency, i.e. having very low latency, even with reduced number of signal lines.

通道适配器41a到41d是用于主计算机的接口,并且通道适配器41a到41d分别与不同的主计算机相连接。优选地,借助总线(比如光纤通道和以太网(Ethernet))将通道适配器41a到41d分别与相应主计算机的接口单元相连接,并且在这种情况下,使用了光纤或同轴电缆作为总线。The channel adapters 41a to 41d are interfaces for host computers, and the channel adapters 41a to 41d are respectively connected to different host computers. Preferably, the channel adapters 41a to 41d are respectively connected to the interface units of the corresponding host computers by means of buses such as Fiber Channel and Ethernet( R) (Ethernet (R )), and in this case, optical fiber or coaxial cables are used as bus.

这些通道适配器41a到41d中的每一个构成为各个控制模块4-0到4-7的一部分,但是,作为相应主计算机与控制模块4-0到4-7之间的接口,必须要支持多种协议。由于所要安装的协议依相应的主计算机而不同,将作为控制模块4-0到4-7的主要单元的缓存管理器40安装在不同的印刷电路板上,如后面在附图7中介绍的那样,从而在需要的时候能够很容易地替换各个通道适配器41a到41d。Each of these channel adapters 41a to 41d constitutes a part of the respective control modules 4-0 to 4-7, but must support multiple kind of agreement. Since the protocol to be installed is different depending on the corresponding host computer, the buffer manager 40, which is the main unit of the control modules 4-0 to 4-7, is installed on different printed circuit boards, as described later in FIG. 7 In that way, the respective channel adapters 41a to 41d can be easily replaced when necessary.

通道适配器41a到41d应当支持的与主计算机的协议的示例是相应于上面提到的光纤通道和以太网的iSCSI(因特网小计算机系统接口)。各个通道适配器41a到41d经设计用于连接LSI(大规模集成电路)与印刷电路板的总线(比如前面提到的PCI-Express总线)直接与缓存管理器40连接。由此,能够实现各通道适配器41a到41d与缓存管理器40之间要求的高流量。Examples of protocols with the host computer that the channel adapters 41a to 41d should support are iSCSI (Internet Small Computer System Interface) corresponding to Fiber Channel and Ethernet (R ) mentioned above. The respective channel adapters 41 a to 41 d are directly connected to the cache manager 40 via a bus designed to connect LSI (Large Scale Integration) with a printed circuit board (such as the aforementioned PCI-Express bus). Thereby, the high throughput required between the respective channel adapters 41a to 41d and the cache manager 40 can be achieved.

盘适配器42a和42b是盘封装组2-0到2-25到盘驱动器的接口,并且与连接在盘封装组2-0到2-25上的BRT 5-0到5-7相连接,为了进行这一连接,使用了四个FC(光纤通道(Fibre Channel)))端口。各个盘适配器42a和42b通过上面提到的设计用于连接LSI(大规模集成电路)与印刷电路板的总线(比如PCI-Express)直接与缓存管理器40相连接。由此,能够实现各个盘适配器42a和42b与缓存管理器40之间要求的高流量。The disc adapters 42a and 42b are the interfaces of the disc encapsulation groups 2-0 to 2-25 to disc drives, and are connected to the BRTs 5-0 to 5-7 connected to the disc encapsulation groups 2-0 to 2-25, for For this connection, four FC (Fibre Channel)) ports are used. The respective disk adapters 42a and 42b are directly connected to the cache manager 40 through the above-mentioned bus (such as PCI-Express) designed to connect LSI (Large Scale Integration) with a printed circuit board. Thus, the high traffic required between the respective disk adapters 42a and 42b and the cache manager 40 can be achieved.

如附图1和附图3所示,BRT 5-0到5-7是多端口开关,它们选择性地开关并可通信地连接各个控制模块4-0到4-7的盘适配器42a和42b与各个盘封装组2-0到2-25。As shown in Figures 1 and 3, the BRTs 5-0 through 5-7 are multi-port switches that selectively switch and communicatively connect the disk adapters 42a and 42b of the respective control modules 4-0 through 4-7 Package groups 2-0 to 2-25 with individual discs.

如附图3所示,多个(在本情况下是两个)BRT 5-0到5-1与各个盘封装组2-0到2-7相连接。如附图4所示,各个盘封装组2-0具有多个盘驱动器200,这些盘驱动器200分别具有两个端口,并且这个盘封装组2-0还具有单位盘封装组20-0到23-0,这些单位盘封装组具有四个连接端口210、212、214和216。这些单位盘封装组是串联连接的,以便实现容量的增大。As shown in FIG. 3, a plurality (two in this case) of BRTs 5-0 to 5-1 are connected to respective disk package groups 2-0 to 2-7. As shown in FIG. 4 , each disk package group 2-0 has a plurality of disk drives 200 each having two ports, and this disk package group 2-0 also has unit disk package groups 20-0 to 23 -0, these unit disk package groups have four connection ports 210 , 212 , 214 and 216 . These unit disk package groups are connected in series in order to achieve an increase in capacity.

在盘封装组20-0到23-0中,各个盘驱动器200的各个端口经从两个端口210和212引出的一对FC缆线与这两个端口210和212相连接。这两个端口210和212与不同的BRT 5-0和5-1相连接,如附图3所示。In the disk package groups 20 - 0 to 23 - 0 , each port of each disk drive 200 is connected to two ports 210 and 212 via a pair of FC cables drawn from the two ports 210 and 212 . These two ports 210 and 212 are connected with different BRTs 5-0 and 5-1, as shown in FIG. 3 .

如附图1所示,各个控制模块4-0到4-7的盘适配器42a和42b分别与所有的盘封装组2-0到2-25相连接。换句话说,各个控制模块4-0到4-7的盘适配器42a分别与连接于盘封装组2-0到2-7上的BRT 5-0(见附图3)、连接于盘封装组2-8、2-9、——上的BRT 5-2、连接于盘封装组2-16、2-17、——上的BRT 5-4和连接于盘封装组2-24、2-25、——上的BRT 5-6相连接。As shown in FIG. 1, the disk adapters 42a and 42b of the respective control modules 4-0 to 4-7 are connected to all the disk package groups 2-0 to 2-25, respectively. In other words, the disk adapters 42a of the respective control modules 4-0 to 4-7 are respectively connected to the BRT 5-0 (see accompanying drawing 3) on the disk packaging groups 2-0 to 2-7, connected to the disk packaging groups BRT 5-2 on 2-8, 2-9, ——, BRT 5-4 connected to disk package group 2-16, 2-17, —— and connected to disk package group 2-24, 2- 25. Connect with BRT 5-6 on ——.

采用同样的方式,各个控制模块4-0到4-7的盘适配器42b分别与连接于盘封装组2-0到2-7上的BRT 5-1(见附图3)、连接于盘封装组2-8、2-9、——上的BRT 5-3、连接于盘封装组2-16、2-17、——上的BRT 5-5和连接于盘封装组2-24、2-25、——上的BRT 5-7相连接。In the same way, the disc adapters 42b of each control module 4-0 to 4-7 are respectively connected to the BRT 5-1 (see accompanying drawing 3) on the disc package group 2-0 to 2-7, connected to the disc package BRT 5-3 on groups 2-8, 2-9, -, connected to BRT 5-5 on reel package groups 2-16, 2-17, - and connected to reel package groups 2-24, 2 -25, - the BRT 5-7 on - is connected.

这样,将多个(在本情况下是两个)BRT连接到了各个盘封装组2-0到2-31上,并且同一控制模块4-0到4-7中的不同的盘适配器42a和42b分别与连接到同一盘封装组2-0到2-31的两个BRT相连接。Thus, multiple (in this case two) BRTs are connected to each disk package group 2-0 to 2-31, and different disk adapters 42a and 42b in the same control module 4-0 to 4-7 Two BRTs connected to the same tray package group 2-0 to 2-31 are respectively connected.

借助这种结构,每个控制模块4-0到4-7能够经盘适配器42a或42b对所有的盘封装组(盘驱动器)2-0到2-31进行存取。With this structure, each control module 4-0 to 4-7 can access all the disk package groups (disk drives) 2-0 to 2-31 via the disk adapter 42a or 42b.

这些构成为控制模块4-0到4-7的一部分的盘适配器42a和42b中的每一个安装在缓存管理器40的电路板上,该缓存管理器40是控制模块4-0到4-7的主要单元,各个盘适配器42a和42b借助例如PCI(外围部件互连)-Express总线与缓存管理器40直接连接,并且由此,能够实现各个盘适配器42a和42b与缓存管理器40之间要求的高流量。Each of these disk adapters 42a and 42b constituting a part of the control modules 4-0 to 4-7 is mounted on a circuit board of the cache manager 40, which is a part of the control modules 4-0 to 4-7. The main unit of each disk adapter 42a and 42b is directly connected with the cache manager 40 by means of, for example, a PCI (Peripheral Component Interconnect)-Express bus, and thus, the requirements between the respective disk adapters 42a and 42b and the cache manager 40 can be realized. of high traffic.

而且如附图2所示,各盘适配器42a和42b借助总线(比如光纤通道和以太网)与相应的BRT 5-0到5-7相连接。在这种情况下,该总线通过电子布线安装在背板的印刷电路板上,后面将作介绍。Also as shown in Figure 2, each disk adapter 42a and 42b is connected to a corresponding BRT 5-0 to 5-7 by means of a bus (such as Fiber Channel and Ethernet (R )). In this case, the bus is mounted on the printed circuit board of the backplane via electronic wiring, described later.

各个控制模块4-0到4-7的盘适配器42和42b和BRT 5-0到5-7进行一对一网形连接,从而连接到了所有的盘封装组上,如前所述,只要控制模块4-0到4-7的数量(换句话说,盘适配器42a和42b的数量)增加,连接线路的数量就会增加,并且连接关系会变得更加复杂,这就会造成物理安装困难。但是当将光纤通道(信号数量少,构成的接口小)用于盘适配器42a和42b与BRT 5-0到5-7之间的连接时,在印刷电路板上进行安装变得可行了。The disk adapters 42 and 42b of each control module 4-0 to 4-7 and the BRT 5-0 to 5-7 are connected in a one-to-one network, thereby being connected to all disk packaging groups. As previously mentioned, as long as the control As the number of modules 4-0 to 4-7 (in other words, the number of disk adapters 42a and 42b) increases, the number of connection lines increases and the connection relationship becomes more complicated, which makes physical installation difficult. But when using Fiber Channel (few number of signals, small interface formed) for connection between disk adapters 42a and 42b and BRT 5-0 to 5-7, mounting on printed circuit board became feasible.

当各个盘适配器42a和42b与相应的BRT 5-0到5-7通过光纤通道连接时,BRT 5-0到5-7就成为了光纤通道的开关。各个BRT 5-0到5-7与相应的盘封装组2-0到2-31也是例如通过光纤通道连接的,并且在这种情况下,由于模块不同,使用了光缆500和510进行连接。如附图1所示,DMA引擎43与其它控制模块4-0到4-7相互通信,并且负责与其它控制模块4-0到4-7的通信和数据传递处理。各个控制模块4-0到4-7的各个DMA引擎43构成为控制模块4-0到4-7的一部分,并且安装在缓存管理器40的电路板上,该缓存管理器40是控制模块4-0到4-7的主要单元。并且DMA引擎43借助前面提到的高速串行总线与缓存管理器40直接连接,并且经FRT 6-0和6-1与其它控制模块4-0到4-7的DMA引擎43相互通信。When the respective disk adapters 42a and 42b are connected to the corresponding BRTs 5-0 to 5-7 through Fiber Channel, the BRTs 5-0 to 5-7 become Fiber Channel switches. The individual BRTs 5-0 to 5-7 are also connected to the corresponding reel packaging groups 2-0 to 2-31, for example by fiber-optic channels, and in this case, due to the different modules, fiber optic cables 500 and 510 are used for the connection. As shown in FIG. 1, the DMA engine 43 communicates with other control modules 4-0 to 4-7, and is responsible for communication and data transfer processing with other control modules 4-0 to 4-7. The respective DMA engines 43 of the respective control modules 4-0 to 4-7 constitute part of the control modules 4-0 to 4-7, and are mounted on the circuit board of the buffer manager 40, which is the control module 4 -Major units from 0 to 4-7. And the DMA engine 43 is directly connected with the cache manager 40 by means of the aforementioned high-speed serial bus, and communicates with the DMA engines 43 of other control modules 4-0 to 4-7 via the FRTs 6-0 and 6-1.

FRT 6-0和6-1与多个(具体来说是三个或更多,在本情况下是八个)控制模块4-0到4-7的DMA引擎43相连接,并且选择性地开关并可通信地连接这些控制模块4-0到4-7。The FRTs 6-0 and 6-1 are connected to the DMA engines 43 of a plurality (specifically three or more, in this case eight) of the control modules 4-0 to 4-7, and selectively These control modules 4-0 to 4-7 are switched and communicatively connected.

借助这种结构,各个控制模块4-0到4-7的各个DMA引擎43执行通信和数据传递处理(例如,镜像处理),所述处理是依照来自主计算机的存取请求在连接到这个控制模块上的缓存管理器40与其它控制模块4-0到4-7的缓存管理器40之间经FRT 6-0和6-1发生的。With this structure, each DMA engine 43 of each control module 4-0 to 4-7 executes communication and data transfer processing (for example, mirroring processing) in connection with this control module in accordance with an access request from a host computer. Between the buffer manager 40 on the module and the buffer manager 40 of other control modules 4-0 to 4-7, it takes place via FRT 6-0 and 6-1.

如附图2所示,各个控制模块4-0到4-7的DMA引擎43由多个(在本情况下是两个)DMA引擎43-a和43-b构成,并且这两个DMA引擎43-a和43-b中的每一个都使用这两个FRT 6-0和6-1。As shown in accompanying drawing 2, the DMA engine 43 of each control module 4-0 to 4-7 is made up of multiple (in this case, two) DMA engines 43-a and 43-b, and these two DMA engines Each of 43-a and 43-b uses these two FRTs 6-0 and 6-1.

DMA引擎43-a和43-b通过例如上面提到的PCI-Express总线与缓存管理器40相连接,以便实现低等待时间。The DMA engines 43-a and 43-b are connected to the cache manager 40 through, for example, the above-mentioned PCI-Express bus in order to achieve low latency.

在各个控制模块4-0到4-7之间(换句话说,在各个控制模块4-0到4-7的缓存管理器40之间)进行通信和数据传递处理的情况下,数据传递量很高,优选地能减少通信所需的时间,并要求高的流量和低的等待时间(很快的响应速度)。因此,如附图1和附图2所示,各个控制模块4-0到4-7的DMA引擎43与FRT 6-0和6-1是通过使用高速串行传输(PCI-Express或Rapid-IO)的总线进行连接的,这种总线被设计成同时满足高流量和低等待时间的要求。In the case of communication and data transfer processing between the respective control modules 4-0 to 4-7 (in other words, between the buffer managers 40 of the respective control modules 4-0 to 4-7), the data transfer amount High, preferably to reduce the time required for communication, and require high throughput and low latency (fast response speed). Therefore, as shown in accompanying drawing 1 and accompanying drawing 2, the DMA engine 43 of each control module 4-0 to 4-7 and FRT 6-0 and 6-1 are by using high-speed serial transmission (PCI-Express or Rapid- IO) bus, which is designed to meet the requirements of high traffic and low latency at the same time.

PCI-Express和Rapid-IO使用2.5Gbps高速串行传输,并且使用了称为LVDS(低电压差分信令)的小幅度差分接口作为其总线接口。PCI-Express and Rapid-IO use 2.5Gbps high-speed serial transmission, and use a small amplitude differential interface called LVDS (Low Voltage Differential Signaling) as its bus interface.

读/写处理read/write processing

现在将介绍附图1到附图4中的数据存储系统的读取处理。附图5是示出了附图1和附图2中的结构的读取操作的示意图。The read processing of the data storage system in FIGS. 1 to 4 will now be described. FIG. 5 is a schematic diagram showing a read operation of the structure in FIG. 1 and FIG. 2 .

当缓存管理器40经相应的通道适配器41a到41d接收到来自一个主计算机的读取请求时,如果缓冲存储器40b保存着这一读取请求的目标数据,则缓存管理器40将保存在缓冲存储器40b中的这一目标数据经通道适配器41a到41d发送给主计算机。When the buffer manager 40 receives a read request from a host computer via the corresponding channel adapters 41a to 41d, if the buffer memory 40b stores the target data of this read request, the cache manager 40 will save the target data in the buffer memory 40b. This object data in 40b is sent to the host computer via channel adapters 41a to 41d.

如果这一数据没有保存在缓冲存储器40b中,则缓存控制单元40a将目标数据从保存着这一数据的盘驱动器200中读取到缓冲存储器40b中,然后将该目标数据发送给发出读取请求的主计算机。If this data is not stored in the buffer memory 40b, then the cache control unit 40a reads the target data from the disc drive 200 that has saved this data into the buffer memory 40b, and then sends the target data to the main computer.

将参照附图5介绍对盘驱动器进行的这一读取处理。This reading process for the disk drive will be described with reference to FIG. 5 .

(1)缓存管理器40的控制单元40a(CPU)在缓冲存储器40b的描述符区中创建FC报头和描述符。该描述符是向数据传递电路(DMA电路)请求数据(DMA)传递的指令,包括FC报头在缓冲存储器上的地址、数据传递到缓冲存储器上的地址、其数据字节数和数据传递的盘的逻辑地址。(1) The control unit 40a (CPU) of the cache manager 40 creates an FC header and a descriptor in the descriptor area of the cache memory 40b. The descriptor is an instruction to request data (DMA) transfer to the data transfer circuit (DMA circuit), including the address of the FC header on the buffer memory, the address of the data transferred to the buffer memory, the number of data bytes and the disk for data transfer logical address.

(2)启动盘适配器42的数据传递电路。(2) The data transfer circuit of the disk adapter 42 is started.

(3)盘适配器42的已启动数据传递电路从缓冲存储器40b中读取描述符。(3) The activated data transfer circuit of the disk adapter 42 reads the descriptor from the buffer memory 40b.

(4)盘适配器42的已启动数据传递电路从缓冲存储器40b中读取FC报头。(4) The activated data transfer circuit of the disk adapter 42 reads the FC header from the buffer memory 40b.

(5)盘适配器42的已启动数据传递电路对描述符进行分析,并且获得关于所请求的盘、首地址和字节数的数据,并且经光纤通道500(510)将FC报头传递给目标盘驱动器200。盘驱动器200读取所请求的目标数据,并且经光纤通道500(510)将其发送给盘适配器42的数据传递电路。(5) The started data delivery circuit of the disk adapter 42 analyzes the descriptor, and obtains data about the requested disk, the first address and the number of bytes, and delivers the FC header to the target disk via the Fiber Channel 500 (510) Drive 200. Disk drive 200 reads the requested target data and sends it to the data transfer circuitry of disk adapter 42 via Fiber Channel 500 (510).

(6)盘驱动器200读取所请求的目标数据,并且当传输完成时,经光纤通道500(510)向盘适配器42的数据传递电路发送完成通知。(6) The disk drive 200 reads the requested target data, and when the transfer is completed, sends a completion notification to the data transfer circuit of the disk adapter 42 via the fiber channel 500 (510).

(7)当接收到完成通知时,盘适配器42的已启动数据传递电路从盘适配器42的存储器中读取已读取数据并且将其存储到缓冲存储器40b中。(7) When the completion notification is received, the activated data transfer circuit of the disk adapter 42 reads the read data from the memory of the disk adapter 42 and stores it into the buffer memory 40b.

(8)当读取传递完成时,盘适配器42的已启动数据传递电路通过中断向缓存管理器40发送完成通知。(8) When the read transfer is completed, the activated data transfer circuit of the disk adapter 42 sends a completion notification to the cache manager 40 through an interrupt.

(9)当接收到来自盘适配器42的中断参数时,缓存管理器40的控制单元40a确认读取传递。(9) When receiving an interrupt parameter from the disk adapter 42, the control unit 40a of the cache manager 40 confirms the read delivery.

(10)缓存管理器40的控制单元42a检查盘适配器42的结束指针,并且确认读取传递完成。(10) The control unit 42a of the cache manager 40 checks the end pointer of the disk adapter 42, and confirms that the read transfer is completed.

所有连接线路必须具有高流量以实现足够高的性能,并且由于尤其是缓存控制单元40a与盘适配器42之间的信号交换是很频繁的(在附图5中是七次),需要具有特别低等待时间的总线。All connection lines must have high traffic to achieve sufficiently high performance, and since especially the handshaking between the cache control unit 40a and the disk adapter 42 is very frequent (seven times in FIG. 5 ), a particularly low Waiting time for the bus.

在这个例子中,使用了PCI-Express(四路)和光纤通道(4G)作为高流量连接线路,但是PCI-Express是低等待时间连接线路,而光纤通道连接具有相对较高的等待时间(数据传递花费时间)。In this example, PCI-Express (quad) and Fiber Channel (4G) are used as high-traffic connections, but PCI-Express is a low-latency connection, while Fiber Channel connections have relatively high latency (data transfer takes time).

在第二种现有技术的情况下,等待时间高的光纤通道不能用于CM 10与DA 13或CA 11之间的RT 14(见附图12),但是在本发明中,具有附图1中的结构,光纤通道可以用于BRT 5-0到5-7。In the case of the second prior art, fiber channel with high latency cannot be used for RT 14 between CM 10 and DA 13 or CA 11 (see accompanying drawing 12), but in the present invention, with accompanying drawing 1 In the structure, Fiber Channel can be used for BRT 5-0 to 5-7.

要实现低等待时间,总线的信号数量不能减少到小于某个数量,但是依据本发明,使用少量信号线的光纤通道可以用于盘适配器42与BRT5-0之间的连接,所以这减少了背板上的信号线数量,这对安装来说是很有效的。To achieve low latency, the number of signals on the bus cannot be reduced to less than a certain amount, but according to the present invention, Fiber Channel using a small number of signal lines can be used for the connection between the disk adapter 42 and the BRT5-0, so this reduces the overhead. The number of signal lines on the board, which is very efficient for the installation.

现在将介绍写入操作。当经相应的通道适配器41a到41d接收到来自主计算机之一的写入请求时,接收到写入请求命令和写入数据的通道适配器41a到41d向缓存管理器40询问写入数据应当写入的缓冲存储器40b的地址。Write operations will now be covered. When a write request from one of the host computers is received via the corresponding channel adapters 41a to 41d, the channel adapters 41a to 41d that have received the write request command and the write data ask the cache manager 40 where the write data should be written. The address of the buffer memory 40b.

当接收到来自缓存管理器40的响应时,通道适配器41a到41d将写入数据写入到缓存管理器40的缓冲存储器40b中,而且还将该写入数据写入到不同于这个缓存管理器40的至少一个缓存管理器40(换句话说,不同控制模块4-0到4-7中的缓存管理器40)中的缓冲存储器40b中。为此,通道适配器41a到41d启动DMA引擎43,并且经FRT 6-0和6-1将写入数据写入到另一个控制模块4-0到4-7中的缓存管理器40中的缓冲存储器40b中。When receiving a response from the cache manager 40, the channel adapters 41a to 41d write the write data into the buffer memory 40b of the cache manager 40, and also write the write data into a buffer memory other than this cache manager. In at least one cache manager 40 of 40 (in other words, the cache managers 40 in the different control modules 4-0 to 4-7) in the buffer memory 40b. To this end, the channel adapters 41a to 41d start the DMA engine 43 and write the write data to the buffer in the buffer manager 40 in another control module 4-0 to 4-7 via the FRTs 6-0 and 6-1. in memory 40b.

这里将写入数据写入到了至少两个不同的控制模块4-0到4-7的缓冲存储器40b中,这是因为要对数据进行复制(镜像),以便即使控制模块4-0到4-7或缓存管理器40发生了意外的硬件故障也能防止数据丢失。Here the write data is written in the buffer memory 40b of at least two different control modules 4-0 to 4-7, this is because the data will be copied (mirrored), so that even if the control modules 4-0 to 4-7 7 or an unexpected hardware failure of the cache manager 40 can also prevent data loss.

当写入数据向这多个缓冲存储器40b的写入正常结束时,通道适配器41a到41d向主计算机3-0到3-31发出完成通知,并且处理结束。When the writing of the write data to the plurality of buffer memories 40b ends normally, the channel adapters 41a to 41d issue a completion notification to the host computers 3-0 to 3-31, and the processing ends.

必须还要将这一写入数据反写到目标盘驱动器中(反写技术,writeback)。缓存控制单元40a按照内部进度将缓冲存储器40b的写入数据反写到保存这一目标数据的盘驱动器200中。将参照附图6描述这一对盘驱动器进行的写入处理。This written data must also be written back into the target disk drive (writeback technique, writeback). The cache control unit 40a reverse-writes the written data of the buffer memory 40b to the disk drive 200 storing this target data according to an internal schedule. The writing process performed by this pair of disk drives will be described with reference to FIG. 6. FIG.

(1)缓存管理器40的控制单元40a(CPU)在缓冲存储器40b的描述符区中创建FC报头和描述符。该描述符是向数据传递(DMA)电路请求数据传递(DMA)的指令,并且包括FC报头在缓冲存储器上的地址、数据要传递到缓冲存储器上的地址及其数据字节数,以及数据传递的盘的逻辑地址。(1) The control unit 40a (CPU) of the cache manager 40 creates an FC header and a descriptor in the descriptor area of the cache memory 40b. This descriptor is an instruction to request data transfer (DMA) to the data transfer (DMA) circuit, and includes the address of the FC header on the buffer memory, the address of the data to be transferred to the buffer memory and the number of data bytes thereof, and the data transfer The logical address of the disk.

(2)启动盘适配器42的数据传递电路。(2) The data transfer circuit of the disk adapter 42 is started.

(3)盘适配器42的已启动数据传递电路从缓冲存储器40b中读取描述符。(3) The activated data transfer circuit of the disk adapter 42 reads the descriptor from the buffer memory 40b.

(4)盘适配器42的已启动数据传递电路从缓冲存储器40b中读取FC报头。(4) The activated data transfer circuit of the disk adapter 42 reads the FC header from the buffer memory 40b.

(5)盘适配器42的已启动数据传递电路对描述符进行分析,并且获得关于所请求的盘、首地址和字节数的数据,并且从缓冲存储器40b中读取数据。(5) The activated data transfer circuit of the disk adapter 42 analyzes the descriptor, and obtains data on the requested disk, head address, and byte count, and reads the data from the buffer memory 40b.

(6)在读取完成之后,盘适配器42的数据传递电路经光纤通道500(510)将FC报头和数据传递给目标盘驱动器200。盘驱动器200将所传递的数据写入到内部盘上。(6) After the reading is completed, the data transfer circuit of the disk adapter 42 transfers the FC header and data to the target disk drive 200 via the fiber channel 500 (510). The disk drive 200 writes the transferred data on the internal disk.

(7)当数据的写入完成时,盘驱动器200经光纤通道500(510)向盘适配器42的数据传递电路发送完成通知。(7) When writing of data is completed, the disk drive 200 sends a completion notification to the data transfer circuit of the disk adapter 42 via the fiber channel 500 (510).

(8)当接收到完成通知时,盘适配器42的已启动数据传递电路借助中断向缓存管理器40发送完成通知。(8) When receiving the completion notification, the activated data transfer circuit of the disk adapter 42 sends a completion notification to the cache manager 40 by means of an interrupt.

(9)当接收到来自盘适配器42的中断参数时,缓存管理器40的控制单元40a确认写入操作。(9) When receiving an interrupt parameter from the disk adapter 42, the control unit 40a of the cache manager 40 confirms the write operation.

(10)缓存管理器40的控制单元42a检查盘适配器42的结束指针,并且确认写入操作完成。(10) The control unit 42a of the cache manager 40 checks the end pointer of the disk adapter 42, and confirms that the write operation is completed.

在附图6和附图5中,箭头标志都表示分组信息(比如数据)的传递,而U形弯箭头标志都表示将数据发送回数据请求端的数据读取。由于需要确认DA中控制电路的开始和结束状态,因此传递一次数据,要在CM 40与DA 42之间交换七次数据。数据在DA 42与盘200之间交换了两次。In Fig. 6 and Fig. 5, the arrow marks represent the transfer of packet information (such as data), and the U-shaped arrow marks represent the data reading that sends data back to the data requesting end. Since it is necessary to confirm the start and end states of the control circuit in the DA, data should be exchanged seven times between the CM 40 and the DA 42 once the data is transferred. Data is exchanged between DA 42 and disc 200 twice.

由此,可以理解,缓存控制单元40与盘适配器42之间的连接线路需要低等待时间,并且另一方面,具有少量信号线的接口可以用于盘适配器42和盘装置200。From this, it can be understood that the connection line between the cache control unit 40 and the disk adapter 42 requires low latency, and on the other hand, an interface with a small number of signal lines can be used for the disk adapter 42 and the disk device 200 .

安装结构installation structure

附图7是示出了依据本发明的控制模块的安装结构示例的示意图,附图8是示出了包括附图7中的控制模块和盘封装组的安装结构示例的示意图,而附图9和附图10是示出了具有这些安装结构的数据存储系统的框图。Accompanying drawing 7 is a schematic diagram showing an example of an installation structure of a control module according to the present invention, and accompanying drawing 8 is a schematic diagram showing an example of an installation structure including a control module and a disc package group in accompanying drawing 7, and accompanying drawing 9 and Figure 10 is a block diagram showing a data storage system having these mounting structures.

如附图8所示,四个盘封装组2-0、2-1、2-8和2-9安装在了存储装置主体的上侧。控制电路安装在了存储装置的下半部分中。这个下半部分由背板7分成了前后两个部分,如附图7所示。分别在背板7的前侧和后侧创建了插槽。在存储系统具有附图9中的大规模结构的情况下,在前侧设置了八个(八片)CM 4-0到4-7,而在后侧设置了两个(两片)FRT 6-0和6-1、八个(八片)BRT 5-0到5-7和负责电源控制的服务处理机SVC(附图1和附图9中未示出)。As shown in FIG. 8, four disk package groups 2-0, 2-1, 2-8, and 2-9 are mounted on the upper side of the memory device main body. The control circuit is installed in the lower half of the storage device. The lower half is divided into front and rear parts by the back plate 7, as shown in Figure 7 . Slots are created on the front and rear sides of the backplane 7, respectively. In the case of the storage system having the large-scale structure in accompanying drawing 9, eight (eight pieces) of CM 4-0 to 4-7 are provided on the front side, and two (two pieces) of FRT 6 are provided on the rear side -0 and 6-1, eight (eight slices) BRT 5-0 to 5-7 and the service processor SVC responsible for power control (not shown in accompanying drawing 1 and accompanying drawing 9).

在附图7中,八片CM 4-0到4-7和两片FRT 6-0和6-1是经背板7由四路PCI-Express连接的。PCI-Express具有四个(差分和双向)信号线,所以四路使用了16根信号线,这意味着信号线的数量为16×16=256。这八片CM 4-0到4-7与八片BRT 5-0到5-7是经背板7由光纤通道连接的。光纤通道(具有差分和双向信号线)具有1×2×2=4根信号线,所以信号线的数量是8×8×4=256。In accompanying drawing 7, eight slices of CM 4-0 to 4-7 and two slices of FRT 6-0 and 6-1 are connected by four-way PCI-Express through backplane 7. PCI-Express has four (differential and bidirectional) signal wires, so 16 signal wires are used for four channels, which means that the number of signal wires is 16×16=256. These eight pieces of CM 4-0 to 4-7 and eight pieces of BRT 5-0 to 5-7 are connected by optical fiber channels through the backplane 7. Fiber Channel (with differential and bidirectional signal lines) has 1 x 2 x 2 = 4 signal lines, so the number of signal lines is 8 x 8 x 4 = 256.

通过如上面所述依据连接位置而不同地使用总线,可以借助512根信号线来实现八片CM 4-0到4-7、两片FRT 6-0和6-1和八片BRT 5-0到5-7,即使在存储系统具有如附图9中所示的大规模结构的情况下也是如此。这一信号线的数量是能够足量安装在背板基板7上的信号线的数量,并且电路板的信号层数量是六个,这个数量是足够用的,并且处在从成本角度讲能够实现的范围之内。By using the bus differently depending on the connection position as described above, eight pieces of CM 4-0 to 4-7, two pieces of FRT 6-0 and 6-1 and eight pieces of BRT 5-0 can be realized by means of 512 signal lines to 5-7, even in the case where the storage system has a large-scale structure as shown in FIG. 9 . The number of this signal line is the number of signal lines that can be installed on the backplane substrate 7 in sufficient quantities, and the number of signal layers of the circuit board is six, which is sufficient and can be realized from the perspective of cost. within the range.

在附图8中,安装了四个盘封装组2-0、2-1、2-8和2-9(见附图9),而其它的盘封装组2-3到2-7和2-10到2-31安装在不同的主体上。In accompanying drawing 8, four disc packing groups 2-0, 2-1, 2-8 and 2-9 (see accompanying drawing 9) are installed, and other disc packing groups 2-3 to 2-7 and 2 -10 to 2-31 mounted on different bodies.

附图10中的中等规模存储系统也能够采用类似的结构来实现。换句话说,可以采用同样的体系结构来实现四个单元的CM 4-0到4-3、四个单元的BRT 5-0到5-3、两个单元的FRT 6-0到6-1和16个模块的盘封装组2-0到2-15的结构。The medium-scale storage system in FIG. 10 can also be implemented with a similar structure. In other words, the same architecture can be used to implement four-unit CM 4-0 to 4-3, four-unit BRT 5-0 to 5-3, two-unit FRT 6-0 to 6-1 and 16-module disk package groups 2-0 to 2-15 configurations.

各个控制模块4-0到4-7的盘适配器42a和42b借助BRT与所有的盘驱动器200相连接,从而各个控制模块4-0到4-7能够经盘适配器42a或42b对所有的盘驱动器进行存取。Disk adapters 42a and 42b of each control module 4-0 to 4-7 are connected with all disk drives 200 by means of BRT, so that each control module 4-0 to 4-7 can connect all disk drives via disk adapter 42a or 42b to access.

这些盘适配器42a和42b分别安装在缓存管理器40的电路板上,该缓存管理器40是控制模块4-0到4-7的主要单元,并且各个盘适配器42a和42b可以借助诸如PCI-Express之类的低等待时间总线直接与缓存管理器40相连接,所以能够实现高流量。These disk adapters 42a and 42b are respectively mounted on the circuit board of the cache manager 40, which is the main unit of the control modules 4-0 to 4-7, and the respective disk adapters 42a and 42b can be connected by means such as PCI-Express A low-latency bus such as the Cache Manager 40 is directly connected to the cache manager 40, so high traffic can be achieved.

各个控制模块4-0到4-7的盘适配器42a和42b与BRT 5-0到5-7进行一对一网形连接,所以即使系统的控制模块4-0到4-7的数量(换句话说,盘适配器42a和42b的数量)增加,光纤通道(具有构成接口的少量信号)也可以用于盘适配器42a和42b与BRT 5-0到5-7之间的连接,这解决了安装问题。The disk adapters 42a and 42b of each control module 4-0 to 4-7 are connected to the BRT 5-0 to 5-7 in a one-to-one mesh connection, so even if the number of control modules 4-0 to 4-7 of the system (change In other words, the number of disk adapters 42a and 42b) increases, and fiber channel (with a small number of signals constituting the interface) can also be used for connection between the disk adapters 42a and 42b and BRT 5-0 to 5-7, which solves the problem of installation question.

在各个控制模块4-0到4-7之间(换句话说,在各个控制模块4-0到4-7的缓存管理器40之间)进行通信和数据传递处理的情况下,数据传递量很高,并且最好能减少连接所需的时间,并且要求高的流量和低的等待时间(很快的响应速度),所以如附图2所示,各个控制模块4-0到4-7的DMA引擎43与FRT6-0和6-1是通过PCI-Express进行连接的,这种总线是使用高速串行传输的总线,被设计成同时满足高流量和低等待时间的要求。In the case of communication and data transfer processing between the respective control modules 4-0 to 4-7 (in other words, between the buffer managers 40 of the respective control modules 4-0 to 4-7), the data transfer amount It is very high, and it is best to reduce the time required for connection, and require high flow and low waiting time (very fast response speed), so as shown in Figure 2, each control module 4-0 to 4-7 The DMA engine 43 and FRT 6-0 and 6-1 are connected through PCI-Express, this bus is a bus using high-speed serial transmission, and is designed to meet the requirements of high traffic and low waiting time at the same time.

其它实施例other embodiments

依据上述的实施例,控制模块中的信号线是使用PCI-Express加以介绍的,但是也可以使用其它的高速串行总线,比如Rapid-IO。控制模块中的通道适配器和盘适配器的数量可以依据需要增加或减少。According to the above embodiments, the signal lines in the control module are introduced using PCI-Express, but other high-speed serial buses, such as Rapid-IO, can also be used. The number of channel adapters and disk adapters in the control module can be increased or decreased as required.

对于盘驱动器来说,可以使用诸如硬盘驱动器、光盘驱动器和磁光盘驱动器之类的存储装置。As the disk drive, storage devices such as hard disk drives, optical disk drives, and magneto-optical disk drives can be used.

本发明是利用具体实施例进行介绍的,但是本发明也可以采用处于本发明的基本特征的范围之内的各种不同的方式来具体实现,并且不应将这些方式排除于本发明的范围之外。The present invention is described using specific embodiments, but the present invention can also be embodied in various ways within the scope of the essential characteristics of the present invention, and these ways should not be excluded from the scope of the present invention outside.

由于各个控制模块的第二接口与多个第一切换单元相连接,因此所有的控制模块都能够维持访问所有存储装置的冗余度,并且即使控制模块的数量增加,也可以使用背板通过串行总线(具有少量构成接口的信号)将控制模块与第一切换单元相连接,因此安装到印刷电路板上成为可能,同时保持控制模块内的低等待时间通信。所以本发明对统一从大规模到小规模的体系结构来说是很有效果的,并且能够有助于降低装置的成本。Since the second interface of each control module is connected to a plurality of first switching units, all control modules can maintain the redundancy of accessing all storage devices, and even if the number of control modules increases, they can also use the backplane A row bus (with a small number of signals constituting the interface) connects the control module with the first switching unit, thus enabling mounting on a printed circuit board while maintaining low-latency communication within the control module. Therefore, the present invention is very effective for unifying the architecture from large scale to small scale, and can contribute to reducing the cost of the device.

相关申请的交叉引用Cross References to Related Applications

本申请基于2004年11月30日提交的在先日本专利申请No.2004-347411和2005年1月28日提交的在先日本专利申请No.2005-022121并且要求这些在先申请的优先权,这些在先申请的全部内容以引用的方式并入本文。This application is based on and claims the priority of the prior Japanese patent application No. 2004-347411 filed on November 30, 2004 and the prior Japanese patent application No. 2005-022121 filed on January 28, 2005, The entire contents of these prior applications are incorporated herein by reference.

Claims (20)

1. data-storage system, this data-storage system comprises:
A plurality of memory storages with one or more access port are used to store data; With
A plurality of control modules are used for carrying out the access control of described memory storage according to the access instruction that comes from main frame,
Wherein said control module further comprises:
Memory buffer is used for storing the data that a part is stored in described memory storage;
Caching control unit is used to control described memory buffer;
First interface unit, described first interface unit is connected with described caching control unit, is used to control the interface with described main frame;
Second interface unit, described second interface unit is connected with described caching control unit, is used to control the interface with described a plurality of memory storages;
And wherein said data-storage system also comprises:
A plurality of first switch units, be used between described a plurality of control modules and described a plurality of memory storages, being connected, and be used for optionally described second interface unit and described a plurality of memory storage of each control module of switch, each described control module links to each other with described a plurality of first switch units, and each described memory storage described first switch unit different with one or more is connected; With
Backboard is used for being connected with described a plurality of described first switch units by universal serial bus each described second interface unit with described a plurality of control modules.
2. data-storage system according to claim 1, wherein said caching control unit is connected by the high-speed serial bus with low latency with described second interface unit, and described second interface unit uses described backboard to be connected by universal serial bus with described a plurality of first switch units.
3. data-storage system according to claim 1, wherein said control module also comprise be used for the communication unit that communicates with the described control module of another one and
Described system also comprises second switch unit, is used for optionally connecting the communication unit of each described control module.
4. data-storage system according to claim 3, wherein the communication unit of each control module is to use described backboard to be connected with second switch unit.
5. data-storage system according to claim 1, wherein said first switch unit is connected by cable with described a plurality of memory storages.
6. data-storage system according to claim 1, wherein said memory storage comprises a plurality of access ports,
And wherein said a plurality of first different switch unit is connected with described a plurality of access ports.
7. data-storage system according to claim 2, wherein said caching control unit is connected by the multipath high-speed universal serial bus with described second interface unit,
And described second interface unit is to use described backboard to be connected by universal serial bus with described a plurality of first switch units.
8. data-storage system according to claim 2, wherein said high-speed serial bus are the PCI-Express buses.
9. data-storage system according to claim 2, wherein said universal serial bus is an optical-fibre channel.
10. data-storage system according to claim 2, wherein said control module connects described caching control unit and described first interface unit by the high-speed serial bus with low latency.
11. a data recording control apparatus is used for being used to store according to the access instruction that comes from main frame the access control of a plurality of memory storages of data, this data recording control apparatus comprises:
A plurality of control modules comprise:
Memory buffer is used for storing the data that a part is stored in described memory storage;
Caching control unit is used to control described memory buffer;
First interface unit, described first interface unit is connected with described caching control unit, is used to control and being connected of described main frame;
Second interface unit, described second interface unit is connected with described caching control unit, is used to control and being connected of described a plurality of memory storages,
A plurality of first switch units, be used between described a plurality of control modules and described a plurality of memory storages, being connected, and be used for optionally described second interface unit and described a plurality of memory storage of each control module of switch, each described control module links to each other with described a plurality of first switch units, and each described memory storage described first switch unit different with one or more is connected; With
Backboard is used for being connected with described a plurality of first switch units by universal serial bus each described second interface unit with described a plurality of control modules.
12. data recording control apparatus according to claim 11, wherein said caching control unit is connected by the high-speed serial bus with low latency with described second interface unit,
And described second interface unit uses described backboard to be connected by universal serial bus with described a plurality of first switch units.
13. also comprising, data recording control apparatus according to claim 11, wherein said control module be used for the communication unit that communicates with the described control module of another one,
And described device also comprises second switch unit, is used for optionally connecting the communication unit of each described control module.
14. data recording control apparatus according to claim 13, wherein the communication unit of each control module is to use described backboard to be connected with second switch unit.
15. data recording control apparatus according to claim 11, wherein said first switch unit is connected by cable with described a plurality of memory storages.
16. data recording control apparatus according to claim 11, wherein said a plurality of first different switch units are connected with each the described memory storage with a plurality of access ports respectively.
17. data recording control apparatus according to claim 12, wherein said caching control unit is connected by the multipath high-speed universal serial bus with described second interface unit,
And described second interface unit is to use described backboard to be connected by universal serial bus with described a plurality of first switch units.
18. data recording control apparatus according to claim 12, wherein said high-speed serial bus are the PCI-Express buses.
19. data recording control apparatus according to claim 12, wherein said universal serial bus is an optical-fibre channel.
20. data recording control apparatus according to claim 12, wherein said caching control unit is to be connected by the high-speed serial bus with low latency with described first interface unit.
CNB2005100796426A 2004-11-30 2005-06-23 Data storage system and data storage control device Expired - Fee Related CN100347655C (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2004347411A JP4404754B2 (en) 2004-11-30 2004-11-30 Data storage apparatus and information processing system
JP2004347411 2004-11-30
JP2005022121 2005-01-28

Publications (2)

Publication Number Publication Date
CN1782978A CN1782978A (en) 2006-06-07
CN100347655C true CN100347655C (en) 2007-11-07

Family

ID=36633589

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2005100796426A Expired - Fee Related CN100347655C (en) 2004-11-30 2005-06-23 Data storage system and data storage control device

Country Status (2)

Country Link
JP (1) JP4404754B2 (en)
CN (1) CN100347655C (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100550880C (en) 2006-11-30 2009-10-14 杭州华三通信技术有限公司 The middle backboard of communication equipment
US8145837B2 (en) * 2007-01-03 2012-03-27 Raytheon Company Computer storage system with redundant storage servers and at least one cache server
JP4362135B2 (en) 2007-02-13 2009-11-11 富士通株式会社 Data transfer apparatus and data transfer method
CN102163130B (en) * 2010-02-23 2013-01-02 慧荣科技股份有限公司 Method for managing memory read data and memory device
JP5545108B2 (en) 2010-08-04 2014-07-09 富士通株式会社 Storage system, control device, and control method
JP5736875B2 (en) 2011-03-18 2015-06-17 富士通株式会社 Storage device and storage device control method
JP6540204B2 (en) 2015-04-30 2019-07-10 富士通株式会社 Relay device
JP6604029B2 (en) 2015-04-30 2019-11-13 富士通株式会社 Control device, storage device, control program
JP6582523B2 (en) 2015-04-30 2019-10-02 富士通株式会社 Storage device, control device, control program
CN111522513B (en) * 2020-04-24 2022-08-12 上海航天计算机技术研究所 Extensible high-performance multi-interface satellite-borne storage system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1132805A2 (en) * 2000-03-10 2001-09-12 Hitachi, Ltd. Disk array controller and method for adapting the number of control units in a disk array controller
US6351838B1 (en) * 1999-03-12 2002-02-26 Aurora Communications, Inc Multidimensional parity protection system
US6385681B1 (en) * 1998-09-18 2002-05-07 Hitachi, Ltd. Disk array control device with two different internal connection systems
US6425049B1 (en) * 1999-02-08 2002-07-23 Hitachi, Ltd. Disk array system and method of changing the configuration of the disk array system
CN1447254A (en) * 2003-04-04 2003-10-08 清华大学 Networked mass storage device and implementation approach

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6385681B1 (en) * 1998-09-18 2002-05-07 Hitachi, Ltd. Disk array control device with two different internal connection systems
US6425049B1 (en) * 1999-02-08 2002-07-23 Hitachi, Ltd. Disk array system and method of changing the configuration of the disk array system
US6351838B1 (en) * 1999-03-12 2002-02-26 Aurora Communications, Inc Multidimensional parity protection system
EP1132805A2 (en) * 2000-03-10 2001-09-12 Hitachi, Ltd. Disk array controller and method for adapting the number of control units in a disk array controller
CN1447254A (en) * 2003-04-04 2003-10-08 清华大学 Networked mass storage device and implementation approach

Also Published As

Publication number Publication date
CN1782978A (en) 2006-06-07
JP2006155392A (en) 2006-06-15
JP4404754B2 (en) 2010-01-27

Similar Documents

Publication Publication Date Title
CN1975654A (en) Data storage system and data storage control apparatus
CN1270250C (en) Method and device for raising serial storage equipment reliability and elasticity
CN1828510A (en) Data storage system and data storage control device
CN1955940A (en) RAID system, RAID controller and reconstruction/copyback processing method thereof
CN100347655C (en) Data storage system and data storage control device
CN1924783A (en) Storage system and storage control method
CN1655111A (en) Storage System
CN101042632A (en) Computer system and method for controlling allocation of physical links
US20140223097A1 (en) Data storage system and data storage control device
CN1812693A (en) Dual bus interface circuit board components and assemble method thereof
CN1832489A (en) Method for accessing object magnetic dish and system for extensing disk content
CN101040251A (en) Method and system for transferring data directly between storage devices in a storage area network
CN1828574A (en) Computer system using serial connection bus and multi-CPU interconnection method
US20240134814A1 (en) Scaling midplane bandwidth between storage processors via network devices
CN100351766C (en) Disk array system
JP4985750B2 (en) Data storage system
US7426658B2 (en) Data storage system and log data equalization control method for storage control apparatus
JP4440127B2 (en) Data storage system and data storage control device
CN1892566A (en) Interface device and storage device
US20080005396A1 (en) Storage system and configuration-change method thereof
CN1790248A (en) Device with dual write-in functions and memory control device
CN1567146A (en) Input/output processing unit of storage device
GB2412205A (en) Data storage system with an interface in the form of separate components plugged into a backplane.

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20071107

Termination date: 20210623