[go: up one dir, main page]

US6941439B2 - Computer system - Google Patents

Computer system Download PDF

Info

Publication number
US6941439B2
US6941439B2 US10/282,863 US28286302A US6941439B2 US 6941439 B2 US6941439 B2 US 6941439B2 US 28286302 A US28286302 A US 28286302A US 6941439 B2 US6941439 B2 US 6941439B2
Authority
US
United States
Prior art keywords
command
disk array
attribute
address translation
translation server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related, expires
Application number
US10/282,863
Other versions
US20030225993A1 (en
Inventor
Ikuya Yagisawa
Naoto Matsunami
Yasuyuki Mimatsu
Akihiro Mannen
Kenji Muraoka
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hitachi Ltd
Original Assignee
Hitachi Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hitachi Ltd filed Critical Hitachi Ltd
Assigned to HITACHI LTD reassignment HITACHI LTD ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MANNEN, AKIHIRO, MURAOKA, KENJI, MATSUNAMI, NAOTO, MIMATSU, YASUYUKI, YAGISAWA, IKUYA
Publication of US20030225993A1 publication Critical patent/US20030225993A1/en
Assigned to HITACHI, LTD. reassignment HITACHI, LTD. CORRECTIVE COVERSHEET TO CORRECT SERIAL NUMBER 10/282,869 ON ASSIGNMENT DOCUMENT PREVIOUSLY RECORDED ON REEL 014060, FRAME 0522. Assignors: MANNON, AKIHIRO, MURAOKA, KENJI, MATSUNAMI, NAOTO, MIMATSU, YASUYUKI, YAGISAWA, IKUYA
Application granted granted Critical
Publication of US6941439B2 publication Critical patent/US6941439B2/en
Adjusted expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0629Configuration or reconfiguration of storage systems
    • G06F3/0635Configuration or reconfiguration of storage systems by changing the path, e.g. traffic rerouting, path reconfiguration
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/0604Improving or facilitating administration, e.g. storage management
    • G06F3/0605Improving or facilitating administration, e.g. storage management by facilitating the interaction with a user or administrator
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0629Configuration or reconfiguration of storage systems
    • G06F3/0631Configuration or reconfiguration of storage systems by allocating resources to storage systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0683Plurality of storage devices
    • G06F3/0689Disk arrays, e.g. RAID, JBOD

Definitions

  • the present invention relates to a storage device, and more particularly to a technique that virtually assigns a plurality of storage devices into one memory area and provides the area to a computer.
  • a storage device is usually constructed with disk arrays.
  • the disk array known as a Redundant Arrays of Independent Disks (RAID) and is a memory device where many disk drive devices are configured in arrays. Details of the disk array are described in: “A Case for Redundant Arrays of Inexpensive Disks (RAID).”
  • Japanese Patent Laid-open No. 6-161837 discloses a technique wherein, at the time of assigning a file to a user of a memory area available in a memory device, a storage device will automatically select a memory area that is optimum for assigning files, based on file attributes or file information such as type of space assignment.
  • a single storage device is used in a computer system
  • the user of the system can use a storage device that is suitable for the application of the data to be used.
  • a virtualization technique is introduced into the system, the user cannot specify and use a storage device that is suitable for the application of data to be used, since virtual memory areas, which are formed by a plurality of storage devices, are used.
  • an address translation server that realizes the virtualization recognizes and stores characteristics of a storage device that forms a storage pool. Further, the address translation server recognizes access characteristics from a host computer and creates and issues a command to the storage device, in view of the characteristics of the storage device to which the command is issued. In addition, the storage device provides a resource that matches the access characteristics required.
  • the present invention incorporates an external-storage-device attributes acquisition program where an address translation server that realizes the virtualization can dynamically execute an attribute change instruction to a storage device which forms a storage pool, according to changes in the access status from the host computer.
  • the storage device changes the internal control so that the attributes an be provided as required.
  • FIG. 1 is a system configuration diagram according to a first embodiment of the present invention
  • FIG. 2 is an explanatory diagram of a mapping table according to the first embodiment of the present invention.
  • FIG. 3 is an explanatory diagram of storage LU information according to the first embodiment of the present invention.
  • FIG. 4 is an explanatory diagram of commands according to the first embodiment of the present invention.
  • FIG. 5 is a flow chart of address translation and a command issuing operation according to the first embodiment of the present invention.
  • FIG. 6 is a structural diagram of Disk Array 400 ;
  • FIG. 7 is an explanatory diagram of a mapping table according to a second embodiment of the present invention.
  • FIG. 8 is an explanatory diagram of commands according to the second embodiment of the present invention.
  • FIG. 9 is an explanatory diagram of an access characteristics counter according to the second embodiment of the present invention.
  • FIG. 10 is an explanatory diagram of a cache management table according to the second embodiment of the present invention.
  • FIG. 11 is a flow chart of LU attributes designating the command issuing operation according to the second embodiment of the present invention.
  • FIG. 1 is a diagram showing a first embodiment of a computer system wherein the present invention is applied.
  • the computer system comprises a host computer 100 (hereinafter referred to as the “host 100 ”), an address translation computer 200 (hereinafter referred to as the “address translation server 200 ”), a plurality of storage devices 400 , and a tape device 500 .
  • the storage device 400 may be either a single disk device, or a storage device system, which is a combination of a plurality of disk devices, such as a RAID and a control device.
  • the plurality of storage devices 400 may be either a combination of the same storage devices or a combination of different storage devices.
  • a memory area that is available in each of the storage devices 400 is called a logical unit (LU).
  • the LUs of the plurality of storage devices 400 will be referred to as LU 10 , LU 11 , LU 12 and LU 13 , respectively.
  • the LU available in the storage device 400 may either be one that corresponds one-on-one to a disk device available in the storage devices 400 , or one that corresponds to a memory area that extends over the plurality of disk devices available in the storage devices 400 .
  • an attribute of an LU available in each of the storage devices 400 is recognized by the address translation server 200 .
  • the attribute of the LU may include not only functions that are realized by the storage device 400 , but basic characteristics of the storage device 400 , such as access performance from the host 100 and reliability.
  • An example of the functions might include the one to enable/disable the recognition of special commands issued by the address translation server 200 .
  • the tape device 500 is connected to the host 100 and is used to back up data.
  • the host 100 and the address translation server 200 and likewise, the address translation server 200 and the storage device 400 , are respectively connected to each other via a communication line.
  • the communication line used may be any of those lines, for example, where IP protocols are used, or where fiber channel protocols are used.
  • LU_ 300 is a virtual memory area that is to be recognized by the host 100 , and has an area A_ 310 and an area B_ 320 .
  • area A_ 310 is associated with LU 10
  • area B_ 320 is associated with LU 11 .
  • area A_ 310 and area B_ 320 are memory areas available in respective storage devices, which are practically different each other, LU_ 300 is recognized as a series of logical units by the host 100 .
  • the address translation server 200 has a CPU 220 , a memory 230 , a command receiving unit 210 , which receives commands from the host 100 , an access characteristics judging program 211 , which judges access characteristics of commands from the host 100 , an access characteristics receiving program 212 , which receives access characteristics information that is included in commands from the host 100 , an address translation program 213 , a mapping table 214 , a mapping table creating program 215 , an external storage device attributes acquisition program 216 , a command creating program 217 , a command issuing program 218 and an LU information storage unit 219 .
  • the CPU 220 executes the above-described programs.
  • the address translation program 213 is executed when the address of LU_ 300 designated by the host 100 is translated into addresses showing LU 10 , LU 11 , LU 12 and LU 13 .
  • mapping table 214 relationships with LU_ 300 ,_LUs 10 to 13 and attributes of LUs 10 to 13 are registered.
  • the mapping table creating program 215 is executed, according to the configuration of the storage device 400 that the computer system has, at the time the mapping table 214 is created.
  • the external-storage-device attributes acquisition program 216 is executed when information indicating attributes of the storage device 400 is acquired from the storage device.
  • the command creating program 217 is executed when commands, such as a command to write data to the storage device 400 , are created.
  • FIG. 2 is a diagram showing an example of the mapping table 214 .
  • the mapping table 214 registers an LU number (LUN) and a logical block address (LBA) of the virtual host LU_ 300 that can be recognized by the host 100 , the virtual LBAs of area A_ 310 and area B_ 320 , which are virtual areas, as well as the LUNs, LBAs and attributes of LU 10 , LU 11 , LU 12 and LU 13 , which are memory areas of respective storage devices 400 .
  • the mapping table 214 records the relationship among various registered items.
  • Attributes of the storage device 400 include sequential access performance, random access performance, evaluation of reliability, the function to enable/disable recognition of commands with attributes, etc. of the storage device 400 .
  • evaluation values for sequential access performance, random access performance and reliability are indicated in five steps, and a higher evaluation is given to those steps that have a higher number.
  • FIG. 3 is a diagram showing an example of storage LU information of respective storage devices 400 that is obtained by the address translation server 200 through the execution of the external storage device attributes acquisition program 216 .
  • the storage LU information includes storage attributes and evaluation values thereof
  • the storage attributes include LU number, capacity, sequential access performance, random access performance, reliability and the function to enable/disable the recognition of commands with attributes.
  • some values for information of attributes shown in FIG. 3 are not set.
  • FIG. 4 is a diagram showing the contents of commands issued by the host 100 and the address translation server 200 .
  • those corresponding to #1 to #3 are contents that are included in typical commands to be used for reading and writing. More specifically, operation details such as read/write are set to the item “Op codes”.
  • LBA values are set to the item “LBA”, and information indicating size is set to the item “Size.”
  • mapping table 214 is described.
  • the address translation server 200 executes the mapping table creating program 215 to retrieve the storage devices 400 , which are connected to the address translation server 200 . Thereafter, the address translation server 200 executes the external-storage-device attributes acquisition program 216 to acquire LU information from the storage device 400 , and stores it in the LU information storage unit 219 .
  • the address translation server 200 issues commands such as ModeSense to the storage device 400 to acquire the storage LU information. Thereafter, the address translation server 200 executes the mapping table creating program 215 , based on the storage LU information thus acquired, to record values for the storage device LU and storage device attributes in respective items of the mapping table 214 .
  • the address translation server 200 executes the external-storage-device attributes acquisition program 216 to issue read/write commands to the storage device 400 and measures performance values according to access characteristics. Thereafter, the address translation server 200 records values in items of storage device attributes based on the measurements.
  • the address translation server 200 executes the mapping table, creating program 215 to form a host LU by combining the available LUs of storage devices, based on storage attribute values such as the storage capacity, performance, and reliability of the LUs thus requested.
  • the storage capacity and the storage attribute values may be sent to the address translation server 200 from the host 100 .
  • FIG. 2 shows that a LUN 0 , which is a certain LU_ 300 , has a storage capacity of 200 blocks.
  • LUN_ 0 consists of an area A_ 310 and an area B_ 320 , and, further, area A_ 310 comprises LBA_ 0 to LBA_ 99 of LU 10 and area B_ 320 comprises LBA 0 to LBA_ 99 of LU 11 . Both LU 10 and LU 11 are associated with storage device attributes.
  • FIG. 5 is a flow chart showing processing of the address translation and the issuance of commands in the address translation server 200 .
  • sequential access is assumed as an access characteristic, but any other characteristics may be used.
  • the address translation server 200 receives a command issued by the host 100 at the command receiving unit 210 (step 1001 ).
  • the address translation server 200 executes the access characteristics judging program 211 to judge, based on a plurality of commands received, whether the access characteristic is sequential or not.
  • There are various methods for the judgment of the access characteristic and the judgment can easily be made with a known method. Therefore, the description of the methods has been omitted (step 1002 ).
  • the address translation server 200 creates a command which does not have any access attributes (step 1003 ), and issues a command to the storage device 400 (step 1004 ). Otherwise, if the characteristic of a received command is sequential, the address translation server 200 refers to the mapping table 214 (step 1006 ), and judges whether or not the LU of the storage device 400 which corresponds to the command can recognize a command with attributes (step 1007 ).
  • the address translation server 200 creates a command to which an access attribute showing a sequential characteristic is added (step 1008 ), and issues a command to the storage device 400 (step 1004 ). It should be noted that in order to add an attribute of sequential access to a command, a value which indicates “Yes” is filled in in the description column for “sequential property” of the command. In addition, a plurality of commands that are judged to be sequential maybe put together for a single command.
  • step 1007 if it is judged that the storage device 400 cannot recognize a command with attributes, the address translation server 200 creates a command that does not have any access attributes (step 1009 ), and issues a command to the storage device 400 .
  • a plurality of commands that are judged to be sequential may be put together for a single command (step 1004 ).
  • the address translation server 200 may, in step 1002 , judge the sequential characteristic of the command by using the access characteristics receiving program 212 instead of using the access characteristics judging program 211 .
  • a host 100 that issues a lot of random access may be assumed to be used, and further, a case where data that are stored in the storage device 400 are backed up in the tape device 500 may be assumed.
  • Efficient processing of commands maybe executed if the address translation server 200 instructs the storage device 400 that random access should be applied except for backup processing and that sequential access should be applied during backup processing, since the storage device 400 is able to recognize access characteristics of commands received in advance.
  • a storage device includes a disk device and a cache
  • the command received from the address translation server 200 is recognized to be sequential access, then it is possible to execute staging to the cache in advance by issuing commands that are expected in advance to be sequential to the disk device, and then data can be directly transferred from the cache in the phase where access to the staging-completed address is provided from the address translation server 200 .
  • the staging to a cache which will be executed in advance, can easily be realized with a known technique.
  • an item such as reliability may be used in addition to performance.
  • the number of access characteristics to be recorded in a command is at least one.
  • a storage device that matches an application can be assigned to the host 100 by allowing the address translation server 200 to keep attributes of the storage device 400 in a mapping table 214 .
  • the address translation server 200 can improve the access performance by creating and issuing a command to the storage device, after recognizing the access characteristic from the host 100 , while taking into consideration the characteristic of the storage device to which the command is issued.
  • An address translation server 200 incorporates, in addition to the components shown in FIG. 1 , an external storage device attributes setting program 222 , which sets an attribute to a storage device 400 , and an access characteristics counter 221 , which counts the frequency of a command that is received from the host 100 and has a certain access characteristic.
  • the access characteristic includes performance values such as the number of I/Os per unit time and the data transfer volume per unit time, in addition to the sequential property and the local property that are determined by an address and a size specified by a command.
  • the address translation server 200 judges the access characteristic by executing an access characteristics judging program 211 .
  • the address translation server 200 executes a command creating program 217 to create a ModeSelect command, which is used to instruct a change in an LU attribute to the storage device 400 .
  • FIG. 6 is a diagram showing an exemplary configuration of the storage device 400 according to the first embodiment and the second embodiment, as well, more specifically of a disk array 400 .
  • LU 10 is theoretically built within the storage device 400 shown in FIG. 6 .
  • the disk array 400 comprises a disk controller 650 , a disk group 651 , a CPU 610 , a memory 620 , a disk cache 690 and an external interface 640 .
  • the disk group 651 incorporates a plurality of disks 670 .
  • a RAID control program 601 In the memory 620 , a RAID control program 601 , a cache management program 605 , and a mirror management program 621 are stored. These programs are executed by the CPU 610 .
  • the memory 620 incorporates a cache management table 606 in which a bit map for the cache management will be stored.
  • the RAID control program 601 is executed when the CPU 610 controls a disk array.
  • Each of the disk groups 651 has a RAID 5 , or a redundant configuration using parity, provided that the number of disks in each of the disk groups 651 as well as the RAID configuration of each disk groups 651 maybe of another configuration such as a RAID 1 , etc.
  • Data that is temporarily stored on disk 670 is stored in the disk cache 690 .
  • An external I/F 640 interfaces with other devices, and in the second embodiment of the present invention, it constitutes an interface with the address translation server 200 .
  • LUs belonging to disk groups 651 are LU 10 and LU 10 ′, respectively.
  • the same data is stored in LU 10 and LU 10 ′ (hereinafter referred to as “mirroring”).
  • LU 10 is the mirror-source LU in which original data are stored
  • LU 10 ′ is the mirror-destination LU in which copies of the original data are to be stored. In case these LUs are not managed under mirroring status, each LU will be handled as an independent LU.
  • the mirror management program 621 of the disk array 400 incorporates an LU mirror subprogram 631 and a mirror synchronizing subprogram 632 .
  • the LU mirror subprogram 631 is executed by the CPU 610 when an update for a particular LU is applied also to an another LU that is specified in advance and mirroring is performed to write the same data in the two LUs.
  • the disk array 400 executes reading from an LU on either of the two LUs, thus reducing the load on a disk.
  • the mirror synchronizing program 632 is a program executed by the CPU 610 when an initial copy is carried out to the mirror-destination LU from the mirror-source LU when the disc array 400 performs the mirroring.
  • the cache management program 605 is used as a subprogram, and it incorporates a look-ahead control program 607 , a cache resident control program 608 and a cache inhibition control program 609 .
  • the look-ahead control program 607 is executed when the CPU 610 performs a look-ahead control.
  • the cache resident program 608 is executed when the CPU 610 controls the LU resident on the disk cache 690 .
  • the cache inhibition control program 609 is executed when the CPU 610 performs a control to inhibit caching to the disk cache 690 .
  • the disk array 400 reads data other than those requested from the disk group 651 to the disk cache 690 by executing the look-ahead control program 607 , thus forecasting data to be read out in advance for a command requested by the address translation server.
  • the disk array 400 executes the cache resident control program 608 to allow data included in an LU or part of an LU to be constantly stored in the disk cache 690 .
  • the LU to be constantly stored in the disk cache 690 includes, for example, an LU that is requested to respond to the address translation server 200 at a high speed.
  • the LU attributes command receiving program 602 is executed by the CPU 610 when the ModeSelect command, which is an LU attributes command from the address translation server 200 , is received.
  • the LU information setting program 603 is executed when an LU attribute is set based on the Mode Select command, which is the LU attributes command thus received.
  • LU attributes information is stored in an LU information table 604 .
  • An LU information table 604 exists for each LU.
  • FIG. 7 is a diagram showing an example of the mapping table 214 according to the second embodiment of the present invention. Differences from the first embodiment are that the columns under the LU attributes include “look-ahead amount”, “resident in cache”, “inhibition of cache” and “mirror.” In the column “look-ahead amount”, the number of logical blocks in which a look-ahead is executed is stored. It should be noted that the value to be stored may be not the number of blocks, but a unit showing other data amounts. In columns “resident in cache”, “inhibition of cache” and “mirror,” such information as ON, which indicates that functions corresponding to the respective items are enabled, or OFF, which indicates that those functions are disabled is stored.
  • FIG. 8 is a diagram showing an example of the ModeSelect command which is issued by the address translation server 200 to the disk array 400 , according to the second embodiment of the present invention.
  • the same information as that of the first embodiment is set.
  • Information showing the look-ahead amount, the cache resident, the cache inhibition and the mirroring, which correspond to attributes to be stored in the mapping table 214 are set for the respective items #3 to #6.
  • FIG. 9 is a diagram showing an example of an access characteristics counter 221 according to the second embodiment of the present invention.
  • the access characteristics counter 221 the number of commands to be issued to an LU corresponding to an LU number, the number of commands with sequential property, and a timer value are stored.
  • the timer value shows the period of time stored for operating the counter.
  • the disk array 400 calculates the percentage of associated sequential commands by comparing the number of commands for the LU within the time period set for the timer value with that of sequential commands, thus judging the sequential property of the access.
  • the disk array 400 verifies the number of I/Os within a certain period of time set for the timer value.
  • FIG. 10 is a diagram showing the contents of the cache management table 606 .
  • the cache management table 606 are stored a cache address showing an address in the disk cache 690 , an address of LBA corresponding to the cache address, a read cache hit bit showing if a caching to an area corresponding to the cache address is inhibited or not, and a resident bit showing if data in an area corresponding to the cache address is cache resident data or not.
  • FIG. 11 is a diagram showing an operation of the address translation server 200 according to the second embodiment of the present invention.
  • a case is assumed, due to the access characteristic of the host 100 , where random access without a sequential property occurs frequently during the initial stage, and, subsequently, access with a sequential property occurs frequently.
  • the address translation server 200 executes the external storage device attributes setting program 222 to create the ModeSelect command which is an LU attributes setting command. Then, by using the command issuing program 218 , the address translation server 200 issues the ModeSelect command to the disk array 400 .
  • the values to be set by the command are determined to be zero (0) for the look-ahead amount, ON for the cache resident, OFF for the cache inhibition, and OFF for the mirroring, assuming that random access will occur frequently. If the capacity of the disk cache 690 is taken into consideration, the cache resident may be set as OFF.
  • the disk array 400 executes the LU attributes command receiving program 602 to receive the ModeSelect command thus issued. Thereafter, the disk array 400 executes the LU information setting program 603 to set the LU attributes information included in the ModeSelect command thus received to the LU information table 604 (step 2001 ).
  • the address translation server 200 monitors the commands from the host 100 , and counts the access characteristics to the access characteristics counter 221 . In the second embodiment, counting is done to determine whether access has a sequential property. The method judging sequential performance may be the same as that for the first embodiment. It should be noted that counting to the counter is not shown in FIG. 11 since the attribution is executed without the synchronization with steps 2001 and thereafter.
  • the address translation server 200 refers to the access characteristics counter 221 from time to time (step 2003 ), and examines, with regard to the volume of commands received from the host 100 , whether sequential access has exceeded a certain level of threshold value (step 2004 ). If sequential access has not exceeded the threshold value, the address translation server 200 executes the processing of step 2003 .
  • the address translation server 200 executes the external storage device attributes setting program 222 to create the ModeSelect command, which is an LU attributes setting command. Then, the address translation server 200 executes the command issuing program 218 to issue the ModeSelect command to the disk array 400 .
  • the ModeSelect command to be created at this time is a value obtained by assuming that sequential access occurs frequently; the look-ahead amount may be, for example, 32 blocks, and the cache inhibition may be ON.
  • the reason why the cache inhibition may be set as ON, that is, why no caching is made, is that it is highly likely that the data requested may have been deleted from the disk cache 690 before the data could be reused, since accesses from the host 100 been sequential (step 2005 ).
  • the address translation server 200 executes the processing of the step 2003 after issuing the command.
  • the address translation server 200 can count, in similar processing procedures, changes in access characteristics by using the access characteristics counter 221 , and issue, depending on the status, a ModeSelect command, which is an LU attributes setting command, to the disk array 400 .
  • a ModeSelect command which is an LU attributes setting command
  • the address translation server 200 recognizes the access characteristic of a command received from the host 100 , the recognition may be performed by transmitting a command with attributes from the host 100 , as is the case with the flow of FIG. 5 described for the first embodiment.
  • the disk array 400 which receives a ModeSelect command that is an LU attributes setting command will be described.
  • the disk array 400 receives a ModeSelect command, which is an LU attributes setting command, by executing the LU attributes command receiving program 602 . Thereafter, the disk array 400 executes the LU information setting program 603 to store contents included in the LU attributes setting command in the LU information table 604 . Then, the disk array 400 determines control methods for the look-ahead amount, the cache resident, the cache inhibition and the mirroring functions respectively, according to the information stored in the LU information table 604 .
  • a ModeSelect command which is an LU attributes setting command
  • the disk array 400 executes the LU information setting program 603 to store contents included in the LU attributes setting command in the LU information table 604 . Then, the disk array 400 determines control methods for the look-ahead amount, the cache resident, the cache inhibition and the mirroring functions respectively, according to the information stored in the LU information table 604 .
  • the disk array 400 In case the look-ahead amount of the LU information table 604 is not equivalent to zero (0), responding to the command requested by the address translation server 200 and based on the look-ahead amount set in the LU information table 604 , the disk array 400 reads data covering the look-ahead amount thus set from the disk group 651 to the disk cache 690 . If the cache resident information of the LU information table 604 is ON, the disk array 400 controls the disk cache 690 to make the LU concerned or a part of the LU resident in the disk cache 690 . If the information is OFF, the disk array 400 controls the disk cache 690 make it non-resident.
  • the disk array 400 executes the RAID control program 601 to refer to the cache management table 606 , and judges if an address of LBA, which is stored in association with a cache address showing a position within the disk cache 690 , is an address showing the position of data that should be resident in the cache. If cache inhibition information in the LU information table 604 is ON, the disk array 400 controls the disk cache 690 to inhibit the caching of an LU or apart of an LU in the disk cache 690 . If the information is OFF, the disk array 400 controls and does not inhibit the caching.
  • the disk array 400 executes the RAID control program 601 to refer to the cache management table 606 , and the disk array 400 judges whether or not the LBA address stored in association with a cache address showing a position in the disk cache 690 is the address of data whose caching should be inhibited. Further, if mirroring information in the LU information table 604 is ON, the disk array 400 executes the mirror management program 621 to perform a mirroring of access to LU 10 over to LU 10 ′.
  • the disk array 400 first executes the mirror synchronizing program 632 to copy LU 10 to LU 10 ′. Thereafter, the disk array 400 performs the mirroring of access to LU 10 by using the LU mirror subprogram 631 . In a case where loads for random access become large, the disk array 400 can improve the value of reading performance by increasing the number of disks based on an instruction from the address translation server 200 , as a result of setting the mirror information in the LU information table 604 ON to change it to the mirror attribute.
  • the performance value of the disk array 400 is changed by performing the mirroring, but an another method may be used to change the performance value of the disk array 400 , for example, by changing the RAID level for LU 10 , or by adding disk devices in a RAID configuration.
  • the mirroring may be a multiple mirroring, for example, a triple mirroring or greater.
  • the disk array 400 can respond adequately to an access request from the host 100 when the address translation server 200 can dynamically designate attributes that are necessary for the disk array 400 . Further, in the second embodiment, in a case, for example, where the number of commands issued by the host 100 has increased along with an increased number of clients, the disk array 400 can increase the number of disk devices to be used for processing based on an instruction from the address translation server 200 , thus enabling response to requests from the host 100 .
  • an address translation server which realizes virtualization can assign a storage device that matches an intended application to a host computer. Furthermore, the use of the address translation server can improve the access performance of the whole system.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

An address translation server recognizes and stores a characteristic of a storage device which forms a memory area. The address translation server creates and issues a command to the storage device by recognizing an access characteristic from a host computer and taking into consideration a characteristic of a storage device to which the command is issued. In addition, the storage device provides a resource which matches a requested access characteristic. An external storage device attributes setting program is provided so that the address translation server can dynamically execute, in accordance with changes in the access status from the host computer, an instruction to change an attribute to a storage device which forms a memory area. Furthermore, the storage device incorporates an LU attribute command receiving program which is used to change the internal control for obtaining a requested attribute.

Description

BACKGROUND OF THE INVENTION
The present invention relates to a storage device, and more particularly to a technique that virtually assigns a plurality of storage devices into one memory area and provides the area to a computer.
There is known a technique called “virtualation,” which virtualizes a memory area available in a plurality of storage devices and provides the virtualized memory area as one or a plurality of virtual memory areas to a computer. A storage device is usually constructed with disk arrays. The disk array known as a Redundant Arrays of Independent Disks (RAID) and is a memory device where many disk drive devices are configured in arrays. Details of the disk array are described in: “A Case for Redundant Arrays of Inexpensive Disks (RAID).”
In addition, Japanese Patent Laid-open No. 6-161837 discloses a technique wherein, at the time of assigning a file to a user of a memory area available in a memory device, a storage device will automatically select a memory area that is optimum for assigning files, based on file attributes or file information such as type of space assignment.
SUMMARY OF THE INVENTION
If a single storage device is used in a computer system, the user of the system can use a storage device that is suitable for the application of the data to be used. However, if a virtualization technique is introduced into the system, the user cannot specify and use a storage device that is suitable for the application of data to be used, since virtual memory areas, which are formed by a plurality of storage devices, are used.
In addition, even if a storage device that is suitable for the application of data is assigned to the user in a specified manner, the computer system concerned cannot deal with changes if the application of data used by the user varies.
To solve the above-stated problems with the present invention, an address translation server that realizes the virtualization recognizes and stores characteristics of a storage device that forms a storage pool. Further, the address translation server recognizes access characteristics from a host computer and creates and issues a command to the storage device, in view of the characteristics of the storage device to which the command is issued. In addition, the storage device provides a resource that matches the access characteristics required.
Furthermore, in order to solve the above-described problems, the present invention incorporates an external-storage-device attributes acquisition program where an address translation server that realizes the virtualization can dynamically execute an attribute change instruction to a storage device which forms a storage pool, according to changes in the access status from the host computer. In addition, the storage device changes the internal control so that the attributes an be provided as required.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is a system configuration diagram according to a first embodiment of the present invention;
FIG. 2 is an explanatory diagram of a mapping table according to the first embodiment of the present invention;
FIG. 3 is an explanatory diagram of storage LU information according to the first embodiment of the present invention;
FIG. 4 is an explanatory diagram of commands according to the first embodiment of the present invention;
FIG. 5 is a flow chart of address translation and a command issuing operation according to the first embodiment of the present invention;
FIG. 6 is a structural diagram of Disk Array 400;
FIG. 7 is an explanatory diagram of a mapping table according to a second embodiment of the present invention;
FIG. 8 is an explanatory diagram of commands according to the second embodiment of the present invention;
FIG. 9 is an explanatory diagram of an access characteristics counter according to the second embodiment of the present invention;
FIG. 10 is an explanatory diagram of a cache management table according to the second embodiment of the present invention; and
FIG. 11 is a flow chart of LU attributes designating the command issuing operation according to the second embodiment of the present invention.
DESCRIPTION OF THE PREFERRED EMBODIMENTS
FIG. 1 is a diagram showing a first embodiment of a computer system wherein the present invention is applied. In FIG. 1, the computer system comprises a host computer 100 (hereinafter referred to as the “host 100”), an address translation computer 200 (hereinafter referred to as the “address translation server 200”), a plurality of storage devices 400, and a tape device 500. It should be noted that the storage device 400 may be either a single disk device, or a storage device system, which is a combination of a plurality of disk devices, such as a RAID and a control device. In addition, the plurality of storage devices 400 may be either a combination of the same storage devices or a combination of different storage devices.
A memory area that is available in each of the storage devices 400 is called a logical unit (LU). Hereinafter, the LUs of the plurality of storage devices 400 will be referred to as LU10, LU11, LU12 and LU13, respectively. It should be noted, however, the LU available in the storage device 400 may either be one that corresponds one-on-one to a disk device available in the storage devices 400, or one that corresponds to a memory area that extends over the plurality of disk devices available in the storage devices 400.
In the present invention, an attribute of an LU available in each of the storage devices 400 is recognized by the address translation server 200. The attribute of the LU may include not only functions that are realized by the storage device 400, but basic characteristics of the storage device 400, such as access performance from the host 100 and reliability. An example of the functions might include the one to enable/disable the recognition of special commands issued by the address translation server 200.
The tape device 500 is connected to the host 100 and is used to back up data.
The host 100 and the address translation server 200, and likewise, the address translation server 200 and the storage device 400, are respectively connected to each other via a communication line. The communication line used may be any of those lines, for example, where IP protocols are used, or where fiber channel protocols are used.
LU_300 is a virtual memory area that is to be recognized by the host 100, and has an area A_310 and an area B_320. In the first embodiment, area A_310 is associated with LU10, while area B_320 is associated with LU11. Although area A_310 and area B_320 are memory areas available in respective storage devices, which are practically different each other, LU_300 is recognized as a series of logical units by the host 100.
The address translation server 200 has a CPU 220, a memory 230, a command receiving unit 210, which receives commands from the host 100, an access characteristics judging program 211, which judges access characteristics of commands from the host 100, an access characteristics receiving program 212, which receives access characteristics information that is included in commands from the host 100, an address translation program 213, a mapping table 214, a mapping table creating program 215, an external storage device attributes acquisition program 216, a command creating program 217, a command issuing program 218 and an LU information storage unit 219. The CPU 220 executes the above-described programs.
The address translation program 213 is executed when the address of LU_300 designated by the host 100 is translated into addresses showing LU10, LU11, LU12 and LU13. In the mapping table 214, relationships with LU_300,_LUs 10 to 13 and attributes of LUs 10 to 13 are registered. The mapping table creating program 215 is executed, according to the configuration of the storage device 400 that the computer system has, at the time the mapping table 214 is created.
The external-storage-device attributes acquisition program 216 is executed when information indicating attributes of the storage device 400 is acquired from the storage device. The command creating program 217 is executed when commands, such as a command to write data to the storage device 400, are created.
FIG. 2 is a diagram showing an example of the mapping table 214. The mapping table 214 registers an LU number (LUN) and a logical block address (LBA) of the virtual host LU_300 that can be recognized by the host 100, the virtual LBAs of area A_310 and area B_320, which are virtual areas, as well as the LUNs, LBAs and attributes of LU10, LU11, LU12 and LU13, which are memory areas of respective storage devices 400. In addition, the mapping table 214 records the relationship among various registered items.
Attributes of the storage device 400 include sequential access performance, random access performance, evaluation of reliability, the function to enable/disable recognition of commands with attributes, etc. of the storage device 400. In the first embodiment of the present invention, evaluation values for sequential access performance, random access performance and reliability are indicated in five steps, and a higher evaluation is given to those steps that have a higher number.
FIG. 3 is a diagram showing an example of storage LU information of respective storage devices 400 that is obtained by the address translation server 200 through the execution of the external storage device attributes acquisition program 216. The storage LU information includes storage attributes and evaluation values thereof The storage attributes include LU number, capacity, sequential access performance, random access performance, reliability and the function to enable/disable the recognition of commands with attributes. Depending on the storage LU information to be obtained from the storage device 400, some values for information of attributes shown in FIG. 3 are not set.
FIG. 4 is a diagram showing the contents of commands issued by the host 100 and the address translation server 200. Of the items shown in FIG. 4, those corresponding to #1 to #3 are contents that are included in typical commands to be used for reading and writing. More specifically, operation details such as read/write are set to the item “Op codes”. Moreover, LBA values are set to the item “LBA”, and information indicating size is set to the item “Size.”
For the items corresponding to #4 to #6, information showing access characteristics is set. For the item “sequential property”, information that shows whether the command for sequential data access is set. For the item “random property”, information that shows whether the command for random access is set. For the item “reliability”, information that shows what level of reliability is necessary for the data is set.
Hereinafter, how to create mapping table 214 is described.
At the time of the initial start up of the system, the address translation server 200 executes the mapping table creating program 215 to retrieve the storage devices 400, which are connected to the address translation server 200. Thereafter, the address translation server 200 executes the external-storage-device attributes acquisition program 216 to acquire LU information from the storage device 400, and stores it in the LU information storage unit 219.
Then, the address translation server 200 issues commands such as ModeSense to the storage device 400 to acquire the storage LU information. Thereafter, the address translation server 200 executes the mapping table creating program 215, based on the storage LU information thus acquired, to record values for the storage device LU and storage device attributes in respective items of the mapping table 214.
It should be noted that, depending on storage LU information, there are items to which the storage device 400 may not respond. If this is the case, the address translation server 200 executes the external-storage-device attributes acquisition program 216 to issue read/write commands to the storage device 400 and measures performance values according to access characteristics. Thereafter, the address translation server 200 records values in items of storage device attributes based on the measurements.
In addition, upon receiving the request for creating an LU from the host 100, the address translation server 200 executes the mapping table, creating program 215 to form a host LU by combining the available LUs of storage devices, based on storage attribute values such as the storage capacity, performance, and reliability of the LUs thus requested. At the time the creation of an LU is requested, the storage capacity and the storage attribute values may be sent to the address translation server 200 from the host 100.
FIG. 2 shows that a LUN0, which is a certain LU_300, has a storage capacity of 200 blocks. LUN_0 consists of an area A_310 and an area B_320, and, further, area A_310 comprises LBA_0 to LBA_99 of LU10 and area B_320 comprises LBA 0 to LBA_99 of LU11. Both LU10 and LU11 are associated with storage device attributes.
FIG. 5 is a flow chart showing processing of the address translation and the issuance of commands in the address translation server 200. In FIG. 5, sequential access is assumed as an access characteristic, but any other characteristics may be used.
First, the address translation server 200 receives a command issued by the host 100 at the command receiving unit 210 (step 1001). The address translation server 200 executes the access characteristics judging program 211 to judge, based on a plurality of commands received, whether the access characteristic is sequential or not. There are various methods for the judgment of the access characteristic, and the judgment can easily be made with a known method. Therefore, the description of the methods has been omitted (step 1002).
If the characteristic of a received command is not sequential, the address translation server 200 creates a command which does not have any access attributes (step 1003), and issues a command to the storage device 400 (step 1004). Otherwise, if the characteristic of a received command is sequential, the address translation server 200 refers to the mapping table 214 (step 1006), and judges whether or not the LU of the storage device 400 which corresponds to the command can recognize a command with attributes (step 1007).
If the storage device 400 can recognize a command with attributes, the address translation server 200 creates a command to which an access attribute showing a sequential characteristic is added (step 1008), and issues a command to the storage device 400 (step 1004). It should be noted that in order to add an attribute of sequential access to a command, a value which indicates “Yes” is filled in in the description column for “sequential property” of the command. In addition, a plurality of commands that are judged to be sequential maybe put together for a single command.
Further, in step 1007, if it is judged that the storage device 400 cannot recognize a command with attributes, the address translation server 200 creates a command that does not have any access attributes (step 1009), and issues a command to the storage device 400. In this case, a plurality of commands that are judged to be sequential may be put together for a single command (step 1004).
It should be noted, however, that if an access characteristic is recorded in the command received from the host 100, the address translation server 200 may, in step 1002, judge the sequential characteristic of the command by using the access characteristics receiving program 212 instead of using the access characteristics judging program 211.
For example, a host 100 that issues a lot of random access may be assumed to be used, and further, a case where data that are stored in the storage device 400 are backed up in the tape device 500 may be assumed. Efficient processing of commands maybe executed if the address translation server 200 instructs the storage device 400 that random access should be applied except for backup processing and that sequential access should be applied during backup processing, since the storage device 400 is able to recognize access characteristics of commands received in advance.
For a case where a storage device includes a disk device and a cache, if the command received from the address translation server 200 is recognized to be sequential access, then it is possible to execute staging to the cache in advance by issuing commands that are expected in advance to be sequential to the disk device, and then data can be directly transferred from the cache in the phase where access to the staging-completed address is provided from the address translation server 200. This makes it possible to make the command processing more efficient. The staging to a cache, which will be executed in advance, can easily be realized with a known technique.
For an access characteristic to be recorded in a command that is received from the host 100, an item such as reliability may be used in addition to performance. The number of access characteristics to be recorded in a command is at least one.
According to the first embodiment of the present invention, a storage device that matches an application can be assigned to the host 100 by allowing the address translation server 200 to keep attributes of the storage device 400 in a mapping table 214.
Furthermore, the address translation server 200 can improve the access performance by creating and issuing a command to the storage device, after recognizing the access characteristic from the host 100, while taking into consideration the characteristic of the storage device to which the command is issued.
Next, a system configuration of a second embodiment according to the present invention is described. Differences from the system shown in FIG. 1 are as follows: An address translation server 200 incorporates, in addition to the components shown in FIG. 1, an external storage device attributes setting program 222, which sets an attribute to a storage device 400, and an access characteristics counter 221, which counts the frequency of a command that is received from the host 100 and has a certain access characteristic. The access characteristic includes performance values such as the number of I/Os per unit time and the data transfer volume per unit time, in addition to the sequential property and the local property that are determined by an address and a size specified by a command.
The address translation server 200 judges the access characteristic by executing an access characteristics judging program 211. The address translation server 200 executes a command creating program 217 to create a ModeSelect command, which is used to instruct a change in an LU attribute to the storage device 400.
FIG. 6 is a diagram showing an exemplary configuration of the storage device 400 according to the first embodiment and the second embodiment, as well, more specifically of a disk array 400. LU10 is theoretically built within the storage device 400 shown in FIG. 6. The disk array 400 comprises a disk controller 650, a disk group 651, a CPU 610, a memory 620, a disk cache 690 and an external interface 640. The disk group 651 incorporates a plurality of disks 670.
In the memory 620, a RAID control program 601, a cache management program 605, and a mirror management program 621 are stored. These programs are executed by the CPU 610. In addition, the memory 620 incorporates a cache management table 606 in which a bit map for the cache management will be stored.
The RAID control program 601 is executed when the CPU 610 controls a disk array. Each of the disk groups 651 has a RAID5, or a redundant configuration using parity, provided that the number of disks in each of the disk groups 651 as well as the RAID configuration of each disk groups 651 maybe of another configuration such as a RAID1, etc. Data that is temporarily stored on disk 670 is stored in the disk cache 690. An external I/F 640 interfaces with other devices, and in the second embodiment of the present invention, it constitutes an interface with the address translation server 200.
Access to a memory area in the disk group 651 is made as a logical unit (LU) as determined in the SCSI standards. LUs belonging to disk groups 651 are LU10 and LU10′, respectively. In the second embodiment of the present invention, the same data is stored in LU10 and LU10′ (hereinafter referred to as “mirroring”). LU10 is the mirror-source LU in which original data are stored, and LU10′ is the mirror-destination LU in which copies of the original data are to be stored. In case these LUs are not managed under mirroring status, each LU will be handled as an independent LU.
The mirror management program 621 of the disk array 400 incorporates an LU mirror subprogram 631 and a mirror synchronizing subprogram 632. The LU mirror subprogram 631 is executed by the CPU 610 when an update for a particular LU is applied also to an another LU that is specified in advance and mirroring is performed to write the same data in the two LUs. In addition, the disk array 400 executes reading from an LU on either of the two LUs, thus reducing the load on a disk.
It should be noted that the disk array 400 performs the mirroring of LU10 to LU10′, but it is also possible to limit the data writing only to LU10, as is the normal case. The mirror synchronizing program 632 is a program executed by the CPU 610 when an initial copy is carried out to the mirror-destination LU from the mirror-source LU when the disc array 400 performs the mirroring.
The cache management program 605 is used as a subprogram, and it incorporates a look-ahead control program 607, a cache resident control program 608 and a cache inhibition control program 609. The look-ahead control program 607 is executed when the CPU 610 performs a look-ahead control. The cache resident program 608 is executed when the CPU 610 controls the LU resident on the disk cache 690. The cache inhibition control program 609 is executed when the CPU 610 performs a control to inhibit caching to the disk cache 690.
The disk array 400 reads data other than those requested from the disk group 651 to the disk cache 690 by executing the look-ahead control program 607, thus forecasting data to be read out in advance for a command requested by the address translation server. In addition, the disk array 400 executes the cache resident control program 608 to allow data included in an LU or part of an LU to be constantly stored in the disk cache 690. The LU to be constantly stored in the disk cache 690 includes, for example, an LU that is requested to respond to the address translation server 200 at a high speed.
Further, the LU attributes command receiving program 602 is executed by the CPU 610 when the ModeSelect command, which is an LU attributes command from the address translation server 200, is received. The LU information setting program 603 is executed when an LU attribute is set based on the Mode Select command, which is the LU attributes command thus received. LU attributes information is stored in an LU information table 604. An LU information table 604 exists for each LU.
FIG. 7 is a diagram showing an example of the mapping table 214 according to the second embodiment of the present invention. Differences from the first embodiment are that the columns under the LU attributes include “look-ahead amount”, “resident in cache”, “inhibition of cache” and “mirror.” In the column “look-ahead amount”, the number of logical blocks in which a look-ahead is executed is stored. It should be noted that the value to be stored may be not the number of blocks, but a unit showing other data amounts. In columns “resident in cache”, “inhibition of cache” and “mirror,” such information as ON, which indicates that functions corresponding to the respective items are enabled, or OFF, which indicates that those functions are disabled is stored.
FIG. 8 is a diagram showing an example of the ModeSelect command which is issued by the address translation server 200 to the disk array 400, according to the second embodiment of the present invention. For items #1 and #2, the same information as that of the first embodiment is set. Information showing the look-ahead amount, the cache resident, the cache inhibition and the mirroring, which correspond to attributes to be stored in the mapping table 214, are set for the respective items #3 to #6.
FIG. 9 is a diagram showing an example of an access characteristics counter 221 according to the second embodiment of the present invention. In the access characteristics counter 221, the number of commands to be issued to an LU corresponding to an LU number, the number of commands with sequential property, and a timer value are stored. The timer value shows the period of time stored for operating the counter.
For example, in a case where the disk array 400 examines a sequential property of access, the disk array 400 calculates the percentage of associated sequential commands by comparing the number of commands for the LU within the time period set for the timer value with that of sequential commands, thus judging the sequential property of the access.
Further, in a case where the disk array 400 examines a performance value requested by the host 100 using the access characteristics counter 221, the disk array 400 verifies the number of I/Os within a certain period of time set for the timer value.
FIG. 10 is a diagram showing the contents of the cache management table 606. In the cache management table 606 are stored a cache address showing an address in the disk cache 690, an address of LBA corresponding to the cache address, a read cache hit bit showing if a caching to an area corresponding to the cache address is inhibited or not, and a resident bit showing if data in an area corresponding to the cache address is cache resident data or not.
FIG. 11 is a diagram showing an operation of the address translation server 200 according to the second embodiment of the present invention. In FIG. 11, a case is assumed, due to the access characteristic of the host 100, where random access without a sequential property occurs frequently during the initial stage, and, subsequently, access with a sequential property occurs frequently.
For example, when the host 100 uses the address translation server 200 as storage for a database, random access will occur in typical database accessing. However, in a case where the host 100 backs up the data to tape device 500 or the like, it is necessary for the host 100 to perform an operation to read sequential addresses from the storage device it is using, thus making the access sequential
First, the address translation server 200 executes the external storage device attributes setting program 222 to create the ModeSelect command which is an LU attributes setting command. Then, by using the command issuing program 218, the address translation server 200 issues the ModeSelect command to the disk array 400. For the command to be generated at this time, the values to be set by the command are determined to be zero (0) for the look-ahead amount, ON for the cache resident, OFF for the cache inhibition, and OFF for the mirroring, assuming that random access will occur frequently. If the capacity of the disk cache 690 is taken into consideration, the cache resident may be set as OFF.
It should be noted that the disk array 400 executes the LU attributes command receiving program 602 to receive the ModeSelect command thus issued. Thereafter, the disk array 400 executes the LU information setting program 603 to set the LU attributes information included in the ModeSelect command thus received to the LU information table 604 (step 2001).
The address translation server 200 monitors the commands from the host 100, and counts the access characteristics to the access characteristics counter 221. In the second embodiment, counting is done to determine whether access has a sequential property. The method judging sequential performance may be the same as that for the first embodiment. It should be noted that counting to the counter is not shown in FIG. 11 since the attribution is executed without the synchronization with steps 2001 and thereafter.
The address translation server 200 refers to the access characteristics counter 221 from time to time (step 2003), and examines, with regard to the volume of commands received from the host 100, whether sequential access has exceeded a certain level of threshold value (step 2004). If sequential access has not exceeded the threshold value, the address translation server 200 executes the processing of step 2003.
If sequential access has exceeded the threshold value, the address translation server 200 executes the external storage device attributes setting program 222 to create the ModeSelect command, which is an LU attributes setting command. Then, the address translation server 200 executes the command issuing program 218 to issue the ModeSelect command to the disk array 400.
The ModeSelect command to be created at this time is a value obtained by assuming that sequential access occurs frequently; the look-ahead amount may be, for example, 32 blocks, and the cache inhibition may be ON. The reason why the cache inhibition may be set as ON, that is, why no caching is made, is that it is highly likely that the data requested may have been deleted from the disk cache 690 before the data could be reused, since accesses from the host 100 been sequential (step 2005). The address translation server 200 executes the processing of the step 2003 after issuing the command.
In addition to the example of sequential access described in FIG. 11, the address translation server 200 can count, in similar processing procedures, changes in access characteristics by using the access characteristics counter 221, and issue, depending on the status, a ModeSelect command, which is an LU attributes setting command, to the disk array 400. For example, in a case where the number of I/Os per unit time from the host 100 has increased and the capacity of disk devices currently in operation is insufficient to handle such increases, it is possible to set the item “mirror” to ON to increase the number of disk devices being used for the I/Os.
In addition, when the address translation server 200 recognizes the access characteristic of a command received from the host 100, the recognition may be performed by transmitting a command with attributes from the host 100, as is the case with the flow of FIG. 5 described for the first embodiment. Next, operations of the disk array 400 which receives a ModeSelect command that is an LU attributes setting command will be described.
The disk array 400 receives a ModeSelect command, which is an LU attributes setting command, by executing the LU attributes command receiving program 602. Thereafter, the disk array 400 executes the LU information setting program 603 to store contents included in the LU attributes setting command in the LU information table 604. Then, the disk array 400 determines control methods for the look-ahead amount, the cache resident, the cache inhibition and the mirroring functions respectively, according to the information stored in the LU information table 604.
In case the look-ahead amount of the LU information table 604 is not equivalent to zero (0), responding to the command requested by the address translation server 200 and based on the look-ahead amount set in the LU information table 604, the disk array 400 reads data covering the look-ahead amount thus set from the disk group 651 to the disk cache 690. If the cache resident information of the LU information table 604 is ON, the disk array 400 controls the disk cache 690 to make the LU concerned or a part of the LU resident in the disk cache 690. If the information is OFF, the disk array 400 controls the disk cache 690 make it non-resident.
More specifically, the disk array 400 executes the RAID control program 601 to refer to the cache management table 606, and judges if an address of LBA, which is stored in association with a cache address showing a position within the disk cache 690, is an address showing the position of data that should be resident in the cache. If cache inhibition information in the LU information table 604 is ON, the disk array 400 controls the disk cache 690 to inhibit the caching of an LU or apart of an LU in the disk cache 690. If the information is OFF, the disk array 400 controls and does not inhibit the caching.
More specifically, the disk array 400 executes the RAID control program 601 to refer to the cache management table 606, and the disk array 400 judges whether or not the LBA address stored in association with a cache address showing a position in the disk cache 690 is the address of data whose caching should be inhibited. Further, if mirroring information in the LU information table 604 is ON, the disk array 400 executes the mirror management program 621 to perform a mirroring of access to LU10 over to LU10′.
More specifically, the disk array 400 first executes the mirror synchronizing program 632 to copy LU10 to LU10′. Thereafter, the disk array 400 performs the mirroring of access to LU10 by using the LU mirror subprogram 631. In a case where loads for random access become large, the disk array 400 can improve the value of reading performance by increasing the number of disks based on an instruction from the address translation server 200, as a result of setting the mirror information in the LU information table 604 ON to change it to the mirror attribute.
It should be noted that, in the second embodiment, the performance value of the disk array 400 is changed by performing the mirroring, but an another method may be used to change the performance value of the disk array 400, for example, by changing the RAID level for LU10, or by adding disk devices in a RAID configuration. In addition, the mirroring may be a multiple mirroring, for example, a triple mirroring or greater.
According to the second embodiment of the present invention, the disk array 400 can respond adequately to an access request from the host 100 when the address translation server 200 can dynamically designate attributes that are necessary for the disk array 400. Further, in the second embodiment, in a case, for example, where the number of commands issued by the host 100 has increased along with an increased number of clients, the disk array 400 can increase the number of disk devices to be used for processing based on an instruction from the address translation server 200, thus enabling response to requests from the host 100.
As described above, according to the present invention, an address translation server which realizes virtualization can assign a storage device that matches an intended application to a host computer. Furthermore, the use of the address translation server can improve the access performance of the whole system.

Claims (8)

1. A system for storing data comprising:
a plurality of disk array systems (400 in FIGS. 1 and 6), each of which comprises plural disks (670 in FIG. 6) and a processor (610 in FIG. 6) controlling the plural disks; and
an address translation server (200 in FIG. 1) coupled to each of the plurality of disk array systems and a host computer (100 in FIG. 1),
wherein the address translation server includes a mapping table (214) which stores a correspondence between a virtual address designated by the host computer and a block address sent to a disk array system and which stores information indicating whether or not the disk array system having a storage area indicated by the corresponding block address can process a command having an associated attribute,
wherein the address translation server is configured to:
receive a first command from a host computer (FIG. 5 step 1001), the first command having an associated first attribute;
convert a virtual address included in the received command to a block address based on the mapping table;
determine whether or not a disk array system having a storage area indicated by the block address can process a command that is associated with an attribute based on the mapping table (FIG. 5 step 1007);
if the disk array system can process a command having an associated attribute (FIG. 5 step 1008), then create a second command that is associated with a second attribute; and
send the second command to the disk array system (FIG. 5 step 1004), the second attribute being based on the first attribute.
2. A system for storing data according to claim 1, wherein
the first command is a command including information indicating sequential access characteristics, and
the address translation server is further configured to determine whether or not the first command requests a sequential access (FIG. 5 step 1002), and if the first command requests a sequential access to determine whether or not a disk array system having a storage area indicated by the block address can process a command that is associated with an attribute.
3. A system for storing data according to claim 2, wherein
the first command includes information indicating access characteristics, and the address translation server is configured to check whether or not the first command host computer requests a sequential access based on the information included in the first command.
4. A system for storing data according to claim 2, wherein
the address translation server is further coupled to a tape device, and
the address translation server is further configured to send the second command including information indicating sequential access characteristic when data stored in the disk array system is backed up on the tape device.
5. A method for storing data comprising:
receiving a first command from a host computer;
translating an address indicated in the first command to a block-address, the block address identifying an area of storage in a disk array system;
if the first command is associated with an attribute, then determining whether the disk array system provides a command that corresponds to the first command and that can be invoked with the attribute;
if the disk array system provides a command that corresponds to the first command and that can be invoked with the attribute, then generating a second command which corresponds to the first command and which includes the attribute, and sending the second command to the disk array system; and
if the disk array system does not provide a command that corresponds to the first command and that can be invoked with the attribute, then generating a third command which corresponds to the first command and which does not include the attribute, and sending the third command to the disk array system.
6. A method for storing data according to claim 5, wherein the first command is a command including information indicating sequential access characteristics, the further comprising determining whether or not the first command requests a sequential access, and if the first command requests a sequential access, then determining whether or not a disk array system having a storage area indicated by the block address can process a command that is associated with an attribute.
7. A method for storing data according to claim 6, wherein the first command includes information indicating access characteristics, the method further comprising determining whether, or not the first command host computer requests a sequential access based on the information included in the first command.
8. A method for storing data according to claim 6, wherein the address translation server is further coupled to a tape device, the method further comprising sending the second command including information indicating sequential access characteristic when data stored in the disk array system is backed up on the tape device.
US10/282,863 2002-05-29 2002-10-28 Computer system Expired - Fee Related US6941439B2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2002-154945 2002-05-29
JP2002154945A JP2003345514A (en) 2002-05-29 2002-05-29 Computer system

Publications (2)

Publication Number Publication Date
US20030225993A1 US20030225993A1 (en) 2003-12-04
US6941439B2 true US6941439B2 (en) 2005-09-06

Family

ID=29561390

Family Applications (1)

Application Number Title Priority Date Filing Date
US10/282,863 Expired - Fee Related US6941439B2 (en) 2002-05-29 2002-10-28 Computer system

Country Status (2)

Country Link
US (1) US6941439B2 (en)
JP (1) JP2003345514A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050246490A1 (en) * 2003-02-04 2005-11-03 Hitachi, Ltd. Disk control system and control method of disk control system
US20060064541A1 (en) * 2004-09-17 2006-03-23 Hitachi Ltd. Method of and system for controlling attributes of a plurality of storage devices
US20100036897A1 (en) * 2004-06-15 2010-02-11 Seisuke Tokuda Storage system
US20120011317A1 (en) * 2010-07-06 2012-01-12 Fujitsu Limited Disk array apparatus and disk array control method

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8180872B1 (en) 2001-06-29 2012-05-15 Symantec Operating Corporation Common data model for heterogeneous SAN components
US7886031B1 (en) 2002-06-04 2011-02-08 Symantec Operating Corporation SAN configuration utility
US7194538B1 (en) 2002-06-04 2007-03-20 Veritas Operating Corporation Storage area network (SAN) management system for discovering SAN components using a SAN management server
US8019849B1 (en) 2002-09-13 2011-09-13 Symantec Operating Corporation Server-side storage area network management interface
US7401338B1 (en) 2002-09-27 2008-07-15 Symantec Operating Corporation System and method for an access layer application programming interface for managing heterogeneous components of a storage area network
US7885256B1 (en) 2003-05-30 2011-02-08 Symantec Operating Corporation SAN fabric discovery
JP4266725B2 (en) 2003-06-27 2009-05-20 株式会社日立製作所 Storage system
US7096325B2 (en) * 2004-03-29 2006-08-22 Hitachi, Ltd. Method and apparatus for multistage volume locking
JP2005302152A (en) * 2004-04-12 2005-10-27 Sony Corp Composite type storage device, data writing method, and program
US7782845B2 (en) * 2004-11-19 2010-08-24 International Business Machines Corporation Arbitrated loop address management apparatus method and system
JP4806556B2 (en) * 2005-10-04 2011-11-02 株式会社日立製作所 Storage system and configuration change method
US7721053B2 (en) 2005-10-24 2010-05-18 Hewlett-Packard Development Company, L.P. Intelligent logical unit provisioning
US20070180210A1 (en) * 2006-01-31 2007-08-02 Seagate Technology Llc Storage device for providing flexible protected access for security applications
JP4839133B2 (en) * 2006-05-22 2011-12-21 株式会社日立製作所 Data management method and computer system for storage apparatus
US7925758B1 (en) 2006-11-09 2011-04-12 Symantec Operating Corporation Fibre accelerated pipe data transport
JP5134915B2 (en) * 2007-11-02 2013-01-30 株式会社日立製作所 Storage area configuration optimization method, computer system, and management computer
JP2009116834A (en) 2007-11-09 2009-05-28 Sony Corp Data recording device, method for internal control of data recording device, and data recording system
JP5206103B2 (en) * 2008-05-09 2013-06-12 日本電気株式会社 Storage device, storage device control system, storage device control method, and program
US8464003B2 (en) * 2010-02-17 2013-06-11 Hitachi, Ltd. Method and apparatus to manage object based tier
US8711864B1 (en) 2010-03-30 2014-04-29 Chengdu Huawei Symantec Technologies Co., Ltd. System and method for supporting fibre channel over ethernet communication
US8255634B2 (en) * 2010-08-11 2012-08-28 Lsi Corporation Apparatus and methods for look-ahead virtual volume meta-data processing in a storage controller
US8176218B2 (en) 2010-08-11 2012-05-08 Lsi Corporation Apparatus and methods for real-time routing of received commands in a split-path architecture storage controller
US8261003B2 (en) 2010-08-11 2012-09-04 Lsi Corporation Apparatus and methods for managing expanded capacity of virtual volumes in a storage system
JPWO2020095583A1 (en) * 2018-11-09 2021-09-02 パナソニックIpマネジメント株式会社 Storage device
US12334108B1 (en) * 2024-04-29 2025-06-17 Seagate Technology Llc Speculative read regulation in hard disk drives having read reliability budgets

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH06161837A (en) 1992-11-20 1994-06-10 Hitachi Ltd System for selecting volume on external storage device
US5530821A (en) * 1991-08-02 1996-06-25 Canon Kabushiki Kaisha Method and apparatus including independent virtual address translation
US6272492B1 (en) * 1997-11-21 2001-08-07 Ibm Corporation Front-end proxy for transparently increasing web server functionality
US20010034733A1 (en) * 2000-03-03 2001-10-25 Michel Prompt System and method for providing access to databases via directories and other hierarchical structures and interfaces
US6567883B1 (en) * 1999-08-27 2003-05-20 Intel Corporation Method and apparatus for command translation and enforcement of ordering of commands
US20030182501A1 (en) * 2002-03-22 2003-09-25 Elizabeth George Method and system for dividing a plurality of existing volumes of storage into a plurality of virtual logical units of storage
US20040088432A1 (en) * 2002-10-31 2004-05-06 Hubbard Eric D. Management of attribute data

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5530821A (en) * 1991-08-02 1996-06-25 Canon Kabushiki Kaisha Method and apparatus including independent virtual address translation
JPH06161837A (en) 1992-11-20 1994-06-10 Hitachi Ltd System for selecting volume on external storage device
US6272492B1 (en) * 1997-11-21 2001-08-07 Ibm Corporation Front-end proxy for transparently increasing web server functionality
US6567883B1 (en) * 1999-08-27 2003-05-20 Intel Corporation Method and apparatus for command translation and enforcement of ordering of commands
US20010034733A1 (en) * 2000-03-03 2001-10-25 Michel Prompt System and method for providing access to databases via directories and other hierarchical structures and interfaces
US20030182501A1 (en) * 2002-03-22 2003-09-25 Elizabeth George Method and system for dividing a plurality of existing volumes of storage into a plurality of virtual logical units of storage
US20040088432A1 (en) * 2002-10-31 2004-05-06 Hubbard Eric D. Management of attribute data

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
David A. Patterson et al., A Case fpr Redundant Arrays of Inexpensive Disks (RAID) ; University of California, Berkeley, Computer Science Division, Copy right 1988, pp. 109-118.
Gonzalo Navarro, Proximal Nodes: A Model To Query Document Databases By Content And Structure, ACM Transactions on Information Systems, page(s) 400-435, 1997. *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050246490A1 (en) * 2003-02-04 2005-11-03 Hitachi, Ltd. Disk control system and control method of disk control system
US7127583B2 (en) 2003-02-04 2006-10-24 Hitachi, Ltd. Disk control system and control method of disk control system
US20100036897A1 (en) * 2004-06-15 2010-02-11 Seisuke Tokuda Storage system
US8234318B2 (en) 2004-06-15 2012-07-31 Hitachi, Ltd. Storage system
US20060064541A1 (en) * 2004-09-17 2006-03-23 Hitachi Ltd. Method of and system for controlling attributes of a plurality of storage devices
US7296115B2 (en) * 2004-09-17 2007-11-13 Hitachi, Ltd. Method of and system for controlling attributes of a plurality of storage devices
US20080040516A1 (en) * 2004-09-17 2008-02-14 Hitachi, Ltd. Method of and system for controlling attributes of a plurality of storage devices
US7441080B2 (en) 2004-09-17 2008-10-21 Hitachi, Ltd. Method of and system for controlling attributes of a plurality of storage devices
US20120011317A1 (en) * 2010-07-06 2012-01-12 Fujitsu Limited Disk array apparatus and disk array control method

Also Published As

Publication number Publication date
JP2003345514A (en) 2003-12-05
US20030225993A1 (en) 2003-12-04

Similar Documents

Publication Publication Date Title
US6941439B2 (en) Computer system
US7743206B2 (en) Dynamic loading of virtual volume data in a virtual tape server
US7930474B2 (en) Automated on-line capacity expansion method for storage device
US7680984B2 (en) Storage system and control method for managing use of physical storage areas
US7730275B2 (en) Information processing system and management device for managing relocation of data based on a change in the characteristics of the data over time
EP0801344B1 (en) An apparatus for reallocating logical to physical disk devices using a storage controller and method of the same
US7340571B2 (en) Storage system and data management device for storage system
US8347060B2 (en) Storage system, storage extent release method and storage apparatus
US8032689B2 (en) Techniques for data storage device virtualization
US8131969B2 (en) Updating system configuration information
US20070162692A1 (en) Power controlled disk array system using log storage area
US8359431B2 (en) Storage subsystem and its data processing method for reducing the amount of data to be stored in a semiconductor nonvolatile memory
US7222135B2 (en) Method, system, and program for managing data migration
US8694563B1 (en) Space recovery for thin-provisioned storage volumes
JP2009093571A (en) Storage controller, data archive method for storage controller, and storage system
US20070016749A1 (en) Disk control system and control method of disk control system
US9558111B1 (en) Storage space reclaiming for virtual provisioning
US20070113041A1 (en) Data processing system, storage apparatus and management console
US7966448B2 (en) Storage apparatus and data writing method

Legal Events

Date Code Title Description
AS Assignment

Owner name: HITACHI LTD, JAPAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:YAGISAWA, IKUYA;MATSUNAMI, NAOTO;MIMATSU, YASUYUKI;AND OTHERS;REEL/FRAME:014060/0522;SIGNING DATES FROM 20020821 TO 20020827

AS Assignment

Owner name: HITACHI, LTD., JAPAN

Free format text: CORRECTIVE COVERSHEET TO CORRECT SERIAL NUMBER 10/282,869 ON ASSIGNMENT DOCUMENT PREVIOUSLY RECORDED ON REEL 014060, FRAME 0522.;ASSIGNORS:YAGISAWA, IKUYA;MATSUNAMI, NAOTO;MIMATSU, YASUYUKI;AND OTHERS;REEL/FRAME:015321/0759;SIGNING DATES FROM 20040821 TO 20040827

FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAY Fee payment

Year of fee payment: 4

FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

REMI Maintenance fee reminder mailed
LAPS Lapse for failure to pay maintenance fees
STCH Information on status: patent discontinuation

Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362

FP Lapsed due to failure to pay maintenance fee

Effective date: 20130906