JP2004295914A

JP2004295914A - Data processing device

Info

Publication number: JP2004295914A
Application number: JP2004161157A
Authority: JP
Inventors: Eiji Sakakibara; 栄二榊原; Naomiki Mitsuishi; 直幹三ツ石; Hisashi Kajiwara; 久志梶原; Susumu Ue; 晋宇枝
Original assignee: Renesas Technology Corp; Hitachi Engineering Co Ltd
Current assignee: Renesas Technology Corp; Hitachi Industry and Control Solutions Co Ltd
Priority date: 2004-05-31
Filing date: 2004-05-31
Publication date: 2004-10-21

Abstract

<P>PROBLEM TO BE SOLVED: To provide a technology for obtaining exception handling performance corresponding to needs while maintaining compatibility and moreover reducing a logical scale. <P>SOLUTION: A combination of a plurality of assignable registers out of general-purpose registers is fixed to a control means (a CPU) for controlling an execution means for carrying out instructions, to have save/return instructions (STM instruction and LDM instruction) of the plurality of registers. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、データ処理装置に関し、特に、半導体集積回路装置によって構成される高速かつ小型のシングルチップマイクロコンピュータに利用して有効な技術に関するものである。 The present invention relates to a data processing device, and more particularly to a technology effective when used in a high-speed and small-sized single-chip microcomputer constituted by a semiconductor integrated circuit device.

半導体集積回路装置の製造技術の高度化に伴って、半導体単結晶からなるシングルチップに、中央演算処理装置（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ；以下、単にＣＰＵと称する）、プログラムを格納するＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、書き替え可能に各種データを格納するＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）等を含む構成素子を集積して製造した、小型のシングルチップマイクロコンピュータ（以下、単にマイクロコンピュータと称する）が広範囲に普及してきており、種々の目的のデータ処理装置として使用されてきている。このマイクロコンピュータは、ＣＰＵが同時に処理し得る情報の量によって性能が異なり、例えば４ビット、８ビット、１６ビット、３２ビット等のマイクロコンピュータとして区分されている。 2. Description of the Related Art With the advancement of the manufacturing technology of semiconductor integrated circuit devices, a central processing unit (hereinafter, simply referred to as a CPU) and a ROM (Read Only Memory) for storing programs on a single chip made of a semiconductor single crystal. 2. Description of the Related Art A small single-chip microcomputer (hereinafter simply referred to as a microcomputer) manufactured by integrating components including a RAM (Random Access Memory) for storing various data in a rewritable manner has become widespread. Have been used as data processing devices for various purposes. This microcomputer has different performance depending on the amount of information that can be simultaneously processed by the CPU, and is classified as, for example, a 4-bit, 8-bit, 16-bit, or 32-bit microcomputer.

このようなマイクロコンピュータは、アドレス空間の拡張や、命令セットの拡大、高速化等が図られてきている。また、ＣＰＵは、ソフトウェアによってその性能が定義されているから、前記のようにアドレス空間の拡張や、命令セットの拡大、高速化等を図ったマイクロコンピュータにおいても、既存のマイクロコンピュータのソフトウェア資産を有効に利用できることが望ましい。 In such a microcomputer, an address space is expanded, an instruction set is expanded, a speed is increased, and the like. Further, since the performance of the CPU is defined by software, even in a microcomputer in which the address space is expanded, the instruction set is expanded, and the speed is increased as described above, the software resources of the existing microcomputer are used. It is desirable that it can be used effectively.

このため、オブジェクトレベルで互換性を保ちつつ、アドレス空間の拡張や、命令セットの拡大、高速化等を実現した例として、例えば本出願人が先に提案した特許文献１、あるいは非特許文献１等がある。 For this reason, as an example of realizing expansion of an address space, expansion of an instruction set, speeding-up, etc. while maintaining compatibility at the object level, for example, Patent Document 1 proposed earlier by the present applicant or Non-Patent Document 1 Etc.

前記ＣＰＵは、システムクロックの２周期である、いわゆる２ステートで基本命令を実行している。これに対して、１ステートで基本命令を実行するようにし、さらに、ＣＰＵとは独立して乗算器を内蔵して高速化を図った例として、例えば非特許文献２、あるいは非特許文献３等がある。このような乗算器は積和演算と乗算に利用する。 The CPU executes a basic instruction in so-called two states, which are two periods of a system clock. On the other hand, as an example in which a basic instruction is executed in one state and a multiplier is built in independently of the CPU to increase the speed, for example, Non-Patent Document 2 or Non-Patent Document 3 There is. Such a multiplier is used for the product-sum operation and the multiplication.

このように高速化を図ることによって、マイクロコンピュータによって制御される各種機器の高速化や高性能化、あるいは、従来においては複数の半導体集積回路装置で構成していたものを、結合したりすることにより小型化を図ることができるようになる。 By increasing the speed in this way, it is possible to increase the speed and performance of various devices controlled by the microcomputer, or to combine components conventionally constituted by a plurality of semiconductor integrated circuit devices. Thereby, miniaturization can be achieved.

また、前記のような各種機器の高速化や高性能化、あるいは小型化は、アドレス空間が比較的小さく、命令セットが比較的小さいＣＰＵあるいはマイクロコンピュータにおいても要求されるから、前記特許文献１等に記載されるアドレス空間の広いＣＰＵと、アドレス空間の小さいＣＰＵが存在する場合には、その双方の高速化を図ることが望ましい。 In addition, high speed, high performance, and miniaturization of various devices as described above are also required for a CPU or a microcomputer having a relatively small address space and a relatively small instruction set. When there are a CPU having a large address space and a CPU having a small address space, it is desirable to speed up both of them.

このような観点から、上位ＣＰＵを開発し、これをベースにして下位ＣＰＵへ展開できれば都合が良い。これによって、開発効率を向上することができる。さらに、半導体集積回路装置によって構成されるＣＰＵ自体の他に、クロスアセンブラやＣコンパイラ、シミュレータ、リアルタイムＯＳ等の開発ツール等の開発も共通化して、開発効率を向上することが望ましい。 From such a viewpoint, it is convenient if the upper CPU can be developed and developed to the lower CPU based on this. Thereby, the development efficiency can be improved. Further, in addition to the CPU itself constituted by the semiconductor integrated circuit device, it is desirable that development of development tools such as a cross assembler, a C compiler, a simulator, a real-time OS, and the like be shared to improve development efficiency.

特開平６−５１９８１号公報JP-A-6-51981

平成５年６月（株）日立製作所発行、「Ｈ８／３００Ｈシリーズプログラミングマニュアル」"H8 / 300H Series Programming Manual" issued by Hitachi, Ltd. in June 1993 平成４年１１月日経ＢＰ社発行、「日経エレクトロニクスＮＯ．５６８」、ＰＰ９９〜ＰＰ１１２Published by Nikkei BP, November 1992, "Nikkei Electronics No. 568", PP99-PP112 平成５年３月（株）日立製作所発行、「ＳＨ７０３２、ＳＨ７０３４ハードウエアマニュアル」Published by Hitachi, Ltd. in March 1993, "SH7032, SH7034 Hardware Manual"

前記のようなマイクロコンピュータにおいて、乗算器は専用の資源を必要とするから、必ずしも積和演算や乗算の高速化を必要としない場合には、費用対効果の点で得策でない。また、例えば前記非特許文献３においては、乗算結果は専用のレジスタ（ＭＡＣ）に得られるから、これを利用する場合には、別の命令によってそれをＣＰＵの汎用レジスタに転送しなければならない。乗算器を内蔵して乗算自体を高速化しても、そのように乗算結果を使用するまでの時間が長くなっては意味がない。 In such a microcomputer, since the multiplier requires a dedicated resource, it is not advisable in terms of cost-effectiveness if the product-sum operation or the multiplication is not necessarily required to be performed at high speed. Further, for example, in Non-Patent Document 3, since the multiplication result is obtained in a dedicated register (MAC), if this is used, it must be transferred to a general-purpose register of the CPU by another instruction. Even if the multiplication itself is speeded up by incorporating a multiplier, there is no point in increasing the time required to use the multiplication result.

一方、従来のＣＰＵとの互換性を維持するためには、前記のように命令の追加は困難であり、追加する命令は最小限にしなければならない。また、演算結果等のフラグも互換性を保持する必要がある。積和演算についても、演算結果等のフラグを参照できれば使い勝手が良くなる。フラグの状態を判定して分岐する、いわゆる条件分岐命令などで演算結果を容易に判定し、処理の内容を変更することができるからである。かかるフラグには、オーバフロー（Ｖ）、ゼロ（Ｚ）、ネガティブ（Ｎ）などがある。 On the other hand, in order to maintain compatibility with a conventional CPU, it is difficult to add instructions as described above, and the added instructions must be minimized. Further, it is necessary to maintain compatibility of flags such as operation results. As for the product-sum operation, the usability is improved if a flag such as an operation result can be referred to. This is because it is possible to easily determine the operation result using a so-called conditional branch instruction or the like that determines the state of the flag and branches, and changes the content of the processing. Such flags include overflow (V), zero (Z), and negative (N).

本発明の目的は、互換性を維持しつつニーズに応じた乗算性能が得られ、しかも処理性能の向上を図ることが可能な技術を提供することにある。 An object of the present invention is to provide a technique capable of obtaining multiplication performance according to needs while maintaining compatibility and improving processing performance.

本発明の他の目的は、制御手段に乗算手段を内蔵して互換性を維持しつつ処理の高速化を図ることが可能な技術を提供することにある。 Another object of the present invention is to provide a technique capable of increasing the speed of processing while maintaining compatibility by incorporating a multiplying means in a control means.

本発明のその他の目的は、乗算手段を制御手段から独立して設け、しかも制御手段に乗算機能を備えさせることにより、製造費用の低減が可能な技術を提供することにある。 Another object of the present invention is to provide a technique capable of reducing the manufacturing cost by providing the multiplying means independently of the control means and providing the control means with a multiplying function.

本発明の前記ならびにそのほかの目的と新規な特長は、本発明書の記述および添付図面から明らかになるであろう。 The above and other objects and novel features of the present invention will become apparent from the description of the present specification and the accompanying drawings.

本願において開示される発明のうち代表的なものの概要を簡単に説明すれば下記の通りである。 The outline of a representative invention among the inventions disclosed in the present application will be briefly described as follows.

（１）本発明のデータ処理装置は、命令を実行する実行手段を制御する制御手段と乗算手段を設け、前記制御手段と乗算手段とが並列動作する第１の命令とともに、前記乗算手段が動作する第２の命令を有し、乗算手段は前記制御手段に内蔵されている。また、第１の命令は積和命令であるとともに、第２の命令は乗算命令になっている。積和命令のアドレッシングモードは、いわゆるポストインクリメントレジスタ間接とする。乗算手段には結果を判定するフラグ検出手段を設け、乗算命令時はフラグ検出結果を制御手段に供給して、保持させる手段を設ける。 (1) A data processing device according to the present invention includes a control unit for controlling an execution unit for executing an instruction, and a multiplication unit. The multiplication unit operates together with a first instruction in which the control unit and the multiplication unit operate in parallel. And a multiplication means is built in the control means. The first instruction is a multiply-accumulate instruction, and the second instruction is a multiply instruction. The addressing mode of the product-sum instruction is so-called post-increment register indirect. The multiplication means is provided with a flag detection means for judging the result, and is provided with means for supplying the flag detection result to the control means at the time of a multiplication instruction and holding the result.

（２）本発明のデータ処理装置は、命令を実行する実行手段を制御する制御手段と乗算手段を設け、前記制御手段と乗算手段とが並列動作する第１の命令とともに、前記乗算手段が動作する第２の命令を有し、乗算手段は前記制御手段から独立して設けられている。また、制御手段は乗算機能を備えている。 (2) The data processing device of the present invention includes a control unit for controlling an execution unit for executing an instruction and a multiplication unit, and the multiplication unit operates together with a first instruction in which the control unit and the multiplication unit operate in parallel. And a multiplying means is provided independently of the control means. The control means has a multiplication function.

（３）本発明のデータ処理装置は、命令を実行する実行手段を制御する制御手段に指定可能な複数のレジスタの組み合わせを固定にし、複数のレジスタの退避／復帰命令を有している。 (3) The data processing device of the present invention has a fixed combination of a plurality of registers that can be designated as a control unit that controls an execution unit that executes an instruction, and has a save / restore instruction for a plurality of registers.

（４）本発明のデータ処理装置は、命令を実行する実行手段を制御する制御手段に搭載されるコントロールレジスタの有効／無効を切り換える手段を有し、コントロールレジスタの有効時には、例外処理の遷移時、例外処理からの復帰時に、前記コントロールレジスタの待避／復帰を行い、前記コントロールレジスタの無効時には、例外処理の遷移時、例外処理からの復帰時に、前記コントロールレジスタの待避／復帰を行なわない。 (4) The data processing device of the present invention has means for switching the validity / invalidity of the control register mounted on the control means for controlling the execution means for executing the instruction. When returning from exception processing, the control register is saved / restored. When the control register is invalid, the control register is not saved / restored at the transition of exception processing or when returning from exception processing.

（５）本発明のデータ処理装置は、命令を実行する実行手段を制御する制御手段の搭載される固定的なスタックレジスタを設け、エミュレーションプログラムへの遷移時、エミュレーションプログラムからの復帰時に、前記固定的なスタックレジスタ用いて、ユーザが使用するスタックポインタを無視あるいは保持するかを指定する手段を有している。 (5) The data processing device of the present invention is provided with a fixed stack register in which a control means for controlling an execution means for executing an instruction is provided, and the fixed stack register is provided at the time of transition to the emulation program and at the time of return from the emulation program. A means for designating whether to ignore or hold the stack pointer used by the user by using a general stack register.

上記した（１）の手段によれば、乗算器（乗算手段）を内蔵することによって、アドレッシングモードの増加を最小限にして、かつ処理性能を低下させずに積和演算を実行可能にすることができる。また、ポストインクリメントレジスタ間接により、多数のデータの積和演算を連続して処理することができる。さらに、乗算の結果（積、フラグ）を直ちに利用できるから、実質的な乗算の実行速度を向上することができる。 According to the above-mentioned means (1), by incorporating a multiplier (multiplication means), it is possible to minimize the increase in the addressing mode and to execute the product-sum operation without lowering the processing performance. Can be. In addition, the product-sum operation of a large number of data can be continuously processed by the post-increment register indirect. Further, since the result (product, flag) of the multiplication can be used immediately, the execution speed of the actual multiplication can be improved.

乗算器とＣＰＵ（制御手段）を一体に構成して、乗算器・ＣＰＵ間の配線を短縮して、物理的規模を縮小する。また、高速化に寄与することができる。 The multiplier and the CPU (control means) are integrally configured to reduce the wiring between the multiplier and the CPU, thereby reducing the physical scale. In addition, it can contribute to speeding up.

上記した（２）の手段によれば、乗算器を取外し可能に（独立して）設けることによって、乗算器を取外した場合は、積和演算をサポートしないことによって、容易に下位ＣＰＵを実現し、論理的・物理的規模を縮小し、製造費用を低減した別のマイクロコンピュータを容易に開発することができる。また、乗算器を取外したＣＰＵにおいても、汎用的な乗算命令をサポートすることによって、使い勝手の低下を防止できる。さらに、乗算器使用するか使用しないかの制御信号（有効／無効）を与えて制御することによって、テスト性を向上したり、エミュレータを共通化したりすることができる。さらにまた、全体的な開発効率を向上することができる。 According to the above-mentioned means (2), the lower CPU can be easily realized by providing the multiplier detachably (independently) and not supporting the product-sum operation when the multiplier is removed. Further, another microcomputer having a reduced logical / physical scale and a reduced manufacturing cost can be easily developed. Further, even in the CPU from which the multiplier has been removed, by supporting a general-purpose multiplication instruction, it is possible to prevent a decrease in usability. Further, by giving and controlling a control signal (valid / invalid) for using or not using a multiplier, testability can be improved and an emulator can be shared. Furthermore, overall development efficiency can be improved.

乗算器を削除した場合、乗算は除算と同一のシーケンスで実行できる。積和演算はサポートせず、積和演算の特殊なシーケンスをサポートしないことによって論理規模の縮小を更に行なうことができる。テスト命令をサポートすることによって、論理規模の増加を最低限にして、テストの容易性を向上することができる。 If the multiplier is omitted, the multiplication can be performed in the same sequence as the division. Since the product-sum operation is not supported and the special sequence of the product-sum operation is not supported, the logical scale can be further reduced. By supporting test instructions, testability can be improved with minimal increase in logic size.

上記した（３）の手段によれば、複数レジスタの退避／復帰命令を持ち、この組み合わせを固定的にすることによって、論理規模の縮小を図ることができ、また、高速化を図ることができる。レジスタの本数の異なる命令を複数命令サポートすることによって、使い勝手の低下を防ぐことができる。 According to the above-mentioned means (3), a save / restore instruction for a plurality of registers is provided, and by fixing the combination, the logical scale can be reduced and the speed can be increased. . By supporting a plurality of instructions having different numbers of registers, it is possible to prevent a decrease in usability.

さらに、内部動作のパイプラインに対応して、入出力タイミングの異なるレジスタ選択回路を複数持つことにより、レジスタ間演算命令などの基本命令を実質的に１命令／１ステート実行を行なうことができる。 Further, by providing a plurality of register selection circuits having different input / output timings corresponding to the pipeline of the internal operation, it is possible to substantially execute one instruction / one state of a basic instruction such as an inter-register operation instruction.

上記した（４）の手段によれば、コントロールレジスタの有効／無効を切り換えることで、スタックの節約と、割込み応答時間の高速化に寄与することができる。また、互換性を維持することができる。 According to the above-described means (4), by switching between valid and invalid of the control register, it is possible to contribute to saving of the stack and shortening of the interrupt response time. In addition, compatibility can be maintained.

上記した（５）の手段によれば、エミュレータ専用の固定スタックポインタを持つことにより、エミュレータのサポートを容易にすることができる。また、論理規模の増加を最低限にして、エミュレータの設計を容易にすることができる。エミュレータ専用スタックポインタの一部のアドレスを、ＣＰＵ外部から与えるようにして、スタックレジスタをリロケータブルにし、マイクロコンピュータのアドレス配置などに容易に対応することができる。 According to the above-mentioned means (5), the support of the emulator can be facilitated by having a fixed stack pointer dedicated to the emulator. Further, the design of the emulator can be facilitated by minimizing the increase in the logical scale. By giving a part of the address of the emulator-dedicated stack pointer from outside the CPU, the stack register can be made relocatable, and it is possible to easily cope with the address arrangement of the microcomputer.

本題において開示される発明のうち代表的なものによって得られる効果を簡単に説明すれば下記の通りである。 The effects obtained by typical aspects of the invention disclosed in the subject will be briefly described as follows.

（１）乗算器をＣＰＵに内蔵することによって、アドレッシングモードの増加を最小限にして、かつ処理性能を低下させずに積和演算を実行可能にすることができる。乗算器による乗算の結果を汎用レジスタＣＣＲに反映させ、かかる結果を直ちに利用可能にして、処理速度を高速にすることができる。 (1) By incorporating the multiplier in the CPU, it is possible to minimize the increase in the addressing mode and execute the product-sum operation without lowering the processing performance. The result of the multiplication by the multiplier is reflected in the general-purpose register CCR, such a result can be immediately used, and the processing speed can be increased.

（２）乗算器を取外し可能に（独立して）設けることによって、乗算器を取外した場合は、積和演算をサポートしないことによって、容易に下位ＣＰＵを実現し、論理的・物理的規模を縮小し、製造費用の低減に寄与することができる。 (2) By providing the multiplier detachably (independently), when the multiplier is removed, the lower CPU can be easily realized by not supporting the product-sum operation, and the logical and physical scales can be reduced. This can contribute to a reduction in manufacturing cost.

（３）複数レジスタの退避／復帰命令を持ち、この組み合わせを固定的にすることによって、論理規模の縮小を図ることができ、また、高速化を図ることができる。 (3) By having a save / restore instruction for a plurality of registers and fixing this combination, the logical scale can be reduced, and the speed can be increased.

（４）コントロールレジスタの有効／無効を切り換えることで、スタックの節約と、割込み応答時間の高速化に寄与することができるとともに、互換性を維持することができる。 (4) By switching between valid / invalid of the control register, it is possible to save the stack and to shorten the interrupt response time, and to maintain compatibility.

（５）エミュレータ専用の固定スタックポインタを持つことにより、エミュレータをサポートすることができ、また、論理規模の増加を最低限にして、エミュレータの設計を容易にすることができ、さらにエミュレータ専用スタックポインタの一部のアドレスを、ＣＰＵ外部から与えるようにして、スタックレジスタをリロケータブルにし、マイクロコンピュータのアドレス配置などに容易に対応することができる。 (5) Emulators can be supported by having a fixed stack pointer dedicated to the emulator, an increase in logical scale can be minimized, the design of the emulator can be simplified, and a stack pointer dedicated to the emulator can be used. By giving a part of the address from outside the CPU, the stack register is made relocatable, and it is possible to easily cope with the address arrangement of the microcomputer.

以下、本発明について、図面を参照して実施の形態とともに詳細に説明する。 Hereinafter, the present invention will be described in detail along with embodiments with reference to the drawings.

なお、実施の形態を説明するための全図において、同一機能を有するものは同一符号を付け、その繰り返しの説明は省略する。 In all the drawings for describing the embodiments, components having the same function are denoted by the same reference numerals, and repeated description thereof will be omitted.

図１に、本発明の適用されたデータ処理装置の一例であるシングルチップマイクロコンピュータ（以下、単にマイクロコンピュータと称する）のブロック図を示す。マイクロコンピュータは、ＣＰＵ１、乗算器２、システムコントローラ（ＳＹＳＣ）３、割込コントローラ（ＩＮＴ）４、ＲＯＭ５、ＲＡＭ６、タイマＡ７、タイマＢ８、シリアルコミュニケーションインタフェース（ＳＣＩ）９、Ａ／Ｄ変換器１０、第１乃至第９入出力ポート（ＩＯＰ１〜ＩＯＰ９）１１Ａ〜１１Ｉ、クロック発振器（ＣＰＧ）１２の機能ブロック乃至はモジュールから構成され、公知の半導体製造技術により１つの半導体基板上に半導体集積回路装置として形成される。ＣＰＵ１は、乗算器２を内蔵してなる。システムコントローラ（ＳＹＳＣ）３は、システムコントロールレジスタ（ＳＹＳＣＲ）１３および制御レジスタ（ＣＰＵＣＲ）１４を内蔵している。 FIG. 1 shows a block diagram of a single-chip microcomputer (hereinafter simply referred to as a microcomputer) as an example of a data processing device to which the present invention is applied. The microcomputer includes a CPU 1, a multiplier 2, a system controller (SYSC) 3, an interrupt controller (INT) 4, a ROM 5, a RAM 6, a timer A7, a timer B8, a serial communication interface (SCI) 9, an A / D converter 10, It is composed of functional blocks or modules of first to ninth input / output ports (IOP1 to IOP9) 11A to 11I and a clock oscillator (CPG) 12, and is formed as a semiconductor integrated circuit device on one semiconductor substrate by a known semiconductor manufacturing technique. It is formed. The CPU 1 has a built-in multiplier 2. The system controller (SYSC) 3 includes a system control register (SYSCR) 13 and a control register (CPUCR) 14.

かかるマイクロコンピュータは、電源端子として、グランドレベル（Ｖｓｓ）、電源電圧レベル（Ｖｃｃ）、その他専用制御端子として、リセット（ＲＥＳ）、スタンバイ（ＳＴＢＹ）、モード制御（ＭＤ０〜２）、クロック入力（ＥＸＴＡＬ、ＸＴＡＬ）端子を有する。クロック入力（ＥＸＴＡＬ、ＸＴＡＬ）端子に接続される、図示はされない水晶振動子に基づいて、クロック発振器が生成するシステムクロック（φ１、φ２）に同期して、マイクロコンピュータは動作する。或は外部クロックをＥＸＴＡＬ端子に入力してもよい。システムクロックの１周期を１ステートと呼ぶ。 Such a microcomputer has a ground level (Vss) and a power supply voltage level (Vcc) as power supply terminals, and reset (RES), standby (STBY), mode control (MD0 to 2), and clock input (EXTAL) as other dedicated control terminals. , XTAL) terminals. The microcomputer operates in synchronization with system clocks (φ1, φ2) generated by a clock oscillator based on a crystal oscillator (not shown) connected to clock input (EXTAL, XTAL) terminals. Alternatively, an external clock may be input to the EXTAL terminal. One cycle of the system clock is called one state.

これらの機能ブロックは、内部バスによって相互に接続される。内部バスは内部アドレスバス（ＰＡＢ）・内部データバス（ＰＤＢ）の他、リード信号・ライト信号を含み、さらにバスサイズ信号或いはシステムクロック（φ１、φ２）などを含む。 These functional blocks are interconnected by an internal bus. The internal bus includes a read signal and a write signal in addition to an internal address bus (PAB) and an internal data bus (PDB), and further includes a bus size signal or a system clock (φ1, φ2).

入出力ポートは、外部バス信号、入出力回路の入出力信号と兼用とされている。これらは、動作モードあるいはソフトウェアの設定により、機能を選択されて、使用される。ＩＯＰ１〜３はアドレスバス出力、ＩＯＰ４、５はデータバス入出力、ＩＯＰ６はバス制御信号入出力信号と兼用されている。外部アドレスは、それぞれ、これらの入出力ポートに含まれるバッファ回路を介して内部アドレスバスと接続されている。 The input / output port is also used as an external bus signal and an input / output signal of an input / output circuit. These functions are selected and used depending on the operation mode or software setting. IOPs 1 to 3 are used for address bus output, IOPs 4 and 5 are used for data bus input / output, and IOP 6 is also used for bus control signal input / output signals. Each of the external addresses is connected to an internal address bus via a buffer circuit included in these input / output ports.

内部バスおよび外部バス共に１６ビットバス幅とし、バイトサイズ（８ビット）およびワードサイズ（１６ビット）のリード／ライトを可能にする。なお、内部バスおよび外部バスのいずれも８ビット幅とすることもできる。バス制御信号入出力信号には、アドレスストローブ信号ＡＳ、リード信号ＲＤ、ライト信号ＨＷＲ・ＬＷＲ、ウェイト信号ＷＡＩＴ、エリア０選択信号ＣＳ０などがある。割込信号は、タイマ・ＳＣＩ・ＩＯＰ８から要求され、割込コントローラ（ＩＮＴ）が調停して、ＣＰＵに割込を要求する。このとき、ＣＰＵに対し、割込要求信号とベクタ番号を与える。 Both the internal bus and the external bus have a 16-bit bus width, and enable reading / writing of byte size (8 bits) and word size (16 bits). Note that both the internal bus and the external bus may have an 8-bit width. The bus control signal input / output signals include an address strobe signal AS, a read signal RD, a write signal HWR / LWR, a wait signal WAIT, an area 0 selection signal CS0, and the like. The interrupt signal is requested from the timer / SCI / IOP8, the interrupt controller (INT) arbitrates, and requests the CPU for an interrupt. At this time, an interrupt request signal and a vector number are given to the CPU.

ＲＥＳ端子にリセット信号が加えられると、モード端子（ＭＤ０〜２）で与えられる動作モードを取り込み、マイクロコンピュータはリセット状態になる。モード端子で設定する動作モードは、シングルチップ／拡張、アドレス空間、内蔵ＲＯＭの有効／無効、データバス幅の初期値を８ビットまたは１６ビットから選択する。 When a reset signal is applied to the RES terminal, the operation mode given by the mode terminals (MD0 to MD2) is fetched, and the microcomputer is reset. The operation mode set by the mode terminal selects single chip / extension, address space, enable / disable of built-in ROM, and the initial value of the data bus width from 8 bits or 16 bits.

図２に、システムコントロールレジスタ（ＳＹＳＣＲ）３の構成を示す。各ビットの内容を表１乃至表４に示す。 FIG. 2 shows the configuration of the system control register (SYSCR) 3. Tables 1 to 4 show the contents of each bit.

なお、ビット２、１：リザーブビット
リードすると常に”０”が読み出される。ライトは無効である。 Bit 2, 1: Reserved bit
When read, "0" is always read. Light is invalid.

［表１］

[Table 1]

［表２］

[Table 2]

［表３］

[Table 3]

［表４］

[Table 4]

以下に、表５にＣＰＵ１の命令セットを示す。本実施の形態に用いられるＣＰＵ１の命令は合計で７１種類ある。表６に命令とアドレッシングモードとの組み合わせを示す。表７に以下の各表に使用される記号（オペレーションの記号）の意味を示す。表８乃至表１５に各命令の機能別一覧表を示す。 Table 5 shows an instruction set of the CPU 1. There are a total of 71 instructions of the CPU 1 used in the present embodiment. Table 6 shows combinations of instructions and addressing modes. Table 7 shows the meaning of symbols (operation symbols) used in the following tables. Tables 8 to 15 show a list of each instruction by function.

［表５］

[Table 5]

［表６］

[Table 6]

［表７］

[Table 7]

［表８］

[Table 8]

［表９］

[Table 9]

［表１０］

[Table 10]

［表１１］

[Table 11]

［表１２］

[Table 12]

［表１３］

[Table 13]

［表１４］

[Table 14]

［表１５］

[Table 15]

基本的な命令は平成５年６月（株）日立製作所発行『Ｈ８／３００Ｈシリーズプログラミングマニュアル』などに記載のＣＰＵと同様であり、いわゆる、ロードストアアーキテクチャを採用している。命令とアドレッシングモードの組み合わせを削減し、ＣＰＵの命令制御の論理規模・物理的規模を縮小できる。 The basic instructions are the same as those of the CPU described in "H8 / 300H Series Programming Manual" issued by Hitachi, Ltd. in June 1993, and employ a so-called load store architecture. The combination of the instruction and the addressing mode can be reduced, and the logical and physical scales of the instruction control of the CPU can be reduced.

本発明のＣＰＵは、上記従来ＣＰＵに対して命令実行時間の高速化を実現している。 The CPU of the present invention realizes faster instruction execution time than the conventional CPU.

ＣＰＵの命令は、２バイト（ワード）を単位にしている。各命令は下記のようなオペレーションフィールド（ｏｐ）、レジスタフィールド（ｒ）、ＥＡ拡張部（ＥＡ）、およびコンディションフィールド（ｃｃ）から構成されている。 CPU instructions are in units of 2 bytes (words). Each instruction includes an operation field (op), a register field (r), an EA extension (EA), and a condition field (cc) as described below.

（１）オペレーションフィールド
命令の機能を表し、アドレッシングモードの指定、オペランドの処理内容を指定する。命令の先頭４ビットを必ず含んでいる。２つのオペレーションフィールドを持つ場合もある。 (1) Operation field This field indicates the function of the instruction, and specifies the addressing mode and the processing contents of the operand. It always contains the first 4 bits of the instruction. It may have two operation fields.

（２）レジスタフィールド
汎用レジスタを指定する。アドレスレジスタのとき３ビット、データレジスタのとき３ビットまたは４ビットである。２つのレジスタフィールドを持つ場合、またはレジスタフィールドを持たない場合もある。 (2) Register field Specify a general-purpose register. The address register has 3 bits, and the data register has 3 bits or 4 bits. It may have two register fields or no register field.

（３）ＥＡ拡張部
イミディエイトデータ、絶対アドレスまたはディスプレースメントを指定する。８ビット、１６ビット、または３２ビットである。 (3) EA extension part Specifies immediate data, an absolute address, or a displacement. 8, 16 or 32 bits.

（４）コンディションフィールド
Ｂｃｃ命令の分岐条件を示す。 (4) Condition field Indicates the condition for branching the Bcc instruction.

図３に、命令の基本フォーマットの例を示す。 FIG. 3 shows an example of the basic format of the instruction.

図４に、マイクロコンピュータにおいて、ＣＰＵ１に対し乗算器２を取外し可能に設けた概略ブロック図を示す。命令レジスタ（ＩＲ）２１、命令デコーダ・制御回路（ＣＯＮＴ）２２、レジスタセレクタ（ＲＳＥＬ）２３、ライトデータバッファ（ＤＢＷ）２４、リードデータバッファ（ＤＢＲ）２５、演算器（ＡＬＵ）２６、演算器（ＩＮＣ）２７、汎用レジスタ（ＥＲ０〜ＥＲ７）２８Ａ〜２８Ｈ、エミュレータスタックポインタ（ＥＭＬＳＰ）２９、プログラムカウンタ（ＰＣ）３０、コンディションコードレジスタ（ＣＣＲ）３１、拡張レジスタ（ＥＸＲ）３２、アドレスバッファ（ＭＡＢ）３３からなる。乗算器２なしのＣＰＵ１はこれらによって構成される。各バッファやレジスタ、演算器の各ブロックの機能は、特開平５−２４１８２６号公報に記載のＣＰＵと概略同様である。また、乗算器２を含むＣＰＵ１は、更に、バススイッチ３４、乗算器２がある。 FIG. 4 is a schematic block diagram in which a multiplier 2 is detachably provided from a CPU 1 in a microcomputer. Instruction register (IR) 21, instruction decoder / control circuit (CONT) 22, register selector (RSEL) 23, write data buffer (DBW) 24, read data buffer (DBR) 25, arithmetic unit (ALU) 26, arithmetic unit ( INC) 27, general-purpose registers (ER0 to ER7) 28A to 28H, emulator stack pointer (EMLSP) 29, program counter (PC) 30, condition code register (CCR) 31, extension register (EXR) 32, address buffer (MAB) It consists of 33. The CPU 1 without the multiplier 2 is constituted by these components. The function of each buffer, register, and each block of the arithmetic unit is substantially the same as that of the CPU described in Japanese Patent Application Laid-Open No. 5-241826. The CPU 1 including the multiplier 2 further includes a bus switch 34 and the multiplier 2.

命令デコーダ・制御回路（ＣＯＮＴ）２２には、制御信号ＣＰＵＳ、制御信号ＩＮＴＭ１、そのほかの制御信号（割り込み要求など）が入力されている。ＣＯＮＴ２２は各部を制御するための、出力タイミングの相違する制御信号Ａ、Ｂ、Ｃを出力する。 The instruction decoder / control circuit (CONT) 22 receives a control signal CPUS, a control signal INTM1, and other control signals (such as an interrupt request). The CONT 22 outputs control signals A, B, and C having different output timings for controlling each unit.

なお、図中のＣ１およびＣ２は、当該信号の同期タイミングを示す。例えば、ＲＳＥＬ入力１のＣ１はφに同期して入力が行われることを示し、ＲＳＥＬ入力２のＣ２はφ＃（＃は論理反転）に同期して入力が行われることを示す。また、ＡＬＵ入力のＣ１は、φの期間に入力が行われることを示し、ＡＬＵ出力のＣ２は、φ＃の期間に出力が行われることを示す。ＡＬＵ２６とＩＮＣ２７は、それぞれ動作タイミングの異なった演算器であり、それぞれ、オーバラップしつつ演算可能である。 Note that C1 and C2 in the figure indicate the synchronization timing of the signal. For example, C1 of RSEL input 1 indicates that input is performed in synchronization with φ, and C2 of RSEL input 2 indicates that input is performed in synchronization with φ # (# is logical inversion). Further, C1 of the ALU input indicates that the input is performed during the period of φ, and C2 of the ALU output indicates that the output is performed during the period of φ #. The ALU 26 and the INC 27 are arithmetic units having different operation timings, and can perform arithmetic operations while overlapping each other.

そのほかのレジスタなどは、φ、φ＃の両方でデータを入出力可能である。ＧＢ、ＤＢ、ＷＢの各バスはφ、φ＃の両方で異なったデータを転送可能である。φ、φ＃は互いにノーオーバラップの関係の２相クロックとしてもよい。 Other registers can input / output data in both φ and φ #. The GB, DB, and WB buses can transfer different data in both φ and φ #. φ and φ # may be two-phase clocks having a no-overlap relationship with each other.

レジスタセレクタ（ＲＳＥＬ）２３には、ＩＲ２１乃至ＣＯＮＴ２２から命令コードの一部（レジスタ指定フィールド）が与えられる。この供給タイミングは、レジスタ指定フィールドの位置によって相違される。ＲＳＥＬ２３は出力タイミングの相違するレジスタ選択信号Ａ、Ｂを出力する。 The register selector (RSEL) 23 receives a part of the instruction code (register designation field) from the IR 21 to the CONT 22. This supply timing differs depending on the position of the register designation field. The RSEL 23 outputs register selection signals A and B having different output timings.

例えば、平成５年６月（株）日立製作所発行『Ｈ８／３００Ｈシリーズプログラミングマニュアル』に記載のＣＰＵにおいては、１６ビット単位の命令コードのビット７−４が、ＣＯＮＴ２２と同時に与えられ（ＲＳＥＬ入力１）、ビット１１−８および３−０（ＲＳＥＬ入力２）が、ＣＯＮＴ２２の内容と０．５ステート遅れて与えられる。ＲＳＥＬ入力２の反転制御信号をＲＳＥＬに与える。 For example, in the CPU described in "H8 / 300H Series Programming Manual" issued by Hitachi, Ltd. in June 1993, bits 7-4 of a 16-bit instruction code are given simultaneously with CONT 22 (RSEL input 1). ), Bits 11-8 and 3-0 (RSEL input 2) are provided 0.5 state later than the contents of CONT22. An inversion control signal of RSEL input 2 is provided to RSEL.

ＣＰＵ１内部のＤＢＷ２４、ＤＢＲ２５、ＡＬＵ２６、ＩＮＣ２７、ＥＲ０〜ＥＲ７（２８Ａ〜２８Ｈ）、ＰＣ３０、ＣＣＲ３１、ＶＡＧ、ＡＢは、ＧＢバス、ＤＢバス、ＷＢバスによって相互に接続されている。 The DBW 24, DBR 25, ALU 26, INC 27, ER0 to ER7 (28A to 28H), PC 30, CCR 31, VAG, and AB inside the CPU 1 are interconnected by a GB bus, a DB bus, and a WB bus.

２つの演算器ＡＬＵ２６、ＩＮＣ２７に対し、ＧＢ、ＤＢバスからデータを入力し、ＷＢバスにデータを出力する。それぞれの入出力バスの数に対応した数の内部バスとして、バス即ち配線の増加による物理的規模の増加を抑止している。 Data is input from the GB and DB buses to the two arithmetic units ALU26 and INC27, and data is output to the WB bus. As the number of internal buses corresponding to the number of input / output buses, an increase in physical scale due to an increase in buses, that is, wiring, is suppressed.

また、ライトデータバッファ（ＤＢＷ）２４は内部データバスへの出力、リードデータバッファ（ＤＢＲ）２５は内部データバスからの入力、アドレスバッファ（ＭＡＢ）３３は内部アドレスバスへの出力、命令レジスタは内部データバスからの入力が可能であり、それぞれ内部バスに接続されている。ライトデータバッファ（ＤＢＷ）２４およびリードデータバッファ（ＤＢＲ）２５は３２ビット構成とされる。ライトデータは３２ビット一括してライトデータバッファ（ＤＢＷ）２４に書き込むことができ、所定のタイミングで、１６ビットの内部データバスに出力される。また、内部データバスから読み出したデータを、リードデータバッファ（ＤＢＲ）２５に一旦格納して、３２ビットのリードデータを一括して出力することができる。ＭＡＢは＋２のインクリメント機能を有する。 The write data buffer (DBW) 24 outputs to the internal data bus, the read data buffer (DBR) 25 inputs from the internal data bus, the address buffer (MAB) 33 outputs to the internal address bus, and the instruction register stores the internal data bus. Input from the data bus is possible, and each is connected to the internal bus. The write data buffer (DBW) 24 and the read data buffer (DBR) 25 have a 32-bit configuration. Write data can be written to the write data buffer (DBW) 24 in a batch of 32 bits, and output to the 16-bit internal data bus at a predetermined timing. Further, data read from the internal data bus can be temporarily stored in the read data buffer (DBR) 25, and 32-bit read data can be output collectively. MAB has a +2 increment function.

命令デコーダ・制御回路（ＣＯＮＴ）２２が、ＩＲ２１からの入力、ＣＰＵＳ信号、ＩＮＴＭ１信号やそのほかの入力信号に基づいて、動作制御を行なう。制御回路の出力は所定のバッファを介して出力される。ＣＯＮＴ２２自身にも、ステート番号などがフィードバックされる。 An instruction decoder / control circuit (CONT) 22 controls the operation based on the input from the IR 21, the CPUS signal, the INTM1 signal, and other input signals. The output of the control circuit is output via a predetermined buffer. The state number and the like are also fed back to the CONT 22 itself.

アドレスバッファ（ＭＡＢ）３３はインクリメント機能（＋２）を有する。ＥＲ０〜ＥＲ７（２８Ａ〜２８Ｈは）データレジスタまたはアドレスレジスタとして使用することができる。 The address buffer (MAB) 33 has an increment function (+2). ER0 to ER7 (28A to 28H) can be used as data registers or address registers.

ＥＭＬＳＰ２９は、ユーザには公開されていないリソースで、エミュレータに搭載されて動作するとき、ユーザプログラムとエミュレーションプログラムの間の遷移時のスタックポインタとして使用する。その内容を指定するために、一部の内容が、ＣＰＵ外部から与えられる。 The EMLSP 29 is a resource that is not disclosed to the user, and is used as a stack pointer at the time of transition between a user program and an emulation program when operating on an emulator. In order to specify the contents, some contents are given from outside the CPU.

ＰＣ３０は３２ビットのカウンタであり、ＣＰＵ１が次に実行する命令のアドレスを示している。コンディションコードレジスタ（ＣＣＲ）３１は割り込みマスクビット（Ｉ）、キャリフラグ（Ｃ）、ゼロフラグ（Ｚ）、ネガティブフラグ（Ｎ）、オーバフローフラグ（Ｖ）を含んでいる。 The PC 30 is a 32-bit counter and indicates the address of an instruction to be executed next by the CPU 1. The condition code register (CCR) 31 includes an interrupt mask bit (I), a carry flag (C), a zero flag (Z), a negative flag (N), and an overflow flag (V).

ＣＰＵ１と乗算器２は、バススイッチ３４を介して接続されている。また、バススイッチ３４は内部データバスとのインタフェースも行なう。また、ＣＰＵ１から乗算器２への制御信号を与える。乗算器２のステータス信号ＢＵＳＹと、フラグ検出信号をＣＰＵ１に与える。ＴＥＳＴＭＯＤＥ信号を、例えば、ＳＹＳＣ３から与える。制御信号ＣＰＵＳは、ＳＹＳＣＲ１４あるいはそのほかのレジスタの制御ビットの出力にしてもよいし、マイクロコンピュータの制御端子のようなもので指定してもよい。 The CPU 1 and the multiplier 2 are connected via a bus switch 34. Bus switch 34 also interfaces with an internal data bus. Further, a control signal from the CPU 1 to the multiplier 2 is given. The status signal BUSY of the multiplier 2 and the flag detection signal are supplied to the CPU 1. The TESTMODE signal is supplied from, for example, SYSC3. The control signal CPUS may be a control bit output of the SYSCR 14 or another register, or may be designated by a control terminal of a microcomputer.

図５に、制御信号ＣＰＵＳを制御レジスタ（ＣＰＵＣＲ）１４の制御ビットで構成した具体的な例を示す。図は１ビットの構成を示している。ＣＰＵＣＲ１４は、フリップフロップで構成される。フリップフロップにはリセット信号が与えられる。フリップフロップのクロックは内部ライト信号と、アドレスをデコードして得られるＣＰＵＣＲ選択信号の論理積信号とされる。データ入力はデータバスのビット８とされる。出力がＣＰＵＳ信号とされる。また、クロックトバッファＣＢＦ６を介して、データバスに出力される。クロックトバッファＣＢＦ６のクロックは内部ライト信号とＣＰＵＣＲ選択信号の論理積信号とされる。 FIG. 5 shows a specific example in which the control signal CPUS is configured by the control bits of the control register (CPUCR) 14. The figure shows the configuration of one bit. The CPUCR 14 is configured by a flip-flop. A reset signal is supplied to the flip-flop. The clock of the flip-flop is a logical product signal of an internal write signal and a CPUCR selection signal obtained by decoding an address. The data input is bit 8 of the data bus. The output is a CPUS signal. The data is output to the data bus via the clocked buffer CBF6. The clock of the clocked buffer CBF6 is a logical product signal of the internal write signal and the CPUCR selection signal.

本レジスタのライトは、テストモードや、エミュレータに搭載した場合のブレークモードなどでのみライト可能にするとよい。ブレークモードなどについては、特開平６−１５００２６などに記載されている。同様に、ＴＥＳＴＭＯＤＥ信号を生成することができる。同一のレジスタに配置することができる。 This register can be written only in the test mode or in the break mode when the emulator is mounted. The break mode and the like are described in JP-A-6-150026 and the like. Similarly, a TESTMODE signal can be generated. They can be located in the same register.

図６に、制御信号ＣＰＵＳの設定方法の一例として、エミュレーション用プロセッサおよびエミュレータをブロック図で示す。エミュレーション用プロセッサ３８は、マイクロコンピュータ部分にエミュレーション用インタフェース３９を加えて構成される。エミュレーション用インタフェース３９には、エミュレーション用プロセッサ専用の制御レジスタ４１を有する。メモリ４２は、ＲＯＭ、ＲＡＭを含み、Ｉ／ＯはＩ／Ｏポート、タイマ、ＳＣＩなどを含む。 FIG. 6 is a block diagram showing an emulation processor and an emulator as an example of a method of setting the control signal CPUS. The emulation processor 38 is configured by adding an emulation interface 39 to a microcomputer part. The emulation interface 39 has a control register 41 dedicated to the emulation processor. The memory 42 includes a ROM and a RAM, and the I / O includes an I / O port, a timer, an SCI, and the like.

コネクタ部がマイクロコンピュータの代わりに応用システム（ユーザシステム）４３に装着される。エミュレーション用プロセッサ３８は上記コネクタ部とインタフェースケーブル４４を介し、ターゲットシステムインタフェースを用いて上記応用システム４３と信号の入出力を行なう。 The connector section is mounted on an application system (user system) 43 instead of the microcomputer. The emulation processor 38 inputs and outputs signals to and from the application system 43 using the target system interface via the connector section and the interface cable 44.

応用システム（ユーザシステム）４３には、特に制限はされないものの、ユーザバス４５が存在し、ユーザメモリ４６が接続される。エミュレーション用プロセッサ３８が出力し、インタフェースケーブル４４を介して供給されるユーザストローブ信号に従って、ユーザメモリ４６はリード／ライトされる。 The application system (user system) 43 includes, but is not limited to, a user bus 45 and a user memory 46. The user memory 46 is read / written according to a user strobe signal output from the emulation processor 38 and supplied via the interface cable 44.

一方、エミュレーション用プロセッサ３８は上記エミュレーションインタフェース３９を用いてエミュレーションバス４７に接続される。エミュレーションバス４７には図示はされない状態信号・制御信号などを含む。上記エミュレーションバス４７を用いて、エミュレーション用プロセッサ３８から、応用システム４３とエミュレーション用プロセッサ３８の内部状態に応じた情報などが出力され、また、エミュレーション用プロセッサ３８に対し、エミュレーションのための各種制御信号が入力される。エミュレーション用プロセッサ３８の、図示はされないエミュレートモード端子が電源レベルに固定され、エミュレーション用プロセッサ３８内部ではエミュレートモードが設定される。 On the other hand, the emulation processor 38 is connected to the emulation bus 47 using the emulation interface 39. The emulation bus 47 includes state signals and control signals (not shown). Using the emulation bus 47, information and the like corresponding to the internal state of the application system 43 and the emulation processor 38 are output from the emulation processor 38, and various control signals for emulation are sent to the emulation processor 38. Is entered. An emulation mode terminal (not shown) of the emulation processor 38 is fixed at the power supply level, and the emulation mode is set inside the emulation processor 38.

さらに、上記エミュレーションバス４７には、特に制限はされないものの、応用システム４３またはターゲットマイクロコンピュータ内蔵のメモリを代行するためのＲＡＭでなるようなエミュレーションメモリ４８がある。また、エミュレーション用プロセッサ３８の制御状態やエミュレーションバス４７の状態を監視して、その状態が予め設定された状態に達した時に、上記エミュレータ専用割込みを入力して、ＣＰＵによるユーザプログラムの実行を停止させ、エミュレーション用プログラム実行状態に遷移させる（ブレーク）ためのブレーク制御回路４９と、上記ＣＰＵのリード動作またはライト動作を示す信号、命令リード動作を示す信号などに基づき、エミュレーションバス４７に与えられるアドレスデータさらには制御情報を逐次蓄えるリアルタイムトレース回路５０などが接続される。 Further, the emulation bus 47 includes, although not particularly limited to, an emulation memory 48 such as a RAM for substituting the built-in memory of the application system 43 or the target microcomputer. Further, the control state of the emulation processor 38 and the state of the emulation bus 47 are monitored, and when the state reaches a preset state, the emulator-dedicated interrupt is input to stop the execution of the user program by the CPU. And a break control circuit 49 for causing a transition to an emulation program execution state (break), and an address given to the emulation bus 47 based on a signal indicating a read operation or a write operation of the CPU, a signal indicating an instruction read operation, and the like. A real-time trace circuit 50 for sequentially storing data and control information is connected.

上記エミュレーションバス４７が、エミュレーションメモリ４８、ブレーク制御回路４９、リアルタイムトレース回路５０などに、それぞれ接続される。これらでもってマイクロコンピュータ開発装置５５が構成されている。 The emulation bus 47 is connected to an emulation memory 48, a break control circuit 49, a real-time trace circuit 50, and the like. These components constitute the microcomputer development device 55.

上記エミュレーションメモリ４８、ブレーク制御回路４９、リアルタイムトレース回路５０はコントロールバス５１に接続され、コントロールバス５１を介してコントロールプロセッサ５２の制御を受けるようになっている。上記コントロールバス５１は、エミュレーション用プロセッサ制御回路に接続されるとともに、インタフェース回路を介して、特に制限はされないもののパーソナルコンピュータなどのシステム開発装置５４に接続される。例えば、システム開発装置５４から入力されたプログラムをエミュレーションメモリ４８に転送し、内蔵ＲＯＭ上に配置されるべきかかるプログラムをＣＰＵ１がリードすると、エミュレーションメモリ４８上のプログラムがリードされる。また、ブレーク条件や、リアルタイムトレース条件などもシステム開発装置５４から与えることができる。 The emulation memory 48, break control circuit 49, and real-time trace circuit 50 are connected to a control bus 51, and are controlled by a control processor 52 via the control bus 51. The control bus 51 is connected to an emulation processor control circuit, and is connected to a system development device 54 such as a personal computer, though not particularly limited, through an interface circuit. For example, when the program input from the system development device 54 is transferred to the emulation memory 48 and the CPU 1 reads such a program to be arranged in the built-in ROM, the program on the emulation memory 48 is read. Also, a break condition, a real-time trace condition, and the like can be given from the system development device 54.

コントロールプロセッサ５２は、ＣＰＵＳ信号をエミュレーション用プロセッサ３８に供給して、乗算器の使用／不使用の選択を行なうことができる。コントロールプロセッサ５２は、システム開発装置５４から入力された情報などに基づいて、ＣＰＵＳ信号を制御する。あるいは、図５のような制御レジスタを、エミュレーション用インタフェース３９内に制御レジスタに設けて、エミュレータ４０のソフトウェアをＣＰＵが実行して、前記制御レジスタを指定することによって、ＣＰＵＳ信号を生成するようにすることができる。この場合は、エミュレーション用ソフトウェアの実行モード、いわゆるブレークモードでのみライト可能にすると都合がよい。開発途上にあるユーザのソフトウェアの誤動作によって、誤った設定を行なうことがない。 The control processor 52 can supply the CPUS signal to the emulation processor 38 to select use / non-use of the multiplier. The control processor 52 controls the CPUS signal based on information input from the system development device 54 and the like. Alternatively, a control register as shown in FIG. 5 is provided in the control register in the emulation interface 39, and the CPU of the emulator 40 executes the software to specify the control register to generate the CPUS signal. can do. In this case, it is convenient to enable writing only in the execution mode of the emulation software, so-called break mode. Erroneous setting is not performed due to malfunction of software of a user under development.

エミュレーション用プロセッサ３８およびエミュレータ４０を複数のＣＰＵをサポート可能にすることによって、実際のマイクロコンピュータのみを開発すればよく、開発効率を向上することができる。なお、ＥＭＬＳＰ２９のアドレス指定情報も、エミュレーション用インタフェース３９内の制御レジスタで指定することができる。 By enabling the emulation processor 38 and the emulator 40 to support a plurality of CPUs, only an actual microcomputer needs to be developed, and development efficiency can be improved. Note that the address designation information of the EMLSP 29 can also be designated by a control register in the emulation interface 39.

エミュレーション用プロセッサ３８やエミュレータ４０については、特開平３−２７１８３４号公報、あるいは特開平６−１５００２６号公報などに記載されている。 The emulation processor 38 and the emulator 40 are described in JP-A-3-271834 and JP-A-6-150026.

図７に、制御信号ＣＰＵＳ設定方法の一例である、マイクロコンピュータの主要部をブロック図で示す。ＣＰＵＳ信号をレジスタによらず、ＣＭＯＳインバータ回路５８の出力とする。かかるＣＭＯＳインバータ回路５８は、Ｐチャネル型ＭＯＳトランジスタＱ１、Ｎチャネル型ＭＯＳトランジスタＱ２で構成される。このＣＭＯＳインバータ回路５８の入力は、抵抗Ｒを介して電源Ｖｄｄに接続されると共に、保護回路Ｑ３、Ｑ４を介して端子Ｐに結合される。端子Ｐは、ワイヤＷによってグランドレベル電源用リードＬに接続されるか、解放状態とされるかが選択され、ＣＰＵＳの設定を行なう。 FIG. 7 is a block diagram showing a main part of a microcomputer which is an example of a control signal CPUS setting method. The CPUS signal is output from the CMOS inverter circuit 58 without depending on the register. The CMOS inverter circuit 58 includes a P-channel MOS transistor Q1 and an N-channel MOS transistor Q2. The input of the CMOS inverter circuit 58 is connected to a power supply Vdd via a resistor R and to a terminal P via protection circuits Q3 and Q4. The terminal P is connected to the ground level power supply lead L by a wire W or is set in an open state, and the CPU S is set.

端子Ｐが解放状態とされれば、ＣＭＯＳインバータ回路５８の入力はハイレベルとなって、ＣＰＵＳ信号は非活性状態になる。一方、端子Ｐが、ワイヤＷによって、グランドレベル電源用リードＬに接続されれば、ＣＭＯＳインバータ回路５８の入力はロウレベルとなって、ＣＰＵＳ信号は活性状態になる。乗算器を使用可能にする。 When the terminal P is released, the input of the CMOS inverter circuit 58 becomes high level, and the CPUS signal becomes inactive. On the other hand, if the terminal P is connected to the ground level power supply lead L via the wire W, the input of the CMOS inverter circuit 58 becomes low level and the CPUS signal is activated. Enable the multiplier.

端子Ｐは対応するリードを持たず、例えばプラスティックパッケージに封止された場合には、対応する端子を持たない。 The terminal P does not have a corresponding lead, and for example, does not have a corresponding terminal when sealed in a plastic package.

これにより、半導体集積回路装置のパッケージの端子を直接利用することなく、乗算器の制御を設定できるため、一定のパッケージを用いた場合に、有効な端子数の減少防ぐことができる。この場合、端子Ｐをグランドレベル電源端子に隣接して配置すると都合がよい。 Thus, since the control of the multiplier can be set without directly using the terminals of the package of the semiconductor integrated circuit device, it is possible to prevent the effective number of terminals from decreasing when a fixed package is used. In this case, it is convenient to arrange the terminal P adjacent to the ground level power supply terminal.

あるいは、端子Ｐをグランドレベル電源用リードＬにワイヤＷによって接続するか、しないかの選択を、半導体集積回路装置の配線変更として実現してもよい。ＣＭＯＳインバータ回路５８の入力を、半導体集積回路装置内部の電源電圧またはグランドのいずれに接続するかを選択すればよい。このとき、抵抗Ｒ及び端子Ｐは削除することができる。または、ＣＰＵＳビットをＰＲＯＭ素子などで構成してもよい。この場合、製造者が設定を行なってもよいし、ユーザが設定を行なってもよい。 Alternatively, whether or not the terminal P is connected to the ground level power supply lead L by the wire W or not may be realized as a change in wiring of the semiconductor integrated circuit device. It is sufficient to select whether to connect the input of the CMOS inverter circuit 58 to the power supply voltage or the ground inside the semiconductor integrated circuit device. At this time, the resistor R and the terminal P can be deleted. Alternatively, the CPUS bit may be constituted by a PROM element or the like. In this case, the manufacturer may make the setting, or the user may make the setting.

図８および図９に、ＣＰＵの内部レジスタ構成を示す。これらのレジスタは、図８の汎用レジスタおよび図９のコントロールレジスタの２つに分割される。以下、各レジスタについて説明する。 8 and 9 show the internal register configuration of the CPU. These registers are divided into two, a general-purpose register in FIG. 8 and a control register in FIG. Hereinafter, each register will be described.

（１）汎用レジスタ
ＣＰＵはこの汎用レジスタを８本有している。この汎用レジスタは３２ビット長からなり、すべて同じ機能を有しており、アドレスレジスタとしてもデータレジスタとしても使用することができる。データレジスタとしては３２ビット、１６ビットおよび８ビットレジスタとして使用できる。 (1) General-purpose registers The CPU has eight general-purpose registers. This general-purpose register has a 32-bit length and has the same function, and can be used as both an address register and a data register. The data register can be used as a 32-bit, 16-bit and 8-bit register.

アドレスレジスタ及び３２ビットレジスタとしては、一括して汎用レジスタＥＲ（ＥＲ０〜ＥＲ７）として使用する。１６ビットレジスタとしては、汎用レジスタＥＲを分割して汎用レジスタＥ（Ｅ０〜Ｅ７）、汎用レジスタＲ（Ｒ０〜Ｒ７）として使用する。これらは同等の機能を有しており、１６ビットレジスタを最大１６本まで使用することができる。 The address register and the 32-bit register are collectively used as general-purpose registers ER (ER0 to ER7). As a 16-bit register, the general-purpose register ER is divided and used as general-purpose registers E (E0 to E7) and general-purpose registers R (R0 to R7). These have equivalent functions and can use up to 16 16-bit registers.

８ビットレジスタとしては、汎用レジスタＲを分割して汎用レジスタＲＨ（Ｒ０Ｈ〜Ｒ７Ｈ）、汎用レジスタＲＬ（Ｒ０Ｌ〜Ｒ７Ｌ）として使用する。これらは同等の機能を有しており、８ビットレジスタを最大１６本まで使用することができる。 As the 8-bit register, the general-purpose register R is divided and used as general-purpose registers RH (R0H to R7H) and general-purpose registers RL (R0L to R7L). These have the same function and can use up to 16 8-bit registers.

図１０に、汎用レジスタの使用方法を示す。各レジスタは独立して使用方法を選択することができる。 FIG. 10 shows how to use a general-purpose register. The usage of each register can be independently selected.

汎用レジスタＥＲ７には、汎用レジスタとしての機能に加えて、スタックポインタ（ＳＰ）としての機能が割り当てられており、例外処理やサブルーチン分岐などで暗黙的に使用される。図１１にスタックの状態を示す。 The general-purpose register ER7 is assigned a function as a stack pointer (SP) in addition to the function as the general-purpose register, and is used implicitly in exception processing and subroutine branching. FIG. 11 shows the state of the stack.

（２）コントロールレジスタ
コントロールレジスタは、２４ビットのプログラムカウンタ（ＰＣ）と８ビットの拡張レジスタ（エクステンドレジスタ）（ＥＸＲ）および８ビットのコンディションコードレジスタ（ＣＣＲ）を含んでいる。 (2) Control Register The control register includes a 24-bit program counter (PC), an 8-bit extension register (extended register) (EXR), and an 8-bit condition code register (CCR).

１．プログラムカウンタ（ＰＣ）
２４ビットのカウンタで、ＣＰＵが次に実行する命令のアドレスを示している。ＣＰＵの命令は、すべて２バイト（ワード）を単位としているため、最下位ビットは無効である。（命令コードのリード時には最下位ビットは”０”とみなされる）。 1. Program counter (PC)
A 24-bit counter indicates the address of an instruction to be executed next by the CPU. Since all CPU instructions are in units of 2 bytes (words), the least significant bit is invalid. (The least significant bit is regarded as "0" when reading the instruction code).

分岐命令の実行アドレスの上位８ビットは無視される。プログラム領域として使用できるのは、Ｈ’００００００００〜Ｈ’００ＦＦＦＦＦＦの領域である。 The upper 8 bits of the execution address of the branch instruction are ignored. The area of H'00000000 to H'00FFFFFF can be used as the program area.

２．拡張レジスタ（ＥＸＲ）
８ビットのレジスタで、トレースビット（Ｔ）、割込みマスクビット（Ｉ２〜Ｉ０）を含む８ビットで構成されている。 2. Extension register (EXR)
It is an 8-bit register, and is composed of 8 bits including a trace bit (T) and an interrupt mask bit (I2 to I0).

ビット７：トレースビット（Ｔ）
トレースビットか否かを指定する。本ビットが”０”にクリアされているときは命令を順次実行する。”１”にセットされているときは１命令実行する毎にトレース例外処理を実行する。 Bit 7: trace bit (T)
Specify whether it is a trace bit or not. When this bit is cleared to "0", instructions are sequentially executed. When set to "1", trace exception processing is executed each time one instruction is executed.

ビット６〜４：リザーブビット
リザーブビットである。 Bits 6-4: Reserved bits These are reserved bits.

ビット２〜０：割込みマスクビット（Ｉ２〜Ｉ０）
割込み要求マスクレベル（０〜７）を指定する。 Bits 2 to 0: interrupt mask bits (I2 to I0)
Specify the interrupt request mask level (0 to 7).

ＥＸＲは、ＬＤＣ、ＳＴＣ、ＡＮＤＣ、ＯＲＣ、ＸＯＲＣ命令で実行することができる。このうち、ＳＴＣを除く命令を実行した場合、実行終了後３ステートの間は、ＮＭＩを含めてすべての割込みは受け付けられない。 EXR can be executed with LDC, STC, ANDC, ORC, and XORC instructions. When instructions other than the STC are executed, all interrupts including the NMI are not accepted for three states after the execution is completed.

３．コンディションコードレジスタ（ＣＣＲ）
８ビットのレジスタで、ＣＰＵの内部状態を示す。割込みマスクビット（Ｉ）とハーフキャリ（Ｈ）、ネガティブ（Ｎ）、ゼロ（Ｚ）、オーバフロー（Ｖ）、キャリ（Ｃ）を含む８ビットで構成されている。 3. Condition code register (CCR)
An 8-bit register indicating the internal state of the CPU. It is composed of 8 bits including an interrupt mask bit (I), a half carry (H), a negative (N), a zero (Z), an overflow (V), and a carry (C).

ビット７：割込みマスクビット（Ｉ）
本ビットが”１”にセットされると、割込みがマスクされる。ただし、ＮＭＩはＩビットに関係なく受け付けられる。例外処理の実行が開始されたときに”１”にセットされる。 Bit 7: interrupt mask bit (I)
When this bit is set to "1", the interrupt is masked. However, NMI is accepted regardless of the I bit. It is set to "1" when the execution of the exception processing is started.

ビット６：ユーザビット／割込みマスクビット（ＵＩ）
ソフトウェア（ＬＤＣ、ＳＴＣ、ＡＮＤＣ、ＯＲＣ、ＺＯＲＣ命令）でリード／ライトできる。割込みマスクビットとしても使用可能である。 Bit 6: user bit / interrupt mask bit (UI)
It can be read / written by software (LDC, STC, ANDC, ORC, ZORC instructions). It can also be used as an interrupt mask bit.

ビット５：ハーフキャリフラグ（Ｈ）
ＡＤＤ．Ｂ、ＡＤＤＸ．Ｂ、ＳＵＢ．Ｂ、ＳＵＢＸ．Ｂ、ＣＭＰ．Ｂ、ＮＥＧ．Ｂ命令の実行により、ビット３にキャリまたはボローが生じたとき”１”にセットされ、生じなかったとき”０”にクリアされる。また、ＡＤＤ．Ｗ、ＳＵＢ．Ｗ、ＣＭＰ．Ｗ、ＮＥＧ．Ｗ命令の実行により、ビット１１にキャリまたはボローが生じたとき、ＡＤＤ．Ｌ、ＳＵＢ．Ｌ、ＣＭＰ．Ｌ、ＮＥＧ．Ｌ命令の実行により、ビット２７にキャリまたはボローが生じたとき、”１”にセットされ、生じなかったとき”０”にクリアされる。 Bit 5: Half carry flag (H)
ADD. B, ADDX. B, SUB. B, SUBX. B, CMP. B, NEG. The bit 3 is set to “1” when a carry or a borrow occurs in the bit 3 by execution of the B instruction, and cleared to “0” when no carry occurs. ADD. W, SUB. W, CMP. W, NEG. When a carry or borrow occurs in bit 11 due to execution of the ADD. L, SUB. L, CMP. L, NEG. This bit is set to “1” when a carry or borrow occurs in bit 27 due to execution of the L instruction, and cleared to “0” when no carry occurs.

ビット４：ユーザビット（Ｕ）
ソフトウェア（ＬＤＣ、ＳＴＣ、ＡＮＤＣ、ＯＲＣ、ＸＯＲＣ命令）でリート／ライトできる。 Bit 4: User bit (U)
REIT / WRITE can be performed by software (LDC, STC, ANDC, ORC, XORC instructions).

ビット３：ネガティブフラグ（Ｎ）
データの最上位ビットを符号ビットとみなし、最上位ビットの値を格納する。 Bit 3: Negative flag (N)
The most significant bit of the data is regarded as a sign bit, and the value of the most significant bit is stored.

ビット２：ゼロフラグ（Ｚ）
データがゼロのとき”１”にセットされ、ゼロ以外のとき”０”にクリアされる。 Bit 2: Zero flag (Z)
It is set to "1" when the data is zero, and cleared to "0" when it is not zero.

ビット１：オーバフローフラグ（Ｖ）
算術演算命令により、オーバフローが生じたとき”１”にセットされる。それ以外のとき”０”にクリアされる。 Bit 1: Overflow flag (V)
Set to "1" when an overflow occurs due to an arithmetic operation instruction. Otherwise, it is cleared to "0".

ビット０：キャリフラグ（Ｃ）
演算の実行により、キャリが生じたとき”１”にセットされ、生じなかったとき”０”にクリアされる。キャリには次の種類がある。 Bit 0: carry flag (C)
It is set to "1" when a carry is generated by execution of an operation, and is cleared to "0" when no carry is generated. There are the following types of carry.

（ａ）加算結果のキャリ
（ｂ）減算結果のボロー
（ｃ）シフト／ローテートのキャリ
また、キャリフラグには、ビットアキュムレータの機能があり、ビット操作命令で使用される。なお、命令によってはフラグが変化しない場合がある。ＣＣＲは、ＬＤＣ、ＳＴＣ、ＡＮＤＣ、ＯＲＣ、ＸＯＲＣ命令で操作することができる。また、Ｎ、Ｚ、Ｖ、Ｃの各フラグは、条件分岐命令（Ｂｃｃ）で使用される。 (A) Carry of addition result (b) Borrow of subtraction result (c) Carry of shift / rotate The carry flag has a bit accumulator function and is used in a bit manipulation instruction. Note that the flag may not change depending on the instruction. The CCR can be operated with LDC, STC, ANDC, ORC, and XORC instructions. The flags N, Z, V, and C are used in a conditional branch instruction (Bcc).

４．積和レジスタ（ＭＡＣ）
６４ビットのレジスタであり、積和演算結果を格納する。３２ビットのＭＡＣＨ、ＭＡＣＬから構成される。ＭＡＣＨは下位１０ビットが有効であり、上位は符号拡張されている。 4. Multiply-accumulate register (MAC)
This is a 64-bit register that stores the product-sum operation result. It is composed of 32-bit MACH and MACL. The lower 10 bits of the MACH are valid, and the upper bits are sign-extended.

図１２に、ＣＰＵの基本動作タイミングを示す。 FIG. 12 shows the basic operation timing of the CPU.

ＡＤＤ．ＷＲ０、Ｒ１のようなレジスタ間演算のタイミングである。特に制限はされないものの、内部データバスは１６ビットであって、内蔵ＲＯＭ、ＲＡＭのリード／ライトを１ステートでリード／ライト可能とする。 ADD. This is the timing of the operation between registers such as W R0 and R1. Although there is no particular limitation, the internal data bus is 16 bits, and the internal ROM and RAM can be read / written in one state.

Ｔ０のＣ２（φ＃同期。＃は反転論理を示す）で、ＣＰＵ１のアドレスバッファ（ＭＡＢ）３３からアドレスがＩＡＢに出力される。 The address is output to the IAB from the address buffer (MAB) 33 of the CPU 1 at C2 of T0 (φ # synchronization; # indicates inverted logic).

Ｔ１のＣ１（φ同期）で、ＩＡＢの内容がＰＡＢに出力され、リードサイクルが開始される。Ｃ２でリードデータが内部データバスに得られ、これをＩＲ２１にラッチする。以上の動作は以前の命令の実行の制御によって行われる。 At C1 (φ synchronization) of T1, the contents of IAB are output to PAB, and a read cycle is started. In C2, read data is obtained on the internal data bus, and this is latched in IR21. The above operation is performed by controlling the execution of the previous instruction.

直前の命令の実行が終了すると、最も早く命令の実行が開始される場合には、Ｔ２のＣ１で命令コードがＣＯＮＴ２２に入力されて、命令の内容が解読される。解読結果に従って、制御信号を出力して、各部の制御を行なう。命令の一部（レジスタ指定フィールド：ＲＳＥＬ入力信号１）がレジスタセレクタ２３に与えられる。 When the execution of the immediately preceding instruction is completed and the execution of the instruction is started earliest, the instruction code is input to the CONT 22 at C1 of T2, and the contents of the instruction are decoded. According to the decoding result, a control signal is output to control each unit. A part of the instruction (register designation field: RSEL input signal 1) is given to the register selector 23.

レジスタ間演算命令では、Ｔ２のＣ２で、ＰＣの内容を内部バスＧＢに読み出して、ＭＡＢ３３とＩＮＣ２７に入力する。ＭＡＢ３３からアドレスＩＡＢが出力される。レジスタセレクタ２３に制御信号を与える。ＲＳＥＬ入力信号１と制御信号Ａ（Ｒｓ−ＤＢ出力、Ｒｄ−ＧＢ出力）とに基づいて、レジスタ選択信号Ｂが生成される。ＲＳＥＬ入力信号２がレジスタセレクタ２３に与えられる。 In the inter-register operation instruction, the contents of the PC are read out to the internal bus GB at C2 of T2 and input to the MAB 33 and the INC 27. The address IAB is output from the MAB 33. A control signal is given to the register selector 23. The register selection signal B is generated based on the RSEL input signal 1 and the control signal A (Rs-DB output, Rd-GB output). The RSEL input signal 2 is provided to the register selector 23.

Ｔ３から、次の次の命令がリードされる。Ｔ３のＣ１で、ＩＮＣ２７でインクリメント（＋２）された結果が、内部バスＷＢを経由して、ＰＣ３０にライトされる。ＲＳＥＬ入力信号２と制御信号Ｂ（ＷＢ−Ｒｄ入力）とに基づいて、レジスタ選択信号Ｃが生成される。レジスタ選択信号Ｂがレジスタを選択して、ソース側、デスティネーション側のレジスタ（Ｓ、Ｄ）のデータをＡＬＵ２６に入力する。ＡＬＵ２６の演算内容はＣＯＮＴ２２が制御信号Ｃによって指示する。加減算・論理演算・シフトなどは１クロックで演算を行なうことができる。例えば、上記命令では１６ビットの加算を行なう。次の命令のＣＯＮＴ２２へのロードを指示する。ＲＳＥＬ入力信号２と制御信号Ｂ（ＷＢ−Ｒｄ入力）とに基づいて、レジスタ選択信号Ｃが生成される。 From T3, the next next instruction is read. At C1 of T3, the result incremented (+2) by INC27 is written to PC 30 via internal bus WB. A register selection signal C is generated based on the RSEL input signal 2 and the control signal B (WB-Rd input). The register selection signal B selects the register, and inputs the data of the source-side and destination-side registers (S, D) to the ALU 26. The contents of the operation of the ALU 26 are specified by the control signal C by the CONT 22. Operations such as addition, subtraction, logical operation, and shift can be performed in one clock. For example, the above instruction performs 16-bit addition. Instructs loading of the next instruction into CONT22. A register selection signal C is generated based on the RSEL input signal 2 and the control signal B (WB-Rd input).

Ｔ３のＣ２で、ＡＬＵ２６の演算結果（Ｒ）が、内部バスＷＢを経由して、レジスタ選択信号Ｃが選択したデスティネーション側のレジスタにライトされる。制御信号Ｃによって、ＣＣＲ３１の更新を行なう。更に次の次の命令をＩＲ２１に取り込む。同時に、次の命令の実行が開始され、例えば、ＰＣ３０の内容を読み出して、ＭＡＢ３３とＩＮＣ２７に入力される。レジスタ間演算を実質的に１ステートで実行できる。２つの演算器２６、２７の入出力バスの数に対応した数の内部バスとして（演算器に対応して、内部バスを増加させることなく）、バス即ち配線の増加による物理的規模の増加を抑止している。 At C2 in T3, the operation result (R) of the ALU 26 is written to the destination register selected by the register selection signal C via the internal bus WB. The CCR 31 is updated by the control signal C. Further, the next next instruction is taken into the IR 21. At the same time, the execution of the next instruction is started. For example, the contents of the PC 30 are read and input to the MAB 33 and the INC 27. The operation between registers can be executed in substantially one state. As the number of internal buses corresponding to the number of input / output buses of the two computing units 26 and 27 (without increasing the number of internal buses corresponding to the computing units), an increase in the physical scale due to an increase in the number of buses, that is, the number of wires, is reduced. It is deterred.

図１３に、ＣＰＵの基本動作タイミングを示す。 FIG. 13 shows the basic operation timing of the CPU.

ＭＯＶ．ＷＲ０、＠Ｒ１のような、レジスタ間接によるデータライトのタイミングである。 MOV. This is the timing of data write by register indirect, such as WR0 and ＠ R1.

Ｔ０のＣ２で、ＣＰＵ１のＭＡＢ３３からアドレスがＩＡＢに出力される。 At C2 of T0, the address is output from the MAB 33 of the CPU 1 to the IAB.

Ｔ１のＣ１で、アドレスがＰＡＢに出力され、リードサイクルが開始される。Ｃ２でリードデータが内部データバスに得られ、これをＩＲ２１にラッチする。 At C1 of T1, the address is output to PAB, and a read cycle is started. In C2, read data is obtained on the internal data bus, and this is latched in IR21.

直前の命令の実行が終了すると、Ｔ２のＣ１で命令コードがＣＯＮＴ２２に入力されて、命令の内容が解読され、各部の制御を行なう。命令の一部のレジスタ指定フィールド（ＲＳＥＬ入力信号１）がレジスタセレクタ２３に与えられる。レジスタ間接によるデータライトでは、制御信号ＡとＲＳＥＬ入力信号１とに基づいて、レジスタ選択信号Ａが与えられ、アドレスとして指定されたレジスタが選択される。 When the execution of the immediately preceding instruction is completed, the instruction code is input to the CONT 22 at C1 of T2, the content of the instruction is decoded, and each unit is controlled. A register designation field (RSEL input signal 1) of a part of the instruction is supplied to the register selector 23. In data writing by register indirect, a register selection signal A is provided based on the control signal A and the RSEL input signal 1, and a register specified as an address is selected.

Ｔ２のＣ２で、選択されたレジスタの内容（Ａ）を内部バスＧＢに読み出して、ＭＡＢ３３を経由してアドレスＩＡＢに出力される。ＲＳＥＬ入力信号２がレジスタセレクタ２３に与えられる。ＲＳＥＬ入力信号２と制御信号Ｂ（Ｒｄ−ＤＢ出力）とに基づいて、レジスタ選択信号Ｂが生成される。 At C2 of T2, the content (A) of the selected register is read out to the internal bus GB and output to the address IAB via the MAB 33. The RSEL input signal 2 is provided to the register selector 23. The register selection signal B is generated based on the RSEL input signal 2 and the control signal B (Rd-DB output).

Ｔ３のＣ１で、制御信号Ｃの一部がＣＯＮＴ２２に入力され、状態遷移が行われる（ステートマシンが構成される）。ＩＡＢの内容に基づいて、ライトサイクルが開始される。選択されたレジスタの内容（Ｄ）を内部バスＤＢに読み出して、データバッファ（ＤＢＷ）を経由して内部データバスに出力される。 At C1 of T3, a part of the control signal C is input to the CONT 22, and a state transition is performed (a state machine is configured). A write cycle is started based on the contents of IAB. The contents (D) of the selected register are read out to the internal bus DB and output to the internal data bus via the data buffer (DBW).

Ｔ３のＣ２で、ＰＣ３０の内容を内部バスＧＢに読み出して、ＭＡＢ３３とＩＮＣ２７に入力する。ＭＡＢ３３からアドレスＩＡＢが出力される。ＲＳＥＬ入力信号１と制御信号Ａ（Ｒｄ−ＧＢ出力）とに基づいて、レジスタ選択信号Ｂが生成される。 At C2 of T3, the contents of the PC 30 are read out to the internal bus GB and input to the MAB 33 and the INC 27. The address IAB is output from the MAB 33. The register selection signal B is generated based on the RSEL input signal 1 and the control signal A (Rd-GB output).

Ｔ４から、次の次の命令がリードされる。 From T4, the next next instruction is read.

Ｔ４のＣ１で、ＩＮＣ２７でインクリメント（＋２）された結果が、内部バスＷＢを経由して、ＰＣ３０にライトされる。レジスタ選択信号Ｂがレジスタを選択して、データレジスタ（Ｄ）のデータをＡＬＵ２６に入力する。ＡＬＵ２６の演算内容はＣＯＮＴ２２が制御信号Ｃによって指示する。転送の場合はデータのチェックのみを行なう。次の命令のＣＯＮＴ２２へのロードを指示する。 At C1 of T4, the result incremented (+2) by INC27 is written to PC 30 via internal bus WB. The register selection signal B selects the register, and inputs the data of the data register (D) to the ALU 26. The contents of the operation of the ALU 26 are specified by the control signal C by the CONT 22. In the case of transfer, only data check is performed. Instructs loading of the next instruction into CONT22.

Ｔ４のＣ２で、制御信号Ｃによって、チェックした結果によって、ＣＣＲ３１の更新を行なう。更に次の次の命令をＩＲ２１に取り込む。同時に、次の命令の実行が開始され、例えば、ＰＣ３０の内容を読み出して、ＭＡＢ３３とＩＮＣ２７に入力される。 At C2 of T4, the CCR 31 is updated according to the result checked by the control signal C. Further, the next next instruction is taken into the IR 21. At the same time, the execution of the next instruction is started. For example, the contents of the PC 30 are read and input to the MAB 33 and the INC 27.

ＲＳＥＬに入力するタイミングを、ＲＳＥＬ入力１（アドレスレジスタ、ソースレジスタ）とＲＳＥＬ入力２（データレジスタ、ディスティネーションレジスタ）のように、レジスタ指定フィールド毎にことなったタイミング（φ同期とφ＃同期）とすることにより、命令実行の高速化を実現することができる。 The timing of input to the RSEL is different from the timing of each register specification field (φ synchronization and φ # synchronization), such as RSEL input 1 (address register, source register) and RSEL input 2 (data register, destination register). By doing so, high-speed instruction execution can be realized.

表１６乃至表１９に、本発明に関係のある命令の説明を示す。 Tables 16 to 19 show descriptions of instructions related to the present invention.

表１６は命令コードを示し、表１８は命令の実行状態を示し、表１９はコンディションコードの変化を示している。表１７はレジスタフィールドと汎用レジスタの対応を示している。 Table 16 shows the instruction code, Table 18 shows the execution state of the instruction, and Table 19 shows the change of the condition code. Table 17 shows the correspondence between register fields and general-purpose registers.

［表１６］

[Table 16]

［表１７］

[Table 17]

［表１８］

[Table 18]

［表１９］

[Table 19]

積和演算を行なうＭＡＣ命令、ＭＡＣレジスタをクリアするＣＬＲＭＡＣ命令、汎用レジスタの内容をＭＡＣレジスタに転送するＬＤＭＡＣ命令、ＭＡＣレジスタの内容を汎用レジスタに転送するＳＴＭＡＣ命令がある。 There are a MAC instruction for performing a sum-of-products operation, a CLRMAC instruction for clearing a MAC register, an LDMAC instruction for transferring the contents of a general-purpose register to a MAC register, and an STMAC instruction for transferring the contents of a MAC register to a general-purpose register.

また、汎用レジスタの待避／復帰命令には、１本のレジスタの待避／復帰命令に、ＰＵＳＨ、ＰＯＰ命令が、複数レジスタの待避／復帰命令にＳＴＭ／ＬＤＭ命令がある。ＳＴＭ／ＬＤＭ命令には、指定するレジスタ本数に対応して３種類がある。 The save / restore instructions for general-purpose registers include PUSH and POP instructions for save / restore instructions for one register, and the STM / LDM instructions for save / restore instructions for multiple registers. There are three types of STM / LDM instructions corresponding to the number of designated registers.

図１４に、乗算器２の概略ブロック図を示す。 FIG. 14 is a schematic block diagram of the multiplier 2.

乗算器２は、入力ラッチ（Ｘ）６１、入力ラッチ（Ｙ）６２、部分積生成回路６３、マツチプレクサ６４、デコーダ６５Ａ、６５Ｂ、６５Ｃ、選択回路６６Ａ、６６Ｂ、６６Ｃ、加算器６７、フィードバック回路６８、乗算結果レジスタ６９などによって構成されている。 The multiplier 2 includes an input latch (X) 61, an input latch (Y) 62, a partial product generation circuit 63, a matching multiplexer 64, decoders 65A, 65B, 65C, selection circuits 66A, 66B, 66C, an adder 67, and a feedback circuit 68. , A multiplication result register 69 and the like.

乗算器２は１６×１６ビットの乗算を行なうことを基本動作とし、さらに、これを利用して、１６×１６ビット＋４２ビットの積和演算を可能としている。 The basic operation of the multiplier 2 is to perform 16 × 16-bit multiplication, and by using this, a product-sum operation of 16 × 16 bits + 42 bits can be performed.

乗算器は乗算動作は、２次のブースのデコードを用いて、１６ビット×６ビットを３回行なうようにされる。 The multiplier performs a 16-bit × 6-bit multiplication operation three times using a second-order Booth decoding.

１６ビットの乗数Ｙは、Ｙ＝−ｙ［１６］・２＾１５＋Σ（ｙ［ｉ］・２＾（ｉ−１））＝Σ（ｙ［２ｊ］＋ｙ［２ｊ＋１］−２・ｙ［２ｊ＋２］）・２＾２ｊと表現される。ｉ＝１〜１５、ｊ＝０〜７、ｙ［０］＝０である。 The 16-bit multiplier Y is Y = −y [16] · 2］ 15 + Σ (y [i] · 2 ＾ (i−1)) = Σ (y [2j] + y [2j + 1] −2 · y [2j + 2] ) · 2 ＾ 2j. i = 1 to 15, j = 0 to 7, and y [0] = 0.

被乗数Ｘとの乗算は、Ｘ・Ｙ＝Σ（ｙ［２ｊ］＋ｙ［２ｊ＋１］−２・ｙ［２ｊ＋２］）・Ｘ・２＾２ｊとなる。ｙ［２ｊ］＋ｙ［２ｊ＋１］−２・ｙ［２ｊ＋２］は、ｙ［２ｊ］、ｙ［２ｊ＋１］、ｙ［２ｊ＋２］の値の組み合わせにより、０、±１、±２の５種類があるから、部分積（ｙ［２ｊ］＋ｙ［２ｊ＋１］−２・ｙ［２ｊ＋２］）・Ｘは、０、±Ｘ、±２Ｘの５種類である。この内、０、Ｘは直ちに得られる。２Ｘは、１ビットの左シフト（最下位ビットは０）、−Ｘは２の補数であり、論理反転＋１で得られる。−２Ｘは、論理反転＋１の１ビットの左シフト（最下位ビットは０）で得る。 The multiplication with the multiplicand X is XY = Σ (y [2j] + y [2j + 1] −2 · y [2j + 2]) · X · 2 ＾ 2j. There are five types of y [2j] + y [2j + 1] −2 · y [2j + 2] depending on the combination of the values of y [2j], y [2j + 1], and y [2j + 2]. , Partial products (y [2j] + y [2j + 1] −2 · y [2j + 2]) · X are of five types: 0, ± X, ± 2X. Of these, 0 and X are obtained immediately. 2X is one bit left shift (the least significant bit is 0), -X is a two's complement, and is obtained by logical inversion +1. -2X is obtained by shifting the logical inversion +1 by one bit to the left (the least significant bit is 0).

Ｘ側は、採りうる０、±Ｘ、±２Ｘの５種類の部分積（１７ビット）を生成しておく。この５種類を部分積選択回路６６Ａ〜６６Ｃに与える。 The X side generates five types of partial products (17 bits) of 0, ± X, and ± 2X that can be taken. These five types are given to partial product selection circuits 66A to 66C.

一方、Ｙ入力のｙ［２ｊ］、ｙ［２ｊ＋１］、ｙ［２ｊ＋２］をデコードして、０、±１、±２を判定して、この結果によって、部分積選択回路６６Ａ〜６６Ｃを制御して、前記５種類の部分積を選択する。１回に２ビット単位３種類の選択を行なう。これを加算器６７で加算する。加算は、２ビットずつシフトしたそれぞれ１７ビットの部分積を加算して、２２ビット分の結果を得る。不足する上位ビットは符号拡張したデータとする。 On the other hand, y [2j], y [2j + 1], y [2j + 2] of the Y input are decoded to determine 0, ± 1, ± 2, and the partial product selection circuits 66A to 66C are controlled based on the result. Then, the five types of partial products are selected. Three types of selection are performed at a time in units of two bits. This is added by the adder 67. The addition is performed by adding 17-bit partial products shifted by 2 bits each to obtain a result of 22 bits. Insufficient upper bits are sign-extended data.

この内、下位６ビットは、乗算結果レジスタ６９のビット０〜５に格納される。上位１６ビットはフィードバック回路６８を介して、２回めの加算に含められる。２回目の処理では、前記同様に得られた部分積選択回路６６Ａ〜６６Ｃの出力である、２ビットずつシフトしたそれぞれ１７ビットの部分積と、１回めの処理の上位１６ビットを加算する。２２ビット分の結果を得る。不足する上位ビットは符号拡張したデータとする。この内、下位６ビットは、乗算結果レジスタ６９のビット１１〜６に格納される。上位１６ビットはフィードバック回路６８を介して、２回めの加算に含められる。 Of these, the lower 6 bits are stored in bits 0 to 5 of the multiplication result register 69. The upper 16 bits are included in the second addition via the feedback circuit 68. In the second processing, the 17-bit partial products, which are the outputs of the partial product selection circuits 66A to 66C obtained in the same manner and are shifted by 2 bits each, and the upper 16 bits of the first processing are added. The result for 22 bits is obtained. Insufficient upper bits are sign-extended data. Of these, the lower 6 bits are stored in bits 11 to 6 of the multiplication result register 69. The upper 16 bits are included in the second addition via the feedback circuit 68.

同様に３回めの処理が行われる。加算結果下位２０ビットが乗算結果レジスタ６９のビット３１〜１２に格納される。加算結果の最上位２ビットは無視する。 Similarly, a third process is performed. The lower 20 bits of the addition result are stored in bits 31 to 12 of the multiplication result register 69. The two most significant bits of the addition result are ignored.

積和演算の場合は、前回の結果が同時に加算されるようにする。 In the case of a product-sum operation, the previous results are added simultaneously.

更に、前回の結果の上位ビットとの４回目の加算を行って、４２ビットの結果を得る。１６ビット×１６ビットの積和の結果を４２ビットで得ることにより、約１０００回の積和演算を繰り返してもオーバフローしないことになる。 Furthermore, a fourth addition with the upper bit of the previous result is performed to obtain a 42-bit result. By obtaining the result of a 16-bit × 16-bit product-sum operation in 42 bits, no overflow occurs even if the product-sum operation is repeated about 1000 times.

内部論理の構成上は、４０ビットの結果とすれば、加算器が２２ビット長でよく、論理的規模を最適化できる。 In terms of the structure of the internal logic, if the result is 40 bits, the adder may be 22 bits long, and the logical scale can be optimized.

図１５に、上記のワードサイズ乗算（１６ビット×１６ビット）の演算方法を示す。 FIG. 15 shows an operation method of the word size multiplication (16 bits × 16 bits).

８ビット×８ビットのバイトサイズ乗算は、上位を拡張する。符号無しの場合０拡張、符号付きの場合符号拡張を行なう。いずれの場合も、上位は全ビット”０”か全ビット”１”かのいずれかであって、ブースのデコードは０になる。このため、３回目の処理は行なわずに、２回の処理で済む。 The byte size multiplication of 8 bits × 8 bits extends the high order. If there is no sign, 0 extension is performed, and if signed, sign extension is performed. In either case, the upper bit is either all bits “0” or all bits “1”, and the Booth decoding becomes “0”. Therefore, the second process is sufficient without performing the third process.

図４において、ＣＰＵ１から乗算器２に、乗算を示す信号として、ＭＵＬ信号（制御信号Ｂ）、符号付き／無しを示す信号としてＵＮＳＩＮＰ信号（制御信号Ｂ）、バイト／ワードサイズを示す信号として、ＢＹＴＥ信号（制御信号Ｂ）、積和演算の起動信号として、ＭＡＣ信号（制御信号Ｂ）、ＭＡＣＨからＣＰＵへのデータ転送要求信号として、ＳＴＭＡＣＨ信号（制御信号Ｃ）、ＭＡＣＬからＣＰＵへのデータ転送要求信号として、ＳＴＭＡＣＬ信号（制御信号Ｃ）、ＣＰＵからＭＡＣＨへのデータ転送要求信号として、ＬＤＭＡＣＨ信号（制御信号Ｂ）、ＣＰＵからＭＡＣＬへのデータ転送要求信号として、ＬＤＭＡＣＬ信号（制御信号Ｂ）、ＭＡＣレジスタのクリア信号として、ＣＬＲＭＡＣ信号（制御信号Ｂ）、乗数の転送信号として、ＳＴＸ信号（制御信号Ｂ）、被乗数の転送信号として、ＳＴＹ信号（制御信号Ｂ）、乗算結果の転送信号として、ＭＵＬＲＤ信号（制御信号Ｃ）を与える。 In FIG. 4, the MUL signal (control signal B) as a signal indicating multiplication, the UNSINP signal (control signal B) as a signal indicating signed / non-signed, and a signal indicating byte / word size from the CPU 1 to the multiplier 2 BYTE signal (control signal B), MAC signal (control signal B) as a start signal of the product-sum operation, STMACH signal (control signal C) as a data transfer request signal from MACH to CPU, data transfer from MACL to CPU As a request signal, a STMACL signal (control signal C), as a data transfer request signal from the CPU to the MACH, an LDMACH signal (control signal B), as a data transfer request signal from the CPU to the MACL, an LDMACL signal (control signal B), As a clear signal of the MAC register, a CLRMAC signal (control signal B), a transfer signal of a multiplier To, STX signal (control signal B), as multiplicand transfer signal, STY signal (control signal B), as the transfer signal of the multiplication result, give MULRD signal (control signal C).

また、乗算器２からＣＰＵ１へ、演算実行中を示す信号として、ＢＵＳＹ信号、フラグに反映すべきデータとして、ＶＦＬＡＧ、ＺＦＬＡＧ、ＮＦＬＡＧ信号が与えられる。 Further, the multiplier 2 supplies the CPU 1 with a BUSY signal as a signal indicating that the operation is being performed and a VFLAG, ZFLAG, and NFLAG signal as data to be reflected in the flag.

ＣＰＵ−乗算器の相互のデータ転送にＸバス、Ｙバスを使用する。 An X bus and a Y bus are used for mutual data transfer between the CPU and the multiplier.

また、ＳＹＳＣＲ１３から飽和演算の選択を示す信号として、ＦＩＸＥＤ信号が与えられる。また、テストモード信号として、ＴＥＳＴＭＯＤＥ信号が与えられる。ＴＥＳＴＭＯＤＥ信号が活性状態になって、テストモードが指示されると、乗算器は１回の処理のみで動作を終了するようにする。 Further, a FIXED signal is given from the SYSCR 13 as a signal indicating selection of a saturation operation. Also, a TESTMODE signal is provided as a test mode signal. When the TESTMODE signal is activated and the test mode is instructed, the multiplier terminates the operation only once.

処理を短縮することによって、ＣＰＵの命令実行ステートも短縮できる。入力データの組み合わせを種々変更してテストする場合に、テスト時間を短縮できる。加算を１回しか行なわないので、テスト設計を容易にすることができる。３回の処理を行って、加算結果が蓄積されて、所望の動作のテストの結果を演算結果として得にくくなることがない。 By shortening the processing, the instruction execution state of the CPU can also be shortened. When a test is performed with various combinations of input data, the test time can be reduced. Since the addition is performed only once, test design can be facilitated. By performing the processing three times, the addition result is accumulated, and the result of the test of the desired operation is not difficult to obtain as the operation result.

ＴＥＳＴＭＯＤＥ信号が非活性状態であっても、ＣＰＵ乃至乗算器のテストを行なうことができることは言うまでもない。ＴＥＳＴＭＯＤＥ信号は、前記のＣＰＵＳのようにレジスタの出力として供給することができる。 It goes without saying that the CPU or the multiplier can be tested even when the TESTMODE signal is inactive. The TESTMODE signal can be supplied as an output of a register, as in the CPUS described above.

表２０に乗算器２の内部のフラグの検出方式およびＣＰＵ１への転送方式を示す。 Table 20 shows a method of detecting a flag inside the multiplier 2 and a method of transferring the flag to the CPU 1.

［表２０］

[Table 20]

乗算器２のフラグ仕様は次のように、１．Ｖフラグおよび２．Ｎフラグ、Ｚフラグから構成されている。 The flag specifications of the multiplier 2 are as follows: V flag and 2. It consists of an N flag and a Z flag.

１．Ｖフラグ
セット条件はＭＡＣ命令実行中にオーバフローまたはアンダフローが発生したときである。 1. The V flag is set when an overflow or underflow occurs during the execution of the MAC instruction.

クリア条件はＬＤＭＡＣまたはＣＬＲＭＡＣ命令を実行したときである。 The clear condition is when the LDMAC or CLRMAC instruction is executed.

乗算器からＣＣＲへの転送は、ＳＴＭＡＣ実行時に行われる。 The transfer from the multiplier to the CCR is performed when STMAC is executed.

従って、一連の連続した積和演算中に１回でもオーバフローまたはアンダフローが発生すると、乗算器のＶフラグはセットされた状態を保持する。ＬＤＭＡＣまたはＣＬＲＭＡＣ命令を実行して、新しい一連の積和演算の開始が判断されると、乗算器のＶフラグはクリアされる。 Therefore, if an overflow or underflow occurs at least once during a series of successive product-sum operations, the V flag of the multiplier remains set. When the LDMAC or CLRMAC instruction is executed to determine the start of a new series of multiply-accumulate operations, the multiplier V flag is cleared.

２．Ｎフラグ、Ｚフラグ
ＭＵＬ命令用Ｎ、ＺフラグとＭＡＣ命令用Ｎ、Ｚフラグを別々に設けて出力する。 2. N flag and Z flag The N and Z flags for the MUL instruction and the N and Z flags for the MAC instruction are separately provided and output.

乗算器からＣＣＲへの転送は、乗算（ＭＵＬ）命令の場合、乗算結果の転送時、ＭＡＣ命令の場合ＳＴＭＡＣ実行時に行われる。 The transfer from the multiplier to the CCR is performed at the time of transfer of the multiplication result in the case of a multiplication (MUL) instruction, and at the time of execution of STMAC in the case of a MAC instruction.

なお、ＮフラグとＺフラグは、ＬＤＭＡＣ／ＣＬＲＭＡＣによって変化しない。 Note that the N flag and the Z flag are not changed by LDMAC / CLRMAC.

図１６にＶフラグ仕様の実現の概念図を示し、図１７にＮフラグ、Ｚフラグ仕様の実現の概念図を示す。 FIG. 16 shows a conceptual diagram of realizing the V flag specification, and FIG. 17 shows a conceptual diagram of realizing the N flag and Z flag specifications.

Ｖフラグはセットリセット型のフリップフロップ（ＲＳ−Ｆ／Ｆ）で構成され、一旦、オーバフローまたはアンダフローが発生すると、ＳＴＭＡＣにより読み出すまで状態を保持する。 The V flag is constituted by a set-reset type flip-flop (RS-F / F), and once an overflow or underflow occurs, keeps its state until it is read out by STMAC.

Ｎ、Ｖフラグはラッチ回路（Ｄ−Ｆ／Ｆ）とマルチプレクサ（ＭＰＸ）で構成される。ＭＡＣ命令実行時の演算結果はラッチ回路に保持され、マルチプレクサに与えられる。また、乗算命令実行時の演算結果は、直接マルチプレクサに与えられる。マルチプレクサはＳＴＭＡＣ命令のときラッチ回路の出力を出力し、それ以外のとき演算結果を直接出力する。 The N and V flags are composed of a latch circuit (DF / F) and a multiplexer (MPX). The operation result at the time of executing the MAC instruction is held in the latch circuit and supplied to the multiplexer. The operation result at the time of executing the multiplication instruction is directly supplied to the multiplexer. The multiplexer outputs the output of the latch circuit in the case of the STMAC instruction, and directly outputs the operation result otherwise.

ＭＡＣ命令とその他の命令は並列して動作する。ＭＡＣ命令のフラグを随時ＣＣＲに反映しては、並列実行中の命令のフラグ動作と矛盾してしまう。ＭＡＣ命令のフラグを乗算器内部で保持して、ＳＴＭＡＣ命令実行時にＣＣＲに転送するようにして、上記矛盾を回避することができる。 The MAC instruction and other instructions operate in parallel. If the flag of the MAC instruction is reflected in the CCR as needed, it contradicts the flag operation of the instruction being executed in parallel. The contradiction can be avoided by holding the flag of the MAC instruction inside the multiplier and transferring it to the CCR when the STMAC instruction is executed.

図１８に、バススイッチ３４のブロック図を示す。 FIG. 18 shows a block diagram of the bus switch 34.

バススイッチ３４は、選択回路７１Ａ、７１Ｂ、拡張回路７２Ａ、７２Ｂ、７２Ｃ、出力バッファ７３Ａ、７３Ｂ、７３Ｃ、７３Ｄから構成される。バススイッチは、ＣＰＵ内部バスのＧＢ、ＤＢ、ＷＢと、乗算器のＸバス、Ｙバスと、マイクロコンピュータの内部バスであるＩＤＢのインタフェースを行なう。 The bus switch 34 includes selection circuits 71A and 71B, extension circuits 72A, 72B and 72C, and output buffers 73A, 73B, 73C and 73D. The bus switch interfaces the GB, DB, and WB of the CPU internal bus, the X and Y buses of the multiplier, and the IDB that is the internal bus of the microcomputer.

乗算の開始時、及びＬＤＭＡＣ命令の場合は、ＧＢ、ＤＢからＸバス、Ｙバスに入力される。ＧＢ、ＤＢの入力は、選択回路７１Ａ、７１Ｂで選択される。これは汎用レジスタ及び内部バスが３２ビット構成であるために、乗数、被乗数が８ビットまたは１６ビットであるために、ＣＯＮＴ２２の制御信号Ａ及びレジスタ制御信号Ａに基づいて、所定の部分が選択される。 At the start of multiplication and in the case of an LDMAC instruction, data is input from GB and DB to the X bus and Y bus. The inputs of GB and DB are selected by the selection circuits 71A and 71B. This is because a general-purpose register and an internal bus have a 32-bit configuration, and since a multiplier and a multiplicand are 8 bits or 16 bits, a predetermined portion is selected based on the control signal A of the CONT 22 and the register control signal A. You.

選択回路７１Ａ、７１Ｂの出力は、拡張回路７２Ａ、７２Ｂに入力される。ＣＯＮＴ２２の制御信号Ａに基づいて、符号無しバイトサイズ乗算（ＭＵＬＸＵ．Ｂ）の場合、上位８ビットを０拡張する。また、符号付きバイトサイズ乗算（ＭＵＬＸＳ．Ｂ）の場合、上位８ビットを符号拡張する。ワードサイズの場合は、選択回路７１Ａ、７１Ｂの出力をそのまま出力する。 Outputs of the selection circuits 71A and 71B are input to extension circuits 72A and 72B. In the case of unsigned byte size multiplication (MULXU.B) based on the control signal A of the CONT 22, the upper 8 bits are extended to zero. In the case of signed byte size multiplication (MULXS.B), the upper 8 bits are sign-extended. In the case of the word size, the outputs of the selection circuits 71A and 71B are output as they are.

選択回路７１Ａ、７１Ｂの出力は出力バッファ７３Ｂ、７３Ｃに入力される。ＣＯＮＴ２２の制御信号Ａに基づいて、所定のタイミングで、拡張回路７２Ａ、７２Ｂの出力をＸバスまたはＹバスに出力する。 Outputs of the selection circuits 71A and 71B are input to output buffers 73B and 73C. Based on the control signal A of the CONT 22, the outputs of the extension circuits 72A and 72B are output to the X bus or the Y bus at a predetermined timing.

乗算の終了時、及びＳＴＭＡＣ命令の場合は、Ｘバス、ＹバスからＷＢへの出力が行われる。Ｘバス、Ｙバスの入力は拡張回路７２Ｃに入力される。これは、ＭＡＣＨの上位２２ビットを符号拡張する。拡張回路７２Ｃの出力は出力バッファに入力される。ＣＯＮＴ２２の制御信号Ｃに基づいて、所定のタイミングで、拡張回路７２Ｃの出力をＷＢに出力する。 At the end of the multiplication and in the case of the STMAC instruction, the output is performed from the X bus and the Y bus to the WB. Inputs of the X bus and the Y bus are input to the extension circuit 72C. This sign-extends the upper 22 bits of the MACH. The output of the extension circuit 72C is input to the output buffer. Based on the control signal C of the CONT 22, the output of the extension circuit 72C is output to WB at a predetermined timing.

ＭＡＣ命令のデータリード時、ＩＤＢからＸバス、Ｙバスへの入力が、それぞれ１回ずつ行われる。ＣＯＮＴ２２の制御信号Ａに基づいて、所定のタイミングで、ＩＤＢの内容をＸバスまたはＹバスに出力する。また、ＩＤＢは、ＤＢＷからの出力を入力可能とされ、ＤＢＲ及びＩＲへデータを入力可能とされる。 At the time of data reading of the MAC instruction, input from the IDB to the X bus and the Y bus is performed once each. The contents of the IDB are output to the X bus or the Y bus at a predetermined timing based on the control signal A of the CONT 22. The IDB can receive an output from the DBW and can input data to the DBR and the IR.

図１９、２０に、ＭＡＣ命令の動作タイミングを示す。 19 and 20 show the operation timing of the MAC instruction.

例えば、ＭＡＣ＠ＥＲ１＋，＠ＥＲ２＋命令などの例である。この場合のＥＲ１を第１のアドレスレジスタ、ＥＲ２を第２のアドレスレジスタとする。前記同様に、Ｔ２からＭＡＣ命令の実行が開始される。 For example, it is an example of a MAC $ ER1 +, $ ER2 + instruction. In this case, ER1 is a first address register, and ER2 is a second address register. As described above, the execution of the MAC instruction is started from T2.

まず、プリフィックスコードの実行を行い、ＰＣ３０の内容をアドレスとした命令のリードを行い、また、ＰＣ３０の内容のインクリメントを行なう。 First, a prefix code is executed, an instruction using the contents of the PC 30 as an address is read, and the contents of the PC 30 are incremented.

Ｔ３のφ＃で、レジスタ制御信号Ａに基づいて、第１のアドレスの内容をＧＢに読み出して、ＭＡＢ３３に転送し、ＩＡＢに出力する。 At φ # of T3, the contents of the first address are read to GB based on the register control signal A, transferred to the MAB 33, and output to the IAB.

Ｔ４のφで、第１のアドレスの内容をＧＢに読み出して、ＡＬＵ２６に入力し、インクリメントを行なう。 At φ of T4, the contents of the first address are read out to GB and input to the ALU 26 to perform the increment.

Ｔ４のφ＃で、インクリメント結果を、ＷＢ経由で、第１のアドレスレジスタに格納する。バススイッチ３４に入力した、第１のリードデータをＸバスに出力すると共に、制御信号Ｂに含まれるＳＴＸ信号を活性状態にし、乗算器２にこの内容を入力ラッチＸにラッチさせる。同時に、第２のアドレスの内容をＧＢに読み出して、ＭＡＢに転送し、ＩＡＢに出力する。 At φ # of T4, the increment result is stored in the first address register via WB. The first read data input to the bus switch 34 is output to the X bus, and the STX signal included in the control signal B is activated to cause the multiplier 2 to latch this content in the input latch X. At the same time, the contents of the second address are read to GB, transferred to MAB, and output to IAB.

Ｔ５のφで、第２のアドレスの内容をＧＢに読み出して、ＡＬＵ２６に入力し、インクリメントを行なう。Ｔ５のφ＃で、インクリメント結果を、ＷＢ経由で、第２のアドレスレジスタに格納する。バススイッチ３４に入力した、第２のリードデータをＹバスに出力すると共に、制御信号Ｂに含まれるＳＴＹ信号を活性状態にし、乗算器２にこの内容を入力ラッチＹにラッチさせる。ＭＡＣ信号を活性状態にして、積和演算動作の開始を指示する。同時に、ＰＣ３０の内容をアドレスとした命令のリードを行い、また、ＰＣ３０の内容のインクリメントを行なう。 At φ of T5, the content of the second address is read out to GB and input to the ALU 26 to perform the increment. At φ # of T5, the increment result is stored in the second address register via WB. The second read data input to the bus switch 34 is output to the Y bus, and the STY signal included in the control signal B is activated to cause the multiplier 2 to latch this content in the input latch Y. The MAC signal is activated to instruct the start of the product-sum operation. At the same time, an instruction using the contents of the PC 30 as an address is read, and the contents of the PC 30 are incremented.

Ｔ６のφで、インクリメント結果をＰＣ３０に格納する。一方、ＢＵＳＹ信号が活性状態になる。ＭＡＣ命令では、ＣＰＵ１は乗算器２とは並列に動作し、ＢＵＳＹ信号を無視して、次の命令の実行を開始する。 At φ of T6, the increment result is stored in the PC 30. On the other hand, the BUSY signal is activated. With the MAC instruction, the CPU 1 operates in parallel with the multiplier 2 and starts executing the next instruction ignoring the BUSY signal.

ＭＡＣ命令を連続して実行した場合も、次のＭＡＣ命令がアドレス計算を行っている間に、乗算器の動作が終了するために、ＭＡＣ命令実行にウェイトを挿入することはない。 Even when the MAC instruction is continuously executed, the operation of the multiplier is completed while the next MAC instruction is performing the address calculation, so that no wait is inserted into the execution of the MAC instruction.

プリフィックスコードを付した命令コードとすることにより、特開平−５１９８１号公報に記載されているように、互換性を保持しつつ命令セットを拡張することができる。また、乗算器動作中に、次の積和演算を行った場合、命令フェッチとデータのアクセスを行なうことができるから、命令長が長くなっても実行時間を低下させることがない。積和演算を連続的に高速に実行することができる。 By using an instruction code with a prefix code, an instruction set can be extended while maintaining compatibility, as described in Japanese Patent Application Laid-Open No. Hei 5-11981. In addition, when the next product-sum operation is performed during the operation of the multiplier, the instruction fetch and the data access can be performed, so that the execution time does not decrease even if the instruction length becomes long. The product-sum operation can be continuously executed at high speed.

乗算器２は、演算終了時点で、ＳＹＳＣＲ１３のＭＡＣＳビットを参照して、オーバフローが発生していれば、ＭＡＣレジスタの内容を、上限（Ｈ’７ＦＦＦＦＦＦＦ）または下限（Ｈ’８０００００００）に固定する。 At the end of the operation, the multiplier 2 refers to the MACS bit of the SYSCR 13 and fixes the contents of the MAC register to the upper limit (H'7FFFFFFF) or the lower limit (H'80000000) if an overflow has occurred.

図２１、２２に、ＳＴＭＡＣ、ＬＤＭＡＣ命令の動作タイミングを示す。 21 and 22 show the operation timing of the STMAC and LDMAC instructions.

例えば、ＳＴＭＡＣＭＡＣＨ，ＥＲ２命令などの例である。前記同様に、Ｔ２からＳＴＭＡＣ命令の実行が開始される。 For example, it is an example of an STMAC MACH, ER2 instruction. As described above, the execution of the STMAC instruction is started from T2.

まず、ＢＵＳＹ信号の状態をサンプリングする。ＢＵＳＹ信号が活性状態であれば、ウェイト状態になる。 First, the state of the BUSY signal is sampled. When the BUSY signal is in the active state, the state changes to the wait state.

Ｔ２のφ＃でＰＣの内容がＧＢに読み出され、ＭＡＢ３３に入力されて、ＩＡＢに出力される。また、ＩＮＣ２７に入力されて、インクリメント動作が開始される。 At φ # of T2, the contents of the PC are read to GB, input to MAB 33, and output to IAB. Also, the data is input to the INC 27, and the increment operation is started.

ＣＰＵ内部のクロックがロウレベルで固定され、ＣＰＵの動作を停止する。直前にＭＡＣ命令を実行した場合、ＢＵＳＹ信号は３ステートの期間活性状態であり、ＳＴＭＡＣ命令も３ステートウェイト状態になる。 The clock inside the CPU is fixed at the low level, and the operation of the CPU is stopped. When the MAC instruction is executed immediately before, the BUSY signal is active for three states, and the STMAC instruction is also in a three-state wait state.

Ｔ５でＢＵＳＹ信号が非活性状態になると、Ｔ６からクロックの動作が開始される。 When the BUSY signal becomes inactive at T5, the clock operation starts at T6.

Ｔ６のφで、インクリメント結果がＷＢに出力され、ＰＣ３０に格納される。ＳＴＭＡＣＨまたはＳＴＭＡＣＬ信号が活性状態になって、ＭＡＣレジスタの読み出しが指示される。ＭＡＣレジスタの内容がＸバス、Ｙバスに出力される。特に制限はされないものの、Ｘバスが上位、Ｙバスが下位の内容とされる。 At φ of T6, the increment result is output to WB and stored in the PC 30. The STMACH or STMACL signal becomes active, and an instruction to read the MAC register is issued. The contents of the MAC register are output to the X bus and the Y bus. Although there is no particular limitation, the X bus is the upper content and the Y bus is the lower content.

Ｔ６のφ＃で、Ｘバス、Ｙバスの内容がＷＢに出力されて、指定されたレジスタ（ＳＴＭＡＣＭＡＣＨ，ＥＲ２の場合は、ＥＲ２）に格納される。同時に、乗算器のフラグの内容がＣＣＲのＮ、Ｚ、Ｖフラグに格納される。 At φ # of T6, the contents of the X bus and the Y bus are output to the WB and stored in the designated register (ER2 in the case of STMAC MACH, ER2). At the same time, the contents of the flags of the multipliers are stored in the N, Z, and V flags of the CCR.

また、ＬＤＭＡＣＥＲ１，ＭＡＣＬ命令などの例である。 Also, examples are LDMAC ER1 and MACL instructions.

Ｔ８からＬＤＭＡＣ命令の実行が開始される。 The execution of the LDMAC instruction is started from T8.

Ｔ９のφで、指定されたレジスタ（ＬＤＭＡＣＥＲ１，ＭＡＣＬの場合は、ＥＲ１）の内容が読み出される。この内容がＸバス、Ｙバスに出力される。 At φ in T9, the contents of the specified register (ER1 in the case of LDMAC ER1, MACL) are read. This content is output to the X bus and the Y bus.

Ｔ９のφ＃で、ＬＤＭＡＣＨまたはＬＤＭＡＣＬ信号が活性状態になる。ＰＣ３０の内容がＧＢ経由で、ＭＡＢ３３とＩＮＣ２７に入力される。 At φ # of T9, the LDMACH or LDMACL signal is activated. The contents of the PC 30 are input to the MAB 33 and the INC 27 via GB.

Ｔ１０のφで、インクリメントされた結果がＷＢ経由で、ＰＣ３０に格納される。また、Ｘバス、Ｙバスの内容がＭＡＣレジスタに格納される。 At φ of T10, the incremented result is stored in the PC 30 via WB. The contents of the X bus and the Y bus are stored in the MAC register.

前記同様に、ＢＵＳＹ信号が活性状態の場合は、ＬＤＭＡＣ命令も活性状態になるようにしてもよい。 Similarly to the above, when the BUSY signal is active, the LDMAC instruction may be activated.

ＣＬＲＭＡＣ命令は、概略ＬＤＭＡＣ命令と同様の動作で、ＬＤＭＡＣ命令のＬＤＭＡＣＨ、Ｌ信号と同じタイミングで、ＣＬＲＭＡＣ信号を活性状態にするようにすればよい。 The CLRMAC command may be configured to activate the CLRMAC signal at the same timing as the LDMACH and L signals of the LDMAC command, in substantially the same operation as the LDMAC command.

図２３、２４に、乗算器を用いた乗算命令のタイミング図を示す。 23 and 24 show timing diagrams of a multiplication instruction using a multiplier.

なお、ＣＯＮＴ２２の部分に、内部のステートマシンのステップの番号を記載した。これは、基本的には、ＣＯＮＴ２２の出力のフィードバック信号で形成される。また、制御信号ＣＰＵＳを用いて、乗算器を使用するか使用しないかを選択する。例えば、ＭＵＬＸＵ．ＷＲ１，ＥＲ０などのバイトサイズ・符号無し乗算の例である。前記同様に、Ｔ２から実行を開始する。 The number of the step of the internal state machine is described in the part of CONT22. This is basically formed by the feedback signal of the output of the CONT 22. Further, using the control signal CPUS, whether to use or not use the multiplier is selected. For example, MULXU. This is an example of byte size / unsigned multiplication such as WR1, ER0. As described above, the execution starts from T2.

命令が解読されると、まず、Ｔ２のφ＃で、レジスタ制御信号Ａによって、汎用レジスタの読みだしを指示する。読出された結果は、ＧＢ、ＤＢおよびバススイッチ３４を介して、Ｘバス、Ｙバスに出力される。 When the instruction is decoded, first, the reading of the general-purpose register is instructed by the register control signal A at φ # of T2. The read result is output to the X bus and the Y bus via the GB, the DB, and the bus switch 34.

制御信号Ｂに含まれるＳＴＸ、ＳＴＹ信号に基づいて、Ｘバス、Ｙバスの内容は、Ｔ３のφで乗算器の入力ラッチにラッチされる。また、同時に、制御信号Ｂに含まれるＭＵＬ信号によって、乗算器に乗算を指示する。ＣＯＮＴ２２から、バイト／ワードの選択、符号付／符号無の選択、乗算／積和の選択を上記制御信号によって指示する。 Based on the STX and STY signals included in the control signal B, the contents of the X bus and Y bus are latched by the input latch of the multiplier at φ of T3. At the same time, the MUL signal included in the control signal B instructs the multiplier to perform multiplication. The control signal indicates from the CONT 22 the selection of byte / word, the selection of signed / unsigned, and the selection of multiplication / sum of products.

乗算器は、Ｔ３のφ＃で、ＢＵＳＹ信号を与える。また、マルチプレクサやデコーダを動作させる。Ｔ４のφで、１回目の加算を行なう。Ｔ４のφで部分積を乗算結果レジスタとフィードバックラッチに格納する。これを３回繰り返す。ＢＵＳＹ信号が活性状態になったことに呼応して、ＣＰＵはウェイト状態になる。 The multiplier provides a BUSY signal at φ # of T3. Also, a multiplexer and a decoder are operated. The first addition is performed at φ of T4. The partial product is stored in the multiplication result register and the feedback latch at φ of T4. This is repeated three times. The CPU enters the wait state in response to the BUSY signal being activated.

Ｔ５のφでＢＵＳＹが非活性状態になって、ＣＰＵは動作を再開し、Ｔ６のφで、制御信号Ｃに含まれる、ＭＵＬＲＤ信号を活性状態にして、乗算結果レジスタのリードを指示する。乗算結果レジスタの内容は、Ｘバス、Ｙバスおよびバススイッチ３４を経由して、Ｔ６のφ＃でＷＢを経由して、レジスタ制御信号Ｃによって指定されるレジスタに格納される。同時に、乗算の結果フラグがＣＣＲ３１に格納される。 At φ in T5, BUSY becomes inactive, the CPU resumes operation, and at φ in T6, the MULRD signal included in the control signal C is activated to instruct reading of the multiplication result register. The contents of the multiplication result register are stored in the register specified by the register control signal C via the X bus, the Y bus, and the bus switch 34, and via the WB at φ # of T6. At the same time, the result flag of the multiplication is stored in the CCR 31.

前記の通り、乗算命令はＢＵＳＹ信号によって、クロックが停止し、ウェイト状態となる。バイトサイズ符号無し乗算命令（ＭＵＬＸＵ．ＢＲ０Ｌ，Ｒ１など）は１ウェイトが挿入され、３ステートで実行される。ワードサイズ符号無し乗算命令（ＭＵＬＸＵ．ＷＲ０，ＥＲ１など）は２ウェイトが挿入され、４ステートで実行される。なお、符号付き乗算の場合は、それぞれ、プリフィックスコードの実行が付加される。 As described above, the clock of the multiplication instruction is stopped by the BUSY signal, and the multiplication instruction enters a wait state. A byte-size unsigned multiplication instruction (MULXU.B R0L, R1, etc.) is inserted in one wait and executed in three states. A word-size unsigned multiplication instruction (MULXU.WR0, ER1, etc.) is inserted in two wait states and executed in four states. In the case of signed multiplication, execution of a prefix code is added to each.

ＢＵＳＹ信号によって、演算実行の終了を判定することにより、制御回路（ＣＯＮＴ２２）の論理を縮小することができる。 The logic of the control circuit (CONT22) can be reduced by determining the end of the execution of the operation by the BUSY signal.

ＴＥＳＴＭＯＤＥ信号が活性状態になって、テストモードを指示された場合には、乗算器は１ステップの動作のみを行い、ＢＵＳＹ信号は非活性状態を保持する。ＣＰＵは１ステートで処理を終了する。 When the test mode is activated and the test mode is instructed, the multiplier performs only one-step operation, and the BUSY signal keeps the inactive state. The CPU ends the process in one state.

図２５、２６に、乗算器を用いない乗算命令のタイミング図を示す。乗算器を用いない乗算は、特に制限はされないものの、除算と類似のシーケンスで行なうようにする。 25 and 26 show timing diagrams of a multiplication instruction without using a multiplier. Multiplication without using a multiplier is performed in a sequence similar to division, although there is no particular limitation.

命令が解読されると、まず、汎用レジスタの読みだしを指示する。読出された結果は、符号判定を行なう。符号付／符号無の選択に対応して、符号判定を行い、除数は符号反転し、負数にする。そのほかは正数にする。 When the instruction is decoded, first, the instruction to read the general-purpose register is issued. A sign determination is performed on the read result. In accordance with the selection of signed / unsigned, sign judgment is performed, and the divisor is sign-inverted to a negative number. Others are positive numbers.

被乗数を上位、下位は０にして１ビットずつシフトし、シフトした結果によって、下位側に乗数を加算するかを決める。その結果に対して、さらにシフトを行い、シフトした結果によって、下位側に乗数を加算を行っていく。これを８または１６回繰返して、乗算結果の絶対値を得る。例えば、ＭＵＬＸＵ．ＢＲ１Ｌ，Ｒ０などのバイトサイズ・符号無し乗算の例である。前記同様に、Ｔ２から除算命令の実行が開始される。前記のような、所定の処理を行った後、Ｔ５から部分乗算を行なう。 The multiplicand is shifted one bit at a time, with the upper and lower bits set to 0, and whether to add the multiplier to the lower bit is determined based on the shifted result. The result is further shifted, and a multiplier is added to the lower side according to the shifted result. This is repeated eight or sixteen times to obtain the absolute value of the multiplication result. For example, MULXU. B is an example of byte size / unsigned multiplication such as R1L, R0. As described above, the execution of the division instruction is started from T2. After performing the predetermined processing as described above, partial multiplication is performed from T5.

部分乗算は、左シフト処理と加算で構成される。前回の加算と次回のシフト処理を同一のＡＬＵ処理で行なうようにする。 Partial multiplication includes left shift processing and addition. The previous addition and the next shift processing are performed by the same ALU processing.

Ｔ５のφ１に同期して、指定されたレジスタ（ディスティネーションレジスタＲｄ）から被乗数を読み込み、シフト処理を行なう。シフト処理の結果（部分積）がφ＃に同期してＷＢを経由して、Ｒｄにライトされる。また、シフトアウトされたキャリが内部で保持される。 In synchronization with φ1 of T5, the multiplicand is read from the designated register (destination register Rd) and shift processing is performed. The result (partial product) of the shift processing is written to Rd via WB in synchronization with φ #. Also, the shifted out carry is held internally.

Ｔ６のφ１に同期して、Ｒｄから部分積を読み込み、部分乗算処理を行なう。前回のキャリが”１”である場合、部分積の下位８ビットに乗数を加算し、１６ビットでシフト処理を行い、最下位ビットは”０”とする。前記以外の場合、部分積に１６ビットでシフト処理を行い、最下位ビットは”０”とする。かかる結果がφ＃に同期してＷＢを経由して、Ｒｄにライトされる。また、シフトアウトされたキャリが内部で保持される。この動作を７回繰り返す。 In synchronization with φ1 of T6, a partial product is read from Rd, and a partial multiplication process is performed. If the previous carry is "1", a multiplier is added to the lower 8 bits of the partial product, a shift process is performed with 16 bits, and the least significant bit is set to "0". In other cases, the partial product is shifted by 16 bits, and the least significant bit is set to “0”. The result is written to Rd via WB in synchronization with φ #. Also, the shifted out carry is held internally. This operation is repeated seven times.

Ｔ１３では、上記同様の判定を行い、前回のキャリが”１”である場合、部分積の下位８ビットに乗数を加算する。前記以外の場合、部分積を保持する。１６ビットの積が得られる。かかる結果がφ＃に同期してＷＢを経由して、Ｒｄにライトされる。符号付きの場合は、Ｔ１４で符号処理を行なう。また、ワードサイズの場合は、部分乗算処理が８回追加される。 At T13, the same determination as above is performed. If the previous carry is “1”, a multiplier is added to the lower 8 bits of the partial product. Otherwise, the partial product is retained. A 16-bit product is obtained. The result is written to Rd via WB in synchronization with φ #. If signed, sign processing is performed at T14. In the case of a word size, a partial multiplication process is added eight times.

先に保持した符号判定結果に基づいて、積の符号処理を行なう。すなわち、乗数・被乗数の一方が正数、他方が負数のときは、積の符号を反転する（０から積を引く）。 The sign processing of the product is performed based on the sign judgment result held earlier. That is, when one of the multiplier and the multiplicand is a positive number and the other is a negative number, the sign of the product is inverted (the product is subtracted from 0).

図２７に、乗算命令の状態遷移図を示す。例えば、ＭＵＬＸＵ．ＢＲ１Ｌ，Ｒ０などのバイトサイズ・符号無し乗算の例である。命令の実行が開始されると、ＣＰＵＳ信号の状態によって分岐する。 FIG. 27 shows a state transition diagram of the multiplication instruction. For example, MULXU. B is an example of byte size / unsigned multiplication such as R1L, R0. When the execution of the instruction is started, the operation branches depending on the state of the CPUS signal.

ＣＰＵＳ信号が活性状態であって、乗算器の使用が許可されると、図６の動作を行なう。即ち、ステップ１で、指定されたレジスタの内容を、ＧＢ、ＤＢを経由して、Ｘ、Ｙバスに出力して、乗算器に供給する。ＢＵＳＹ信号の状態を判定する。テストモードであれば、ＢＵＳＹ信号は非活性状態であって、直ちにステップ２に遷移する。 When the CPUS signal is active and the use of the multiplier is permitted, the operation of FIG. 6 is performed. That is, in step 1, the contents of the designated register are output to the X and Y buses via GB and DB, and supplied to the multiplier. The state of the BUSY signal is determined. In the test mode, the BUSY signal is in an inactive state, and the process immediately transits to step S2.

ＢＵＳＹ信号が活性状態であると、ＷＡＩＴ状態に遷移する。ＢＵＳＹ信号は非活性状態になるとステップ２に遷移する。 When the BUSY signal is in the active state, the state transitions to the WAIT state. When the BUSY signal goes into the inactive state, the flow goes to step 2.

ステップ２では、Ｘ、Ｙバスの内容をＷＢを経由して、指定されたレジスタにライトする。例えば、乗算器のフラグの内容をＣＣＲ３１に格納する。次の命令の実行を開始する。 In step 2, the contents of the X and Y buses are written to the designated register via the WB. For example, the contents of the flag of the multiplier are stored in the CCR 31. Start executing the next instruction.

ＣＰＵＳ信号が非活性状態であって、乗算器の使用が禁止されると、図２５、２６の動作を行なう。即ち、ステップ１、２でデータアライメントなどを行った後、ステップ３から、部分乗算処理を行なう。ステップ３では、ＧＢ上位に被乗数を出力し、これをシフトする。 When the CPUS signal is in the inactive state and the use of the multiplier is prohibited, the operations of FIGS. 25 and 26 are performed. That is, after performing data alignment and the like in steps 1 and 2, a partial multiplication process is performed from step 3. In step 3, the multiplicand is output to the upper GB, and the multiplicand is shifted.

ステップ４では、ＧＢに部分積を、ＤＢ下位に乗数を出力し、ＡＬＵ２６で加算を行なう。前のステップでシフトアウトしたビットが”１”であれば、加算した結果が選択され、シフトアウトしたビットが”０”であれば、ＧＢの内容が選択され、シフトを行なう。これをステップ１０まで繰り返す。 In step 4, the partial product is output to GB and the multiplier is output to the lower part of DB, and the ALU 26 performs addition. If the bit shifted out in the previous step is "1", the result of the addition is selected. If the bit shifted out is "0", the contents of GB are selected and the shift is performed. This is repeated until step 10.

ステップ１１では、ＧＢに部分積を、ＤＢ下位に乗数を出力し、ＡＬＵ２６で加算を行なう。前のステップでシフトアウトしたビットが”１”であれば、加算した結果が選択され、シフトアウトしたビットが”０”であれば、ＧＢの内容が選択される。シフトは行なわない。ステップ１２で、命令のリードを行なう。例えば、積を検査して、ＣＣＲに反映する。次の命令の実行を開始する。 In step 11, the partial product is output to GB and the multiplier is output to the lower part of DB, and the ALU 26 performs addition. If the bit shifted out in the previous step is "1", the addition result is selected. If the bit shifted out is "0", the content of GB is selected. No shift is performed. In step 12, an instruction is read. For example, the product is checked and reflected in the CCR. Start executing the next instruction.

図２５、２６に、除算命令のタイミング図を示している。例えば、ＤＩＶＸＵ．ＢＲ１Ｌ，Ｒ０などのバイトサイズ・符号無し除算の例である。前記同様に、Ｔ２から除算命令の実行が開始される。除数の符号反転などの、所定の処理を行った後、Ｔ５から部分除算を行なう。部分除算は、左シフト処理と減算で構成される。前回の減算と次回のシフト処理を同一のＡＬＵ処理で行なうようにする。 25 and 26 show timing diagrams of the division instruction. For example, DIVXU. B is an example of byte size / unsigned division such as R1L, R0. As described above, the execution of the division instruction is started from T2. After performing predetermined processing such as sign inversion of the divisor, partial division is performed from T5. Partial division includes left shift processing and subtraction. The previous subtraction and the next shift processing are performed by the same ALU processing.

Ｔ５のφ１に同期して、指定されたレジスタ（ディスティネーションレジスタＲｄ）から被除数を読み込み、シフト処理を行なう。シフト処理の結果（部分剰余）がφ＃に同期してＷＢを経由して、Ｒｄにライトされる。また、シフトアウトされたキャリが内部で保持される。 In synchronization with φ1 of T5, the dividend is read from the designated register (destination register Rd) and shift processing is performed. The result of the shift processing (partial remainder) is written to Rd via WB in synchronization with φ #. Also, the shifted out carry is held internally.

Ｔ６のφ１に同期して、Ｒｄから部分剰余を読み込み、部分除算処理を行なう。前回のキャリが”１”である場合、または、部分剰余の上位８ビットが除数以上である場合、部分剰余の上位８ビットから除数を減算（除数の符号反転を行っている場合、除数の反転を加算）し、１６ビットでシフト処理を行い、最下位ビットは”１”とする。前記以外の場合、部分剰余に１６ビットでシフト処理を行い、最下位ビットは”０”とする。かかる結果がφ＃に同期してＷＢを経由して、Ｒｄにライトされる。また、シフトアウトされたキャリが内部で保持される。この動作を７回繰り返す。 In synchronization with φ1 of T6, a partial remainder is read from Rd, and a partial division process is performed. When the previous carry is “1”, or when the upper 8 bits of the partial remainder are greater than or equal to the divisor, the divisor is subtracted from the upper 8 bits of the partial remainder (if the sign of the divisor is inverted, the divisor is inverted). Are added), and a shift process is performed with 16 bits, and the least significant bit is set to “1”. In other cases, the partial remainder is shifted by 16 bits, and the least significant bit is set to “0”. The result is written to Rd via WB in synchronization with φ #. Also, the shifted out carry is held internally. This operation is repeated seven times.

Ｔ１３では、上記同様の判定を行い、前回のキャリが”１”である場合、または、部分剰余の上位８ビットが除数以上である場合、部分剰余の上位８ビットから除数を減算し、下位８ビットでシフト処理を行い、最下位ビットは”１”とする。前記以外の場合、部分剰余に下位８ビットでシフト処理を行い、最下位ビットは”０”とする。いずれの場合も、ビット７の値は失われる。上位８ビットに剰余、下位８ビットに商が得られる。かかる結果がφ＃に同期してＷＢを経由して、Ｒｄにライトされる。 At T13, the same determination as above is performed, and if the previous carry is “1” or if the upper 8 bits of the partial remainder are greater than or equal to the divisor, the divisor is subtracted from the upper 8 bits of the partial remainder to obtain the lower 8 bits. Shift processing is performed on the bits, and the least significant bit is set to “1”. In other cases, the partial remainder is shifted by the lower 8 bits, and the least significant bit is set to “0”. In either case, the value of bit 7 is lost. The remainder is obtained in the upper 8 bits, and the quotient is obtained in the lower 8 bits. The result is written to Rd via WB in synchronization with φ #.

符号付きの場合は、Ｔ１４で符号処理を行なう。また、ワードサイズの場合は、部分除算処理が８回追加される。 If signed, sign processing is performed at T14. In the case of a word size, a partial division process is added eight times.

図２８に、ＡＬＵ２６の概略ブロック図を示す。ＡＬＵ２６は、算術論理演算回路７６と、選択回路７７、シフト回路７８、制御回路７９から構成される。乗除算に直接関係のない部分は省略している。 FIG. 28 shows a schematic block diagram of the ALU 26. The ALU 26 includes an arithmetic and logic operation circuit 76, a selection circuit 77, a shift circuit 78, and a control circuit 79. Parts not directly related to multiplication / division are omitted.

算術論理演算回路７６は、ＧＢとＤＢの内容を入力して、加算、減算、論理積、論理和、排他的論理和などの演算を行い、結果を出力する。選択回路７７は、算術論理演算回路の出力と、ＧＢの内容を入力して、いずれかを選択して出力する。シフト回路７８は、選択回路７７の出力を入力して、シフト処理を行なう。 The arithmetic and logic operation circuit 76 inputs the contents of GB and DB, performs operations such as addition, subtraction, logical product, logical sum, and exclusive logical sum, and outputs the result. The selection circuit 77 receives the output of the arithmetic and logic operation circuit and the content of GB, selects one of them, and outputs it. Shift circuit 78 receives the output of selection circuit 77 and performs a shift process.

選択回路７７、シフト回路７８は制御回路７９によって制御される。制御回路７９は、ＣＯＮＴ２２の与える制御信号と算術論理演算回路７６とシフト回路７８の出力によって、選択回路７７の選択とシフト回路７８のシフト入力を制御する。制御回路７９が、前記の部分乗算、部分除算の判定を行なう。条件が成立していれば、算術論理演算回路７６の出力を選択し、除算の場合、１をシフト回路に入力する。条件が不成立であれば、ＧＢの入力を選択し、除算の場合、０をシフト回路に入力する。乗算の場合のシフト回路の入力は、０とされる。 The selection circuit 77 and the shift circuit 78 are controlled by the control circuit 79. The control circuit 79 controls the selection of the selection circuit 77 and the shift input of the shift circuit 78 based on the control signal supplied from the CONT 22 and the outputs of the arithmetic and logic operation circuit 76 and the shift circuit 78. The control circuit 79 determines the above-described partial multiplication and partial division. If the condition is satisfied, the output of the arithmetic and logic operation circuit 76 is selected. In the case of division, 1 is input to the shift circuit. If the condition is not satisfied, GB input is selected, and in the case of division, 0 is input to the shift circuit. The input of the shift circuit in the case of multiplication is set to 0.

除算の部分除算と乗算の部分乗算の処理のシーケンス、及びＡＬＵ２６の回路構成を共通化する。除算と乗算を共通化して、ＣＯＮＴ２２の論理規模を縮小できる。 The processing sequence of partial division of division and partial multiplication of multiplication and the circuit configuration of the ALU 26 are shared. The logical scale of the CONT 22 can be reduced by sharing the division and the multiplication.

これにより、乗算器を持たないＣＰＵを容易に提供することができる。乗算器を持つＣＰＵにおいて不必要な乗算器を用いない乗算の論理を除算と共通化して、論理規模の増加を最低限にすることができる。 This makes it possible to easily provide a CPU having no multiplier. In a CPU having a multiplier, the logic of multiplication that does not use an unnecessary multiplier is shared with division, thereby minimizing the increase in the logic scale.

また、ＣＰＵＳによって、乗算器を用いない選択を可能にすることによって、テスト性を向上することができる。テスト時に、乗算器を用いるか用いないかを選択することに両方の論理をテストの対象にすることができる。 Further, the testability can be improved by enabling selection without using the multiplier by the CPUS. During testing, both logic can be tested by choosing to use or not use a multiplier.

複数命令の待避／復帰命令の命令コードは表１６の通りである。 Table 16 shows the instruction codes of the save / restore instructions of a plurality of instructions.

最初に使用するレジスタ番号が、命令コード中に指定される。例えば、昭和５年３月（株）日立製作所発行『Ｈ８／５００シリーズプログラミングマニュアル』に記載の複数命令の待避／復帰命令のように任意のレジスタの組み合わせを指定するのではなく、連続したレジスタ番号の固定の組み合わせとし、２、３、４本の固定の組み合わせとしている。命令コードも、レジスタ本数に応じて３種類を用意している。 The register number to be used first is specified in the instruction code. For example, instead of specifying an arbitrary combination of registers as in the save / restore instruction of multiple instructions described in the “H8 / 500 Series Programming Manual” issued by Hitachi, Ltd. , And two, three, and four fixed combinations. Three types of instruction codes are prepared according to the number of registers.

複数命令の待避命令は、待避するレジスタの本数に対応して、
ＳＴＭ（ＥＲｌ−ＥＲｌ＋１），＠−ＳＰ
ＳＴＭ（ＥＲｍ−ＥＲｍ＋２），＠−ＳＰ
ＳＴＭ（ＥＲｎ−ＥＲｎ＋３），＠−ＳＰ
の３種類を有する。ｌ＝０、２、４、６であり、ｍ、ｎ＝０、４である。指定した汎用レジスタをスタックに待避する。例えば、ＥＲ０とＥＲ１をスタックに待避する場合は、
ＳＴＭ（ＥＲ０−ＥＲ１），＠−ＳＰ
を用いる。ＥＲ０、ＥＲ１の順番でスタックにライトされ、スタックポインタ（ＥＲ７）は＋８される。命令コード中のレジスタ指定部は、最初に待避されるレジスタ番号にしてある。 The save instruction of multiple instructions corresponds to the number of registers to be saved,
STM (ERl-ERl + 1), ＠ -SP
STM (ERm-ERm + 2), ＠ -SP
STM (ERn-ERn + 3), ＠ -SP
There are three types. l = 0, 2, 4, 6 and m, n = 0, 4. Save the specified general-purpose register to the stack. For example, when saving ER0 and ER1 to the stack,
STM (ER0-ER1), ＠ -SP
Is used. The data is written to the stack in the order of ER0 and ER1, and the stack pointer (ER7) is incremented by +8. The register designation part in the instruction code is the register number to be saved first.

図２９、３０に複数レジスタの待避命令の実行シーケンスを示す。例えば、ＳＴＭ．ＬＥＲ０−ＥＲ１，＠−ＳＰなどの２本の汎用レジスタを待避する例である。レジスタ指定フィールドはＢ’０００である（Ｂ’は２進数を示す）。 29 and 30 show an execution sequence of a save instruction of a plurality of registers. For example, in STM. This is an example in which two general-purpose registers such as LER0-ER1 and $ -SP are saved. The register designation field is B'000 (B 'indicates a binary number).

前記同様に、Ｔ２から除算命令の実行が開始される。特に制限はされないものの、命令コードの第１ワードはプリフィックスコードであり、次の命令コードの動作を指定し、ＰＣをインクリメントするほかの動作は行なわない。 As described above, the execution of the division instruction is started from T2. Although not particularly limited, the first word of the instruction code is a prefix code, specifies the operation of the next instruction code, and does not perform any other operation for incrementing the PC.

第２ワードの命令コードは、ＰＵＳＨ命令と共通にされる。 The instruction code of the second word is made common to the PUSH instruction.

Ｔ４のφで、ＳＰの内容をＧＢに読み出し、ＡＬＵ２６に入力する。ＡＬＵ２６では−４の演算を行なう。なお、前記の通り、実行前のＳＰはスタックの先頭アドレスを示しているとする。 At φ of T4, the contents of the SP are read out to GB and input to the ALU 26. The ALU 26 performs the operation of -4. As described above, it is assumed that the SP before execution indicates the top address of the stack.

Ｔ４のφ＃で演算結果がＷＢとＧＢに出力される。ＷＢからＳＰに書き込まれ、ＧＢからＭＡＢ３３に格納される。ＭＡＢ３３の内容がＩＡＢに出力される。また、第１の制御信号ＢとＲＳＥＬ２（＝Ｂ’０００）とによって、待避されるレジスタが選択され、レジスタ制御信号Ｂが生成される。 The calculation result is output to WB and GB at φ # of T4. The data is written from WB to SP, and stored from GB to MAB 33. The contents of MAB 33 are output to IAB. The register to be saved is selected by the first control signal B and RSEL2 (= B'000), and the register control signal B is generated.

Ｔ５のφで、選択されたレジスタ（ＥＲ０）の内容がＤＢ経由で、ＤＢＷ２４に転送される。 At φ of T5, the contents of the selected register (ER0) are transferred to the DBW 24 via the DB.

Ｔ５のφ＃で、転送されたデータ（ＥＲ０の内容）の上位１６ビット（Ｅレジスタの内容）が内部データバスに出力される。また、ＭＡＢ３３のインクリメント機能によって、ＩＡＢの出力値を＋２とする。 At φ # of T5, the upper 16 bits (contents of the E register) of the transferred data (contents of ER0) are output to the internal data bus. Further, the output value of IAB is set to +2 by the increment function of MAB33.

Ｔ６のφで、更に、ＳＰの内容をＧＢに読み出し、ＡＬＵ２６に入力する。ＡＬＵ２６では−４の演算を行なう。 At φ of T6, the contents of the SP are further read to GB and input to the ALU 26. The ALU 26 performs the operation of -4.

Ｔ６のφ＃で、ＤＢＷ２４に転送されたデータの下位１６ビット（Ｒレジスタの内容）が内部データバスに出力される。ＡＬＵ２６の演算結果がＷＢとＧＢに出力される。ＷＢからＳＰに書き込まれ、ＧＢからＭＡＢに格納される。ＭＡＢ３３の内容がＩＡＢに出力される。また、第２の制御信号ＢによってＲＳＥＬのビット０が反転される。第１の制御信号とＲＳＥＬ２（＝Ｂ’００１）とによって、待避されるレジスタが選択され、レジスタ制御信号Ｂが生成される。 At φ # of T6, the lower 16 bits (contents of the R register) of the data transferred to the DBW 24 are output to the internal data bus. The operation result of the ALU 26 is output to WB and GB. The data is written from WB to SP and stored from GB to MAB. The contents of MAB 33 are output to IAB. Also, bit 0 of RSEL is inverted by the second control signal B. The register to be saved is selected by the first control signal and RSEL2 (= B'001), and the register control signal B is generated.

Ｔ７のφで、選択されたレジスタ（ＥＲ１）の内容がＤＢ経由で、ＤＢＷ２４に転送される。 At φ of T7, the contents of the selected register (ER1) are transferred to the DBW 24 via the DB.

Ｔ７のφ＃で、転送されたデータ（ＥＲ１の内容）の上位１６ビット（Ｅレジスタの内容）が内部データバスに出力される。また、ＭＡＢ３３のインクリメント機能によって、ＩＡＢの出力値を＋２とする。 At φ # of T7, the upper 16 bits (contents of the E register) of the transferred data (contents of ER1) are output to the internal data bus. Further, the output value of IAB is set to +2 by the increment function of MAB33.

Ｔ８のφ＃で、ＤＢＷ２４に転送されたデータの下位１６ビット（Ｒレジスタの内容）が内部データバスに出力される。 At φ # of T8, the lower 16 bits (contents of the R register) of the data transferred to the DBW 24 are output to the internal data bus.

Ｔ８のφ＃以降で、前記同様に、次の次の命令の読み出しと、ＰＣ３０のインクリメント（＋２）を行なう。 After φ # at T8, the next next instruction is read and the PC 30 is incremented (+2) in the same manner as described above.

レジスタ３本を指定した場合は、実行ステート数が２ステート長くなり、ＳＰのデクリメント（−４）と、ＲＳＥＬのビット１が反転される。ＲＳＥＬは、レジスタ指定フィールドが０００の場合、０１０とされ、汎用レジスタＥＲ２が選択される。ライト動作が２回行われる。 When three registers are specified, the number of execution states increases by two states, and the SP decrement (−4) and the bit 1 of the RSEL are inverted. RSEL is set to 010 when the register designation field is 000, and the general-purpose register ER2 is selected. The write operation is performed twice.

レジスタ４本を指定した場合は、更に、実行ステート数が２ステート長くなり、ＳＰのデクリメント（−４）と、ＲＳＥＬのビット１とビット０が反転される。ＲＳＥＬは、レジスタ指定フィールドが０００の場合、０１１とされ、汎用レジスタＥＲ２３が選択される。ライト動作が２回行われる。 When four registers are specified, the number of execution states is further increased by two states, and the SP decrement (−4) and the bit 1 and bit 0 of RSEL are inverted. RSEL is set to 011 when the register designation field is 000, and the general-purpose register ER23 is selected. The write operation is performed twice.

レジスタ番号の下位ビットが固定であるので、これを命令処理の実行に従って、変更させることが容易である。例えば、２本のレジスタを待避する場合、命令コード上のレジスタ指定フィールドの下位ビットは０であるので、１回めのレジスタ指定は、レジスタ指定フィールドの値に従い、２回のレジスタ指定は、ＣＯＮＴ２２の制御に従って、レジスタ指定フィールドの下位１ビットを１に変更して、行なうようにする。 Since the lower bits of the register number are fixed, it is easy to change this in accordance with the execution of the instruction processing. For example, when saving two registers, the lower bit of the register specification field on the instruction code is 0, so the first register specification is performed according to the value of the register specification field, and the second register specification is performed using CONT22. Is performed by changing the lower 1 bit of the register designation field to 1 in accordance with the control of (1).

一方、ＰＵＳＨ命令はレジスタ１本の待避であり、前記の２回目の待避動作を行なわないようにされ、実行動作の共通化を図っている。 On the other hand, the PUSH instruction saves one register, so that the second save operation is not performed, and the execution operation is shared.

図３１、３２に複数レジスタの復帰命令の実行シーケンスを示す。例えば、ＬＤＭ．Ｌ＠ＥＲ７＋，ＥＲ０−ＥＲ１などの２本の汎用レジスタを待避する例である。レジスタ指定フィールドは００１である。 FIGS. 31 and 32 show the execution sequence of the return instruction of a plurality of registers. For example, LDM. In this example, two general registers such as L @ ER7 + and ER0-ER1 are saved. The register designation field is 001.

図３３、３４に、ＲＳＥＬ２入力制御回路の具体的な構成、およびその動作説明を示す。この制御回路は、アンド回路７５Ａ、７５Ｂ、オア回路８０Ａ、８０Ｂから構成される。 33 and 34 show a specific configuration of the RSEL2 input control circuit and an operation description thereof. This control circuit includes AND circuits 75A and 75B and OR circuits 80A and 80B.

ビット２には、オペコードのレジスタ指定フィールドのビット２がそのまま入力される。ビット１、０には、オアゲートとアンドゲートを介して入力される。オアゲートの他方の入力はＳＴＭ制御信号１、０であり、アンドゲートの他方の入力はＬＤＭ制御信号１、０の反転とされる。ＳＴＭ制御信号１、０およびＬＤＭ制御信号１、０は、ＣＯＮＴ２２の出力である制御信号Ｂに含まれる。 In the bit 2, the bit 2 of the register designation field of the operation code is input as it is. Bits 1 and 0 are input via an OR gate and an AND gate. The other input of the OR gate is the STM control signal 1, 0, and the other input of the AND gate is the inverse of the LDM control signal 1, 0. The STM control signals 1 and 0 and the LDM control signals 1 and 0 are included in the control signal B that is the output of the CONT 22.

ＳＴＭ制御信号が活性状態になると、当該ＲＳＥＬビットは１になる。また、ＬＤＭ制御信号が活性状態になると、当該ＲＳＥＬビットは０になる。ＳＴＭ、ＬＤＭ命令と指定したレジスタ本数に従って、ＳＴＭ制御信号、ＬＤＭ制御信号が生成される。 When the STM control signal is activated, the RSEL bit becomes 1. When the LDM control signal is activated, the RSEL bit becomes 0. An STM control signal and an LDM control signal are generated according to the STM and LDM instructions and the designated number of registers.

これにより、レジスタ選択回路をそのほかの命令と共通化することができる。共通化によって、物理的規模の増加を抑止できる。 Thereby, the register selection circuit can be shared with other instructions. Commonization can suppress an increase in physical scale.

図３５、３６に、Ｃ言語で書かれた関数と、これをＣＰＵの命令に変換したリストの概略を示す。このリストには、オフセット（相対アドレス）、命令コード、Ｃラベル、Ｃソース及びアセンブラ命令の各項目が示されている。 35 and 36 schematically show functions written in the C language and lists converted from the functions into CPU instructions. This list shows items of offset (relative address), instruction code, C label, C source, and assembler instruction.

Ｃ言語からＣＰＵの命令へのコンパイルについては、例えば、平成４年９月（株）日立製作所発行『Ｈ８／３００シリーズＣコンパイラ』に記載されている。引数を汎用レジスタＥＲ０、ＥＲ１に設定しておくことができる。 Compilation from C language to CPU instructions is described in, for example, "H8 / 300 Series C Compiler" issued by Hitachi, Ltd., September, 1992. Arguments can be set in general-purpose registers ER0 and ER1.

関数Ｐｒｏｃ１では、引数をレジスタ渡しとし、これをＥＲ０に割り当てている。関数内の処理で、ＥＲ２、３、４、６を使用するため、関数処理の先頭で、
ＳＴＭ（ＥＲ２−ＥＲ３），＠−ＳＰ
ＳＴＭ（ＥＲ４−ＥＲ６），＠−ＳＰ
を実行して、関数の最後で、
ＬＤＭ＠ＳＰ＋，（ＥＲ４−ＥＲ６）
ＬＤＭ＠ＳＰ＋，（ＥＲ２−ＥＲ３）
を実行して、サブルーチンからリターン（ＲＴＳ）している。 In the function Proc1, an argument is passed to a register and is assigned to ER0. Since ER2, ER3, ER4, ER6 are used in the processing within the function, at the beginning of the function processing,
STM (ER2-ER3), ＠ -SP
STM (ER4-ER6), ＠ -SP
And at the end of the function,
LDM @ SP +, (ER4-ER6)
LDM @ SP +, (ER2-ER3)
And returns from the subroutine (RTS).

ＥＲ０、ＥＲ１は引数領域のため、関数内では使用せず、内容の待避／復帰も行なわない。 Since ER0 and ER1 are argument areas, they are not used in the function and the contents are not saved / restored.

また、この関数内で呼び出される関数Ｐｒｏｃ３は、引数をレジスタ渡しとし、これをＥＲ０に割り当てている。関数内の処理で、ＥＲ５を使用するため、関数処理の先頭で、１レジスタの待避
ＰＵＳＨ．ＬＥＲ５
を実行して、関数の最後で、
ＰＯＰ．ＬＥＲ５
を実行して、サブルーチンからリターン（ＲＴＳ）している。 The function Proc3 called in this function passes an argument by register and assigns it to ER0. Since ER5 is used in the processing in the function, one register is saved at the beginning of the function processing. L ER5
And at the end of the function,
POP. L ER5
And returns from the subroutine (RTS).

スタックポインタはＥＲ７と兼用であるから、ＥＲ７を待避／復帰することは意味がない。従って、タスク切替えを行なう場合に使用可能なすべてのレジスタを待避する場合には、
ＳＴＭ＠ＳＰ＋，（ＥＲ０−ＥＲ３）
ＳＴＭ＠ＳＰ＋，（ＥＲ４−ＥＲ６）
の２命令を用いる。ＥＲ０からＥＲ６の順番でスタックに待避される。同様に、復帰する場合には、
ＬＤＭ＠ＳＰ＋，（ＥＲ４−ＥＲ６）
ＬＤＭ＠ＳＰ＋，（ＥＲ０−ＥＲ３）
の２命令を用いる。ＥＲ６からＥＲ０の順番でスタックから復帰される。 Since the stack pointer is also used as ER7, it is meaningless to save / restore ER7. Therefore, when saving all the registers that can be used when performing task switching,
STM @ SP +, (ER0-ER3)
STM @ SP +, (ER4-ER6)
Are used. It is saved on the stack in the order of ER0 to ER6. Similarly, when returning,
LDM @ SP +, (ER4-ER6)
LDM @ SP +, (ER0-ER3)
Are used. Return from the stack in the order of ER6 to ER0.

前記のように任意の組み合わせを指定できないが、予め、レジスタの割当てを行っておくことにより、実質的な制約にはなりにくい。７本のレジスタを待避／復帰する場合に２命令を用いることになるが、全体的な実行ステート数やプログラム容量に対しては影響が小さい。少なくとも、１本のレジスタずつの待避／復帰命令を用いるより効果がある。後者の場合、４バイト×７、５ステート×７であるのに対して、前者では、４バイト×２、９＋１１ステートで実行できる。少なくとも、命令リードのためのリードサイクルや、アドレス計算のための内部動作の分を短縮して、高速化を図ることができる。 Although an arbitrary combination cannot be specified as described above, by allocating registers in advance, it is unlikely to be a substantial constraint. Two instructions are used to save / restore the seven registers, but this has little effect on the total number of execution states and the program capacity. At least, it is more effective to use the save / restore instruction for each register. In the latter case, 4 bytes × 7, 5 states × 7, whereas in the former, 4 bytes × 2, 9 + 11 states. At least, a read cycle for instruction read and an internal operation for address calculation can be shortened to achieve high speed.

前記Ｃ言語で書かれたプログラムのように、関数乃至サブルーチンを多く用いるプログラムの高速化を実現することができる。 Like a program written in the C language, it is possible to realize a high-speed program using many functions or subroutines.

また、上記のような関数乃至サブルーチンの場合のほかに、割り込み処理ルーチンにおいても、同様のレジスタの待避／復帰を行なう必要がある。マイクロコンピュータが機器制御などを行なう場合には、割り込み処理については、割り込みのイベントが発生してから、実際の割り込み処理を実行するまでの時間を短縮することによって、リアルタイム制御性を向上することができる。複数レジスタの待避を高速に実行可能にすることにより、かかるリアルタイム制御性の向上に効果がある。 In addition to the above-described functions and subroutines, it is necessary to perform the same save / restore of registers in an interrupt processing routine. When a microcomputer performs device control, etc., it is possible to improve real-time controllability by reducing the time from the occurrence of an interrupt event to the actual execution of interrupt processing. it can. By making it possible to save a plurality of registers at high speed, it is effective in improving such real-time controllability.

また、固定の組み合わせにし、各命令の実行ステート数を固定にすることにより、内部の条件分岐を行なうことをなくし、内部論理を簡潔にし、論理規模を縮小できる。マイクロプログラムによらず、ワイアードロジックなどでも容易に実現できる。マイクロプログラムによらず、ワイアードロジックなどとすることにより、論理回路の高速化に寄与することができる。特に、Ｃ言語など関数乃至サブルーチンを多く用いるプログラムを高速に実行することができる。 Further, by using a fixed combination and fixing the number of execution states of each instruction, it is possible to eliminate internal conditional branching, simplify the internal logic, and reduce the logic scale. It can be easily realized by wired logic or the like without using a microprogram. By using wired logic or the like regardless of the microprogram, it is possible to contribute to speeding up of a logic circuit. In particular, a program that uses many functions or subroutines such as the C language can be executed at high speed.

図３７、３８に、割込み例外処理のシーケンスを示す。 37 and 38 show the sequence of the interrupt exception handling.

図３９に、例外処理の状態遷移図を示す。 FIG. 39 shows a state transition diagram of the exception processing.

前記同様に、Ｔ２から割り込み例外処理の実行が開始される。プリフェッチした命令はキャンセルされ、図示されない割り込み要求信号に呼応して、ＣＯＮＴ２２の入力が切り換えられる。 In the same manner as described above, the execution of the interrupt exception handling is started from T2. The prefetched instruction is canceled, and the input of the CONT 22 is switched in response to an interrupt request signal (not shown).

ステップ１の動作として、ＰＣ３０のデクリメントを行なう。Ｔ２のφ＃で、ＰＣ３０の内容を読み出して、ＧＢ経由で、ＩＮＣ２７でデクリメント（−４）を行なう。これはプリフェッチをキャンセルしたことに対応して、待避すべきＰＣ３０の値を算出する。 As an operation of step 1, the PC 30 is decremented. At φ2 of T2, the contents of the PC 30 are read, and the decrement (−4) is performed by the INC 27 via GB. This calculates the value of the PC 30 to be saved in response to the cancellation of the prefetch.

Ｔ３のφで、デクリメントした結果を、ＷＢ経由で一旦ＰＣ３０に格納する。 The result of the decrement at φ of T3 is temporarily stored in the PC 30 via WB.

ステップ２で、ＳＰをデクリメントし、この内容をアドレスとして、ＰＣ３０の内容をデータとして、ライト動作を行なう。即ち、Ｔ３のφで、同時に、ＳＰの内容を読み出して、ＧＢ経由でＡＬＵ２６でデクリメント（−２）を行なう。 In step 2, the SP is decremented, and a write operation is performed using the contents as an address and the contents of the PC 30 as data. That is, at φ of T3, the contents of SP are read out at the same time, and the ALU 26 decrements (−2) via GB.

Ｔ３のφ＃で、デクリメントした結果を、ＷＢ経由でＳＰに格納するとともに、ＧＢ経由でＭＡＢ３３に転送し、ＩＡＢに出力させる。 At φ # of T3, the decremented result is stored in the SP via the WB, transferred to the MAB 33 via the GB, and output to the IAB.

Ｔ４のφで、ＰＣ３０の内容をＤＢ経由でＤＢＷ２４に転送する。ＤＢＷ２４の内容は、Ｔ４のφ＃から、内部データバスに出力される。 At φ of T4, the contents of the PC 30 are transferred to the DBW 24 via the DB. The contents of the DBW 24 are output from φ # of T4 to the internal data bus.

ステップ３で、ＳＰをデクリメントし、この内容をアドレスとして、ＰＣ３０の上位８ビットとＣＣＲ３１の内容をデータとして、ライト動作を行なう。 In step 3, the SP is decremented, and the write operation is performed using the contents as an address and the upper 8 bits of the PC 30 and the contents of the CCR 31 as data.

Ｔ４のφで、同時に、ＳＰの内容を読み出して、ＧＢ経由でＡＬＵ２６でデクリメント（−２）を行なう。 At φ of T4, the contents of the SP are read at the same time, and the ALU 26 decrements (−2) via GB.

Ｔ４のφ＃で、デクリメントした結果を、ＷＢ経由でＳＰに格納するとともに、ＧＢ経由でＭＡＢ３３に転送し、ＩＡＢに出力させる。ＤＢＷ２４に保持したＰＣ３０の内容下位１６ビットを内部データバスに出力する。 At φ # of T4, the decremented result is stored in SP via WB, transferred to MAB 33 via GB, and output to IAB. The lower 16 bits of the content of the PC 30 held in the DBW 24 are output to the internal data bus.

Ｔ５のφで、ＣＣＲ３１の内容をＤＢ経由でＤＢＷ２４に転送する。Ｔ４で格納したＰＣ３０の上位８ビットは保持される。ＤＢＷ２４の内容は、Ｔ５のφ＃から、内部データバスに出力される。 At φ of T5, the contents of the CCR 31 are transferred to the DBW 24 via the DB. The upper 8 bits of the PC 30 stored at T4 are held. The contents of the DBW 24 are output to the internal data bus from φ # of T5.

ＩＮＴＭ１信号が非活性状態であれば、ステップ４に遷移する。ＩＮＴＭ１信号が活性状態であれば、ステップ１２に遷移し、ＳＰをデクリメントし、この内容をアドレスとして、ＥＸＲの内容をデータとして、ライト動作を行なう。 If the INTM1 signal is in the inactive state, the process proceeds to step 4. If the INTM1 signal is in the active state, the flow goes to step 12 to decrement SP and perform a write operation using this content as an address and the EXR content as data.

Ｔ５のφで、ＳＰの内容を読み出して、ＧＢ経由でＡＬＵ２７でデクリメント（−２）を行なう。 At φ of T5, the contents of SP are read, and decrement (-2) is performed by ALU 27 via GB.

Ｔ５のφ＃で、デクリメントした結果を、ＷＢ経由でＳＰに格納する。 At φ5 of T5, the result of the decrement is stored in the SP via WB.

Ｔ６のφで、ＥＸＲの内容をＤＢ経由でＤＢＷ２４に転送する。Ｔ６のφ＃から、内部データバスに出力される。 At φ of T6, the contents of EXR are transferred to DBW 24 via DB. From φ # of T6, it is output to the internal data bus.

ステップ４で、ベクタアドレスの内容をリードする。 In step 4, the contents of the vector address are read.

Ｔ５のφ＃で、同時に、ＶＡＧの内容をＧＢ経由でＭＡＢ３３に転送し、ＩＡＢに出力させる。ＶＡＧには、図示されない、割り込みコントローラから与えられるベクタ番号に基づいて、ベクタアドレスを生成する。 At φ # of T5, the contents of VAG are simultaneously transferred to MAB 33 via GB and output to IAB. The VAG generates a vector address based on a vector number provided from an interrupt controller (not shown).

ステップ５で、ベクタアドレスのリード動作の終了を待つ。 In step 5, the process waits for the end of the vector address read operation.

ステップ６で、ＤＢＲ２５に格納した、ベクタアドレスの内容をアドレスとして、命令のリードを行なう。ＤＢＲ２５の内容をインクリメントし、ＰＣ３０に格納する。 In step 6, an instruction is read using the contents of the vector address stored in the DBR 25 as an address. The contents of the DBR 25 are incremented and stored in the PC 30.

Ｔ８のφ＃で、ＤＢＲ２５に格納したベクタアドレスのリード内容（分岐先の先頭アドレス）をＧＢ経由で、ＭＡＢ３３に転送し、ＩＡＢに出力させ、ＡＬＵ２６でインクリメント（＋２）する。 At φ # of T8, the read content (start address of the branch destination) of the vector address stored in the DBR 25 is transferred to the MAB 33 via GB, output to the IAB, and incremented (+2) by the ALU 26.

Ｔ９のφで、インクリメントした結果を、ＷＢ経由でＰＣ３０に格納する。 At φ of T9, the incremented result is stored in the PC 30 via WB.

ステップ７で、ＰＣ３０の内容をアドレスとして、命令のリードを行い、ＰＣ３０のインクリメントを行なう。 In step 7, an instruction is read using the contents of the PC 30 as an address, and the PC 30 is incremented.

Ｔ９のφ＃で、ＰＣ３０の内容（分岐先の先頭アドレス）をＧＢ経由で、ＭＡＢ３３に転送し、ＩＡＢに出力させ、ＡＬＵ２６でインクリメント（＋２）する。リードした命令をＩＲ２１に格納する。 At φ # of T9, the contents of the PC 30 (the leading address of the branch destination) are transferred to the MAB 33 via GB, output to the IAB, and incremented (+2) by the ALU 26. The read instruction is stored in the IR 21.

Ｔ１０のφで、インクリメントした結果を、ＷＢ経由でＰＣ３０に格納する。 At φ of T10, the increment result is stored in the PC 30 via WB.

次の命令の実行を開始させる。 Starts execution of the next instruction.

制御信号ＩＮＴＭ１に従って、ステップ１２を行なうか、行なわないかが選択され、スタックを２回行なうか、３回行なうかが選択される。スタックを２回行なう場合には、ＰＣとＣＣＲ３１のみが待避される。ＳＰは−４となる。上記Ｔ５の動作に相当する部分（ステップ１２）が実行されない。３回行なう場合には、ＰＣ３０とＣＣＲ３１及びＥＸＲが待避される。ＳＰは−６となる。 According to control signal INTM1, whether to perform step 12 or not is selected, and whether to perform stacking twice or three times is selected. When stacking is performed twice, only the PC and the CCR 31 are saved. SP is -4. The part corresponding to the operation of the above T5 (step 12) is not executed. When the operation is performed three times, the PC 30, CCR 31, and EXR are saved. SP is -6.

なお、ステップ１単位の動作が複数ステートにまたがっているのは、１つのＣＯＮＴ２２の入力に対応して、複数の異なるタイミングの制御信号Ａ、Ｂ、Ｃ及びレジスタ選択信号Ａ、Ｂ、Ｃが生成されるのに対応する。 It should be noted that the operation in units of steps spans a plurality of states because a plurality of control signals A, B, C and register selection signals A, B, C at different timings are generated in response to the input of one CONT 22. Corresponding to being done.

図４０、４１に、例外処理後のスタックの状態を示す。図４１はノーマルモードを示し、図４２はアドバンストモードを示している。 40 and 41 show the state of the stack after the exception processing. FIG. 41 shows the normal mode, and FIG. 42 shows the advanced mode.

図４２に、ＲＴＥ命令の実行シーケンスを示す。 FIG. 42 shows an execution sequence of the RTE instruction.

図４３に、例外処理の状態遷移図を示す。 FIG. 43 shows a state transition diagram of the exception processing.

前記同様に、Ｔ２からＲＴＥ命令の実行が開始される。 As described above, the execution of the RTE instruction is started from T2.

ステップ１の動作として、ＳＰの内容をアドレスとして、スタックのリードを行なう。Ｔ２のφ＃で、ＳＰの内容を読み出して、ＧＢ経由で、ＭＡＢ３３に転送し、ＩＡＢに出力させる。 As an operation of step 1, the stack is read using the contents of the SP as an address. At φ # of T2, the contents of the SP are read, transferred to the MAB 33 via GB, and output to the IAB.

Ｔ３のφで、ＳＰの内容を読み出して、ＧＢ経由で、ＡＬＵ２７でインクリメント（＋２）する。ＩＡＢのアドレスでスタックをリードする。Ｔ３のφ＃で、リードした内容をＤＢＲ２５に格納する。 At φ of T3, the contents of the SP are read and incremented (+2) by the ALU 27 via GB. Read the stack with the IAB address. At φ # of T3, the read contents are stored in the DBR 25.

ＩＮＴＭ１信号が非活性状態であれば、ステップ２に遷移する。ＩＮＴＭ１信号が活性状態であれば、ステップ１０に遷移し、リードした結果をＥＸＲに格納する。ＳＰをインクリメントし、この内容でリードを行なう。 If the INTM1 signal is in the inactive state, the process proceeds to step 2. If the INTM1 signal is in the active state, the flow goes to step 10 to store the read result in EXR. SP is incremented, and reading is performed with this content.

Ｔ３のφ＃で、同時に、インクリメントした結果を、ＷＢ経由でＳＰに格納する。また、ＧＢを経由して、ＭＡＢ３３に転送し、ＩＡＢに出力させる。 At φ # of T3, the incremented result is simultaneously stored in SP via WB. Further, the data is transferred to the MAB 33 via the GB and output to the IAB.

Ｔ４のφで、ＤＢＲ２５の内容をＧＢに読み出し、これをＡＬＵ２７に入力する。Ｔ４のφ＃で、ＡＬＵ２７はＧＢから入力した内容をそのまま、ＷＢに出力し、ＥＸＲに格納する。 At φ of T4, the contents of the DBR 25 are read to GB and input to the ALU 27. At φ # of T4, the ALU 27 outputs the content input from GB as it is to WB and stores it in EXR.

ステップ２で、リードした結果をＣＣＲ３１に格納する。ＭＡＢ３３に格納した内容を、ＭＡＢ３３でインクリメントさせる。この内容でリードを行なう。なお、ＭＡＢ３３のインクリメント機能は、特開平４−３３３１５３号公報などに記載されている。 In step 2, the read result is stored in the CCR 31. The contents stored in the MAB 33 are incremented by the MAB 33. A read is performed with these contents. The increment function of the MAB 33 is described in, for example, JP-A-4-333153.

Ｔ５のφで、ＤＢＲ２５の内容をＧＢに読み出し、これをＡＬＵ２７に入力する。Ｔ５のφ＃で、ＡＬＵ２７はＧＢから入力した内容をそのまま、ＷＢに出力し、ＣＣＲ３１に格納する。 At φ of T5, the contents of the DBR 25 are read to GB, and this is input to the ALU 27. At φ # of T5, the ALU 27 outputs the content input from the GB as it is to the WB and stores it in the CCR 31.

ステップ３で、ＳＰの内容をインクリメント（＋４）する。 In step 3, the content of the SP is incremented (+4).

Ｔ６のφで、ＳＰの内容を読み出して、ＧＢ経由で、ＡＬＵ２６でインクリメント（＋４）する。 At φ of T6, the contents of the SP are read and incremented (+4) by the ALU 26 via GB.

Ｔ６のφ＃で、インクリメントした結果を、ＷＢ経由でＳＰに格納する。 At φ6 of T6, the increment result is stored in the SP via WB.

ステップ４で、ＤＢＲ２５に格納した、スタックから復帰したＰＣ３０の内容をアドレスとして、命令のリードを行なう。ＤＢＲ２５の内容をインクリメントし、ＰＣ３０に格納する。 In step 4, an instruction is read using the contents of the PC 30 restored from the stack stored in the DBR 25 as an address. The contents of the DBR 25 are incremented and stored in the PC 30.

Ｔ６のφ＃で、同時に、ＤＢＲ２５に格納したベクタアドレスのリード内容（分岐先の先頭アドレス）をＧＢ経由で、ＭＡＢに転送し、ＩＡＢに出力させ、ＡＬＵでインクリメント（＋２）する。 At φ # of T6, at the same time, the read contents of the vector address (the leading address of the branch destination) stored in the DBR 25 are transferred to the MAB via GB, output to the IAB, and incremented (+2) by the ALU.

Ｔ７のφで、インクリメントした結果を、ＷＢ経由でＰＣ３０に格納する。 At φ of T7, the increment result is stored in the PC 30 via WB.

ステップ５で、ＰＣ３０の内容をアドレスとして、命令のリードを行い、ＰＣ３０のインクリメントを行なう。 In step 5, an instruction is read using the contents of the PC 30 as an address, and the PC 30 is incremented.

Ｔ７のφ＃で、ＰＣ３０の内容（分岐先の先頭アドレス）をＧＢ経由で、ＭＡＢ３３に転送し、ＩＡＢに出力させ、ＡＬＵ２６でインクリメント（＋２）する。リードした命令をＩＲに格納する。 At φ # of T7, the contents of the PC 30 (the leading address of the branch destination) are transferred to the MAB 33 via GB, output to the IAB, and incremented (+2) by the ALU 26. The read instruction is stored in the IR.

Ｔ８のφで、インクリメントした結果を、ＷＢ経由でＰＣ３０に格納する。次の命令の実行を開始させる。 At φ of T8, the increment result is stored in the PC 30 via WB. Starts execution of the next instruction.

例えば、ＩＮＴＭ１信号が０レベルの場合には、前記従来ＣＰＵ（例えば、前記平成５年６月（株）日立製作所発行『Ｈ８／３００Ｈシリーズプログラミングマニュアル』に記載のＣＰＵ）と同一のスタックの構造とされる。命令コードが共通であることと相俟って、従来ＣＰＵによって書かれたプログラムをそのまま実行することができる。 For example, when the INTM1 signal is at the 0 level, the same stack structure as that of the conventional CPU (for example, the CPU described in "H8 / 300H Series Programming Manual" issued by Hitachi, Ltd., June 1993) is used. Is done. The program written by the conventional CPU can be directly executed in combination with the common instruction code.

新たな、コンディションコードや割込みマスクビットやトレースビットなどを追加する場合には、これに対応したプログラムを作成することになるから、スタックの構造が異なっても実質的な問題はない。割込みマスクビットを追加するなどして、使い勝手を向上することができる。 When a new condition code, interrupt mask bit, trace bit, or the like is added, a program corresponding to this is created, so that there is no substantial problem even if the stack structure is different. Usability can be improved by adding an interrupt mask bit or the like.

なお、前記の通りＩＮＴＭ１ビットがＳＹＳＣＲに存在し、このビットの状態がＩＮＴＭ１信号に反映されるようになっている。リセット後に、かかるＳＹＳＣＲの設定を行なうことにより、ＥＸＲを使用するかしないかが選択される。 As described above, the INTM1 bit exists in the SYSCR, and the state of this bit is reflected in the INTM1 signal. After the reset, the setting of the SYSCR is performed to select whether to use the EXR.

図４４にＣＯＮＴ２２の一部の論理を示す。このＣＯＮＴ２２は、アンド回路８６Ａ〜８６Ｄによって構成される。 FIG. 44 shows a part of the logic of the CONT 22. The CONT 22 includes AND circuits 86A to 86D.

ＥＸＲを使用しない、すなわち、ＩＮＴＭ１ビットを”０”にクリアすると、ＥＸＲのビットは全て”０”とみなされ、設定値は、無視されるようにされる。 When EXR is not used, that is, when the INTM1 bit is cleared to “0”, all bits of EXR are regarded as “0” and the set value is ignored.

次に、図４におけるエミュレーション用インタフェース３９に含まれる制御レジスタ４１の構成を示す。この制御レジスタ４１は、以下説明するように、（１）ＡＳＥコントロールレジスタＤ（ＡＳＥＣＲＤ）、（２）ブレークコントロールレジスタＡＢ（ＢＲＣＲＡ、Ｂ）、（３）ブレークアドレスレジスタＡ、Ｂ（ＢＡＲＡ、Ｂ）、（４）ブレークアドレスマスクレジスタＡ、Ｂ（ＢＡＭＲＡ、Ｂ）、および（５）ＡＳＥ専用スタックレジスタ（ＢＲＫＳＴＫＲ）から構成されている。 Next, the configuration of the control register 41 included in the emulation interface 39 in FIG. 4 will be described. As described below, this control register 41 includes (1) ASE control register D (ASECRD), (2) break control register AB (BRCR, B), and (3) break address register A, B (BARA, B). ), (4) Break address mask register A, B (BAMRA, B), and (5) ASE dedicated stack register (BRKSTKR).

図４５に、（１）ＡＳＥコントロールレジスタＤ（ＡＳＥＣＲＤ）の構成を示す。このレジスタは８ビットリード／ライト可能なレジスタで、シングルステップの設定、ＲＴＢ命令実行後の割込制御、多重ブレークの許可禁止、ウインドウ機能を指定する。各ビットの内容を表２１乃至表２４に示す。 FIG. 45 shows the configuration of (1) ASE control register D (ASECRD). This register is an 8-bit readable / writable register, and specifies single step setting, interrupt control after execution of an RTB instruction, permission / prohibition of multiple breaks, and a window function. Tables 21 to 24 show the contents of each bit.

［表２１］

[Table 21]

［表２２］

[Table 22]

［表２３］

[Table 23]

［表２４］

[Table 24]

図４６に、（２）ブレークコントロールレジスタＡＢ（ＢＲＣＲＡ、Ｂ）の構成を示す。このレジスタは、（ａ）ＢＲＣＲＡ、（ｂ）ＢＲＣＲＢからなり、各々は８ビットのリード／ライトが可能なレジスタで、それぞれＰＣブレークのチャネルＡ、Ｂの制御を行なう。各ビットの内容を表２５乃至表２８に示す。 FIG. 46 shows the configuration of (2) Break Control Register AB (BRCR, B). This register is composed of (a) BRCRA and (b) BRCRB, each of which is an 8-bit readable / writable register and controls the PC break channels A and B, respectively. Tables 25 to 28 show the contents of each bit.

［表２５］

[Table 25]

［表２６］

[Table 26]

［表２７］

[Table 27]

［表２８］

[Table 28]

図４７に、（３）ブレークアドレスレジスタＡ、Ｂ（ＢＡＲＡ、Ｂ）の構成を示す。このレジスタは、（ａ）ＢＡＲＡ、（ｂ）ＢＡＲＢからなり、各々は３２ビットのリード／ライトが可能なレジスタで、それぞれＰＣブレークのチャネルＡ、Ｂのアドレスを指定する。３２ビットのレジスタをバイトサイズに分割して、ＢＡＲＲ、Ｅ、Ｈ、Ｌと表記される場合もある。最上位のＢＡＲＲはリザーブされている。リードすると不定値が読み出される。ライトは無効である。 FIG. 47 shows the configuration of (3) break address registers A and B (BARA, B). This register is composed of (a) BARA and (b) BARB, each of which is a 32-bit readable / writable register and specifies the address of a channel A or B of a PC break, respectively. A 32-bit register may be divided into byte sizes and denoted as BARR, E, H, and L. The top BARR is reserved. When read, an undefined value is read. Light is invalid.

図４８に、（４）ブレークアドレスマスクレジスタＡ、Ｂ（ＢＡＭＲＡ、Ｂ）の構成を示す。このレジスタは、（ａ）ＢＡＭＲＡ、（ｂ）ＢＡＭＲＢからなり、各々は３２ビットのリード／ライトが可能なレジスタで、それぞれＰＣブレークのチャネルＡ、Ｂのアドレス比較のマスクを行なうビットを指定する。ＢＡＭＲのビットを”１”にセットすると、このビットに対応するアドレスのビットは、アドレス比較対象から除外される。３２ビットのレジスタをバイトサイズに分割して、ＢＡＭＲＲ、Ｅ、Ｈ、Ｌと表記される場合もある。最上位のＢＡＲＲはリザーブされている。リードすると不定値が読み出される。ライトは無効である。 FIG. 48 shows the configuration of (4) Break address mask registers A and B (BAMRA, B). This register comprises (a) BAMRA and (b) BAMRB, each of which is a 32-bit readable / writable register and specifies a bit for masking the address comparison of channels A and B of a PC break, respectively. When the bit of BAMR is set to "1", the bit of the address corresponding to this bit is excluded from the address comparison target. In some cases, a 32-bit register is divided into byte sizes and denoted as BAMRR, E, H, and L. The top BARR is reserved. When read, an undefined value is read. Light is invalid.

図４９に、（５）ＡＳＥ専用スタックレジスタ（ＢＲＫＳＴＫＲ）の構成を示す。このレジスタは、６バイト（４８ビット）のリード／ライトが可能なレジスタで、ユーザモード⇔ブレークモードの遷移時に、スタック領域として使用する。ユーザのＳＰは使用せず、保持される。スタックされるリソースおよびスタックの構造は、ＭＣＵ動作モード（ノーマルモード／アドバンストモード）および、制御レジスタの設定（ＳＹＳＣＲのＩＮＴＭ１ビット）によって相違される。表２９にこのレジスタの使用方法を示す。 FIG. 49 shows the configuration of (5) ASE dedicated stack register (BRKSTKR). This register is a 6-byte (48-bit) readable / writable register, and is used as a stack area when transitioning from user mode to break mode. The user's SP is not used and is retained. The resources to be stacked and the structure of the stack differ depending on the MCU operation mode (normal mode / advanced mode) and the setting of the control register (INTM1 bit of SYSCR). Table 29 shows how to use this register.

［表２９］

[Table 29]

エミュレーション用ソフトウェアの実行状態への遷移（ブレーク）時には、固定アドレスのブレークスタックレジスタを使用するようにする。ブレーク例外処理や、ブレークからのリターン命令時には、ユーザのスタックポインタ（ＥＲ７）を使用せず、固定的なスタックアドレスを生成する。かかるスタックアドレスの生成はＥＭＬＳＰ２９による。 At the time of transition (break) to the execution state of the emulation software, a break stack register at a fixed address is used. At the time of break exception handling or a return instruction from a break, a fixed stack address is generated without using the user's stack pointer (ER7). The generation of such a stack address is performed by the EMLSP 29.

図５０に、ＥＭＬＳＰ２９の構成を示す。このＥＭＬＳＰ２９は、クロックトバッファで構成される。 FIG. 50 shows the configuration of the EMLSP 29. This EMLSP 29 is composed of a clocked buffer.

かかるクロックトバッファの内、ビット２３〜１０は１固定、ビット５、４、０は０固定、ビット９〜６は、外部からの指定を入力する。また、ビット３、２、１はＣＯＮＴ２２の制御信号を入力する。ＣＭＯＳ回路で構成する場合、必要に応じて論理反転を用いればよい。クロックトバッファの出力はＧＢに接続されている。また制御信号ｍは、ＣＯＮＴの制御信号とクロック（φ＃）の論理積信号である。 Of the clocked buffers, bits 23 to 10 are fixed at 1, bits 5, 4, and 0 are fixed at 0, and bits 9 to 6 are input from outside. Bits 3, 2, and 1 receive the control signal of CONT22. When a CMOS circuit is used, logical inversion may be used as needed. The output of the clocked buffer is connected to GB. The control signal m is a logical product signal of the control signal of CONT and the clock (φ #).

通常のレジスタ回路が、データを保持するためのラッチ回路を持たなければならないが、ＥＭＬＳＰ２９は、これを持たず、小型化を図っている。 A normal register circuit must have a latch circuit for holding data, but the EMLSP 29 does not have this and is trying to reduce the size.

従って、ブレークスタックレジスタの先頭アドレスは、Ｂ’００００００であって、６４ｋバイト単位で１６通りのアドレスを選択可能とされる。マイクロコンピュータの内部Ｉ／Ｏレジスタの配置によって、アドレスを変更できる。 Therefore, the start address of the break stack register is B'000000, and 16 addresses can be selected in units of 64 kbytes. The address can be changed depending on the arrangement of the internal I / O registers of the microcomputer.

ブレーク例外処理の実行シーケンスは、図３７、３８と同様であり、そこでのＳＰ（ＥＲ７）の読み出しに代わって、ＥＭＬＳＰ２９を読み出すようにする。
この場合、最初（Ｔ３のφ）は下位アドレスをＢ’０００１１０として、デクリメントした内容のＢ’０００１００がスタックのアドレスとされる。 The execution sequence of the break exception process is the same as that shown in FIGS. 37 and 38, and the EMLSP 29 is read instead of reading the SP (ER7) there.
In this case, at the beginning (φ of T3), the lower address is B'000110, and the decremented content B'000100 is set as the address of the stack.

２回目（Ｔ４のφ）は下位アドレスをＢ’０００１００として、デクリメントした内容のＢ’００００１０がスタックのアドレスとされる。 In the second time (φ of T4), the lower address is set to B'000100, and the decremented content B'000010 is set as the stack address.

３回目（Ｔ５のφ）は下位アドレスをＢ’００００１０として、デクリメントした内容のＢ’００００００がスタックのアドレスとされる。３回目は、ＩＮＴＭ信号が活性状態のときに有効である。読み出されるビット２、１は、ＣＯＮＴ２２の制御信号によって選択する。 For the third time (φ of T5), the lower address is set to B'000010, and the decremented content B'000000 is set as the address of the stack. The third time is effective when the INTM signal is in the active state. The bits 2 and 1 to be read are selected by a control signal of the CONT 22.

リターン命令の実行シーケンスは、図４２と同様であり、ＳＰ（ＥＲ７）の読み出しに代わって、ＥＭＬＳＰ２９を読み出すようにする。 The execution sequence of the return instruction is the same as that in FIG. 42, and the EMLSP 29 is read instead of reading the SP (ER7).

最初（Ｔ２のφ＃）はＩＮＴＭ１信号によって異なり、ＩＮＴＭ１信号が非活性状態であれば、下位アドレスをＢ’００００１０として、ＩＮＴＭ１信号が活性状態であれば、下位アドレスをＢ’００００００として、読み出す。これらがスタックのアドレスとなる。 The first (φ # of T2) differs depending on the INTM1 signal. When the INTM1 signal is inactive, the lower address is read as B'000010, and when the INTM1 signal is active, the lower address is read as B'000000. These are the addresses of the stack.

２回目（Ｔ３のφ）は、ＩＮＴＭ１信号が活性状態である場合に有効であり、下位アドレスをＢ’００００１０として、デクリメントした結果をアドレスとしてリードを行なう。 The second time (φ of T3) is effective when the INTM1 signal is in the active state. The lower address is set to B'000010, and the result of the decrement is read as the address.

３回目はＭＡＢ３３のインクリメントによってアドレスを生成し、ＥＭＬＳＰ２９は使用しない。 In the third time, an address is generated by incrementing the MAB 33, and the EMLSP 29 is not used.

これにより、固定的な出力回路として論理規模を縮小できる。ユーザに公開されない資源による論理規模の増大を最小限にすることができる。 Thereby, the logic scale can be reduced as a fixed output circuit. An increase in logical scale due to resources not disclosed to the user can be minimized.

図５１に、ブレーク例外処理の実行タイミングを示す。実行シーケンスは図３７、３８の例外処理タイミングと同様である。 FIG. 51 shows the execution timing of the break exception handling. The execution sequence is the same as the exception processing timing in FIGS.

前記同様に、Ｔ２から割り込み例外処理の実行が開始される。プリフェッチした命令はキャンセルされ、図示されないブレーク要求信号に呼応して、ＣＯＮＴ２２の入力が切り換えられる。 In the same manner as described above, the execution of the interrupt exception handling is started from T2. The prefetched instruction is canceled, and the input of the CONT 22 is switched in response to a break request signal (not shown).

Ｔ３のバス動作が行われない期間に、ブレークモードを示す信号ＢＲＫＡＫ＃が活性状態になる。 During a period in which the bus operation of T3 is not performed, the signal BRKAK # indicating the break mode is activated.

図５２に、ブレーク制御論理の回路構成を示す。この回路は、アンド回路８１Ａ乃至８１Ｇ、オア回路８２Ａ乃至８２Ｄ、フリップフロップ８３から構成されている。 FIG. 52 shows a circuit configuration of the break control logic. This circuit includes AND circuits 81A to 81G, OR circuits 82A to 82D, and a flip-flop 83.

ＣＰＵに対するブレーク要求は、３要因が存在する。第１はＢＲＫ端子による要求である。第２はアドレス比較Ａによる要求であり、これは、ＢＲＫＣＲのＢＩＥＡビットによって許可される。第３はアドレス比較Ｂによる要求であり、これは、ＢＲＫＣＲのＢＩＥＢビットによって許可される。なお、かかるアドレス比較は、前記の通り、（ＣＡ２３・ＡＲ２３＋¬ＣＡ２３・¬ＡＲ２３＋ＡＭＲ２３）・…・（ＣＡｎ・ＡＲｎ＋¬ＣＡｎ・¬ＡＲｎ＋ＡＭＲｎ）・…・（ＣＡ０・ＡＲ０＋¬ＣＡ０・¬ＡＲ０＋ＡＭＲ０）と表現される。（¬は論理反転を示す）。 There are three causes for a break request to the CPU. The first is a request by the BRK terminal. The second is a request by address comparison A, which is granted by the BIEA bit of BRKCR. Third is a request by address compare B, which is granted by the BIEB bit of BRKCR. As described above, this address comparison is expressed as (CA23 / AR23 + @ CA23 / @ AR23 + AMR23)... (CAn / ARn + @ CAn / @ ARn + AMRn)... (CA0.AR0 + @ CA0. @ AR0 + AMR0). You. (¬ indicates logical inversion).

これらの論理和信号が、ブレーク要求として、ＣＰＵに与えられる。ブレークモードでは、ＭＢＩＥビットの状態によってＢＲＫ端子によるブレーク要求の許可禁止が選択される。即ち、ブレークモードでＭＢＩＥビットが”０”にクリアされている場合は、ブレーク要求が抑止される。アドレス比較によるブレーク要求は、ブレークモードで禁止される。 These OR signals are supplied to the CPU as a break request. In the break mode, permission / prohibition of the break request by the BRK terminal is selected according to the state of the MBIE bit. That is, when the MBIE bit is cleared to "0" in the break mode, the break request is suppressed. Break requests by address comparison are prohibited in the break mode.

また、ＭＢＩＥビットは、フリップフロップで構成され、ＢＲＫＡＫ信号の反転信号で”０”にクリアされる。かかるフリップフロップの入力は、所定のデータバスのビットであって、クロックは、ブレークモード信号とアドレスデコード信号とライト信号の論理積信号とされる。かかるアドレスデコード信号は、ＣＰＵの出力するアドレスがＢＲＫＣＲの存在するアドレスになったとき、活性状態とされる。即ち、ブレークモードでのみライト可能とされる。 The MBIE bit is formed by a flip-flop, and is cleared to “0” by an inverted signal of the BRKAK signal. The input of the flip-flop is a bit of a predetermined data bus, and the clock is an AND signal of a break mode signal, an address decode signal, and a write signal. Such an address decode signal is activated when the address output from the CPU becomes an address where BRKCR exists. That is, writing is enabled only in the break mode.

即ち、ブレークモードに遷移した直後は、ブレーク要求が禁止状態であって、不所望のブレークの多重例外処理（スタックした内容の破壊）が禁止される。また、ＢＲＫＣＲのＳＳＴＰビットと、ＢＲＫＡＫ信号の反転信号との論理積が、シングルステップブレーク要求として、ＣＰＵに与えられる。 That is, immediately after the transition to the break mode, the break request is in a disabled state, and the multiple exception processing of the undesired break (destruction of the stacked contents) is prohibited. The logical product of the SSTP bit of BRKCR and the inverted signal of the BRKAK signal is given to the CPU as a single step break request.

シングルステップブレーク要求と、ＲＴＢ命令実行信号との論理積信号と、ブレーク要求が、ＣＰＵ内部で、ＣＰＵブレーク例外処理要求として認識される。これらの例外処理の内容は共通とされる。 The logical product signal of the single step break request, the RTB instruction execution signal, and the break request are recognized inside the CPU as a CPU break exception processing request. The contents of these exception processes are common.

ＣＰＵはＲＴＢ命令実行時には、かかるシングルステップブレーク要求を無視する。 When executing the RTB instruction, the CPU ignores the single step break request.

上記実施の形態によれば、以下の作用効果を得るものである。 According to the above embodiment, the following operation and effect can be obtained.

（１）既存の命令セットと互換性を維持しつつ、乗算器を内蔵することに当っては、ポストインクリメントレジスタ間接のアドレッシングモードのみをサポートすることによって、アドレッシングモードの増加を最小限にして、かつ処理性能を低下させずに積和演算を実行可能にすることができる。また、アドレスレジスタの補正をレジスタ間演算命令で行い、これを１ステートで実行することができ、アドレッシングの柔軟性を向上することができる。さらに、積和演算をＣＰＵの内部動作（ポストインクリメントのアドレス計算）と並行に行なうことによって、実行ステート数の短縮を行なうことができる。 (1) Incorporating a multiplier while maintaining compatibility with the existing instruction set, by supporting only the post-increment register indirect addressing mode, the increase in the addressing mode is minimized. In addition, the product-sum operation can be executed without lowering the processing performance. Further, correction of the address register can be performed by an inter-register operation instruction, and this can be executed in one state, so that the flexibility of addressing can be improved. Further, by performing the product-sum operation in parallel with the internal operation of the CPU (post-increment address calculation), the number of execution states can be reduced.

（２）乗算器を利用して乗算命令を実行することにあたっては、乗算の結果（積、フラグ）を直接汎用レジスタ、ＣＣＲに格納するようにして、直ちに結果を利用できるようにし、実質的な乗算の実行速度を向上することができる。また、積和演算の結果（ＭＡＣ）をリード（ＳＴＭＡＣ）すると同時に、乗算器内部で保持したフラグをＣＣＲに格納することによって、積和演算結果の利用や判定を容易に行なうことができ、使い勝手を向上することができる。 (2) In executing a multiplication instruction using a multiplier, the result of multiplication (product, flag) is directly stored in a general-purpose register or CCR so that the result can be used immediately, The execution speed of the multiplication can be improved. In addition, by reading the result (MAC) of the product-sum operation (STMAC) and storing the flag held in the multiplier in the CCR at the same time, the result of the product-sum operation can be easily used and determined. Can be improved.

（３）乗算器を取外し可能にすることによって、乗算器を取外した場合は、積和演算をサポートしないことによって、容易に下位ＣＰＵを実現し、論理的・物理的規模を縮小し、製造費用を低減した別のマイクロコンピュータを容易に開発することができる。また、汎用的な乗算命令を、乗算器によらずにサポートすることによって、かかる別のマイクロコンピュータにおける使い勝手の低下を防止できる。さらに、乗算器によらない乗算命令を除算と同一のシーケンスで実行するようにして、乗算器を持つマイクロコンピュータにおいても冗長な論理を最低限にすることができる。 (3) By making the multiplier removable, when the multiplier is removed, the lower CPU can be easily realized by not supporting the product-sum operation, thereby reducing the logical and physical scale, and the manufacturing cost. It is possible to easily develop another microcomputer in which the number is reduced. Further, by supporting a general-purpose multiplication instruction without using a multiplier, it is possible to prevent the usability of such another microcomputer from being reduced. Further, by executing a multiplication instruction which does not depend on the multiplier in the same sequence as the division, redundant logic can be minimized even in a microcomputer having a multiplier.

また、乗算器使用するか使用しないかの制御信号を与えて制御することによって、テスト性を向上したり、エミュレータを共通化したりすることができる。全体的な開発効率を向上することができる。さらにまた、乗算器を削除し、小型化したＣＰＵを用いて、マイクロコンピュータを構成することによって、半導体集積回路の論理規模・物理的規模を縮小して、製造費用の縮小を図ることができる。 In addition, by providing a control signal indicating whether a multiplier is used or not, control can be performed to improve testability and to use a common emulator. Overall development efficiency can be improved. Furthermore, by eliminating the multiplier and configuring the microcomputer using a downsized CPU, the logical scale and the physical scale of the semiconductor integrated circuit can be reduced, and the manufacturing cost can be reduced.

（４）乗算器とＣＰＵを一体に構成して、乗算器・ＣＰＵ間の配線を短縮して、物理的規模を縮小する。また、高速化に寄与することができる。 (4) The multiplier and the CPU are integrally formed to reduce the wiring between the multiplier and the CPU, thereby reducing the physical scale. In addition, it can contribute to speeding up.

（５）乗算器のテストモードを設定して、このときの乗算器の処理を１ステップのみにすることによって、論理規模の増加を最低限にして、テストの容易性を向上することができる。テストステップを短縮することができる。 (5) By setting the test mode of the multiplier and performing only one step of the processing of the multiplier at this time, it is possible to minimize the increase in the logic scale and improve the testability. Test steps can be shortened.

（６）内部動作のパイプラインに対応して、入出力タイミングの異なるレジスタ選択回路を複数持つことにより、実質的に１命令／１ステート実行を行なうことができる。 (6) By providing a plurality of register selection circuits having different input / output timings corresponding to the pipeline of the internal operation, one instruction / one state can be executed substantially.

（７）複数レジスタの退避／復帰命令を持ち、この組み合わせを固定的にすることによって、論理規模の縮小を図ることができる。レジスタの本数の異なる命令を複数命令サポートすることによって、使い勝手の低下を防ぐことができる。また、複数レジスタの退避／復帰命令を関数（サブルーチン）の入り口／出口で実行することによって、Ｃ言語などで記述された場合のように、関数の使用頻度が高い場合に、処理速度を特に向上することができる。さらに、割り込み例外処理ルーチンの先頭で複数レジスタの退避を用いることにより、リアルタイム性の向上を図ることができる。 (7) By having a save / restore instruction for a plurality of registers and fixing this combination, the logical scale can be reduced. By supporting a plurality of instructions having different numbers of registers, it is possible to prevent a decrease in usability. Also, by executing save / restore instructions of a plurality of registers at the entrance / exit of a function (subroutine), the processing speed is particularly improved when the function is frequently used, such as when the function is described in C language or the like. can do. Further, by using the saving of a plurality of registers at the beginning of the interrupt exception handling routine, the real-time property can be improved.

（８）ＥＸＲの有効／無効を切り換えることで、互換性を維持することと、機能拡張とを両立することができる。ＥＸＲを無効とし、例外処理において退避／復帰を行なわないようにすることで、スタックの節約と、割込み応答時間の高速化に寄与することができる。また、互換性を維持する。さらに、ＥＸＲを有効とすることで、割り込みマスクレベルを拡張したり、トレース機能を追加したりして、使い勝手を向上することができる。 (8) By switching between valid / invalid of EXR, compatibility can be maintained and function expansion can be achieved at the same time. By disabling EXR and not saving / restoring in exception processing, it is possible to contribute to saving the stack and shortening the interrupt response time. Also, maintain compatibility. Further, by enabling the EXR, the usability can be improved by extending the interrupt mask level and adding a trace function.

（９）エミュレータ用の固定的なスタックポインタを持つことによって、ユーザプログラムとエミュレーションプログラムの遷移時に、ユーザのスタックポインタとは独立して、固定的なアドレスに対して退避および復帰が行われるから、エミュレータのソフトウェア、ハードウェアの開発を容易にすることができる。 (9) By having a fixed stack pointer for the emulator, at the transition between the user program and the emulation program, saving and restoring to a fixed address are performed independently of the user's stack pointer. Emulator software and hardware development can be facilitated.

また、エミュレータ用のスタックポインタを固定的にすることによって、ユーザに公開しない資源を最小限の論理的・物理的規模にすることができる。ユーザプログラムからエミュレーションプログラムへの遷移（ブレーク）を多重に行なうことを禁止することを可能にすることによって、不所望のスタックの内容の破壊を防止することができる。 In addition, by fixing the stack pointer for the emulator, resources that are not disclosed to the user can be reduced to a minimum logical and physical scale. By making it possible to prohibit multiple transitions (breaks) from the user program to the emulation program, it is possible to prevent undesired destruction of the contents of the stack.

以上本発明者等によってなされた発明を実施の形態に限定されるものではなく、その要旨を逸脱しない範囲において種々変更可能である。実施の形態を相互に組み合せて使用することもできる。 The invention made by the inventors of the present invention is not limited to the embodiment, but can be variously changed without departing from the gist of the invention. The embodiments may be used in combination with each other.

例えば、ＣＰＵの命令セットやレジスタ構成は変更可能である。内部バス幅なども変更可能である。但し、命令の大部分の命令コード長より、小さいバス幅でないことが望ましい。 For example, the instruction set and register configuration of the CPU can be changed. The internal bus width can be changed. However, it is desirable that the bus width is not smaller than the instruction code length of most of the instructions.

また、乗算器の内部構成なども種々変更可能である。１６ビット×６ビットを３回繰り返すのではなく、１６ビット×４ビットを４下位繰り返すようにしてもよい。命令実行ステートと論理的規模に鑑みて選択すればよい。 Further, the internal configuration of the multiplier can be variously changed. Instead of repeating 16 bits × 6 bits three times, 16 bits × 4 bits may be repeated four lower orders. The selection may be made in consideration of the instruction execution state and the logical scale.

さらに、飽和演算の指定、乗算器あり／なしの指定、テストモードの指定方法が種々変更可能であることは言うまでもない。 Further, it goes without saying that the designation of the saturation operation, the designation of the presence / absence of the multiplier, and the designation method of the test mode can be variously changed.

さらにまた、乗算器に限らず除算器を内蔵するものであっても良い。 Furthermore, the present invention is not limited to the multiplier, and may include a divider.

また、互換性を維持すべき対象は、前記例に限定されない。一般的に、ＣＰＵの例外処理時に、コントロールレジスタの内容を待避することは行われており、そのほかのＣＰＵについても、本発明を適用して、互換性を維持しつつ、コントロールレジスタの機能を拡張することができる。 Further, the object for which compatibility should be maintained is not limited to the above example. Generally, the contents of the control register are saved during exception processing of the CPU, and the functions of the control register are extended to other CPUs while maintaining compatibility by applying the present invention. can do.

さらに、シングルチップマイクロコンピュータのその他の機能ブロックについても何等制約されない。 Further, the other functional blocks of the single-chip microcomputer are not restricted at all.

以上の説明では主として本発明者によってなされた発明をその背景となった利用分野であるシングルチップマイクロコンピュータに適用した場合について説明したが、それに限定されるものではなく、その他のデータ処理装置にも適用可能であり、本発明は少なくとも、複数の動作モードを選択して動作するデータ処理装置に適用することができる。 In the above description, the case where the invention made by the present inventor is mainly applied to a single-chip microcomputer, which is a field of use as a background, has been described. The present invention is applicable to at least a data processing device that operates by selecting a plurality of operation modes.

本発明の実施の形態によるマイクロコンピュータの構成を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a microcomputer according to an embodiment of the present invention. 本実施の形態のマイクロコンピュータのシステムコントロールレジスタの構成図であるFIG. 3 is a configuration diagram of a system control register of the microcomputer according to the present embodiment. 本実施の形態のマイクロコンピュータに用いられるＣＰＵの命令フォーマットの構成図である。FIG. 2 is a configuration diagram of an instruction format of a CPU used in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータに用いられるＣＰＵと乗算器を示すブロック図である。FIG. 2 is a block diagram illustrating a CPU and a multiplier used in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータにおける制御レジスタの１ビットの構成図である。FIG. 2 is a configuration diagram of one bit of a control register in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータにおける制御信号の設定方法の一例を示すブロックである。3 is a block diagram illustrating an example of a control signal setting method in the microcomputer according to the present embodiment. 本実施の形態のマイクロコンピュータにおける制御信号の設定方法の一例を示す概略図である。FIG. 4 is a schematic diagram illustrating an example of a control signal setting method in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータに用いられるＣＰＵのレジスタの構成図である。FIG. 2 is a configuration diagram of a register of a CPU used in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータに用いられるＣＰＵのレジスタの構成図である。FIG. 2 is a configuration diagram of a register of a CPU used in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータに用いられるＣＰＵのレジスタの使用方法を説明するフロック図である。FIG. 4 is a block diagram illustrating a method of using a register of a CPU used in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータに用いられるＣＰＵのレジスタのスタックの状態の説明図である。FIG. 3 is an explanatory diagram of a state of a stack of registers of a CPU used in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータに用いられるＣＰＵの基本動作のタイミング図である。FIG. 3 is a timing chart of a basic operation of a CPU used in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータに用いられるＣＰＵの基本動作のタイミング図である。FIG. 3 is a timing chart of a basic operation of a CPU used in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータに用いられる乗算器の構成を示すブロック図である。FIG. 2 is a block diagram illustrating a configuration of a multiplier used in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータに用いられる乗算器による演算方法の説明図である。FIG. 4 is an explanatory diagram of a calculation method by a multiplier used in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータに用いられる乗算器のフラグの実現方法の説明図である。FIG. 3 is an explanatory diagram of a method of realizing a flag of a multiplier used in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータに用いられる乗算器のフラグの実現方法の説明図である。FIG. 3 is an explanatory diagram of a method of realizing a flag of a multiplier used in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータに用いられるバススイッチの構成を示すブロック図である。FIG. 2 is a block diagram illustrating a configuration of a bus switch used in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータにおけるＭＡＣ命令の動作のタイミング図である。FIG. 4 is a timing chart of an operation of a MAC instruction in the microcomputer of the embodiment. 図１９に連続する動作のタイミング図である。FIG. 20 is a timing chart of the operation following FIG. 19. 本実施の形態のマイクロコンピュータにおけるＳＴＭＡＣ命令およびＬＤＭＡＣ命令の動作のタイミング図である。FIG. 5 is a timing chart of the operation of the STMAC instruction and the LDMAC instruction in the microcomputer of the embodiment. 図２１に連続する動作のタイミング図である。FIG. 22 is a timing chart of the operation following FIG. 21. 本実施の形態のマイクロコンピュータにおいて乗算器を用いた場合の乗算命令の動作のタイミング図である。FIG. 5 is a timing chart of an operation of a multiplication instruction when a multiplier is used in the microcomputer of the present embodiment. 図１９に連続する動作のタイミング図である。FIG. 20 is a timing chart of the operation following FIG. 19. 本実施の形態のマイクロコンピュータにおいて乗算器を用いない場合の乗算命令の動作のタイミング図である。FIG. 5 is a timing chart of an operation of a multiplication instruction when a multiplier is not used in the microcomputer of the present embodiment. 図２５に連続する動作のタイミング図である。FIG. 26 is a timing chart of the operation following FIG. 25. 本実施の形態のマイクロコンピュータにおける乗算命令の状態遷移図である。FIG. 4 is a state transition diagram of a multiplication instruction in the microcomputer of the embodiment. 本実施の形態のマイクロコンピュータに用いられる演算器の構成を示すブロック図である。FIG. 2 is a block diagram illustrating a configuration of an arithmetic unit used in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータにおける複数レジスタの退避命令の実行シーケンス図である。FIG. 5 is an execution sequence diagram of a save instruction of a plurality of registers in the microcomputer according to the present embodiment. 図２９に連続する動作のタイミング図である。FIG. 30 is a timing chart of the operation following FIG. 29. 本実施の形態のマイクロコンピュータにおける複数レジスタの復帰命令の実行シーケンス図である。FIG. 5 is an execution sequence diagram of a return instruction of a plurality of registers in the microcomputer of the present embodiment. 図３１に連続する動作のタイミング図である。FIG. 32 is a timing chart of the operation following FIG. 31. 本実施の形態のマイクロコンピュータにおけるＲＳＥＬ２入力制御回路の構成を示すブロック図である。FIG. 2 is a block diagram illustrating a configuration of an RSEL2 input control circuit in the microcomputer of the present embodiment. 図３３の動作の説明図である。FIG. 34 is an explanatory diagram of the operation in FIG. 33. 本実施の形態のマイクロコンピュータに適用されるＣ言語による変換リストの概略例である。3 is a schematic example of a conversion list in C language applied to the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータに適用されるＣ言語による変換リストの概略例である。3 is a schematic example of a conversion list in C language applied to the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータにおける割込例外処理の実行シーケンス図である。FIG. 4 is an execution sequence diagram of interrupt exception processing in the microcomputer of the present embodiment. 図３７に連続する動作のタイミング図である。FIG. 38 is a timing chart of the operation following FIG. 37. 本実施の形態のマイクロコンピュータにおける例外処理の状態遷移図である。FIG. 6 is a state transition diagram of exception processing in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータにおける例外処理後のスタックの状態の説明図である。FIG. 4 is an explanatory diagram of a state of a stack after exception processing in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータにおける例外処理後のスタックの状態の説明図である。FIG. 4 is an explanatory diagram of a state of a stack after exception processing in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータにおけるＲＴＥ命令の実行シーケンス図である。FIG. 3 is an execution sequence diagram of an RTE instruction in the microcomputer of the embodiment. 本実施の形態のマイクロコンピュータにおける例外処理の状態遷移図である。FIG. 6 is a state transition diagram of exception processing in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータにおける制御回路の構成の説明図である。FIG. 2 is an explanatory diagram of a configuration of a control circuit in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータにおける制御レジスタの構成図である。FIG. 2 is a configuration diagram of a control register in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータにおける制御レジスタの構成図である。FIG. 2 is a configuration diagram of a control register in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータにおける制御レジスタの構成図である。FIG. 2 is a configuration diagram of a control register in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータにおける制御レジスタの構成図である。FIG. 2 is a configuration diagram of a control register in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータにおける制御レジスタの構成図である。FIG. 2 is a configuration diagram of a control register in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータにおけるエミュレーションスタックポインタの構成の説明図である。FIG. 3 is an explanatory diagram of a configuration of an emulation stack pointer in the microcomputer according to the present embodiment. 本実施の形態のマイクロコンピュータにおけるブレーク割込みシーケンスの実行タイミング図である。FIG. 4 is a timing chart of the execution of a break interrupt sequence in the microcomputer of the present embodiment. 本実施の形態のマイクロコンピュータにおけるブレーク制御処理の回路構成図である。FIG. 3 is a circuit configuration diagram of a break control process in the microcomputer of the present embodiment.

Explanation of reference numerals

１…ＣＰＵ、２…乗算器、３…システムコントローラ（ＳＹＳＣ）、４…割込コントローラ（ＩＮＴ）、５…ＲＯＭ、６…ＲＡＭ、９…シリアルコミュニケーションインターフェース（ＳＣＩ）、１３…システムコントロールレジスタ（ＳＹＳＣＲ）、１４…制御レジスタ（ＣＰＵＣＲ）、２１…命令レジスタ（ＩＲ）、２２…命令デコーダ・制御回路（ＣＯＮＴ）、２３…レジスタセレクタ（ＲＳＥＬ）、２４…ライトデータバッファ（ＤＢＷ）、２５…リードデータバッファ（ＤＢＲ）、２６、２７…演算器、２９…エミュレータスタックポインタ（ＥＭＬＳＰ）、３０…プログラムカウンタ（ＰＣ）、３１…コンディションレジスタ（ＣＣＲ）、３２…拡張レジスタ（ＥＸＲ）、３３…アドレスバッファ（ＭＡＢ）、３４…バススイッチ、３８…エミュレーション用プロセッサ、３９…エミュレーション用インタフェース、４４…インタフェースケーブル、４８…エミュレーションメモリ、４９…ブレーク制御回路、５０…リアルタイムトレース回路、５８…ＣＭＯＳインバータ回路、６５Ａ〜６５Ｃ…デコーダ、６６Ａ〜６６Ｃ、７１Ａ、７１Ｂ、７７…選択回路、６７…加算器、７２Ａ〜７２Ｃ…拡張回路、７６…算術論理演算回路、７８…シフト回路、７９…制御回路、７５Ａ、７５Ｂ、８１Ａ〜８１Ｇ、８６Ａ〜８６Ｄ…アンド回路、８０Ａ、８０Ｂ、８２Ａ〜８２Ｄ…オア回路、８３…フリップフロップ。
DESCRIPTION OF SYMBOLS 1 ... CPU, 2 ... Multiplier, 3 ... System controller (SYSC), 4 ... Interrupt controller (INT), 5 ... ROM, 6 ... RAM, 9 ... Serial communication interface (SCI), 13 ... System control register (SYSCR) ), 14: control register (CPUCR), 21: instruction register (IR), 22: instruction decoder / control circuit (CONT), 23: register selector (RSEL), 24: write data buffer (DBW), 25: read data Buffers (DBR), 26, 27 arithmetic unit, 29 emulator stack pointer (EMLSP), 30 program counter (PC), 31 condition register (CCR), 32 extension register (EXR), 33 address buffer ( MAB), 34 ... bus switch 38, emulation processor, 39, emulation interface, 44, interface cable, 48, emulation memory, 49, break control circuit, 50, real-time trace circuit, 58, CMOS inverter circuit, 65A to 65C, decoder, 66A to 66C , 71A, 71B, 77: selection circuit, 67: adder, 72A to 72C: expansion circuit, 76: arithmetic and logic operation circuit, 78: shift circuit, 79: control circuit, 75A, 75B, 81A to 81G, 86A to 86D ... AND circuits, 80A, 80B, 82A to 82D ... OR circuits, 83 ... flip-flops.

Claims

A data processing device that sequentially executes a predetermined instruction,
A data processing apparatus, wherein a combination of a plurality of registers that can be specified is fixed to a control unit that controls an execution unit that executes the instruction, and a save / restore instruction for the plurality of registers is provided.

2. The data processing apparatus according to claim 1, wherein the save / restore instructions are a plurality of different types of register combinations that can be specified.

3. The data processing device according to claim 1, wherein the save instruction and the return instruction have the same combination of registers that can be specified.

4. The data processing device according to claim 1, wherein the plurality of registers to be saved / restored are a plurality of registers having consecutive register numbers.

5. The data processing apparatus according to claim 4, wherein the register designator of the instruction code of the save / restore instruction represents a combination of a plurality of registers to be saved / restored by a register number saved first.

A data processing device wherein a combination of registers that can be designated by a save / restore instruction is fixed.