[go: up one dir, main page]

CN119623384B - Power consumption optimization method, integrated circuit and chip - Google Patents

Power consumption optimization method, integrated circuit and chip Download PDF

Info

Publication number
CN119623384B
CN119623384B CN202510157183.6A CN202510157183A CN119623384B CN 119623384 B CN119623384 B CN 119623384B CN 202510157183 A CN202510157183 A CN 202510157183A CN 119623384 B CN119623384 B CN 119623384B
Authority
CN
China
Prior art keywords
register
load
icg
registers
power consumption
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202510157183.6A
Other languages
Chinese (zh)
Other versions
CN119623384A (en
Inventor
王泽坤
宋柯贤
黄现
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Taorun Semiconductor Co ltd
Original Assignee
Shanghai Taorun Semiconductor Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Taorun Semiconductor Co ltd filed Critical Shanghai Taorun Semiconductor Co ltd
Priority to CN202510157183.6A priority Critical patent/CN119623384B/en
Publication of CN119623384A publication Critical patent/CN119623384A/en
Application granted granted Critical
Publication of CN119623384B publication Critical patent/CN119623384B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/30Circuit design
    • G06F30/32Circuit design at the digital level
    • G06F30/337Design optimisation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Geometry (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Power Sources (AREA)

Abstract

本发明提供了一种功耗优化方法、集成电路及芯片,该方法包括:基于EDA的时序报告获取负载寄存器中有时序违例的第一终点寄存器;获取由ICG驱动的所有第一负载寄存器;根据第一负载寄存器和第一终点寄存器,获取由ICG驱动且限制电路整体性能的第二终点寄存器;按组合逻辑将各个负载寄存器划分为若干个逻辑上互不相干的寄存器堆;为每一个寄存器堆复制一份ICG用于驱动该寄存器堆内各个负载寄存器的CP端。该方案通过利用EDA工具的ICG复制功能,且实现对影响电路整体性能的关键寄存器的ICG复制以及放置,能够在不增加流程复杂度以及保持对集成电路节省功耗的前提下,使负载寄存器之间路径的时序能够得到极限的优化。

The present invention provides a power consumption optimization method, integrated circuit and chip, the method comprising: obtaining a first terminal register with timing violation in a load register based on an EDA timing report; obtaining all first load registers driven by an ICG; obtaining a second terminal register driven by the ICG and limiting the overall performance of the circuit according to the first load register and the first terminal register; dividing each load register into a plurality of logically unrelated register stacks according to combinational logic; and copying an ICG for each register stack to drive the CP end of each load register in the register stack. The solution utilizes the ICG copy function of the EDA tool and realizes the ICG copy and placement of the key registers that affect the overall performance of the circuit, so that the timing of the path between the load registers can be optimized to the extreme without increasing the complexity of the process and maintaining the power consumption of the integrated circuit.

Description

Power consumption optimization method, integrated circuit and chip
Technical Field
The present invention relates to the field of integrated circuits, and more particularly, to a power consumption optimization method, an integrated circuit, and a chip.
Background
With the continuous development of integrated circuits, not only the circuit performance requirements are continuously improved, but also the constraint on power consumption is more and more strict. In very large scale integrated circuits, the consumption of dynamic power consumption, i.e. the power consumption caused by the inversion of the data and clock signals, accounts for 95% of the overall power consumption, wherein the dynamic power consumption on the clock tree accounts for about 35% of the overall dynamic power consumption. If a part of the functional circuits in the whole design do not need to work for a period of time, which means that the data signal of the data port (D end) of the register will not be flipped, the clock signal of the clock port (CP end) of the register will not need to be flipped, so that the clock will not be flipped, and the power consumption caused by flipping on the clock tree can be avoided.
However, as the scale of the integrated circuit design itself is larger and larger, the functions of the sub-module circuit become complex, so too is the number of load registers in the module circuit, and how to control so many load registers to avoid the power consumption caused by signal inversion becomes a problem to be solved.
Disclosure of Invention
The invention aims to provide a power consumption optimization method, an integrated circuit and a chip, which can realize ICG replication and placement of key registers affecting the overall performance of a circuit by utilizing the ICG replication function of an EDA tool, and can enable the time sequence of paths between load registers to be optimized to the utmost extent on the premise of not increasing the complexity of a flow and keeping the power consumption of the integrated circuit, thereby improving the performance of the chip.
The technical scheme provided by the invention is as follows:
The invention provides a power consumption optimization method, which is used for time sequence control of a load register in a submodule circuit of an integrated circuit, and comprises the following steps:
acquiring a first endpoint register with timing violations in the load register based on a timing report of the submodule circuit acquired by EDA;
Acquiring all first load registers driven by ICG in the load registers;
Acquiring a second end point register which is driven by the ICG and limits the overall performance of the circuit in the load register according to the first load register and the first end point register;
Load registers with combination logic with the second endpoint registers are respectively obtained, and each load register is divided into a plurality of logically mutually incoherent register files according to the combination logic;
one ICG is replicated for each of the register files by the EDA tool for driving the CP end of each of the load registers within the register file.
According to the scheme, through the time sequence report of the submodule circuit obtained by EDA, the first end point register with time sequence violations in the load registers can be judged, and then through determining all the first load registers driven by the ICG in the load registers, the second end point register which is driven by the ICG and limits the overall performance of the circuit in the load registers can be obtained, the load registers with combined logic with the second end point register are found out according to the logic relation, each load register can be divided into a plurality of register files which are not mutually coherent logically, and through copying one ICG for each register file to drive the CP end of each load register in the register file, the fact that the start point register and the end point register which are logically connected and are the critical paths share one ICG as much as possible can be realized, so that the extra pessimistic degree is required to be considered on the clock tree which is caused by premature clock bifurcation because of the ICG is not shared is reduced, the time sequence performance of the critical paths is improved, and the performance of the whole chip is improved.
In some embodiments, the separately obtaining load registers having combinational logic with each of the second endpoint registers further includes:
and acquiring a starting point register positioned at a logic starting point in a load register with combination logic of each second ending point register.
In some embodiments, the copying, by the EDA tool, of one ICG for each of the register files further comprises:
calculating the physical center of each register file according to the physical distribution range from the starting register to the second ending register in each register file;
the ICG is placed at the physical center.
In some embodiments, the CP terminal of each ICG is connected to a clock source, the Q terminal of each ICG is connected to the CP terminal of each load register in the register file, and the E terminal of each ICG is configured to receive a gating signal.
In some embodiments, further comprising performing ICG replication for other load registers of the first load register not belonging to each of the register files by EDA tool according to preset conditions, and driving a plurality of the other load registers by one ICG.
In some embodiments, the preset condition is a physical distribution of the other load registers, and a number of drives per ICG.
In some implementations, the first load register and the first endpoint register are interleaved to obtain the second endpoint register of the load registers that is driven by ICG and limits overall circuit performance.
In some embodiments, after dividing each load register into a plurality of register files that are logically mutually incoherent according to the combinational logic, the method further includes:
And re-acquiring a time sequence report of the sub-module circuit, judging whether the time sequence of each load register with the combinational logic in each register file is accurate or not, and re-dividing the register file when judging that the time sequence of each load register with the combinational logic in each register file is not accurate.
In a second aspect, the present application provides an integrated circuit, where the integrated circuit uses the power consumption optimization method described in the first aspect to perform timing control of a load register in a sub-module circuit.
In a third aspect, the application provides a chip comprising the integrated circuit of the second aspect.
According to the power consumption optimization method, the integrated circuit and the chip, the ICG replication function of the EDA tool is utilized, ICG replication and placement of key registers affecting the overall performance of the circuit are realized, and the time sequence of paths between load registers can be optimized to the greatest extent on the premise that the complexity of a flow is not increased and the power consumption of the integrated circuit is saved, so that the performance of the chip is improved.
Drawings
The above features, technical features, advantages and implementation modes of the present invention will be further described in the following description of preferred embodiments with reference to the accompanying drawings in a clear and understandable manner.
FIG. 1 is a schematic diagram of a control architecture of a clock gating cell of the present invention;
FIG. 2 is a schematic waveform diagram of a clock gating cell of the present invention with its clock signal turned off;
FIG. 3 is a schematic diagram of ICG control of the present invention;
FIG. 4 is a schematic diagram of the data path and clock path of a gated control register to an ICG;
FIG. 5 is a schematic diagram of a prior art circuit after ICG replication;
FIG. 6 is a schematic diagram showing the disadvantages of the prior art ICG replication;
FIG. 7 is a flow chart of a power consumption optimization method according to one embodiment of the invention;
FIG. 8 is a schematic diagram of register file partitioning according to one embodiment of the present invention;
Fig. 9 is a schematic circuit diagram of an ICG replicated according to one embodiment of the present invention.
Detailed Description
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the following description will explain the specific embodiments of the present invention with reference to the accompanying drawings. It is evident that the drawings in the following description are only examples of the invention, from which other drawings and other embodiments can be obtained by a person skilled in the art without inventive effort.
For the sake of simplicity of the drawing, the parts relevant to the present invention are shown only schematically in the figures, which do not represent the actual structure thereof as a product. Additionally, in order to simplify the drawing for ease of understanding, components having the same structure or function in some of the drawings are shown schematically with only one of them, or only one of them is labeled. Herein, "a" means not only "only this one" but also "more than one" case.
With the continuous development of integrated circuits, not only the circuit performance requirements are continuously improved, but also the constraint on power consumption is more and more strict. In very large scale integrated circuits, the consumption of dynamic power consumption, i.e. the power consumption caused by the inversion of the data and clock signals, accounts for 95% of the overall power consumption, wherein the dynamic power consumption on the clock tree accounts for about 35% of the overall dynamic power consumption. If a part of the functional circuits in the whole design do not need to work for a period of time, which means that the data signal of the data port (D end) of the register will not be flipped, the clock signal of the clock port (CP end) of the register will not need to be flipped, so that the clock will not be flipped, and the power consumption caused by flipping on the clock tree can be avoided.
Referring to fig. 1 of the specification, by using a gating control signal generated by a gating unit control logic to enter an E terminal of a gating unit (ICG), a clock sent from a clock source (clock port) is turned off and turned into a turned-off clock, and then the turned-off clock is used as a clock input (clock pin) of a sub-module, so that a clock signal of a clock port (CP terminal) of a register can be realized without turning over.
The turn-off means that the clock signal turned over all the time every cycle becomes a fixed 0 as shown in fig. 2, so that the CP terminals of the devices (Buffer) and Registers (REGS) on the clock tree are not turned over, and thus the power consumption is reduced. The control logic of the gating unit is built by a register, and the gating control signal is set to 0 by detecting that the submodule does not need to work, so that the clock behind the gating unit can be turned off.
Detailed description of the structure in fig. 1 as shown in fig. 3, registers in the sub-module circuit are referred to as load registers, and registers controlling the logic of the ICG/E port are referred to as gating control registers.
However, as the scale of the integrated circuit design itself increases, the functions of the sub-module circuit become complex, and the number of load registers in the module circuit increases, so that in order to build a clock tree from the ICG/Q to the CP end of all load register groups, the clock tree delay is required to be larger, and the delay from the clock source to the ICG/CP is smaller. However, the gating control registers may be distributed among the load register groups, so the delay from the clock source to the CP of the gating control registers is large, as well as the delay of the data from the gating control registers to the ICG/E is also large. As shown in fig. 4, the transmit clock path delay is longer, the transmit data path delay is longer, but the capture clock path delay is more, and the setup time from the gating control register to the ICG is easily problematic according to the definition of the setup time, and does not meet the performance requirements of the circuit.
Meanwhile, in the prior art, when the distribution problem of ICG is solved for the situation that the number of load registers is large, a common method is to make automatic ICG copying through an EDA tool. For example, as shown in fig. 5, ICG is copied into icg_cp1 and icg_cp2, and the E terminals of the ICG after copying are commonly connected to the original gating control signal, that is, the condition that ICG is turned off is the same, without changing the function of the circuit. But icg_cp1 and icg_cp2 may control a part of the load registers, respectively, as load register group a and load register group B are split in the figure. After the ICGs are replicated in this way, the establishment time is advantageously satisfied because the number of load registers behind each ICG is reduced, resulting in a reduced ICG/Q to load register clock delay, i.e., an increased clock delay from the clock source to the ICG/CP port, corresponding to a longer capture clock path delay in fig. 4, which contributes to improved circuit performance.
However, in the prior art, the automatic ICG replication of the tool is mainly based on the physical distribution position of the load register, and meanwhile, the EDA tool adopts a heuristic algorithm, so that the ICG replication scheme has certain randomness. As shown in fig. 6, in a random ICG replication scheme, registers that are physically close to each other but not logically interacted with each other may be allocated under the same ICG, e.g., registers within load register group B/a have no logically interacted with each other, but load register group B has logically interacted with load register group a. Furthermore, in the timing check of the digital integrated circuit, because the clock paths from icg_cp2/CP to the load register group B (transmit clock path) and icg_cp1/CP to the load register group a (capture clock path) are physically located at different positions, the Voltage/Temperature/Process conditions in which they are located are different, and in the worst case, all devices on the transmit clock path are in slower Process conditions, lower Voltage and higher Temperature, and all devices on the capture clock path are in faster Process conditions, higher Voltage and normal Temperature, resulting in a large delay of the transmit clock path and a small delay of the capture clock path. To cover such a practical scenario, the delay of the transmit clock path needs to be additionally multiplied by a factor greater than 1, and the delay of the capture clock path needs to be additionally multiplied by a factor less than 1, i.e., the transmit clock path delay is increased and the capture clock delay is decreased, so that the timing performance from register group B to register group a is poor. But if register group B through register group a are themselves critical paths that are difficult to meet timing requirements, such paths will be referred to as bottlenecks in performance improvement of the entire chip.
To solve such a problem, the present application will have a logically connected and critical path start and end registers sharing as much as one ICG, so that their shared paths will not be under different voltages/samples/processes, and no additional pessimistic coefficients need to be multiplied to cover such a scenario, which is helpful for overall timing improvement. The present application will be described in detail with reference to the accompanying drawings:
in one embodiment, referring to fig. 7 of the specification, the present invention provides a power consumption optimization method for timing control of load registers in a sub-module circuit of an integrated circuit, comprising the steps of:
S100, acquiring a first end point register with timing violations in a load register based on a timing report of the sub-module circuit acquired by EDA.
Timing violations are situations where the signal fails to meet timing constraints in digital circuit designs, and typically occur when the clock frequency is too high or the data path delay is too long. Timing violations are mainly classified into Setup Time (Setup Time) violations, where data fails to stabilize before a clock edge arrives, resulting in data that may be captured erroneously, hold Time (Hold Time) violations, where data changes too fast after a clock edge, resulting in data that may be captured erroneously, multi-cycle path violations, where some paths are delayed by more than one clock cycle, and clock domain violations, where signal transmission between clock domains of different frequencies or phases does not meet timing requirements. In the current design without clock tree, the timing report of the submodule circuit of the integrated circuit can be automatically generated based on EDA, and the END point register with timing violations in each load register can be found out, wherein the END point register is the most critical END point register (which can be expressed as VIO_END_REGS) for limiting the timing of the overall performance.
S200, acquiring all first load registers driven by the ICG in the load registers. In particular, all LOAD registers driven by the ICG (which may be denoted icg_load_regs) among the LOAD registers may be found using the underlying commands of the EDA itself.
S300, according to the first load register and the first end point register, acquiring a second end point register which is driven by the ICG and limits the overall performance of the circuit in the load register.
In one particular implementation, a second endpoint register (which may be denoted as vio_icg_load_end_regs) driven by the ICG and limiting the overall performance of the circuit may be obtained by performing an intersection operation on the first LOAD register and the first endpoint register.
S400, load registers with combination logic with the second endpoint registers are obtained respectively, and each load register is divided into a plurality of logically mutually incoherent register files according to the combination logic.
In one embodiment, each of the LOAD registers having combinational logic with each of the second endpoint registers is obtained, and further comprising obtaining a START register (which may be denoted as vio_icg_load_start_regs) located at the logic START point in the LOAD register having combinational logic with each of the second endpoint registers vio_icg_load_end_regs. Several logically mutually independent register files, each load register divided by combinational logic, are shown in fig. 8.
S500, copying an ICG for each register file through an EDA tool to drive the CP end of each load register in the register file.
As shown in FIG. 9, the load registers with combinational logic share one ICG for driving, so that the need of considering extra pessimistic degree on a clock tree caused by premature clock bifurcation due to the fact that the ICG is not shared can be reduced, and the time sequence performance of a critical path is improved, and the performance of the whole chip is improved. The CP end of each ICG is connected with a clock source, the Q end of each ICG is connected with the CP end of each load register in the corresponding register file, and the E end of each ICG is used for receiving a gating signal.
According to the scheme, through the time sequence report of the submodule circuit obtained by EDA, the first end point register with time sequence violations in the load registers can be judged, and then through determining all the first load registers driven by the ICG in the load registers, the second end point register which is driven by the ICG and limits the overall performance of the circuit in the load registers can be obtained, the load registers with combined logic with the second end point register are found out according to the logic relation, each load register can be divided into a plurality of register files which are not mutually coherent logically, and through copying one ICG for each register file to drive the CP end of each load register in the register file, the fact that the start point register and the end point register which are logically connected and are the critical paths share one ICG as much as possible can be realized, so that the extra pessimistic degree is required to be considered on the clock tree which is caused by premature clock bifurcation because of the ICG is not shared is reduced, the time sequence performance of the critical paths is improved, and the performance of the whole chip is improved.
In one embodiment, based on the previous embodiment, copying one ICG for each register file by the EDA tool further comprises:
Calculating the physical center of each register file according to the physical distribution range from the starting register to the second ending register in each register file, and placing the ICG at the physical center.
Specifically, for each register file, the respective LOAD registers from the START register vio_icg_load_start_regs_k to the second END register vio_icg_load_end_regs_k will have one copy of icg_k replicated by the EDA tool to drive their CP ENDs, while the physical center of the register file is calculated from the physical distribution of the START register vio_icg_load_start_regs_k to the second END register vio_icg_load_end_regs_k, with icg_k being placed in this physical center position to ensure that all registers in the register file can share the longest clock length.
By utilizing the ICG copying function of the EDA tool and realizing ICG copying and placement of key registers affecting the overall performance of the circuit, the time sequence of paths between load registers can be optimized to the utmost extent on the premise of not increasing the complexity of the flow and keeping the power consumption of the integrated circuit, and further the performance of the chip is improved.
In an embodiment, on the basis of the previous embodiment, further comprising performing ICG copy for other load registers not belonging to each register file in the first load register by the EDA tool according to a preset condition, so that a plurality of other load registers are driven by one ICG. The preset conditions are the physical distribution of the other load registers, and the number of drives per ICG. The number of drives per ICG may be determined by the user and entered into the EDA tool before replication.
Because not all load registers in the submodule circuit are provided with combinational logic, a plurality of load registers are independent, and at the moment, the ICG replication can be carried out on the load registers which are driven by the ICG and have no combinational logic according to physical distribution by adopting a conventional EDA replication method, so that the time sequence control of all load registers which are driven by the ICG in the submodule circuit can be realized.
In one embodiment, after dividing each load register into a plurality of logically mutually incoherent register files according to the combinational logic based on the previous embodiment, the method further comprises the steps of re-acquiring a time sequence report of the sub-module circuit, judging whether the time sequence of each load register with combinational logic in each register file is accurate or not, and re-dividing the register files when judging whether the time sequence of each load register with combinational logic is accurate or not, so that the combinational logic of the load registers in each register file is accurate, and ICG control errors are avoided, and circuit performance is affected.
The power consumption optimization method of the application can be realized in the form of script to be suitable for the existing EDA tool. In addition, all replicated icg_ks may be set to dont _touch to ensure that their connection is not changed by the later tool's policy to automatically replicate ICGs, while all replicated icg_ks may be set to soft_fixed to ensure that their physical locations are not moved by the tool.
In one embodiment, the present application provides an integrated circuit that uses the power consumption optimization method of the foregoing embodiment to perform timing control of a load register in a sub-module circuit.
In one embodiment, the present application provides a chip comprising the integrated circuit of the previous embodiment.
It should be noted that the above embodiments can be freely combined as needed. The foregoing is merely a preferred embodiment of the present invention and it should be noted that modifications and adaptations to those skilled in the art may be made without departing from the principles of the present invention, which are intended to be comprehended within the scope of the present invention.

Claims (10)

1. A method for optimizing power consumption for timing control of load registers in a sub-module circuit of an integrated circuit, comprising the steps of:
acquiring a first endpoint register with timing violations in the load register based on a timing report of the submodule circuit acquired by EDA;
Acquiring all first load registers driven by ICG in the load registers;
Acquiring a second end point register which is driven by the ICG and limits the overall performance of the circuit in the load register according to the first load register and the first end point register;
Load registers with combination logic with the second endpoint registers are respectively obtained, and each load register is divided into a plurality of logically mutually incoherent register files according to the combination logic;
one ICG is replicated for each of the register files by the EDA tool for driving the CP end of each of the load registers within the register file.
2. The power consumption optimizing method as claimed in claim 1, wherein said respectively acquiring load registers having combinational logic with each of said second endpoint registers, further comprises:
and acquiring a starting point register positioned at a logic starting point in a load register with combination logic of each second ending point register.
3. The power consumption optimization method of claim 2, wherein said copying, by the EDA tool, one ICG for each of said register files, further comprises:
calculating the physical center of each register file according to the physical distribution range from the starting register to the second ending register in each register file;
the ICG is placed at the physical center.
4. The power consumption optimization method according to claim 1, wherein CP terminals of each ICG are connected to a clock source, Q terminals of each ICG are respectively connected to CP terminals of each load register in the corresponding register file, and E terminals of each ICG are each configured to receive a gating signal.
5. The power consumption optimizing method of claim 1, further comprising performing ICG copy for other load registers of the first load register not belonging to each of the register files by EDA tool according to a preset condition, and driving a plurality of the other load registers by one ICG.
6. The power consumption optimizing method according to claim 5, wherein the preset condition is a physical distribution of the other load registers and a driving number of each ICG.
7. The power consumption optimization method according to claim 1, wherein the first load register and the first end point register are subjected to an intersection operation, and the second end point register which is driven by ICG and limits the overall performance of a circuit in the load registers is obtained.
8. The power consumption optimization method according to claim 1, wherein after dividing each load register into a plurality of register files which are logically mutually incoherent according to the combinational logic, further comprising:
And re-acquiring a time sequence report of the sub-module circuit, judging whether the time sequence of each load register with the combinational logic in each register file is accurate or not, and re-dividing the register file when judging that the time sequence of each load register with the combinational logic in each register file is not accurate.
9. An integrated circuit, characterized in that the integrated circuit employs the power consumption optimization method of any of claims 1-8 for timing control of load registers in sub-module circuits.
10. A chip comprising the integrated circuit of claim 9.
CN202510157183.6A 2025-02-13 2025-02-13 Power consumption optimization method, integrated circuit and chip Active CN119623384B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202510157183.6A CN119623384B (en) 2025-02-13 2025-02-13 Power consumption optimization method, integrated circuit and chip

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202510157183.6A CN119623384B (en) 2025-02-13 2025-02-13 Power consumption optimization method, integrated circuit and chip

Publications (2)

Publication Number Publication Date
CN119623384A CN119623384A (en) 2025-03-14
CN119623384B true CN119623384B (en) 2025-04-18

Family

ID=94907490

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202510157183.6A Active CN119623384B (en) 2025-02-13 2025-02-13 Power consumption optimization method, integrated circuit and chip

Country Status (1)

Country Link
CN (1) CN119623384B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116341437A (en) * 2023-02-24 2023-06-27 山东云海国创云计算装备产业创新中心有限公司 Method, system, equipment and storage medium for solving ICG timing sequence convergence
CN116501160A (en) * 2023-04-26 2023-07-28 西安理工大学 Low-power consumption clock tree processing method for GPU module

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8850371B2 (en) * 2012-09-14 2014-09-30 Nvidia Corporation Enhanced clock gating in retimed modules
CN116720459A (en) * 2023-06-19 2023-09-08 海光信息技术股份有限公司 Method, system, device and medium for inserting gating clock between blocks
CN117311446A (en) * 2023-09-27 2023-12-29 山东云海国创云计算装备产业创新中心有限公司 Clock gating method, system, storage medium and device based on register grouping
CN118643775B (en) * 2024-08-14 2024-10-18 山东启芯软件科技有限公司 Integrated circuit design power consumption optimization method and device with multi-level clock gating inserted

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116341437A (en) * 2023-02-24 2023-06-27 山东云海国创云计算装备产业创新中心有限公司 Method, system, equipment and storage medium for solving ICG timing sequence convergence
CN116501160A (en) * 2023-04-26 2023-07-28 西安理工大学 Low-power consumption clock tree processing method for GPU module

Also Published As

Publication number Publication date
CN119623384A (en) 2025-03-14

Similar Documents

Publication Publication Date Title
US8286025B1 (en) Selection of port adapters for clock crossing boundaries
US8627252B2 (en) Method for selectively implementing low threshold voltage transistors in digital logic designs
US7461187B2 (en) Bus system and data transfer method
US7284143B2 (en) System and method for reducing clock skew
WO2021134220A1 (en) Chip, chip testing method and electronic device
US6643829B1 (en) Reducing clock skew in clock gating circuits
JP2009163758A (en) Data transfer between differently clocked busses
WO2017058457A1 (en) Interactive multi-step physical synthesis
CN111078614A (en) Function module based on FPGA
CN118508933A (en) Time sequence adjusting circuit, delay path determining method and terminal equipment
US9665377B2 (en) Processing apparatus and method of synchronizing a first processing unit and a second processing unit
CN115017845A (en) Bus-Driven Chip Simulation Stimulus Model for IP Cell-Level Verification
CN114997087B (en) Clock tree optimization method, optimization device and related equipment
Pontes et al. Hermes-glp: A gals network on chip router with power control techniques
CN119623384B (en) Power consumption optimization method, integrated circuit and chip
US7945806B2 (en) Data processing apparatus and method for controlling a transfer of payload data over a communication channel
CN119323208A (en) Connection relation establishment method and device, storage medium and electronic equipment
US7082582B1 (en) Reducing clock skew in clock gating circuits
CN105334906A (en) Multistage gated clock network optimization method in nanometer technology
US7415685B2 (en) Method of verifying the power off effect of a design entity at register transfer level and method of modeling the power off effect
US20090150709A1 (en) Reducing Inefficiencies of Multi-Clock-Domain Interfaces Using a Modified Latch Bank
CN113449477B (en) Digital circuit connection method, digital circuit connection device, electronic equipment and storage medium
CN112580278B (en) Optimization method, optimization device and storage medium of logic circuit
Semba et al. A study on the design of interface circuits between synchronous-asynchronous modules using click elements
Agiwal et al. An architecture and a wrapper synthesis approach for multi-clock latency-insensitive systems

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant