[go: up one dir, main page]

CN112542442B - A low-cost multi-chip high-speed and high-bandwidth interconnect structure - Google Patents

A low-cost multi-chip high-speed and high-bandwidth interconnect structure Download PDF

Info

Publication number
CN112542442B
CN112542442B CN202011563925.9A CN202011563925A CN112542442B CN 112542442 B CN112542442 B CN 112542442B CN 202011563925 A CN202011563925 A CN 202011563925A CN 112542442 B CN112542442 B CN 112542442B
Authority
CN
China
Prior art keywords
chip
interconnection
bumps
speed
flip
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011563925.9A
Other languages
Chinese (zh)
Other versions
CN112542442A (en
Inventor
谢慧琴
李力游
蔡宗宇
小约翰·罗伯特·罗兰
陈希恒
韦红芳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Lanyang Intelligent Technology Co ltd
Original Assignee
Nanjing Lanyang Intelligent Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Lanyang Intelligent Technology Co ltd filed Critical Nanjing Lanyang Intelligent Technology Co ltd
Priority to CN202011563925.9A priority Critical patent/CN112542442B/en
Publication of CN112542442A publication Critical patent/CN112542442A/en
Application granted granted Critical
Publication of CN112542442B publication Critical patent/CN112542442B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H01ELECTRIC ELEMENTS
    • H01LSEMICONDUCTOR DEVICES NOT COVERED BY CLASS H10
    • H01L23/00Details of semiconductor or other solid state devices
    • H01L23/48Arrangements for conducting electric current to or from the solid state body in operation, e.g. leads, terminal arrangements ; Selection of materials therefor
    • H01L23/488Arrangements for conducting electric current to or from the solid state body in operation, e.g. leads, terminal arrangements ; Selection of materials therefor consisting of soldered or bonded constructions
    • H01L23/498Leads, i.e. metallisations or lead-frames on insulating substrates, e.g. chip carriers
    • H01L23/49811Additional leads joined to the metallisation on the insulating substrate, e.g. pins, bumps, wires, flat leads
    • H01L23/49816Spherical bumps on the substrate for external connection, e.g. ball grid arrays [BGA]
    • HELECTRICITY
    • H01ELECTRIC ELEMENTS
    • H01LSEMICONDUCTOR DEVICES NOT COVERED BY CLASS H10
    • H01L23/00Details of semiconductor or other solid state devices
    • H01L23/48Arrangements for conducting electric current to or from the solid state body in operation, e.g. leads, terminal arrangements ; Selection of materials therefor
    • H01L23/488Arrangements for conducting electric current to or from the solid state body in operation, e.g. leads, terminal arrangements ; Selection of materials therefor consisting of soldered or bonded constructions
    • H01L23/498Leads, i.e. metallisations or lead-frames on insulating substrates, e.g. chip carriers
    • H01L23/49838Geometry or layout
    • HELECTRICITY
    • H01ELECTRIC ELEMENTS
    • H01LSEMICONDUCTOR DEVICES NOT COVERED BY CLASS H10
    • H01L23/00Details of semiconductor or other solid state devices
    • H01L23/48Arrangements for conducting electric current to or from the solid state body in operation, e.g. leads, terminal arrangements ; Selection of materials therefor
    • H01L23/488Arrangements for conducting electric current to or from the solid state body in operation, e.g. leads, terminal arrangements ; Selection of materials therefor consisting of soldered or bonded constructions
    • H01L23/498Leads, i.e. metallisations or lead-frames on insulating substrates, e.g. chip carriers
    • H01L23/49838Geometry or layout
    • H01L23/49844Geometry or layout for individual devices of subclass H10D
    • HELECTRICITY
    • H01ELECTRIC ELEMENTS
    • H01LSEMICONDUCTOR DEVICES NOT COVERED BY CLASS H10
    • H01L23/00Details of semiconductor or other solid state devices
    • H01L23/52Arrangements for conducting electric current within the device in operation from one component to another, i.e. interconnections, e.g. wires, lead frames
    • HELECTRICITY
    • H01ELECTRIC ELEMENTS
    • H01LSEMICONDUCTOR DEVICES NOT COVERED BY CLASS H10
    • H01L23/00Details of semiconductor or other solid state devices
    • H01L23/52Arrangements for conducting electric current within the device in operation from one component to another, i.e. interconnections, e.g. wires, lead frames
    • H01L23/522Arrangements for conducting electric current within the device in operation from one component to another, i.e. interconnections, e.g. wires, lead frames including external interconnections consisting of a multilayer structure of conductive and insulating layers inseparably formed on the semiconductor body
    • H01L23/528Layout of the interconnection structure
    • H01L23/5283Cross-sectional geometry
    • HELECTRICITY
    • H01ELECTRIC ELEMENTS
    • H01LSEMICONDUCTOR DEVICES NOT COVERED BY CLASS H10
    • H01L23/00Details of semiconductor or other solid state devices
    • H01L23/52Arrangements for conducting electric current within the device in operation from one component to another, i.e. interconnections, e.g. wires, lead frames
    • H01L23/535Arrangements for conducting electric current within the device in operation from one component to another, i.e. interconnections, e.g. wires, lead frames including internal interconnections, e.g. cross-under constructions
    • HELECTRICITY
    • H01ELECTRIC ELEMENTS
    • H01LSEMICONDUCTOR DEVICES NOT COVERED BY CLASS H10
    • H01L23/00Details of semiconductor or other solid state devices
    • H01L23/52Arrangements for conducting electric current within the device in operation from one component to another, i.e. interconnections, e.g. wires, lead frames
    • H01L23/538Arrangements for conducting electric current within the device in operation from one component to another, i.e. interconnections, e.g. wires, lead frames the interconnection structure between a plurality of semiconductor chips being formed on, or in, insulating substrates
    • H01L23/5386Geometry or layout of the interconnection structure

Landscapes

  • Physics & Mathematics (AREA)
  • Condensed Matter Physics & Semiconductors (AREA)
  • General Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Microelectronics & Electronic Packaging (AREA)
  • Power Engineering (AREA)
  • Geometry (AREA)
  • Wire Bonding (AREA)

Abstract

本发明公开了一种低成本多芯片高速高带宽互连结构,包括:复数个有源芯片、互连基板以及倒装互连凸点;所述互连基板用以实现各个有源芯片之间的金属互连和机械支撑;所述各个有源芯片之间通过超高速接口实现数据交换,共同承担人工智能算力;有源芯片通过所述倒装互连凸点与互连基板以倒装焊的方式连接。本发明不仅可以大幅提升人工智能系统的算力,还能极大地降低人工智能系统的制作成本。

The present invention discloses a low-cost multi-chip high-speed high-bandwidth interconnection structure, comprising: a plurality of active chips, an interconnection substrate and a flip-chip interconnection bump; the interconnection substrate is used to realize metal interconnection and mechanical support between each active chip; the active chips realize data exchange through an ultra-high-speed interface and jointly bear artificial intelligence computing power; the active chips are connected to the interconnection substrate in a flip-chip welding manner through the flip-chip interconnection bump. The present invention can not only greatly improve the computing power of the artificial intelligence system, but also greatly reduce the production cost of the artificial intelligence system.

Description

Low-cost multi-chip high-speed high-bandwidth interconnection structure
Technical Field
The invention discloses a low-cost multi-chip high-speed high-bandwidth interconnection structure, and relates to the technical field of system-level integration of artificial intelligent modules.
Background
Artificial intelligence (ARTIFICIAL INTELLIGENCE), abbreviated AI, is a scientific technology for studying and developing theory, methods, techniques and applications for simulating, extending and expanding human intelligence. With the continuous development of artificial intelligence technology in the fields of machine learning, high-performance computing, image recognition, genetic engineering and the like, the amount of data required to be processed by the artificial intelligence technology is increasing in a burst manner. In certain AI training applications, the bandwidth requirements of a single SoC may exceed a few TBs/s. Conventional chip and system integration methods have failed to meet the needs of artificial intelligence technology. At present, the technology of the silicon-based adapter board (Silicon Interposer) is a traditional technology of artificial intelligence system integration, and compared with known products, namely Tesla A100 of NVIDIA, radeon VII, google chip TPU3.0 and the like.
TW201826403A Methods of forming cowos structures discloses a method for producing a cumulative power COWOS (Chip on Wafer on Substrate). In order to improve the integration level, the SOC and HBM (High Bandwidth Memory) are integrated by adopting a silicon-based adapter plate to form a high-bandwidth computing module. However, silicon-based interposer technology is difficult and costly to manufacture. Also, this integration approach is theoretically limited in its upper limit, which is limited by the data processing capabilities of a single SOC.
The traditional artificial intelligence system integration technology is too dependent on the development of moore's law, designs a super chip with strong computing power by continuously improving the integration level of transistors of a single chip, selects a high-cost silicon-based adapter plate technology, and performs traditional packaging after being integrated with an HBM. However, as moore's law goes to end, the size of the transistor gets closer to the limit and the corresponding fabrication costs get higher. In addition, silicon-based interposer is not naturally suitable for large area integration due to the material properties of silicon.
Disclosure of Invention
Aiming at the defects of the prior art, the invention provides a low-cost multi-chip high-speed high-bandwidth interconnection structure.
The invention adopts the following technical scheme for solving the technical problems:
the low-cost multi-chip high-speed high-bandwidth interconnection structure comprises a plurality of active chips, an interconnection substrate and flip-chip interconnection bumps, wherein the interconnection substrate is used for realizing metal interconnection and mechanical support among the active chips, the active chips are used for realizing data exchange through an ultra-high-speed interface and jointly bearing artificial intelligence calculation force, and the active chips are connected with the interconnection substrate in a flip-chip welding mode through the flip-chip interconnection bumps.
The plurality of active chips are used for bearing the calculation force of the artificial intelligence system. If the calculation requirement of the system exceeds the data transmission quantity and calculation force of a single active chip, all the active chips can simultaneously acquire external transmission data, and all the active chips realize data exchange through an ultra-high speed interface and jointly bear artificial intelligence calculation force. The main advantage of the parallel computing is that the parallel computing using a plurality of identical active chips is creatively proposed, so that the computational power requirement of a single chip can be reduced, and the design and manufacturing cost of the chip are greatly reduced.
As a preferred scheme, the structures and functions of the active chips are identical, and the active chips are separate small systems which can be independently interconnected with external data or can be integrated according to an array. The number of integrated chips can be flexibly designed according to the calculation force requirement of the system. Different from different system integration of different chips, the array type integration mode greatly reduces the integration difficulty.
Preferably, the ultra-high speed interface for interconnecting the active chips comprises NRZ, PAM4, PAM8 and other high speed interconnection interfaces. The high-speed interface rate of a single channel which is commercially used at present can reach 112Gbps, and the high-speed high-bandwidth interconnection requirement can be met.
The metal interconnection comprises a horizontal interconnection and a vertical interconnection, wherein the horizontal interconnection is mainly provided by a metal wire, the metal interconnection can be manufactured by an exposure and etching method, and the vertical interconnection is manufactured by a laser blind buried hole or mechanical through hole method.
In particular embodiments, the choices available for the flip-chip interconnect bumps include C4 bumps, copper bumps, or other metal bumps.
Preferably, the flip-chip interconnection bumps are provided with a specific bump arrangement mode, wherein each pair of differential signal bumps are not adjacent, and the middle of each pair of differential signal bumps are separated by a pair of ground bumps. The specific bump array can greatly reduce crosstalk between different channels while not affecting the single channel interconnect performance.
Compared with the prior art, the technical scheme has the following technical effects that the multi-chip is integrated by utilizing the low-cost substrate technology, so that the parallel computation of the multi-chip is realized, and the computation power of the artificial intelligent system is greatly improved. So that the improvement of the system performance is not solely dependent on the improvement of the computing power of a single SOC. Under the condition of the same calculation power, compared with a single SOC chip architecture, the unique multi-chip parallel architecture can greatly reduce the design difficulty and the production cost.
Drawings
FIG. 1 is a cross-sectional view of a package structure in one embodiment of the invention;
FIG. 2 is a top view of a package structure including a plurality of active chips and their interconnections in accordance with one embodiment of the present invention;
FIG. 3 illustrates a single active die and bump arrangement in accordance with an embodiment of the present invention;
FIG. 4 is a graph of simulation results using a three-dimensional full wave electromagnetic field simulation tool in accordance with the present invention;
FIG. 5 is a diagram showing the results of an eye simulation at the receiving end in accordance with the present invention;
the semiconductor device comprises 101-active chips, 102-flip-chip interconnection bumps of the chips, 103-metal interconnection between the active chips, 104-substrate vertical electric interconnection, 105-substrate horizontal electric interconnection, 106-solder balls, 201-signal bumps and 202-power supply bumps.
Detailed Description
Embodiments of the present invention are described in detail below, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals refer to the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the drawings are exemplary only for explaining the present invention and are not to be construed as limiting the present invention.
The technical scheme of the invention is further described in detail below with reference to the accompanying drawings:
Referring to fig. 1-2, the present invention describes in detail a low-cost multi-chip high-speed high-bandwidth interconnection structure suitable for artificial intelligence system integration, which comprises a plurality of active chips, a low-cost interconnection substrate and flip-chip interconnection bumps, wherein the schematic structure is shown in fig. 1, and the top view is shown in fig. 2. The plurality of active chips 101 are each mounted on a low cost interconnect substrate for providing metal interconnection and mechanical support between the individual active chips 101. The low-cost interconnection substrate comprises an organic medium substrate or a low-temperature co-fired ceramic substrate or a high-temperature co-fired ceramic substrate. The metal interconnections 103 between the active chips may be implemented on an interconnection substrate, including a substrate horizontal electrical interconnection 105 in a horizontal direction and a substrate vertical electrical interconnection 104 in a vertical direction. And data exchange is realized among the active chips through an ultra-high speed interface, wherein the ultra-high speed interface comprises NRZ, PAM4, PAM8 and other high-speed interconnection interfaces. The flip-chip interconnect bumps 102 of the chip include C4 bumps, copper bumps, or other metal bumps through which the active chip is connected to the interconnect substrate. The flip-chip interconnect bumps should have a specific arrangement, as shown in fig. 2, where each pair of differential signal bumps are not adjacent, and are separated by a pair of ground bumps. The specific bump array can greatly reduce crosstalk between different channels while not affecting the single channel interconnect performance. The single active chip and its bump arrangement are shown in fig. 3, where signal bumps 201 are spaced apart from power bumps 202.
The invention adopts the three-dimensional full-wave electromagnetic field simulation tool to model the passive interconnection structure, and can accurately compare the transmission characteristics of different structures based on the traditional bump arrangement and the specific bump arrangement. According to the simulation result shown in fig. 4, in the passive structure corresponding to the specific bump array, the crosstalk between different channels is only below-40 dB at the nyquist frequency of 2 times, which is far better than the isolation of-30 dB based on the conventional bump array. Simulation results demonstrate that a particular bump arrangement facilitates interconnection between active chips.
The invention utilizes a time domain eye diagram simulation tool ADS to cascade a chip active model IBIS AMI model and S parameter models of all passive channels, and simulates an eye diagram result at a receiving end. According to the simulation result shown in fig. 5, when the transmission speed of a single channel is 112Gbps (only one differential pair of transmission data, and when the channel contains multiple differential pairs, the channel transmission data is multiplied), the eye diagram of the receiving end is still clearly open, which proves the possibility of high-speed and high-bandwidth transmission between active chips.
The embodiments of the present invention have been described in detail with reference to the drawings, but the present invention is not limited to the above embodiments, and various changes can be made within the knowledge of those skilled in the art without departing from the spirit of the present invention. The present invention is not limited to the preferred embodiments, and the present invention is described above in any way, but is not limited to the preferred embodiments, and any person skilled in the art will appreciate that the present invention is not limited to the embodiments described above, while the invention has been described with respect to specific examples including presently preferred modes of carrying out the invention, those skilled in the art will appreciate that there are numerous variations and permutations of the above described embodiments that fall within the spirit and scope of the invention as set forth in the appended claims.

Claims (4)

1.一种低成本多芯片高速高带宽互连结构,其特征在于,所述互连结构包括:复数个有源芯片、互连基板以及倒装互连凸点,其中,1. A low-cost multi-chip high-speed high-bandwidth interconnect structure, characterized in that the interconnect structure comprises: a plurality of active chips, an interconnect substrate and flip-chip interconnect bumps, wherein: 所述互连基板用以实现各个有源芯片之间的金属互连和机械支撑;The interconnection substrate is used to realize metal interconnection and mechanical support between various active chips; 所述金属互连包括水平方向互连和垂直方向互连;The metal interconnection includes horizontal interconnection and vertical interconnection; 水平方向通过金属导线,以曝光和刻蚀的方法制作,实现电互连;In the horizontal direction, metal wires are made by exposure and etching to achieve electrical interconnection; 垂直方向制作激光盲埋孔或者机械通孔,实现电互连;Make laser blind buried vias or mechanical through holes in the vertical direction to achieve electrical interconnection; 所述各个有源芯片之间通过超高速接口实现数据交换,共同承担人工智能算力;The active chips exchange data through ultra-high-speed interfaces and jointly undertake artificial intelligence computing power; 复数个有源芯片,每个有源芯片的架构和功能一致,每个有缘芯片独立与外部数据进行互连、独立承担人工智能算力,通过超高速接口并行传输和计算;Multiple active chips, each with the same architecture and functions, each independently interconnects with external data, independently undertakes artificial intelligence computing power, and transmits and calculates in parallel through ultra-high-speed interfaces; 有源芯片通过所述倒装互连凸点与互连基板以倒装焊的方式连接;The active chip is connected to the interconnection substrate in a flip-chip soldering manner through the flip-chip interconnection bumps; 所述倒装互连凸点的排列方式为:每一对差分信号凸点均不相邻,中间均使用一对地凸点隔开;The arrangement of the flip-chip interconnect bumps is as follows: each pair of differential signal bumps are not adjacent to each other and are separated by a pair of ground bumps; 单个有源芯片中,信号凸点和电源凸点间隔排列。In a single active chip, signal bumps and power bumps are arranged alternately. 2.如权利要求1所述的一种低成本多芯片高速高带宽互连结构,其特征在于:所述互连基板包括有机介质基板、低温共烧陶瓷基板或者高温共烧陶瓷基板。2. A low-cost multi-chip high-speed high-bandwidth interconnect structure as described in claim 1, characterized in that: the interconnect substrate comprises an organic dielectric substrate, a low-temperature co-fired ceramic substrate or a high-temperature co-fired ceramic substrate. 3.如权利要求1所述的一种低成本多芯片高速高带宽互连结构,其特征在于:所述倒装互连凸点,包括C4凸点、铜凸点或者其他金属凸点。3. A low-cost multi-chip high-speed high-bandwidth interconnect structure as described in claim 1, characterized in that: the flip-chip interconnect bumps include C4 bumps, copper bumps or other metal bumps. 4.如权利要求1所述的一种低成本多芯片高速高带宽互连结构,其特征在于:所述超高速接口包括NRZ接口、PAM4接口或者PAM8接口,及其他PAMN接口,N为整数。4. A low-cost multi-chip high-speed and high-bandwidth interconnect structure as described in claim 1, characterized in that: the ultra-high-speed interface includes an NRZ interface, a PAM4 interface or a PAM8 interface, and other PAMN interfaces, and N is an integer.
CN202011563925.9A 2020-12-25 2020-12-25 A low-cost multi-chip high-speed and high-bandwidth interconnect structure Active CN112542442B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011563925.9A CN112542442B (en) 2020-12-25 2020-12-25 A low-cost multi-chip high-speed and high-bandwidth interconnect structure

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011563925.9A CN112542442B (en) 2020-12-25 2020-12-25 A low-cost multi-chip high-speed and high-bandwidth interconnect structure

Publications (2)

Publication Number Publication Date
CN112542442A CN112542442A (en) 2021-03-23
CN112542442B true CN112542442B (en) 2024-12-13

Family

ID=75018180

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011563925.9A Active CN112542442B (en) 2020-12-25 2020-12-25 A low-cost multi-chip high-speed and high-bandwidth interconnect structure

Country Status (1)

Country Link
CN (1) CN112542442B (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101047160A (en) * 2006-03-29 2007-10-03 台湾积体电路制造股份有限公司 Semiconductor wiring packaging structure and connection method with integrated circuit

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH10173002A (en) * 1996-12-06 1998-06-26 Toshiba Corp Bare chip flip chip bonding method
US7342310B2 (en) * 2004-05-07 2008-03-11 Avago Technologies General Ip Pte Ltd Multi-chip package with high-speed serial communications between semiconductor die
JP2012033786A (en) * 2010-07-31 2012-02-16 Kyocer Slc Technologies Corp Wiring board

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101047160A (en) * 2006-03-29 2007-10-03 台湾积体电路制造股份有限公司 Semiconductor wiring packaging structure and connection method with integrated circuit

Also Published As

Publication number Publication date
CN112542442A (en) 2021-03-23

Similar Documents

Publication Publication Date Title
US20240321833A1 (en) Fully Interconnected Heterogeneous Multi-layer Reconstructed Silicon Device
Chen et al. A novel system in package with fan-out WLP for high speed SERDES application
CN107112301A (en) Electronic component integration on the dorsal part of semiconductor chip
US11652035B2 (en) Multi-pitch ball grid array
Cao et al. Advanced fanout packaging technology for hybrid substrate integration
Koyanagi Heterogeneous 3D integration—Technology enabler toward future super-chip
KR101780145B1 (en) Method of component partitions on system on chip and device thereof
US10249596B1 (en) Fan-out in ball grid array (BGA) package
Liu et al. Fan-out embedded bridge solution in HPC application
CN114400209B (en) Interposer, semiconductor packaging device and electronic device
CN112542442B (en) A low-cost multi-chip high-speed and high-bandwidth interconnect structure
CN115458479A (en) Manufacturing method of high-bandwidth bare chip and high-bandwidth bare chip
Chujo et al. Bumpless build cube (BBCube) 3D: Heterogeneous 3D integration using WoW and CoW to provide TB/s bandwidth with lowest bit access energy
CN113097179B (en) Packaging structure with medium layer
CN117497516B (en) High-density optoelectronic integrated three-dimensional packaging structure and manufacturing method thereof
CN118260235B (en) Computing power accelerator card, design method and computing power server
Hossen et al. Thermomechanical analysis and package-level optimization of mechanically flexible interconnects for interposer-on-motherboard assembly
CN118748170A (en) Module-based functional wafer implementation method and a functional wafer
Zoschke et al. Key Technologies and Design Aspects for Wafer Level Packaging of High Performance Computing Modules
CN116314151B (en) Chip packaging components and electronic devices
CN117457619A (en) A semiconductor device based on high-bandwidth interconnection technology
Huang et al. HBM3 PPA performance evaluation by TSV model with micro-bump and hybrid bonding
TWI811103B (en) Communication interface structure and die-to-die package
CN219800840U (en) A power supply structure for integrated chip systems
Araga et al. Landside capacitor efficacy among multi-chip-module using Si-interposer

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant