CN117370255A - Error reporting structure of multi-port PCIe bridging chip - Google Patents
Error reporting structure of multi-port PCIe bridging chip Download PDFInfo
- Publication number
- CN117370255A CN117370255A CN202311334869.5A CN202311334869A CN117370255A CN 117370255 A CN117370255 A CN 117370255A CN 202311334869 A CN202311334869 A CN 202311334869A CN 117370255 A CN117370255 A CN 117370255A
- Authority
- CN
- China
- Prior art keywords
- error reporting
- pcie
- error
- port
- module
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
- 
        - G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F13/00—Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
- G06F13/38—Information transfer, e.g. on bus
- G06F13/42—Bus transfer protocol, e.g. handshake; Synchronisation
- G06F13/4282—Bus transfer protocol, e.g. handshake; Synchronisation on a serial bus, e.g. I2C bus, SPI bus
 
- 
        - G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2213/00—Indexing scheme relating to interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
- G06F2213/0026—PCI express
 
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Debugging And Monitoring (AREA)
Abstract
Description
技术领域Technical field
本发明涉及集成电路技术领域,特别涉及一种多端口PCIe桥接芯片错误上报架构。The invention relates to the technical field of integrated circuits, and in particular to a multi-port PCIe bridge chip error reporting architecture.
背景技术Background technique
PCI-Express是一种高速串行计算机扩展总线标准,因其高数据传输速率、高带宽、高数据传输可靠性和数据完整性、高兼容性以及支持并行数据通道的特性广泛的应用于计算机的显卡、网络适配器和存储等硬件设备。PCIe Switch(交换机)是PCIe中最常见的设备类型之一,它可以将一个PCIe总线分成多个子总线,其有1个上游端口和大于等于2个下游端口,可将数据从一个端点设备传输到另一个端点设备,同时控制数据的流向和速度。PCIe Switch芯片可以在CPU提供的PCIe通道数量不足时扩展系统中的PCIe通道数量,提高扩展能力和灵活性。PCI-Express is a high-speed serial computer expansion bus standard. It is widely used in computers because of its high data transfer rate, high bandwidth, high data transmission reliability and data integrity, high compatibility and support for parallel data channels. Hardware devices such as graphics cards, network adapters, and storage. PCIe Switch (switch) is one of the most common device types in PCIe. It can divide a PCIe bus into multiple sub-buses. It has 1 upstream port and more than or equal to 2 downstream ports. It can transmit data from an endpoint device to Another endpoint device that controls both the direction and speed of data flow. The PCIe Switch chip can expand the number of PCIe lanes in the system when the number of PCIe lanes provided by the CPU is insufficient, improving expansion capabilities and flexibility.
在PCIe传输过程中会出现一些错误,这些错误大致可分为可校正错误和不可校正错误,不可校正错误又分为致命错误和非致命错误。其中可校正错误是可以自动被硬件识别并被自动校正或恢复。非致命错误可能会导致特定的传输变得不可靠,但是链路和硬件的其它功能不受影响。而致命错误则会导致链路和硬件异常,只有通过系统进行复位操作才能实现恢复。如果这些错误不及时解决,大量报错会造成系统宕机,给PCIe传输造成极大影响。Some errors will occur during the PCIe transmission process. These errors can be roughly divided into correctable errors and uncorrectable errors. Uncorrectable errors are further divided into fatal errors and non-fatal errors. Among them, correctable errors can be automatically recognized by the hardware and automatically corrected or restored. Nonfatal errors may cause a specific transmission to become unreliable, but other functions of the link and hardware are not affected. Fatal errors will cause link and hardware abnormalities, and recovery can only be achieved through a system reset. If these errors are not resolved in time, a large number of error reports will cause system downtime and have a great impact on PCIe transmission.
多端口PCIe桥接芯片中包括了多个PCIe Switch控制器,且多端口PCIe桥接芯片还有Virtual Switch(虚拟交换机)模式,该模式拥有多个PCIe Switch(交换机)上游端口,上下游端口间的通信变化较多,产生PCIe错误的情况会更复杂。因此针对多端口PCIe桥接芯片中每个端口接收或者检测到的PCIe错误进行集中上报的问题,需要专门提供一种错误上报设计来实现整个PCIe桥接芯片的错误上报功能。The multi-port PCIe bridge chip includes multiple PCIe Switch controllers, and the multi-port PCIe bridge chip also has a Virtual Switch mode. This mode has multiple PCIe Switch upstream ports and communication between the upstream and downstream ports. There are more changes, and the situations that cause PCIe errors will be more complicated. Therefore, in order to solve the problem of centralized reporting of PCIe errors received or detected by each port in the multi-port PCIe bridge chip, a special error reporting design needs to be provided to implement the error reporting function of the entire PCIe bridge chip.
发明内容Contents of the invention
本发明的目的在于提供一种多端口PCIe桥接芯片错误上报架构,以解决现有技术无法将PCIE Switch的多个下游端口收到或者检测到的PCIE Error上报给RC(RootComplex,相应根联合体)的问题。The purpose of the present invention is to provide a multi-port PCIe bridge chip error reporting architecture to solve the problem that the existing technology cannot report the PCIE Error received or detected by multiple downstream ports of the PCIE Switch to the RC (RootComplex, corresponding root complex) The problem.
为解决上述技术问题,本发明提供了一种多端口PCIe桥接芯片错误上报架构,包括错误上报接口模块、错误上报总线模块和错误上报模块,其中:In order to solve the above technical problems, the present invention provides a multi-port PCIe bridge chip error reporting architecture, including an error reporting interface module, an error reporting bus module and an error reporting module, wherein:
所述错误上报接口模块连接多端口PCIe桥接芯片中PCIe虚拟交换机的上下游端口,所述错误上报接口模块对从多个PCIE交换机下游端口接收到的信号进行跨时钟和缓存处理;The error reporting interface module is connected to the upstream and downstream ports of the PCIe virtual switch in the multi-port PCIe bridge chip, and the error reporting interface module performs cross-clock and cache processing on signals received from the downstream ports of multiple PCIe switches;
所述错误上报总线模块连接所述错误上报接口模块,所述错误上报模块连接所述错误上报总线模块。The error reporting bus module is connected to the error reporting interface module, and the error reporting module is connected to the error reporting bus module.
在一种实施方式中,所述错误上报接口模块连接多个PCIe交换机的下游端口,在虚拟交换机模式下支持多个PCIe虚拟交换机的错误上报。In one implementation, the error reporting interface module is connected to downstream ports of multiple PCIe switches, and supports error reporting of multiple PCIe virtual switches in virtual switch mode.
在一种实施方式中,所述错误上报接口模块将与之相连的PCIe虚拟交换机的下游端口收到的错误信息、检测到的不同级别的PCIe错误以及端口信息缓存至所述错误上报接口模块的Ingress FIFO中,同时会从所述错误上报接口模块的Egress FIFO中将错误上报请求和错误信息发送至对应PCIe虚拟交换机的上游端口;In one implementation, the error reporting interface module caches the error information received by the downstream port of the PCIe virtual switch connected to it, the detected PCIe errors of different levels, and the port information to the error reporting interface module. In the Ingress FIFO, the error reporting request and error information will be sent from the Egress FIFO of the error reporting interface module to the upstream port of the corresponding PCIe virtual switch;
所述错误上报接口模块同时进行跨时钟处理,使PCIe虚拟交换机下游端口时钟域的信号处理后同步到相应的上游端口的时钟域。The error reporting interface module simultaneously performs cross-clock processing, so that the signals in the clock domain of the downstream port of the PCIe virtual switch are synchronized to the clock domain of the corresponding upstream port after processing.
在一种实施方式中,所述错误上报总线模块将所述Ingress FIFO中存储的PCIe虚拟交换机下游端口的不同级别的错误信息和错误检测提示信号输出至所述错误上报模块进行处理,同时将所述错误上报模块输出的错误信息存储至所述Egress FIFO中,达到管理PCIe虚拟交换机的端口数据通路互连的目的。In one implementation, the error reporting bus module outputs different levels of error information and error detection prompt signals of the PCIe virtual switch downstream ports stored in the Ingress FIFO to the error reporting module for processing, and at the same time The error information output by the error reporting module is stored in the Egress FIFO to achieve the purpose of managing the port data path interconnection of the PCIe virtual switch.
在一种实施方式中,所述错误上报模块将所述Ingress FIFO中存储的错误信息和PCIe错误信号进行转换与路由处理产生相应的错误信息和错误上报请求,再对错误上报请求进行仲裁和反压处理后发送给所述错误上报总线模块。In one implementation, the error reporting module converts and routes the error information and PCIe error signals stored in the Ingress FIFO to generate corresponding error information and error reporting requests, and then arbitrates and responds to the error reporting requests. After pressure processing, it is sent to the error reporting bus module.
在一种实施方式中,根据端口配置信息和PCIe错误类型将PCIe虚拟交换机下游端口检测到的PCIe错误产生对应的错误信息和错误上报请求,并将请求发送给所述错误上报总线模块,或将下游端口收到的错误信息直接路由给所述错误上报总线模块。In one implementation, the PCIe error detected on the downstream port of the PCIe virtual switch generates corresponding error information and error reporting request according to the port configuration information and PCIe error type, and sends the request to the error reporting bus module, or sends the request to the error reporting bus module. The error information received by the downstream port is directly routed to the error reporting bus module.
在一种实施方式中,根据所述Egress FIFO的非满信号对多个错误上报请求进行仲裁,同时根据PCIe虚拟交换机上游端口的反馈信号进行反压,在收到上游端口的反馈信号前缓存后续的错误上报请求,在收到上游端口的反馈信号后读取发送后续的错误上报请求。In one implementation, multiple error reporting requests are arbitrated based on the not-full signal of the Egress FIFO, and at the same time, back pressure is performed based on the feedback signal from the upstream port of the PCIe virtual switch, and subsequent errors are cached before receiving the feedback signal from the upstream port. Error reporting request, after receiving the feedback signal from the upstream port, read and send subsequent error reporting requests.
本发明提供的一种多端口PCIe桥接芯片错误上报架构,可以将多个PCIe虚拟交换机下游端口检测到的PCIe错误进行缓存,根据缓存信息产生错误上报请求和相应的错误信息,通过对应PCIe虚拟交换机的上游端口上报给RC,或者将多个PCIe虚拟交换机下游端口收到的错误信息路由转发给对应PCIe虚拟交换机的上游端口,再通过对应PCIe虚拟交换机的上游端口上报给RC,方便RC及时定位、分析和纠正错误,避免因为错误造成系统宕机问题,增强系统的健壮性和可靠性,可用于实现多端口PCIe桥接芯片的错误上报功能。The invention provides a multi-port PCIe bridge chip error reporting architecture that can cache PCIe errors detected by the downstream ports of multiple PCIe virtual switches, generate error reporting requests and corresponding error information based on the cached information, and pass the corresponding PCIe virtual switch The upstream port of the corresponding PCIe virtual switch is reported to the RC, or the error information received by the downstream ports of multiple PCIe virtual switches is routed and forwarded to the upstream port of the corresponding PCIe virtual switch, and then reported to the RC through the upstream port of the corresponding PCIe virtual switch, which facilitates the timely positioning of the RC. Analyze and correct errors to avoid system downtime caused by errors, enhance the robustness and reliability of the system, and can be used to implement the error reporting function of multi-port PCIe bridge chips.
附图说明Description of the drawings
图1为本发明提供的多端口PCIE桥接芯片错误上报架构的总体框架示意图;Figure 1 is a schematic diagram of the overall framework of the multi-port PCIE bridge chip error reporting architecture provided by the present invention;
图2为本发明提供的错误上报模块的设计架构示意图;Figure 2 is a schematic diagram of the design architecture of the error reporting module provided by the present invention;
图3为本发明提供的多端口PCIE桥接芯片错误上报流程示意图。Figure 3 is a schematic diagram of the multi-port PCIE bridge chip error reporting process provided by the present invention.
具体实施方式Detailed ways
以下结合附图和具体实施例对本发明提出的一种多端口PCIe桥接芯片错误上报架构作进一步详细说明。根据下面说明,本发明的优点和特征将更清楚。需说明的是,附图均采用非常简化的形式且均使用非精准的比例,仅用以方便、明晰地辅助说明本发明实施例的目的。The multi-port PCIe bridge chip error reporting architecture proposed by the present invention will be further described in detail below with reference to the accompanying drawings and specific embodiments. The advantages and features of the present invention will become clearer from the following description. It should be noted that the drawings are in a very simplified form and use imprecise proportions, and are only used to conveniently and clearly assist in explaining the embodiments of the present invention.
如图1所示为本发明提供的多端口PCIE桥接芯片错误上报架构的总体框架示意图,主要包括错误上报接口模块,错误上报总线模块和错误上报模块,其中错误上报接口模块连接多端口PCIe桥接芯片中PCIE Virtual Switch(虚拟交换机)的上下游端口,该错误上报接口模块对从多个PCIE Virtual Switch下游端口接收到的信号进行跨时钟和缓存处理,跨时钟处理的目的是以防PCIE Virtual Switch上下游端口的时钟频率不同,因此需要将从下游端口接收到的信号从下游端口时钟域同步到需要上报的上游端口的时钟域,方便通过上游端口将ErrorMessage(错误信息)上报给RC。缓存处理的逻辑是根据端口信息、PCIE下游端口接收到的不同级别的Error Message、检测到的不同级别的PCIE错误和异步FIFO的非空信号产生Ingress FIFO的写使能信号和输入信号,再根据异步FIFO的非空信号产生Engress FIFO的读使能信号,将Engress FIFO中的ErrorMessage发送给上游端口。Figure 1 is a schematic diagram of the overall framework of the error reporting architecture of the multi-port PCIE bridge chip provided by the present invention. It mainly includes an error reporting interface module, an error reporting bus module and an error reporting module. The error reporting interface module is connected to the multi-port PCIe bridge chip. The error reporting interface module performs cross-clock and cache processing on the signals received from multiple PCIE Virtual Switch downstream ports. The purpose of cross-clock processing is to prevent the PCIE Virtual Switch from being blocked. The clock frequencies of the downstream ports are different, so the signal received from the downstream port needs to be synchronized from the downstream port clock domain to the clock domain of the upstream port that needs to be reported, so that the ErrorMessage (error message) can be reported to the RC through the upstream port. The logic of cache processing is to generate the write enable signal and input signal of the Ingress FIFO based on the port information, the different levels of Error Messages received by the PCIE downstream port, the different levels of detected PCIE errors and the non-empty signal of the asynchronous FIFO, and then based on The non-empty signal of the asynchronous FIFO generates the read enable signal of the Engress FIFO, and sends the ErrorMessage in the Engress FIFO to the upstream port.
错误上报总线模块中的Ingress FIFO读控制逻辑是根据异步FIFO的非空信号产生Ingress FIFO的读使能信号,将Ingress FIFO中缓存的信号发送给错误上报模块进行处理。Engress FIFO写控制逻辑根据错误上报模块中的仲裁结果产生Engress FIFO的写使能信号,将错误上报模块处理完的信号写入Engress FIFO中。The Ingress FIFO read control logic in the error reporting bus module generates the Ingress FIFO read enable signal based on the non-empty signal of the asynchronous FIFO, and sends the buffered signal in the Ingress FIFO to the error reporting module for processing. The Engress FIFO write control logic generates the write enable signal of the Engress FIFO based on the arbitration result in the error reporting module, and writes the signal processed by the error reporting module into the Engress FIFO.
错误上报模块主要将从Ingress FIFO中读取的PCIE错误信息进行转化和仲裁处理后再写入Engress FIFO中。The error reporting module mainly converts and arbitrates the PCIE error information read from the Ingress FIFO and then writes it into the Ingress FIFO.
图2为多端口PCIE桥接芯片中错误上报模块的设计架构图,错误上报模块根据端口配置信号和PCIE错误类型将PCIE Virtual Switch下游端口检测到的错误产生对应的ErrorMessage和错误上报请求,或将下游端口收到的Error Message产生错误上报请求,同时该错误上报模块会对多个错误上报请求进行仲裁,并根据上下游端口配置信号判断在Virtual Switch模式下ErrorMessage的来源和去向,再通过对应VS(Virtual Switch)的错误上报子模块发送给错误上报总线模块。同时根据PCIE Virtual Switch上游端口的反馈信号进行反压,在收到上游端口的反馈信号前缓存后续的Error Message请求,在收到上游端口的反馈信号后读取发送后续的Error Message请求。Figure 2 is the design architecture diagram of the error reporting module in the multi-port PCIE bridge chip. The error reporting module generates corresponding ErrorMessage and error reporting requests for errors detected by the PCIE Virtual Switch downstream port according to the port configuration signal and PCIE error type, or reports the error to the downstream port. The Error Message received by the port generates an error reporting request. At the same time, the error reporting module arbitrates multiple error reporting requests, and determines the source and destination of the Error Message in Virtual Switch mode based on the upstream and downstream port configuration signals, and then passes the corresponding VS( The error reporting sub-module of Virtual Switch sends it to the error reporting bus module. At the same time, back pressure is performed based on the feedback signal of the PCIE Virtual Switch upstream port, and subsequent Error Message requests are cached before receiving the feedback signal from the upstream port. After receiving the feedback signal from the upstream port, subsequent Error Message requests are read and sent.
图3为本发明的多端口PCIE桥接芯片错误上报流程示意图,主要包括了PCIE上下游端口检测或者接收到的PCIE错误的处理及上报流程。Figure 3 is a schematic diagram of the error reporting process of the multi-port PCIE bridge chip of the present invention, which mainly includes the processing and reporting process of PCIE errors detected or received by PCIE upstream and downstream ports.
上述描述仅是对本发明较佳实施例的描述,并非对本发明范围的任何限定,本发明领域的普通技术人员根据上述揭示内容做的任何变更、修饰,均属于权利要求书的保护范围。The above description is only a description of the preferred embodiments of the present invention, and does not limit the scope of the present invention in any way. Any changes or modifications made by those of ordinary skill in the field of the present invention based on the above disclosure shall fall within the scope of the claims.
Claims (7)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title | 
|---|---|---|---|
| CN202311334869.5A CN117370255A (en) | 2023-10-16 | 2023-10-16 | Error reporting structure of multi-port PCIe bridging chip | 
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title | 
|---|---|---|---|
| CN202311334869.5A CN117370255A (en) | 2023-10-16 | 2023-10-16 | Error reporting structure of multi-port PCIe bridging chip | 
Publications (1)
| Publication Number | Publication Date | 
|---|---|
| CN117370255A true CN117370255A (en) | 2024-01-09 | 
Family
ID=89394063
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date | 
|---|---|---|---|
| CN202311334869.5A Pending CN117370255A (en) | 2023-10-16 | 2023-10-16 | Error reporting structure of multi-port PCIe bridging chip | 
Country Status (1)
| Country | Link | 
|---|---|
| CN (1) | CN117370255A (en) | 
- 
        2023
        - 2023-10-16 CN CN202311334869.5A patent/CN117370255A/en active Pending
 
Similar Documents
| Publication | Publication Date | Title | 
|---|---|---|
| CN101814060B (en) | Method and apparatus to facilitate system to system protocol exchange in back to back non-transparent bridges | |
| KR100611268B1 (en) | Enhanced General I / O Architecture and Related Methods for Virtual Channel Setup | |
| US7065672B2 (en) | Apparatus and methods for fault-tolerant computing using a switching fabric | |
| US8751714B2 (en) | Implementing quickpath interconnect protocol over a PCIe interface | |
| US6263397B1 (en) | Mechanism for delivering interrupt messages | |
| US7143227B2 (en) | Broadcast bridge apparatus for transferring data to redundant memory subsystems in a storage controller | |
| US7821919B2 (en) | Data processing apparatus and data processing method | |
| US7653766B1 (en) | Time-gap defect detection apparatus and method | |
| CN117692535B (en) | PCIe protocol message order preserving device | |
| US6182267B1 (en) | Ensuring accurate data checksum | |
| WO2012143953A4 (en) | Optimized multi-root input output virtualization aware switch | |
| JP2020035419A (en) | Error checking for primary signal transmitted between first and second clock domains | |
| WO2006128105A2 (en) | Method of preventing error propagation in a pci / pci-x / pci express link | |
| US7181561B2 (en) | Ordering rule controlled command storage | |
| CN105357147A (en) | High-speed and high-reliability network-on-chip adapter unit | |
| US6256699B1 (en) | Reliable interrupt reception over buffered bus | |
| CN111858413B (en) | Data scheduling method and device for PCIE switching chip port | |
| CN117785783A (en) | High-reliability high-speed interconnected core particle for aerospace | |
| KR20040041644A (en) | Error forwarding in an enhanced general input/output architecture | |
| US7624324B2 (en) | File control system and file control device | |
| US20180081838A1 (en) | Multi-source data insertion using an elastic fifo | |
| US6604161B1 (en) | Translation of PCI level interrupts into packet based messages for edge event drive microprocessors | |
| US20070030862A1 (en) | Request processing between failure windows | |
| US7272745B2 (en) | Data protection system | |
| CN117370255A (en) | Error reporting structure of multi-port PCIe bridging chip | 
Legal Events
| Date | Code | Title | Description | 
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination |