[go: up one dir, main page]

WO2007068139A1 - System and method for controlling the media stream of the multiparty video communication - Google Patents

System and method for controlling the media stream of the multiparty video communication Download PDF

Info

Publication number
WO2007068139A1
WO2007068139A1 PCT/CN2005/002152 CN2005002152W WO2007068139A1 WO 2007068139 A1 WO2007068139 A1 WO 2007068139A1 CN 2005002152 W CN2005002152 W CN 2005002152W WO 2007068139 A1 WO2007068139 A1 WO 2007068139A1
Authority
WO
WIPO (PCT)
Prior art keywords
unit
real
communication terminal
communication
server
Prior art date
Application number
PCT/CN2005/002152
Other languages
French (fr)
Chinese (zh)
Inventor
Zhengxun Xia
Jindong Wang
Jianzhang Ye
Chengjin Pan
Original Assignee
Zte Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zte Corporation filed Critical Zte Corporation
Priority to PCT/CN2005/002152 priority Critical patent/WO2007068139A1/en
Priority to CN2005800516476A priority patent/CN101273631B/en
Publication of WO2007068139A1 publication Critical patent/WO2007068139A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • H04N7/152Multipoint control units therefor
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/23424Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving splicing one content stream with another content stream, e.g. for inserting or substituting an advertisement
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/23614Multiplexing of additional data and video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • H04N21/4348Demultiplexing of additional data and video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44016Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving splicing one content stream with another content stream, e.g. for substituting a video clip
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • H04N21/6437Real-time Transport Protocol [RTP]

Definitions

  • the present invention relates to the field of multimedia technologies, and relates to a multi-party video communication media stream control system and method, and more particularly to a static image and a real-time stream in real-time multi-party video communication.
  • TS stream Transport Stream
  • Streaming media refers to continuous time-based media, such as audio, video or multimedia files, that use streaming technology in the network.
  • Streaming media technology has been greatly used in video telephony and video conferencing systems.
  • each communication terminal communicates through a streaming media server.
  • a communication terminal usually has a plurality of communication links to receive audio and video data of other communication terminals. As a communication center, it is very burdensome.
  • an object of the present invention is to provide a multi-party video communication media stream control system and method.
  • the invention fully considers the characteristics of the multi-party video communication system, and proposes a System and method for hybrid transmission of static images and TS streams and a system and method for seamlessly switching and controlling media streams, which can provide sufficient information while reducing the load of servers and communication terminals, thereby simplifying the process of multi-party video communication .
  • the present invention mainly proposes a system and method for solving the following three problems:
  • the communication terminal receives the static image of the other communication terminal, and selects the main media source concerned by it;
  • a multi-party video communication media stream control system of the present invention includes a media stream transmission control path for transmitting audio and video data and a control signal; and a communication terminal responsible for transmitting and receiving the audio and video data and the control signal;
  • the server is responsible for providing the media stream service for the video communication terminal in response to the above control signal;
  • the communication terminal includes: Real Time Transport Control Protocol (RTCP) unit, Real Time Transport Protocol (hereinafter) RTP) output unit and RTP input unit, TS coding unit, intra-framed frame (I-frame, also called intra picture) (hereinafter referred to as I frame) coding unit, media buffer unit, TS decoding unit, I a frame decoding unit and a flow control unit;
  • the server includes: an I frame storage unit, a flow control unit, and a communication subunit, wherein the communication subunit includes: an RTCP transmission unit, an RTP output unit, and an RTP input unit, an I frame extraction unit, and an RTP Unpacking unit, media buffer Meta and media source
  • the RTP output unit of the communication terminal is responsible for transmitting the audio and video data of the communication terminal to the RTP input unit of the communication subunit corresponding to the communication terminal in the server; the RTP input unit of the communication terminal is responsible for receiving from the server Audio and video data of other communication terminals;
  • the TS coding unit of the communication terminal is responsible for TS stream coding of the communication terminal;
  • the I frame coding unit of the communication terminal is responsible for periodically encoding the static image of the communication terminal, and forming I frame data output to the RTP output unit of the communication terminal, and using a specific Payload type in the above RTP packet to mark the I frame data of the communication terminal;
  • the media buffer unit of the communication terminal is responsible for receiving data of the RTP input unit of the communication terminal;
  • the TS decoding unit of the communication terminal is responsible for extracting the TS stream from the media buffer unit and decoding it;
  • the I frame decoding unit of the communication terminal is configured to extract the RTP packet having the specific Payload type from the media buffer unit, and decode the I frame data to form a still image;
  • the flow control unit of the communication terminal is responsible for the flow with the server.
  • the control unit communicates, notifies the server of the main media source currently used by the communication terminal or the instruction of the TS coding unit that starts the communication terminal of the server;
  • the I frame storage unit of the server is responsible for storing I frame data of each communication terminal;
  • the flow control unit of the server is responsible for controlling the link of the audio and video streams of each communication terminal;
  • the communication subunit of the server is responsible for establishing a communication link with the communication terminal. Transmitting the audio and video data and the control signal, and outputting the video I frame data to the I frame storage unit of the server, and acquiring the media from the RTP unpacking unit of the I frame storage unit of the server and other communication subunits of the server
  • Source data is input to a media source unit of the communication subunit;
  • the RTP output unit of the communication subunit of the server is responsible for transmitting audio and video data of other communication terminals to the communication terminal corresponding to the communication subunit;
  • the RTP input unit of the communication subunit of the server is responsible for receiving audio and video data of the communication terminal;
  • the I frame extracting unit of the communication subunit of the server is responsible for extracting I frame data according to a specific Payload type of the RTP packet, and storing it in the I frame storage unit of the server; if the I frame extracting unit is not found For the above type of RTP packet, the I frame data is directly extracted from the data in the RTP unpacking unit of the communication subunit of the server; for example: if the current video adopts the MPEG4 encoding format, the vop-coding-type or group_of may be adopted.
  • the -vop_start_code field finds the I frame data;
  • the RTP unpacking unit of the communication subunit of the server is responsible for removing the RTP header of the received audio and video data after filtering out the data packet required by the I frame extracting unit of the communication subunit of the server, and outputting the RTP header. Go to the media buffer unit of the communication subunit;
  • the media buffer unit of the communication subunit of the server is responsible for receiving data output by the RTP unpacking unit of the communication subunit of the server;
  • the media source unit of the communication subunit of the server is responsible for acquiring audio and video data required by the communication terminal corresponding to the communication subunit from the I frame storage unit of the server and the media buffer unit of the communication subunit corresponding to the other communication terminal.
  • the RTP output unit of the communication terminal is respectively connected to the TS coding unit and the I frame coding unit of the communication terminal;
  • the RTP input unit of the communication terminal is connected to the media buffer unit of the communication terminal; the media buffer unit of the communication terminal is respectively connected to the TS decoding unit and the I frame decoding unit of the communication terminal;
  • the I frame extracting unit of the communication subunit of the server is respectively connected to the I frame storage unit of the server, the RTP input unit of the communication subunit of the server, and the RTP unpacking unit; the RTP unpacking unit of the communication subunit of the server respectively Connecting to the RTP input unit, the I frame extracting unit, and the media buffer unit of the communication subunit of the server;
  • the media source units of the communication subunits of the server are respectively connected to the RTP output unit of the communication subunit of the server, the I frame storage unit of the server, and the flow control unit.
  • the communication terminal marks the I frame data of the communication terminal by defining a specific load type (hereinafter referred to as a Payload type) in the RTP packet, and the RTP packet of the type includes only the I frame data of the audio and video data of the communication terminal; Because it is a communication system, the communication terminal and the server are two-way audio and video stream communication; when the multi-party video communication is established, the server includes a plurality of the communication subunits, and each of the communication subunits and each of the communication terminals Corresponding; After the multi-party video communication is established, the above communication terminal and the server can transmit audio and video data through at least the RTP/RTCP transmission channel.
  • a specific load type hereinafter referred to as a Payload type
  • a multi-party video communication media stream control method of the present invention includes the following steps: Step 101: The RTP output ⁇ input unit of the communication terminal establishes a pair of data communication links with the RTP input/output unit of the communication subunit corresponding to the communication terminal in the server, wherein the static image and the TS stream pass through the pair. Data communication link transmission;
  • Step 102 The I frame encoding unit of the communication terminal periodically encodes the static image into I frame data, and transmits the static image to the RTP output unit of the communication terminal.
  • Step 103 If the RTP output unit of the communication terminal receives the I frame data in the I frame coding unit, package it into an RTP packet with a specific Payload type, and send it immediately; if the RTP output unit of the communication terminal receives The data of the TS coding unit of the communication terminal is encapsulated into an RTP packet, and then sent according to a time stamp (hereinafter referred to as TimeStamp); wherein the RTP packet of the specific Payload type is different from the RTP packet of the TS stream; 104: If the RTP input unit of the communication subunit corresponding to each communication terminal in the server receives the RTP packet of the specific Payload type, extract the I frame data and store it in the I frame storage in the server. In the unit; if the TS stream data is received, the packet header is removed by the RTP unpacking unit of the communication subunit, and then stored in the media buffer unit of the communication subunit;
  • TimeStamp time stamp
  • Step 105 The media source unit timing of the communication subunit corresponding to each communication terminal in the server acquires I frame data of another communication terminal from the I frame storage unit, and the main media source determined by each communication terminal Obtaining the TS stream data in a media buffer unit of the corresponding communication subunit;
  • Step 106 If the RTP output unit of the communication subunit corresponding to each communication terminal in the server receives the RTP packet of the specific Payload type, send it immediately; if the RTP packet of the TS stream is received, press The above TimeStamp is sent;
  • Step 107 If the RTP input module of the communication terminal receives the RTP packet of the specific Payload type, transmit it to the I frame decoding unit of the communication terminal to form a still image of another communication terminal; if the TS is received The streamed RTP packet is transmitted to the TS decoding unit of the communication terminal to form a TS stream of the primary media source.
  • the data communication link in the above step 101 can be, for example, an RTP/RTCP transmission. Road.
  • the method for selecting a primary media source for a still image received through a communication terminal includes the following steps:
  • Step 201 When initially starting multi-party video communication, the communication terminal only performs I frame coding, and the I frame data is periodically sent to the server through an RTP packet of a specific Payload type;
  • Step 202 The server receives the I frame data of each communication terminal, and stores the data in the I frame storage unit of the server.
  • Step 203 The communication subunit corresponding to each communication terminal in the server acquires I frame data of another communication terminal from the I frame storage unit, and sends the I frame data to the communication terminal.
  • Step 205 The server checks whether the primary media source selected by the communication terminal is transmitting a TS stream, and if not, notifies the flow control unit of the primary media source to start the TS coding unit;
  • the primary media source is another communication terminal different from the communication terminal.
  • the method for switching a communication terminal from a current primary media source to another primary media source includes the following steps: Step 301: Flow control of the communication terminal 4
  • the unit notifies the server's flow control unit to switch its current primary media source from 4, A 2 , ..., 4 to 4;
  • Step 302 The server checks whether the primary media source is transmitting the TS stream, and if not, notifies the flow control unit of the primary media source to activate the TS encoding unit of the primary media source.
  • Step 303 The flow control unit of the server sends a control signal to the communication terminal.
  • the media source unit of the corresponding communication subunit obtains the TS stream data from the media buffer unit of the communication subunit corresponding to the above main media source y .
  • the current primary media sources 4, ..., 4 and the primary media source are communication terminals different from the communication terminal; the subscripts x, y, and n are different from each other.
  • the invention Compared with the existing video communication system, the invention has the following advantages:
  • the communication terminal When the multi-party video communication is established in the initial stage, the communication terminal only periodically transmits the video I frame data, thereby shortening the time for establishing the multi-party video communication;
  • the communication terminal can receive the TS stream data of the main media source of the communication terminal concerned by the current communication terminal while displaying the static image of the other communication terminal that is updated regularly, and can save the bandwidth while ensuring the maximum amount of information acquired;
  • FIG. 1 is a system configuration diagram of the present invention
  • FIG. 2 is a schematic diagram of an application environment according to a preferred embodiment of the present invention
  • FIG. 3 is a schematic diagram of system processing during three-party communication in the system of the present invention
  • FIG. 4 is a schematic flowchart of a process when a communication terminal periodically displays a still image of another communication terminal according to the present invention
  • FIG. 5 is a flowchart of processing when the communication terminal A of the present invention confirms the communication terminal B as its main media source;
  • FIG. 6 is a flowchart of processing when the communication terminal A of the present invention switches its main media source from the communication terminal B to the communication terminal C.
  • FIG. 1 is a block diagram of the system of the present invention.
  • the system of the present invention includes a media stream transmission control path, a communication terminal, and a server, wherein the communication terminal includes: an RTCP unit, an RTP output unit, and an RTP input unit, a TS coding unit, an I frame coding unit, and a media.
  • the server includes: an I frame storage unit, a flow control unit and a communication subunit, wherein the communication subunit comprises: an RTCP transmission unit, an RTP output unit, and an RTP input Unit, I frame extraction unit, RTP unpacking unit, media buffer unit and media source unit.
  • FIG. 2 is a schematic diagram of an application environment according to a preferred embodiment of the present invention.
  • communication terminals A, B, and C are connected to the video communication system server through the mobile network, the Internet (Internet), and the intranet (intranet) respectively.
  • Internet the Internet
  • intranet intranet
  • the server creates three communication sub-units respectively corresponding to the three communication terminals A, B, and C; each of the communication terminals establishes an RTCP control link and an RTP input/output link for each of the communication sub-units to transmit Audio and video data, and a flow control link is established to transmit primary media source control information.
  • the communication terminal A receives the audio and video data sent from the server, extracts the I frame data therein to form a static image of the timing update of the other communication terminal, and determines that the main media source is the communication terminal B according to the display screen, so the above
  • the communication terminal B needs to simultaneously output TS stream data and timed still image data, and the communication terminal A receives the TS stream data of the communication terminal B while receiving the static image data of the timing of the communication terminals B and C.
  • the server can smoothly switch the TS stream from the communication terminal B to the communication terminal C without the communication terminal A performing the switching operation.
  • the operation is seamless switching.
  • the present invention must have the following functions after being successfully implemented:
  • the I frame data sent by the communication terminals B and C, the communication terminal A can receive and display; 2. After the communication terminal A determines that the communication terminal B is its main media source according to the display image, the communication terminal B starts to transmit its TS stream, and the communication terminal A can receive the static image sent by the communication terminal C at the same time.
  • the server can switch the TS stream from the communication terminal B to the communication terminal C without the above-mentioned communication terminal A being aware of it.
  • each communication terminal participating in the communication only periodically transmits the I frame data, and each communication terminal can display the static image of the timing update of the other communication terminals.
  • the general process of this process is shown in Figure 4. Please refer to Figure 2, Figure 3 and Figure 4 for the specific implementation steps as follows - Step 1:
  • the server creates a corresponding communication sub-unit for each communication terminal.
  • the communication terminal and the communication subunit establish an RTP/RTCP transmission channel;
  • Step 2 The I frame encoding unit of the communication terminal VIII, B, and C periodically (for example, every 10 seconds) acquires the current still image of the communication terminal, encodes it into I frame data, and transmits it to the above communication terminals.
  • Step 3 When receiving the data of the I frame coding unit, the RTP output unit of the communication terminals A, B, and C encapsulates an RTP header for the data, where the SSRC field of the RTP header and the RTP header of the TS stream data of the communication terminal SSRC, which marks the source of the data; meanwhile, the Payload field of the RTP header is defined as a specific 7-bit data to mark the static image data packet from the TS stream;
  • Step 4 After the RTP output unit of the communication terminal adds the RTP header to the static image data, immediately sends the RTP header to the communication subunit A corresponding to each communication terminal, B, C in the server through the RTP transmission channel. , B, C RTP input unit;
  • Step 5 The I frame extracting unit of the communication subunits A, B, and C corresponding to each of the communication terminals A, B, and C in the server periodically periodically receives the Payload of the RTP header from the RTP input unit of the communication subunit. Field extracting still image data and transmitting it to the I frame storage unit of the above server; Step 6: The I frame storage unit of the server distinguishes static image data from different communication terminals according to different SSRC fields in the RTP header of each data packet;
  • Step 7 The media source units of the communication subunits A, B, and C corresponding to the communication terminals A, B, and C in the server periodically acquire static image data of other communication terminals from the I frame storage unit. And immediately transmit it to the RTP output unit of its communication subunit, the RTP output unit immediately transmits such data to the RTP input unit of the communication terminal corresponding to its communication subunit;
  • Step 8 The media buffer unit of the communication terminals A, B, and C receives data from the RTP input unit of the communication terminal, extracts still image data according to the Payload field of the RTP packet, and distinguishes other data according to the SSRC field of the RTP packet. Data of different communication terminals to form complete I frame data of different communication terminals;
  • Step 9 The I frame decoding unit of the communication terminals A, B, and C decodes the I frame data, and distinguishes different sources of the decoded image according to the SSRC field, so that the communication terminals VIII, B, and C can See the static image of the timing update of other communication terminals.
  • the terminal A For the communication terminal A, it is now possible to see the static images of the communication terminals B and C that are regularly updated, and to select the main media source B of interest according to the static images of the B and C of the communication terminal, so that the communication is performed.
  • the terminal A can receive the TS stream of the communication terminal B while receiving the static image updated by the timing of the communication terminals B and C.
  • FIG. 5 Please refer to FIG. 2, FIG. 3 and FIG. 5.
  • the specific implementation steps of this process are as follows:
  • Step 1 The communication terminal A selects the communication terminal B as its current main media source, and the flow control unit of the communication terminal A sends a control signal to the flow control unit of the server to request the current primary media source of the communication terminal A to be the communication terminal B;
  • Step 2 After receiving the request that the primary media source is the communication terminal B, the flow control unit of the server first checks whether the TS stream access count of the current communication terminal B is 0, and if the communication terminal B The TS stream access count is 0, indicating that the communication terminal B starts to establish multi-party video communication, and the flow control unit of the server sends a TS stream start request to the flow control unit of the communication terminal B; otherwise, step 6 is performed; Step 3: After receiving the TS stream start request sent by the server, the flow control unit of the communication terminal B notifies the TS coding unit of the communication terminal to start working, and the TS coding unit is working on the current communication terminal.
  • the audio and video data of B is encoded, and the encoded data is transmitted to the RTP output unit of the communication terminal B, and the RTP output unit adds the RTP header to the encoded data, and transmits it according to TimeStamp;
  • Step 4 After receiving the TS stream data sent by the communication terminal B, the RTP input unit of the communication subunit B in the server transmits the TS stream data to the RTP unpacking unit of the communication subunit, where the Payload field according to the RTP packet is used.
  • the difference is static image data or TS stream data;
  • Step 5 The RTP unpacking unit of the communication subunit B in the server removes the RTP header of the TS stream data and stores the data in the media buffer unit of the communication subunit.
  • Step 6 The flow control unit notification of the server The media source unit of the communication subunit A takes data from the media buffer unit of the communication subunit B, and the TS stream access count of the communication terminal B is incremented by one; • the I frame data of the TS stream is started when the data is started. Start fetching, if it is MPEG4 compression encoding, you can find the I frame by vop-coding_type or group-of-vop-start_code field;
  • Step 7 The media source unit of the communication subunit A of the server obtains the TS stream data from the media buffer unit of the communication subunit B, and transmits the TS stream data to the RTP output unit of the communication subunit;
  • Step 8 The RTP output unit of the communication subunit A of the server adds an RTP header to the TS stream data according to the RTP session attribute currently established with the communication terminal A, that is, the SSRC field of the RTP header is the communication terminal A and the communication.
  • Subunit A establishes the SSRC of the RTP session, wherein the TimeStamp and the serial number (hereinafter referred to as SqucneceNumber) are independent of the source of the data source; the RTP output unit of the communication subunit transmits the TS stream data according to the TimeStamp;
  • Step 9 The RTP input unit of the communication terminal A receives the TS stream data and transmits it to the TS decoding unit of the communication terminal, and the TS decoding unit decodes the real-time audio and video stream data of the communication terminal B, and Display images and play sounds.
  • the communication terminal A can receive and play the real-time audio and video data of the communication terminal B while receiving the static image updated by the timing of the communication terminals B and C.
  • the communication terminal A wants to switch the current main media source from the communication terminal B to the communication terminal C, the communication terminal C only needs to be selected according to the static image, and the server uses the main media source of the communication terminal A from the communication terminal.
  • Step 1 The flow control unit of the communication terminal A sends a flow switching instruction to the flow control unit of the server, requesting that its main media source be switched from the communication terminal B to the communication terminal C;
  • Step 2 After receiving the flow switching instruction of the communication terminal A, the flow control unit of the server first checks the current TS flow access count of the communication terminal C, and if the TS flow access count of the communication terminal C is > 0, the communication is indicated. If the terminal C has started to transmit its TS stream, step 3 is performed. If the TS stream access count of the communication terminal C is 0, indicating that the communication terminal C does not transmit the TS stream, the flow control unit of the server provides the communication terminal C. The flow control unit sends a TS stream start request, as described above, the communication terminal C starts to send its TS stream, and the media buffer unit of the communication subunit C stores the real-time audio and video data of the communication terminal C;
  • Step 3 The flow control unit of the server reduces the TS flow access count of the communication terminal B by one, and checks whether the current TS terminal access count of the communication terminal B is 0. If the count is 0, it indicates that no communication terminal needs the communication terminal.
  • the flow control unit of the server In the TS stream of B, the flow control unit of the server notifies the flow control unit of the communication terminal B to stop its current TS stream coding operation, and the flow control unit of the communication terminal B sends a stop real-time coding to the TS coding unit of the communication terminal.
  • step 4 the flow control unit of the server notifies that the media source unit of the communication subunit A is taken from the media buffer unit of the communication subunit C
  • the TS stream access count of the communication terminal C is incremented by 1; the data is started from the I frame data of the TS stream, and if it is MPEG4 compression code, it can be vopt-coding-type or group-of-vop.
  • start— code The PT/CN2005/002152 field finds the I frame;
  • Step 5 The media source unit of the communication subunit A of the server obtains the TS stream data from the media buffer unit of the communication subunit C and transmits the TS stream data to the RTP output unit of the communication subunit;
  • Step 6 The RTP output unit of the communication subunit A of the server adds an RTP header to the TS stream data according to the RTP session attribute currently established with the communication terminal A, that is, the SSRC field of the RTP header is the communication terminal A.
  • the SSRC, the TimeStam and the SequenceNumber of the communication subunit A are independent of the source of the data source, and the RTP output unit of the communication subunit transmits the TS stream data according to the TimeStamp;
  • the RTP input unit of the communication terminal A receives After the TS stream data is transmitted to the TS decoding unit of the communication terminal, the TS decoding unit decodes the real-time audio and video stream data of the communication terminal C, and displays an image and a playback sound.
  • the media stream switching method according to the present invention is characterized by:
  • the switching can be completed by selecting different data sources, without changing the TimeStamp, SequenceNumber, and SSRC in the RTP session;
  • Switching to one or more streams to the same media source is the same for media sources.
  • the present invention may also have other various embodiments, such as transmitting static image data by using a Transmission Control Protocol (hereinafter referred to as TCP protocol), or using static image data and TS stream data differently.
  • TCP protocol Transmission Control Protocol
  • static image data and TS stream data may also have other various embodiments, such as transmitting static image data by using a Transmission Control Protocol (hereinafter referred to as TCP protocol), or using static image data and TS stream data differently. Transmission channel transmission.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention discloses a system and a method for controlling the media stream of the multiparty video communication. The system includes a transmitting control path for media stream, a communication terminal and a server. Therein, the communication terminal includes a RTCP unit, a RTP output unit, and a RTP input unit, a TS encoding unit, an I frame encoding unit, a media buffer unit, a TS decoding UNIT, an I frame decoding unit and a stream controlling unit. The server includes an I frame memory unit, a stream controlling unit and a communication sub-unit. The communication sub-unit includes a RTCP transmitting unit, a RTP output unit and a RTP input unit, an I frame extraction unit, a RTP de-packeting unit, a media buffer unit and a media source unit. The invention also discloses a method for transmitting the static picture mixed with the TS stream and a method for seamless switching and controlling the media stream, which reduce the burden of the server and the communication terminal and provide enough information in order to reduce the process of the multiparty video communication.

Description

一种多方视频通讯媒体流控制系统和方法 技术领域 本发明属于多媒体技术领域,涉及一种多方视频通讯媒体流控制系统 和方法, 更具体地涉及一种实时多方视频通讯时将静态图像与实时流 TECHNICAL FIELD The present invention relates to the field of multimedia technologies, and relates to a multi-party video communication media stream control system and method, and more particularly to a static image and a real-time stream in real-time multi-party video communication.
(Transport Stream) (以下简称 TS流)混合传输的系统和方法及媒体流控制和 切换的系统和方法。 背景技术 随着网络技术的发展, 一种新的媒体技术应运而生, 这就是流媒体技 术。 流媒体是指在网络中使用流式传输技术的连续时基媒体, 如音频、 视 频或多媒体文件。流媒体技术在视频电话和视频会议系统中得到了极大的 应用。在现有的视频通讯系统中,各通讯终端通过流媒体服务器进行通讯, 在多方视频通讯时,一个通讯终端通常要建若干对通讯链接以接收其他通 讯终端的音视频数据, 这种方式的服务器作为通讯中心负担很大, 作为通 讯终端可能并不关心所有参与通讯的通讯终端的视频信息,所以在此基础 上业界又提出了若干减轻服务器负担的方法, 例如在视频会议系统中, 各 通讯终端仅接收正在发言的通讯终端的音视频数据。虽然上述方法部分地 减轻了服务器的负担, 但仍然存在以下不足-(Transport Stream) (hereinafter referred to as TS stream) system and method for hybrid transmission and system and method for media stream control and handover. BACKGROUND With the development of network technology, a new media technology emerges as the times require. This is streaming media technology. Streaming media refers to continuous time-based media, such as audio, video or multimedia files, that use streaming technology in the network. Streaming media technology has been greatly used in video telephony and video conferencing systems. In the existing video communication system, each communication terminal communicates through a streaming media server. In a multi-party video communication, a communication terminal usually has a plurality of communication links to receive audio and video data of other communication terminals. As a communication center, it is very burdensome. As a communication terminal, it may not care about the video information of all the communication terminals participating in the communication. Therefore, on the basis of this, the industry has proposed several methods to reduce the burden on the server, for example, in the video conference system, each communication terminal. Only the audio and video data of the communication terminal that is speaking is received. Although the above method partially relieves the burden on the server, there are still the following shortcomings -
1、 通讯终端从一个媒体源切换到另一个媒体源的切换步骤复杂;1. The switching procedure of the communication terminal switching from one media source to another is complicated;
2、 在一个多方视频通讯过程中, 如果通讯终端仅接收某一个通讯源 的数据, 就会丧失信息的完整性。 发明内容 针对上述情形,本发明的目的在于提出一种多方视频通讯媒体流控制 系统和方法。本发明充分考虑了多方视频通讯系统的特点, 提出了一种将 静态图像与 TS流混合传输的系统和方法及一种媒体流无缝切换和控制的 系统和方法, 在降低服务器和通讯终端负载的同时能够提供足够多的信 息, 从而简化了多方视频通讯的过程。 2. In a multi-party video communication process, if the communication terminal only receives data from a certain communication source, the integrity of the information will be lost. SUMMARY OF THE INVENTION In view of the above circumstances, an object of the present invention is to provide a multi-party video communication media stream control system and method. The invention fully considers the characteristics of the multi-party video communication system, and proposes a System and method for hybrid transmission of static images and TS streams and a system and method for seamlessly switching and controlling media streams, which can provide sufficient information while reducing the load of servers and communication terminals, thereby simplifying the process of multi-party video communication .
本发明主要提出了解决以下三个问题的系统和方法:  The present invention mainly proposes a system and method for solving the following three problems:
1、 静态图像和 TS流同时传输时, 如何识别这 2种数据;  1. How to identify these two kinds of data when the still image and the TS stream are simultaneously transmitted;
2、 多方通讯建立时, 通讯终端接收其他通讯终端的静态图像, 并据 此选择其所关心的主媒体源;  2. When the multi-party communication is established, the communication terminal receives the static image of the other communication terminal, and selects the main media source concerned by it;
3、 如何无缝切换主媒体源。  3. How to seamlessly switch the main media source.
为实现上述目的, 本发明的一种多方视频通讯媒体流控制系统, 包括 媒体流传输控制路径, 负责传输音视频数据和控制信号; 通讯终端, 负责 发送和接收上述音视频数据和控制信号; 和服务器, 负责响应上述控制信 号, 为视频通讯终端提供媒体流服务; 其中上述通讯终端包括: 实时传输 控制协议 (Realtime Transport Control Protocol) (以下简称 RTCP)单元、 实时 传输协议 (Realtime Transport Protocol) (以下简称 RTP)输出单元和 RTP输入 单元, TS编码单元, 帧内编码帧 (I-frame, 又叫内部画面 (intra picture)) (以 下简称 I帧)编码单元, 媒体缓冲单元, TS解码单元, I帧解码单元及流控 制单元; 上述服务器包括: I帧存储单元, 流控制单元及通讯子单元, 其 中该通讯子单元包括: RTCP传输单元、 RTP输出单元和 RTP输入单元, I帧提取单元, RTP解包单元, 媒体缓冲单元及媒体源单元。  To achieve the above object, a multi-party video communication media stream control system of the present invention includes a media stream transmission control path for transmitting audio and video data and a control signal; and a communication terminal responsible for transmitting and receiving the audio and video data and the control signal; The server is responsible for providing the media stream service for the video communication terminal in response to the above control signal; wherein the communication terminal includes: Real Time Transport Control Protocol (RTCP) unit, Real Time Transport Protocol (hereinafter) RTP) output unit and RTP input unit, TS coding unit, intra-framed frame (I-frame, also called intra picture) (hereinafter referred to as I frame) coding unit, media buffer unit, TS decoding unit, I a frame decoding unit and a flow control unit; the server includes: an I frame storage unit, a flow control unit, and a communication subunit, wherein the communication subunit includes: an RTCP transmission unit, an RTP output unit, and an RTP input unit, an I frame extraction unit, and an RTP Unpacking unit, media buffer Meta and media source unit.
其中,上述通讯终端的 RTP输出单元负责将本通讯终端的音视频数据 传递给上述服务器中的与本通讯终端对应的通讯子单元的 RTP输入单元; 上述通讯终端的 RTP输入单元负责从上述服务器接收其他通讯终端 的音视频数据;  The RTP output unit of the communication terminal is responsible for transmitting the audio and video data of the communication terminal to the RTP input unit of the communication subunit corresponding to the communication terminal in the server; the RTP input unit of the communication terminal is responsible for receiving from the server Audio and video data of other communication terminals;
上述通讯终端的 TS编码单元负责本通讯终端的 TS流编码; 上述通讯终端的 I帧编码单元负责定时对本通讯终端的静态图像进行 编码, 形成 I帧数据输出到该通讯终端的 RTP输出单元, 并用上述 RTP 包中一特定的 Payload类型来标记该通讯终端的 I帧数据; 上述通讯终端的媒体缓冲单元负责接收上述通讯终端的 RTP输入单 元的数据; The TS coding unit of the communication terminal is responsible for TS stream coding of the communication terminal; the I frame coding unit of the communication terminal is responsible for periodically encoding the static image of the communication terminal, and forming I frame data output to the RTP output unit of the communication terminal, and using a specific Payload type in the above RTP packet to mark the I frame data of the communication terminal; The media buffer unit of the communication terminal is responsible for receiving data of the RTP input unit of the communication terminal;
上述通讯终端的 TS解码单元负责从上述媒体缓冲单元中提取 TS流, 并将其解码;  The TS decoding unit of the communication terminal is responsible for extracting the TS stream from the media buffer unit and decoding it;
上述通讯终端的 I帧解码单元负责从上述媒体缓冲单元中提取上述有 特定 Payload类型的 RTP包, 并对 I帧数据进行解码, 形成静态图像; 上述通讯终端的流控制单元负责与上述服务器的流控制单元通讯,通 知该服务器本通讯终端当前要使用的主媒体源或接收该服务器的启动本 通讯终端的 TS编码单元的指令;  The I frame decoding unit of the communication terminal is configured to extract the RTP packet having the specific Payload type from the media buffer unit, and decode the I frame data to form a still image; the flow control unit of the communication terminal is responsible for the flow with the server. The control unit communicates, notifies the server of the main media source currently used by the communication terminal or the instruction of the TS coding unit that starts the communication terminal of the server;
上述服务器的 I帧存储单元负责存储各通讯终端的 I帧数据; 上述服务器的流控制单元负责控制各通讯终端的音视频流的链接; 上述服务器的通讯子单元负责与上述通讯终端建立通讯链路传递上 述音视频数据和控制信号, 负责向上述服务器的 I帧存储单元输出视频 I 帧数据, 和负责从上述服务器的 I帧存储单元和上述服务器的其他通讯子 单元的 RTP解包单元中获取媒体源数据输入到该通讯子单元的媒体源单 元中; 其中  The I frame storage unit of the server is responsible for storing I frame data of each communication terminal; the flow control unit of the server is responsible for controlling the link of the audio and video streams of each communication terminal; the communication subunit of the server is responsible for establishing a communication link with the communication terminal. Transmitting the audio and video data and the control signal, and outputting the video I frame data to the I frame storage unit of the server, and acquiring the media from the RTP unpacking unit of the I frame storage unit of the server and other communication subunits of the server Source data is input to a media source unit of the communication subunit;
上述服务器的通讯子单元的 RTP 输出单元负责将其他通讯终端的音 视频数据传递给与该通讯子单元对应的通讯终端;  The RTP output unit of the communication subunit of the server is responsible for transmitting audio and video data of other communication terminals to the communication terminal corresponding to the communication subunit;
上述服务器的通讯子单元的 RTP 输入单元负责接收上述通讯终端的 音视频数据;  The RTP input unit of the communication subunit of the server is responsible for receiving audio and video data of the communication terminal;
上述服务器的通讯子单元的 I帧提取单元负责根据上述 RTP包的一特 定的 Payload类型来提取 I帧数据, 并将其存储到上述服务器的 I帧存储 单元中; 若该 I帧提取单元没有找到上述类型的 RTP包, 则其直接从上述 服务器的通讯子单元的 RTP解包单元中的数据中提取 I帧数据; 例如: 如 果当前视频采用 MPEG4 编码格式, 则可以通过 vop— coding— type 或 group_of—vop— start— code字段找到 I帧数据; 上述服务器的通讯子单元的 RTP解包单元负责在过滤掉上述服务器 的通讯子单元的 I帧提取单元需要的数据包后, 将接收到的音视频数据的 RTP包头移除,·并将其输出到该通讯子单元的媒体缓冲单元中; The I frame extracting unit of the communication subunit of the server is responsible for extracting I frame data according to a specific Payload type of the RTP packet, and storing it in the I frame storage unit of the server; if the I frame extracting unit is not found For the above type of RTP packet, the I frame data is directly extracted from the data in the RTP unpacking unit of the communication subunit of the server; for example: if the current video adopts the MPEG4 encoding format, the vop-coding-type or group_of may be adopted. The -vop_start_code field finds the I frame data; The RTP unpacking unit of the communication subunit of the server is responsible for removing the RTP header of the received audio and video data after filtering out the data packet required by the I frame extracting unit of the communication subunit of the server, and outputting the RTP header. Go to the media buffer unit of the communication subunit;
上述服务器的通讯子单元的媒体缓冲单元负责接收上述服务器的通 讯子单元的 RTP解包单元输出的数据;  The media buffer unit of the communication subunit of the server is responsible for receiving data output by the RTP unpacking unit of the communication subunit of the server;
上述服务器的通讯子单元的媒体源单元负责从上述服务器的 I帧存储 单元和与其他通讯终端对应的通讯子单元的媒体缓冲单元中获取与该通 讯子单元对应的通讯终端需要的音视频数据。  The media source unit of the communication subunit of the server is responsible for acquiring audio and video data required by the communication terminal corresponding to the communication subunit from the I frame storage unit of the server and the media buffer unit of the communication subunit corresponding to the other communication terminal.
此外, 上述通讯终端的 RTP输出单元分别与上述通讯终端的 TS编码 单元及 I帧编码单元连接;  In addition, the RTP output unit of the communication terminal is respectively connected to the TS coding unit and the I frame coding unit of the communication terminal;
上述通讯终端的 RTP输入单元与上述通讯终端的媒体缓冲单元连接; 上述通讯终端的媒体缓冲单元分别与上述通讯终端的 TS解码单元及 I帧解码单元连接; 及  The RTP input unit of the communication terminal is connected to the media buffer unit of the communication terminal; the media buffer unit of the communication terminal is respectively connected to the TS decoding unit and the I frame decoding unit of the communication terminal;
上述服务器的通讯子单元的 I帧提取单元分别与上述服务器的 I帧存 储单元、上述服务器的通讯子单元的 RTP输入单元及 RTP解包单元连接; 上述服务器的通讯子单元的 RTP解包单元分别与上述服务器的通讯 子单元的 RTP输入单元、 I帧提取单元及媒体缓冲单元连接;  The I frame extracting unit of the communication subunit of the server is respectively connected to the I frame storage unit of the server, the RTP input unit of the communication subunit of the server, and the RTP unpacking unit; the RTP unpacking unit of the communication subunit of the server respectively Connecting to the RTP input unit, the I frame extracting unit, and the media buffer unit of the communication subunit of the server;
上述服务器的通讯子单元的媒体源单元分别与上述服务器的通讯子 单元的 RTP输出单元、 上述服务器的 I帧存储单元及流控制单元连接。  The media source units of the communication subunits of the server are respectively connected to the RTP output unit of the communication subunit of the server, the I frame storage unit of the server, and the flow control unit.
其中, 上述通讯终端通过定义 RTP包中一特定的负载类型 (以下简称 Payload类型)来标记该通讯终端的 I帧数据, 该类型的 RTP包仅包含该通 讯终端的音视频数据的 I帧数据; 由于是通讯系统, 因此上述通讯终端与 服务器均为双向的音视频流通讯; 在多方视频通讯建立时, 上述服务器包 含多个上述通讯子单元,且每一个上述通讯子单元与每一个上述通讯终端 相对应; 在多方视频通讯建立后, 上述通讯终端与服务器至少可以通过 RTP/RTCP传输通道传输音视频数据。  The communication terminal marks the I frame data of the communication terminal by defining a specific load type (hereinafter referred to as a Payload type) in the RTP packet, and the RTP packet of the type includes only the I frame data of the audio and video data of the communication terminal; Because it is a communication system, the communication terminal and the server are two-way audio and video stream communication; when the multi-party video communication is established, the server includes a plurality of the communication subunits, and each of the communication subunits and each of the communication terminals Corresponding; After the multi-party video communication is established, the above communication terminal and the server can transmit audio and video data through at least the RTP/RTCP transmission channel.
为实现上述目的, 本发明的一种多方视频通讯媒体流控制方法, 对于 静态图像与 TS流混合传输的方法包括以下步骤: 步骤 101 : 通讯终端的 RTP输出 \输入单元与服务器中的与该通讯终 端对应的通讯子单元的 RTP输入 \输出单元建立一对数据通讯链路, 其中 上述静态图像与上述 TS流都通过这对数据通讯链路传输; To achieve the above object, a multi-party video communication media stream control method of the present invention, the method for hybrid transmission of a static image and a TS stream includes the following steps: Step 101: The RTP output\input unit of the communication terminal establishes a pair of data communication links with the RTP input/output unit of the communication subunit corresponding to the communication terminal in the server, wherein the static image and the TS stream pass through the pair. Data communication link transmission;
步骤 102: 上述通讯终端的 I帧编码单元定时将上述静态图像编码为 I帧数据, 并将其传输到上述通讯终端的 RTP输出单元中;  Step 102: The I frame encoding unit of the communication terminal periodically encodes the static image into I frame data, and transmits the static image to the RTP output unit of the communication terminal.
步骤 103 : 如果上述通讯终端的 RTP输出单元接收到上述 I帧编码单 元中的 I帧数据, 则将其封装成有特定 Payload类型的 RTP包, 并立即发 送; 如果上述通讯终端的 RTP输出单元接收到的是上述通讯终端的 TS编 码单元的数据, 则将其封装成 RTP包后, 按时间戳 (以下简称 TimeStamp) 发送; 其中上述特定 Payload类型的 RTP包区别于上述 TS流的 RTP包; 步骤 104: 如果上述服务器中的与上述各通讯终端对应的通讯子单元 的 RTP输入单元接收到上述特定 Payload类型的 RTP包,则将上述 I帧数 据提取出来并将其存储到服务器中的 I帧存储单元中;如果接收到的是 TS 流数据,则经上述通讯子单元的 RTP解包单元去除包头后,将其存储在上 述通讯子单元的媒体缓冲单元内;  Step 103: If the RTP output unit of the communication terminal receives the I frame data in the I frame coding unit, package it into an RTP packet with a specific Payload type, and send it immediately; if the RTP output unit of the communication terminal receives The data of the TS coding unit of the communication terminal is encapsulated into an RTP packet, and then sent according to a time stamp (hereinafter referred to as TimeStamp); wherein the RTP packet of the specific Payload type is different from the RTP packet of the TS stream; 104: If the RTP input unit of the communication subunit corresponding to each communication terminal in the server receives the RTP packet of the specific Payload type, extract the I frame data and store it in the I frame storage in the server. In the unit; if the TS stream data is received, the packet header is removed by the RTP unpacking unit of the communication subunit, and then stored in the media buffer unit of the communication subunit;
步骤 105: 上述服务器中的与上述各通讯终端对应的通讯子单元的媒 体源单元定时从上述 I帧存储单元中获取其他通讯终端的 I帧数据, 且从 上述各通讯终端确定的主媒体源所对应的通讯子单元的媒体缓冲单元中 获取上述 TS流数据;  Step 105: The media source unit timing of the communication subunit corresponding to each communication terminal in the server acquires I frame data of another communication terminal from the I frame storage unit, and the main media source determined by each communication terminal Obtaining the TS stream data in a media buffer unit of the corresponding communication subunit;
步骤 106: 如果上述服务器中的与上述各通讯终端对应的通讯子单元 的 RTP输出单元接收到上述特定 Payload类型的 RTP包,则立即发送;如 果接收到的是上述 TS流的 RTP包, 则按上述 TimeStamp发送;  Step 106: If the RTP output unit of the communication subunit corresponding to each communication terminal in the server receives the RTP packet of the specific Payload type, send it immediately; if the RTP packet of the TS stream is received, press The above TimeStamp is sent;
步骤 107:如果上述通讯终端的 RTP输入模块接收到上述特定 Payload 类型的 RTP包, 则将其传送给上述通讯终端的 I帧解码单元, 形成其他通 讯终端的静态图像; 如果接收到的是上述 TS流的 RTP包, 则将其传送给 上述通讯终端的 TS解码单元, 形成上述主媒体源的 TS流。  Step 107: If the RTP input module of the communication terminal receives the RTP packet of the specific Payload type, transmit it to the I frame decoding unit of the communication terminal to form a still image of another communication terminal; if the TS is received The streamed RTP packet is transmitted to the TS decoding unit of the communication terminal to form a TS stream of the primary media source.
其中上述步骤 101 中的数据通讯链路例如可以为 RTP/RTCP传输通 道。 The data communication link in the above step 101 can be, for example, an RTP/RTCP transmission. Road.
为实现上述目的, 本发明的一种多方视频通讯媒体流控制方法, 对于 通过通讯终端接收到的静态图像选择主媒体源的方法包括以下步骤:  To achieve the above object, a multi-party video communication media stream control method of the present invention, the method for selecting a primary media source for a still image received through a communication terminal includes the following steps:
步骤 201 : 最初开始多方视频通讯时, 通讯终端仅进行 I 帧编码, I 帧数据通过特定 Payload类型的 RTP包定时发送给服务器;  Step 201: When initially starting multi-party video communication, the communication terminal only performs I frame coding, and the I frame data is periodically sent to the server through an RTP packet of a specific Payload type;
步骤 202: 上述服务器接收各通讯终端的上述 I帧数据, 并将其存储 在上述服务器的 I帧存储单元中;  Step 202: The server receives the I frame data of each communication terminal, and stores the data in the I frame storage unit of the server.
步骤 203 : 上述服务器中的与上述各通讯终端对应的通讯子单元从上 述 I帧存储单元中获取其他通讯终端的 I帧数据并发送给该通讯终端; 步骤 204: 上述通讯终端根据显示的上述其他通讯终端的静态图像, 选择主媒体源,之后该通讯终端的流控制单元通知上述服务器的流控制单 元该通讯终端选择的主媒体源;  Step 203: The communication subunit corresponding to each communication terminal in the server acquires I frame data of another communication terminal from the I frame storage unit, and sends the I frame data to the communication terminal. Step 204: The communication terminal is configured according to the foregoing a static image of the communication terminal, selecting a primary media source, and then the flow control unit of the communication terminal notifying the flow control unit of the server of the primary media source selected by the communication terminal;
步骤 205: 上述服务器检查上述通讯终端所选择的主媒体源是否在传 送 TS流, 如果没有, 则通知该主媒体源的流控制单元启动 TS编码单元; 步骤 206: 上述服务器的流控制单元发送控制信号给与上述通讯终端 对应的通讯子单元的媒体源单元让其从与上述主媒体源对应的通讯子单 元的媒体缓冲单元中获取 TS流数据。  Step 205: The server checks whether the primary media source selected by the communication terminal is transmitting a TS stream, and if not, notifies the flow control unit of the primary media source to start the TS coding unit; Step 206: The flow control unit of the server sends a control The signal is supplied to the media source unit of the communication subunit corresponding to the communication terminal to acquire the TS stream data from the media buffer unit of the communication subunit corresponding to the main media source.
其中上述主媒体源为与上述通讯终端不同的另一通讯终端。  The primary media source is another communication terminal different from the communication terminal.
为实现上述目的, 本发明的一种多方视频通讯媒体流控制方法, 对于 一通讯终端从当前的主媒体源切换至另一主媒体源的方法包括以下步骤: 步骤 301: 通讯终端 4的流控制单元通知服务器的流控制单元将其当 前的主媒体源由 4 、 A2 、 ...、 4,切换至 4 ; To achieve the above object, a multi-party video communication media stream control method of the present invention, the method for switching a communication terminal from a current primary media source to another primary media source includes the following steps: Step 301: Flow control of the communication terminal 4 The unit notifies the server's flow control unit to switch its current primary media source from 4, A 2 , ..., 4 to 4;
步骤 302: 上述服务器检查上述主媒体源 是否在传送 TS流, 如果 没有, 则通知该主媒体源 ^的流控制单元启动该主媒体源 ^的 TS编码单 元;  Step 302: The server checks whether the primary media source is transmitting the TS stream, and if not, notifies the flow control unit of the primary media source to activate the TS encoding unit of the primary media source.
步骤 303 : 上述服务器的流控制单元发送控制信号给与上述通讯终端 4对应的通讯子单元的媒体源单元让其从与上述主媒体源 y对应的通讯 子单元的媒体缓冲单元中获取 TS流数据。 其中, 上述当前的主媒体源 4 、 、 ...、 4与上述主媒体源 ^为与 上述通讯终端 ^互不相同的通讯终端; 上述下标 x、 y及 n互不相同。 Step 303: The flow control unit of the server sends a control signal to the communication terminal. 4 The media source unit of the corresponding communication subunit obtains the TS stream data from the media buffer unit of the communication subunit corresponding to the above main media source y . The current primary media sources 4, ..., 4 and the primary media source are communication terminals different from the communication terminal; the subscripts x, y, and n are different from each other.
与现有的视频通讯系统相比, 采用本发明具有以下优点:  Compared with the existing video communication system, the invention has the following advantages:
1、 在初期建立多方视频通讯时, 通讯终端仅定时传输视频 I帧数据, 从而可以缩短建立多方视频通讯的时间;  1. When the multi-party video communication is established in the initial stage, the communication terminal only periodically transmits the video I frame data, thereby shortening the time for establishing the multi-party video communication;
2、 通讯终端可以在显示定时更新的其他通讯终端的静态图像的同时 接收当前的通讯终端关心的通讯终端的主媒体源 TS流数据, 可以在节约 带宽的同时保证获取的信息量最大;  2. The communication terminal can receive the TS stream data of the main media source of the communication terminal concerned by the current communication terminal while displaying the static image of the other communication terminal that is updated regularly, and can save the bandwidth while ensuring the maximum amount of information acquired;
3、 与其他媒体流切换技术相比, 本发明了建立独立于媒体源的 RTP/RTCP 传输通道, 在传输时不需要进行 TimeStamp 及同步源标识 (Synchronization Source identifiers) (以下简称 SSRC)的变换。 为让本发明的上述和其它目的、特征和优点能更明显易懂, 下文特举 较佳实施例, 并配合说明书附图, 作详细说明如下。 附图说明 图 1为本发明的系统构成图; 3. Compared with other media stream switching technologies, the present invention establishes an RTP/RTCP transmission channel independent of the media source, and does not need to perform TimeStamp and Synchronization Source Identifier (SSRC) conversion during transmission. The above and other objects, features and advantages of the present invention will become more <RTIgt; BRIEF DESCRIPTION OF DRAWINGS FIG. 1 is a system configuration diagram of the present invention;
图 2为本发明一较佳实施例的一种应用环境示意图; 图 3为本发明所述系统中三方通讯时的系统处理示意图;  2 is a schematic diagram of an application environment according to a preferred embodiment of the present invention; FIG. 3 is a schematic diagram of system processing during three-party communication in the system of the present invention;
图 4为本发明的通讯终端定时显示其他通讯终端的静态图像时的大致 处理流程图;  4 is a schematic flowchart of a process when a communication terminal periodically displays a still image of another communication terminal according to the present invention;
图 5为本发明的通讯终端 A确认通讯终端 B作为其主媒体源时的处 理流程图;  FIG. 5 is a flowchart of processing when the communication terminal A of the present invention confirms the communication terminal B as its main media source;
图 6为本发明的通讯终端 A将其主媒体源由通讯终端 B切换至通讯 终端 C时的处理流程图。 具体实施方式 下面结合附图对本发明的技术方案的具体实施作进一步的详细说明, 但其并不作为对本发明的限定。 FIG. 6 is a flowchart of processing when the communication terminal A of the present invention switches its main media source from the communication terminal B to the communication terminal C. DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The detailed description of the technical solutions of the present invention will be further described in detail below with reference to the accompanying drawings.
图 1为本发明的系统构成图。 如图 1所示, 本发明所述系统包括媒体 流传输控制路径、 通讯终端和服务器, 其中上述通讯终端包括: RTCP单 元、 RTP输出单元和 RTP输入单元, TS编码单元, I帧编码单元, 媒体 缓冲单元, TS解码单元, I帧解码单元及流控制单元; 上述服务器包括: I帧存储单元, 流控制单元及通讯子单元, 其中该通讯子单元包括: RTCP 传输单元、 RTP输出单元和 RTP输入单元, I帧提取单元, RTP解包单元, 媒体缓冲单元及媒体源单元。  Figure 1 is a block diagram of the system of the present invention. As shown in FIG. 1, the system of the present invention includes a media stream transmission control path, a communication terminal, and a server, wherein the communication terminal includes: an RTCP unit, an RTP output unit, and an RTP input unit, a TS coding unit, an I frame coding unit, and a media. The buffer unit, the TS decoding unit, the I frame decoding unit and the flow control unit; the server includes: an I frame storage unit, a flow control unit and a communication subunit, wherein the communication subunit comprises: an RTCP transmission unit, an RTP output unit, and an RTP input Unit, I frame extraction unit, RTP unpacking unit, media buffer unit and media source unit.
图 2为本发明一较佳实施例的一种应用环境示意图。 如图 2所示, 通 讯终端 A、 B、 C分别通过移动网络、因特网 (Internet)、企业内部网 (Intranet) 与视频通讯系统服务器相连, 当多方视频通讯开始后 (系统处理图如图 3 所示), 服务器创建 3个通讯子单元分别与上述 3个通讯终端 A、 B、 C一 一对应; 上述每一个通讯终端与上述每一个通讯子单元建立 RTCP控制链 接和 RTP输入 \输出链接来传输音视频数据, 并建立流控制链接来传送主 媒体源控制信息。上述通讯终端 A接收从上述服务器发送过来的音视频数 据, 提取其中的 I帧数据以形成其他通讯终端的定时更新的静态图像, 并 根据显示画面确定其主媒体源为上述通讯终端 B,所以上述通讯终端 B需 同时输出 TS流数据和定时的静态图像数据, 上述通讯终端 A在接收通讯 终端 B、 C的定时的静态图像数据的同时接收上述通讯终端 B的 TS流数 据。 当上述通讯终端 A选择上述通讯终端 C作为其主媒体源时, 上述服 务器能够将 TS流顺利地从上述通讯终端 B切换到上述通讯终端 C, 而不 需上述通讯终端 A作除切换动作之外的操作, 即为无缝切换。  2 is a schematic diagram of an application environment according to a preferred embodiment of the present invention. As shown in Figure 2, communication terminals A, B, and C are connected to the video communication system server through the mobile network, the Internet (Internet), and the intranet (intranet) respectively. When multi-party video communication starts (the system processing diagram is shown in Figure 3). The server creates three communication sub-units respectively corresponding to the three communication terminals A, B, and C; each of the communication terminals establishes an RTCP control link and an RTP input/output link for each of the communication sub-units to transmit Audio and video data, and a flow control link is established to transmit primary media source control information. The communication terminal A receives the audio and video data sent from the server, extracts the I frame data therein to form a static image of the timing update of the other communication terminal, and determines that the main media source is the communication terminal B according to the display screen, so the above The communication terminal B needs to simultaneously output TS stream data and timed still image data, and the communication terminal A receives the TS stream data of the communication terminal B while receiving the static image data of the timing of the communication terminals B and C. When the communication terminal A selects the communication terminal C as its main media source, the server can smoothly switch the TS stream from the communication terminal B to the communication terminal C without the communication terminal A performing the switching operation. The operation is seamless switching.
结合上述实时例, 本发明能够顺利实施后必须达到以下功能:  In combination with the above real-time example, the present invention must have the following functions after being successfully implemented:
1、 上述通讯终端 B、 C发送的 I帧数据, 上述通讯终端 A能够收到 并显示; 2、 上述通讯终端 A根据显示图像, 确定上述通讯终端 B为其主媒体 源后, 通讯终端 B开始发送其 TS流, 上述通讯终端 A能够在接收上述通 讯终端 C发送的静态图像的同时, 接收上述通讯终端 B的 TS流; 1. The I frame data sent by the communication terminals B and C, the communication terminal A can receive and display; 2. After the communication terminal A determines that the communication terminal B is its main media source according to the display image, the communication terminal B starts to transmit its TS stream, and the communication terminal A can receive the static image sent by the communication terminal C at the same time. The TS stream of the communication terminal B;
3、 若上述通讯终端 A选择上述通讯终端 C作为其主媒体源时, 上述 服务器能够在上述通讯终端 A无察觉的情况下,将 TS流由通讯终端 B切 换到通讯终端 C。  3. If the communication terminal A selects the communication terminal C as its main media source, the server can switch the TS stream from the communication terminal B to the communication terminal C without the above-mentioned communication terminal A being aware of it.
为实现上述目的, 且为节约带宽、 加快通讯建立速度, 在多方视频通 讯建立时, 参与通讯的各通讯终端仅定时传输 I帧数据, 各通讯终端能够 显示其他各通讯终端的定时更新的静态图像,此过程的大致流程如图 4所 示, 请同时参照图 2、 图 3和图 4其具体实施步骤如下- 步骤一、 多方视频通讯建立时, 服务器为各通讯终端创建与其对应的 通讯子单元, 通讯终端与通讯子单元建立 RTP/RTCP传输通道;  In order to achieve the above purpose, and in order to save bandwidth and speed up the communication establishment, when the multi-party video communication is established, each communication terminal participating in the communication only periodically transmits the I frame data, and each communication terminal can display the static image of the timing update of the other communication terminals. The general process of this process is shown in Figure 4. Please refer to Figure 2, Figure 3 and Figure 4 for the specific implementation steps as follows - Step 1: When multi-party video communication is established, the server creates a corresponding communication sub-unit for each communication terminal. , the communication terminal and the communication subunit establish an RTP/RTCP transmission channel;
步骤二、 通讯终端八、 B、 C的 I帧编码单元周期性地 (例如每隔 10 秒) 获取本通讯终端当前的静态图像, 将其编码成 I帧数据, 并传递给上 述各通讯终端的 RTP.输出单元;  Step 2: The I frame encoding unit of the communication terminal VIII, B, and C periodically (for example, every 10 seconds) acquires the current still image of the communication terminal, encodes it into I frame data, and transmits it to the above communication terminals. RTP. output unit;
步骤三、 上述通讯终端 A、 B、 C的 RTP输出单元接收到上述 I帧编 码单元的数据时, 为该数据封装 RTP包头, 其中此 RTP包头的 SSRC字 段与该通讯终端 TS流数据的 RTP包头的 SSRC—致, 其标记该数据的来 源; 同时, 该 RTP包头的 Payload字段定义为一特定的 7bit的数据, 以标 记该静态图像数据包区别于 TS流的数据;  Step 3: When receiving the data of the I frame coding unit, the RTP output unit of the communication terminals A, B, and C encapsulates an RTP header for the data, where the SSRC field of the RTP header and the RTP header of the TS stream data of the communication terminal SSRC, which marks the source of the data; meanwhile, the Payload field of the RTP header is defined as a specific 7-bit data to mark the static image data packet from the TS stream;
步骤四、上述通讯终端的 RTP输出单元对上述静态图像数据加上 RTP 包头后,立即通过上述 RTP传输通道将其发送给上述服务器中的与上述各 通讯终端 、 B、 C对应的通讯子单元 A、 B、 C的 RTP输入单元;  Step 4: After the RTP output unit of the communication terminal adds the RTP header to the static image data, immediately sends the RTP header to the communication subunit A corresponding to each communication terminal, B, C in the server through the RTP transmission channel. , B, C RTP input unit;
步骤五、 上述服务器中的与上述各通讯终端 A、 B、 C对应的通讯子 单元 A、 B、 C的 I帧提取单元周期性地从其通讯子单元的 RTP输入单元 中根据 RTP包头的 Payload字段提取静态图像数据, 并将其传送给上述服 务器的 I帧存储单元; 步骤六、上述服务器的 I帧存储单元根据各数据包的 RTP包头中不同 的 SSRC字段来区分来自不同通讯终端的静态图像数据; Step 5: The I frame extracting unit of the communication subunits A, B, and C corresponding to each of the communication terminals A, B, and C in the server periodically periodically receives the Payload of the RTP header from the RTP input unit of the communication subunit. Field extracting still image data and transmitting it to the I frame storage unit of the above server; Step 6: The I frame storage unit of the server distinguishes static image data from different communication terminals according to different SSRC fields in the RTP header of each data packet;
步骤七、 上述服务器中的与上述各通讯终端 A、 B、 C对应的通讯子 单元 A、 B、 C的媒体源单元周期性地从上述 I帧存储单元中获取其他通 讯终端的静态图像数据, 并将其立即传送给其通讯子单元的 RTP输出单 元,该 RTP输出单元立即将此类数据传输给与其通讯子单元相对应的通讯 终端的 RTP输入单元;  Step 7: The media source units of the communication subunits A, B, and C corresponding to the communication terminals A, B, and C in the server periodically acquire static image data of other communication terminals from the I frame storage unit. And immediately transmit it to the RTP output unit of its communication subunit, the RTP output unit immediately transmits such data to the RTP input unit of the communication terminal corresponding to its communication subunit;
步骤八、上述通讯终端 A、 B、 C的媒体缓冲单元从其通讯终端的 RTP 输入单元中接收数据, 根据其 RTP包的 Payload字段提取静态图像数据, 并根据上述 RTP包的 SSRC字段来区分其他不同通讯终端的数据,以组成 不同的各通讯终端的完整的 I帧数据;  Step 8. The media buffer unit of the communication terminals A, B, and C receives data from the RTP input unit of the communication terminal, extracts still image data according to the Payload field of the RTP packet, and distinguishes other data according to the SSRC field of the RTP packet. Data of different communication terminals to form complete I frame data of different communication terminals;
步骤九、上述通讯终端 A、 B、 C的 I帧解码单元对上述 I帧数据进行 解码, 并根据上述 SSRC字段来区分显示解码图像的不同来源, 这样上述 通讯通端八、 B、 C就可以看到其他通讯终端的定时更新的静态图像。  Step 9: The I frame decoding unit of the communication terminals A, B, and C decodes the I frame data, and distinguishes different sources of the decoded image according to the SSRC field, so that the communication terminals VIII, B, and C can See the static image of the timing update of other communication terminals.
对于上述通讯终端 A来说, 现在可以看到定时更新的通讯终端 B、 C 的静态图像, 并可以根据上述通讯终端的 B、 C的静态图像, 选择自己感 兴趣的主媒体源 B, 这样通讯终端 A就可以在接受上述通讯终端 B、 C的 定时更新的静态图像的同时, 接收通讯终端 B的 TS流, 此过程的大致流 程如图 5所示, 请同时参照图 2、 图 3和图 5, 此过程的具体实施步骤如 下:  For the communication terminal A, it is now possible to see the static images of the communication terminals B and C that are regularly updated, and to select the main media source B of interest according to the static images of the B and C of the communication terminal, so that the communication is performed. The terminal A can receive the TS stream of the communication terminal B while receiving the static image updated by the timing of the communication terminals B and C. The general flow of the process is shown in FIG. 5, please refer to FIG. 2, FIG. 3 and FIG. 5. The specific implementation steps of this process are as follows:
步骤一、 通讯终端 A选择通讯终端 B作为其当前的主媒体源, 通讯 终端 A 的流控制单元发送控制信号给服务器的流控制单元要求通讯终端 A当前的主媒体源为通讯终端 B;  Step 1: The communication terminal A selects the communication terminal B as its current main media source, and the flow control unit of the communication terminal A sends a control signal to the flow control unit of the server to request the current primary media source of the communication terminal A to be the communication terminal B;
步骤二、上述服务器的流控制单元接收到上述通讯终端 A确定主媒体 源为上述通讯终端 B的请求后,首先查看当前的通讯终端 B的 TS流访问 计数是否为 0, ·若上述通讯终端 B的 TS流访问计数为 0, 表明该通讯终 端 B才开始建立多方视频通讯,则上述服务器的流控制单元给上述通讯终 端 B的流控制单元发送 TS流开始请求; 否则执行步骤六; 步骤三、 上述通讯终端 B的流控制单元接收到上述服务器发送的 TS 流开始请求后, 通讯终端 B的流控制单元通知其通讯终端的 TS编码单元 开始工作, 该 TS编码单元对当前的通讯终端 B的音视频数据进行编码, 并将编码后的数据传递给上述通讯终端 B的 RTP输出单元, 该 RTP输出 单元为编码后的数据加上 RTP包头, 并按 TimeStamp发送; Step 2: After receiving the request that the primary media source is the communication terminal B, the flow control unit of the server first checks whether the TS stream access count of the current communication terminal B is 0, and if the communication terminal B The TS stream access count is 0, indicating that the communication terminal B starts to establish multi-party video communication, and the flow control unit of the server sends a TS stream start request to the flow control unit of the communication terminal B; otherwise, step 6 is performed; Step 3: After receiving the TS stream start request sent by the server, the flow control unit of the communication terminal B notifies the TS coding unit of the communication terminal to start working, and the TS coding unit is working on the current communication terminal. The audio and video data of B is encoded, and the encoded data is transmitted to the RTP output unit of the communication terminal B, and the RTP output unit adds the RTP header to the encoded data, and transmits it according to TimeStamp;
步骤四、 上述服务器中的通讯子单元 B的 RTP输入单元接收到上述 通讯终端 B发送的 TS流数据后,将其传递给该通讯子单元的 RTP解包单 元, 其中根据 RTP包的 Payload字段来区别是静态图像数据还是 TS流数 据;  Step 4: After receiving the TS stream data sent by the communication terminal B, the RTP input unit of the communication subunit B in the server transmits the TS stream data to the RTP unpacking unit of the communication subunit, where the Payload field according to the RTP packet is used. The difference is static image data or TS stream data;
步骤五、 上述服务器中的通讯子单元 B的 RTP解包单元, 去除上述 TS流数据的 RTP包头后将数据存储在该通讯子单元的媒体缓冲单元中; 步骤六、上述服务器的流控制单元通知上述通讯子单元 A的媒体源单 元从通讯子单元 B的媒体缓冲单元中取数据,且上述通讯终端 B的 TS流 访问计数加 1 ; ·在开始取数据时都是从 TS流的 I帧数据开始取, 如果是 MPEG4压缩编码, 可以通过 vop— coding— type或 group— of— vop— start— code 字段找到 I帧;  Step 5: The RTP unpacking unit of the communication subunit B in the server removes the RTP header of the TS stream data and stores the data in the media buffer unit of the communication subunit. Step 6: The flow control unit notification of the server The media source unit of the communication subunit A takes data from the media buffer unit of the communication subunit B, and the TS stream access count of the communication terminal B is incremented by one; • the I frame data of the TS stream is started when the data is started. Start fetching, if it is MPEG4 compression encoding, you can find the I frame by vop-coding_type or group-of-vop-start_code field;
步骤七、上述服务器的通讯子单元 A的媒体源单元从上述通讯子单元 B的媒体缓冲单元中取得 TS流数据, 并将其传递给该通讯子单元的 RTP 输出单元;  Step 7: The media source unit of the communication subunit A of the server obtains the TS stream data from the media buffer unit of the communication subunit B, and transmits the TS stream data to the RTP output unit of the communication subunit;
步骤八、 上述服务器的通讯子单元 A的 RTP输出单元根据当前与通 讯终端 A建立的 RTP会话属性, 为上述 TS流数据加上 RTP包头, 即此 时 RTP包头的 SSRC字段为通讯终端 A与通讯子单元 A确立 RTP会话的 SSRC, 其中 TimeStamp和序列号 (以下简称 SqucneceNumber)与数据源的 来源无关;该通讯子单元的 RTP输出单元根据 TimeStamp发送上述 TS流 数据;  Step 8: The RTP output unit of the communication subunit A of the server adds an RTP header to the TS stream data according to the RTP session attribute currently established with the communication terminal A, that is, the SSRC field of the RTP header is the communication terminal A and the communication. Subunit A establishes the SSRC of the RTP session, wherein the TimeStamp and the serial number (hereinafter referred to as SqucneceNumber) are independent of the source of the data source; the RTP output unit of the communication subunit transmits the TS stream data according to the TimeStamp;
步骤九、上述通讯终端 A的 RTP输入单元收到上述 TS流数据后将其 传送给其通讯终端的 TS解码单元, 该 TS解码单元对上述通讯终端 B的 实时音视频流数据进行解码操作, 并显示图像和播放声音。 这样上述通讯终端 A就能够在接收上述通讯终端 B、 C的定时更新的 静态图像的同时接收通讯终端 B的实时音视频数据并播放。当该通讯终端 A想将当前的主媒体源由上述通讯终端 B切换到上述通讯终端 C时, 仅 需根据静态图像选择上述通讯终端 C,上述服务器将通讯终端 A的主媒体 源由上述通讯终端 B切换到上述通讯终端(。 对于上述通讯终端 A来说, 在其做出选择不久, TS流的显示窗口将显示通讯终端 C的图像, 并播放 通讯终端 C的声音, 此过程大致处理流程如图 6所示。 请同时参照图 2、 图 3和图 6, 此过程具体的实施步骤如下: Step 9: The RTP input unit of the communication terminal A receives the TS stream data and transmits it to the TS decoding unit of the communication terminal, and the TS decoding unit decodes the real-time audio and video stream data of the communication terminal B, and Display images and play sounds. Thus, the communication terminal A can receive and play the real-time audio and video data of the communication terminal B while receiving the static image updated by the timing of the communication terminals B and C. When the communication terminal A wants to switch the current main media source from the communication terminal B to the communication terminal C, the communication terminal C only needs to be selected according to the static image, and the server uses the main media source of the communication terminal A from the communication terminal. B switches to the above communication terminal (for the above communication terminal A, shortly after it makes a selection, the display window of the TS stream will display the image of the communication terminal C, and play the sound of the communication terminal C, the process roughly processes such as Figure 6. Please refer to Figure 2, Figure 3 and Figure 6. The specific implementation steps of this process are as follows:
步骤一、通讯终端 A的流控制单元向服务器的流控制单元发送流切换 指令, 要求将其主媒体源由通讯终端 B切换到通讯终端 C;  Step 1: The flow control unit of the communication terminal A sends a flow switching instruction to the flow control unit of the server, requesting that its main media source be switched from the communication terminal B to the communication terminal C;
步骤二、上述服务器的流控制单元接收到上述通讯终端 A的流切换指 令后, 首先查看当前的通讯终端 C的 TS流访问计数, 如果上述通讯终端 C的 TS流访问计数 >0, 表明该通讯终端 C已经开始传送其 TS流, 则执 行步骤三; 如果上述通讯终端 C的 TS流访问计数为 0, 表明该通讯终端 C没有传送 TS流, 则上述服务器的流控制单元给上述通讯终端 C的流控 制单元发送 TS流开始请求, 如前面所述步骤, 上述通讯终端 C开始发送 其 TS流,且上述通讯子单元 C的媒体缓冲单元中存放该通讯终端 C的实 时音视频数据;  Step 2: After receiving the flow switching instruction of the communication terminal A, the flow control unit of the server first checks the current TS flow access count of the communication terminal C, and if the TS flow access count of the communication terminal C is > 0, the communication is indicated. If the terminal C has started to transmit its TS stream, step 3 is performed. If the TS stream access count of the communication terminal C is 0, indicating that the communication terminal C does not transmit the TS stream, the flow control unit of the server provides the communication terminal C. The flow control unit sends a TS stream start request, as described above, the communication terminal C starts to send its TS stream, and the media buffer unit of the communication subunit C stores the real-time audio and video data of the communication terminal C;
步骤三、 上述服务器的流控制单元将上述通讯终端 B的 TS流访问计 数减 1,并检查当前的通讯终端 B的 TS流访问计数是否为 0, 如果计数为 0, 表明没有通讯终端需要通讯终端 B的 TS流, 则上述服务器的流控制 单元通知上述通讯终端 B的流控制单元停止其当前的 TS流编码操作, 该 通讯终端 B的流控制单元向其通讯终端的 TS编码单元发送停止实时编码 指令, 上述通讯终端 B的 TS编码单元停止工作; 否则, 执行步骤四; 步骤四、上述服务器的流控制单元通知上述通讯子单元 A的媒体源单 元从上述通讯子单元 C的媒体缓冲单元中取数据, 上述通讯终端 C的 TS 流访问计数加 1 ;其中开始取数据时都是从 TS流的 I帧数据开始取,如果 是 MPEG4压缩编码,可以通过 vop— coding— type或 group— of— vop— start— code P T/CN2005/002152 字段找到 I帧; Step 3: The flow control unit of the server reduces the TS flow access count of the communication terminal B by one, and checks whether the current TS terminal access count of the communication terminal B is 0. If the count is 0, it indicates that no communication terminal needs the communication terminal. In the TS stream of B, the flow control unit of the server notifies the flow control unit of the communication terminal B to stop its current TS stream coding operation, and the flow control unit of the communication terminal B sends a stop real-time coding to the TS coding unit of the communication terminal. The instruction, the TS coding unit of the communication terminal B stops working; otherwise, step 4 is performed; step 4, the flow control unit of the server notifies that the media source unit of the communication subunit A is taken from the media buffer unit of the communication subunit C Data, the TS stream access count of the communication terminal C is incremented by 1; the data is started from the I frame data of the TS stream, and if it is MPEG4 compression code, it can be vopt-coding-type or group-of-vop. — start— code The PT/CN2005/002152 field finds the I frame;
步骤五、上述服务器的通讯子单元 A的媒体源单元从上述通讯子单元 C的媒体缓冲单元中取得 TS流数据并将其传送给其通讯子单元的 RTP输 出单元;  Step 5: The media source unit of the communication subunit A of the server obtains the TS stream data from the media buffer unit of the communication subunit C and transmits the TS stream data to the RTP output unit of the communication subunit;
步骤六、 上述服务器的通讯子单元 A的 RTP输出单元根据当前与上 述通讯终端 A建立的 RTP会话属性, 为上述 TS流数据加上 RTP包头, 即此时 RTP包头的 SSRC字段为上述通讯终端 A与上述通讯子单元 A确 立 RTP会话的 SSRC, TimeStam 和 SequenceNumber与数据源的来源无 关, 该通讯子单元的 RTP输出单元根据 TimeStamp发送上述 TS流数据; 步骤七、上述通讯终端 A的 RTP输入单元收到上述 TS流数据后将其 传送给其通讯终端的 TS解码单元, 该 TS解码单元对上述通讯终端 C的 实时音视频流数据进行解码操作, 并显示图像、 播放声音。  Step 6: The RTP output unit of the communication subunit A of the server adds an RTP header to the TS stream data according to the RTP session attribute currently established with the communication terminal A, that is, the SSRC field of the RTP header is the communication terminal A. The SSRC, the TimeStam and the SequenceNumber of the communication subunit A are independent of the source of the data source, and the RTP output unit of the communication subunit transmits the TS stream data according to the TimeStamp; Step 7. The RTP input unit of the communication terminal A receives After the TS stream data is transmitted to the TS decoding unit of the communication terminal, the TS decoding unit decodes the real-time audio and video stream data of the communication terminal C, and displays an image and a playback sound.
这样服务器就能顺利地将上述通讯终端 A 的主媒体源由通讯终端 B 切换到通讯终端 C;。 本发明涉及的媒体流切换方法的特点为:  Thus, the server can smoothly switch the main media source of the communication terminal A from the communication terminal B to the communication terminal C; The media stream switching method according to the present invention is characterized by:
1、 面向终端建立一独立的 RTP/UTCP传输通道, 与媒体源无关;  1. Establish an independent RTP/UTCP transmission channel for the terminal, independent of the media source;
2、 针对实时流的特点, 在音视频流切换时, 只要选择不同的数据源 即可完成切换, 而不需要对 RTP会话中的 TimeStamp、 SequenceNumber, SSRC作变换; 2. For the characteristics of real-time streaming, when the audio and video streams are switched, the switching can be completed by selecting different data sources, without changing the TimeStamp, SequenceNumber, and SSRC in the RTP session;
3、 不论是一个还是多个流切换到同一媒体源上, 对于媒体源来说都 是一样的。  3. Switching to one or more streams to the same media source is the same for media sources.
当然, 本发明还可有其他多种实施例, 例如釆用直接釆用传送控制协 议 (Transmission Control Protocol) (以下简称 TCP协议)传输静态图像数据, 或将静态图像数据和 TS流数据用不同的传输通道传输。 在不背离本发明 精神及其实质的情况下,所属技术领域的技术人员可根据本发明做出各种 相应的改变和变形,但这些相应的改变和变形都应属于本发明所附的权利 要求的保护范围。  Of course, the present invention may also have other various embodiments, such as transmitting static image data by using a Transmission Control Protocol (hereinafter referred to as TCP protocol), or using static image data and TS stream data differently. Transmission channel transmission. A person skilled in the art can make various changes and modifications in accordance with the present invention without departing from the spirit and scope of the invention, but the corresponding changes and modifications should be included in the appended claims. The scope of protection.

Claims

权 利 要 求 Rights request
1. 一种多方视频通讯媒体流控制系统,包括媒体流传输控制路径,负 责传输音视频数据和控制信号; 通讯终端, 负责发送和接收上述音视频数 据和控制信号; 和服务器, 负责响应上述控制信号, 为视频通讯终端提供 媒体流服务, 其特征在于 A multi-party video communication media stream control system, comprising a media stream transmission control path, responsible for transmitting audio and video data and control signals; a communication terminal, responsible for transmitting and receiving the above audio and video data and control signals; and a server, responsible for responding to the above control Signal, providing a media stream service for a video communication terminal, characterized in that
上述通讯终端包括: 实时传输控制协议单元、实时传输协议输出单元和实时传输协议 输入单元,其中该实时传输协议输出单元负责将本通讯终端的音视频数据 传递给上述服务器中的与本通讯终端对应的通讯子单元的实时传输协议 输入单元,该实时传输协议输入单元负责从上述服务器接收其他通讯终端 的音视频数据;  The communication terminal includes: a real-time transmission control protocol unit, a real-time transmission protocol output unit, and a real-time transmission protocol input unit, wherein the real-time transmission protocol output unit is responsible for transmitting audio and video data of the communication terminal to the server corresponding to the communication terminal. a real-time transmission protocol input unit of the communication subunit, the real-time transmission protocol input unit is responsible for receiving audio and video data of other communication terminals from the server;
实时流编码单元, 负责本通讯终端的实时流编码;  The real-time stream coding unit is responsible for real-time stream coding of the communication terminal;
帧内编码帧编码单元, 负责定时对本通讯终端的静态图像进行编 码, 形成帧内编码帧数据输出到该通讯终端的实时传输协议输出单元, 并 用上述实时传输协议包中一特定的负载类型来标记该通讯终端的帧内编 码帧数据;  The intra-coded frame coding unit is configured to periodically encode the static image of the communication terminal, form the intra-coded frame data to be output to the real-time transmission protocol output unit of the communication terminal, and mark with a specific load type in the real-time transmission protocol packet. Intra-coded frame data of the communication terminal;
媒体缓冲单元, 负责接收上述通讯终端的实时传输协议输入单元 的数据;  a media buffer unit, configured to receive data of a real-time transmission protocol input unit of the communication terminal;
实时流解码单元, 负责从上述媒体缓冲单元中提取实时流, 并将 其解码;  a real-time stream decoding unit, configured to extract a real-time stream from the media buffer unit and decode the same;
帧内编码帧解码单元, 负责从上述媒体缓冲单元中提取上述有特 定负载类型的实时传输协议包, 并对帧内编码帧数据进行解码, 形成静态 图像; 及  The intra-coded frame decoding unit is configured to extract the above-mentioned real-time transmission protocol packet with a specific payload type from the media buffer unit, and decode the intra-coded frame data to form a static image;
流控制单元, 负责与上述服务器的流控制单元通讯, 通知该服务 器本通讯终端当前要使用的主媒体源或接收该服务器的启动本通讯终端 的实时流编码单元的指令;  a flow control unit, configured to communicate with the flow control unit of the server, to notify the server of the primary media source currently used by the communication terminal or to receive an instruction of the real-time stream coding unit of the communication terminal of the server;
上述服务器包括: 帧内编码帧存储单元,负责存储上述各通讯终端的帧内编码帧数 据; The above servers include: An intra-coded frame storage unit is responsible for storing intra-coded frame data of each of the communication terminals;
流控制单元, 负责控制上述各通讯终端的音视频流的链接; 及 通讯子单元,负责与上述通讯终端建立通讯链路传递上述音视频 数据和控制信号,负责向上述服务器的帧内编码帧存储单元输出视频帧内 编码帧数据,和负责从上述服务器的帧内编码帧存储单元和上述服务器的 其他通讯子单元的实时传输协议解包单元中获取媒体源数据输入到该通 讯子单元的媒体源单元中, 其中该通讯子单元包括:  a flow control unit, configured to control a link of the audio and video streams of each of the communication terminals; and a communication subunit, configured to establish a communication link with the communication terminal to transmit the audio and video data and the control signal, and is responsible for intraframe coded frame storage to the server The unit outputs the video intraframe encoded frame data, and the media source for inputting the media source data into the communication subunit from the real-time transport protocol unpacking unit of the intra-coded frame storage unit of the server and the other communication sub-units of the server In the unit, wherein the communication subunit includes:
实时传输控制协议传输单元、实时传输协议输出单元和实时 传输协议输入单元,其中该实时传输协议输出单元负责将上述其他通讯终 端的音视频数据传递给与该通讯子单元对应的通讯终端,该实时传输协议 输入单元负责接收上述通讯终端的音视频数据;  a real-time transmission control protocol transmission unit, a real-time transmission protocol output unit, and a real-time transmission protocol input unit, wherein the real-time transmission protocol output unit is responsible for transmitting audio and video data of the other communication terminals to a communication terminal corresponding to the communication sub-unit, the real-time The transmission protocol input unit is responsible for receiving audio and video data of the communication terminal;
帧内编码帧提取单元,负责根据上述实时传输协议包的一特 定的负载类型来提取帧内编码帧数据,并将其存储到上述服务器的帧内编 码帧存储单元中;若该帧内编码帧提取单元没有找到上述类型的实时传输 协议包,则其直接从上述服务器的通讯子单元的实时传输协议解包单元中 的数据中提取帧内编码帧数据;  The intra-coded frame extracting unit is configured to extract the intra-coded frame data according to a specific load type of the real-time transport protocol packet and store it in the intra-coded frame storage unit of the server; if the intra-coded frame If the extracting unit does not find the real-time transport protocol packet of the above type, it extracts the intra-coded frame data directly from the data in the real-time transport protocol unpacking unit of the communication sub-unit of the server;
实时传输协议解包单元,负责在过滤掉上述服务器的通讯子 单元的帧内编码帧提取单元需要的数据包后,将接收到的音视频数据的实 时传输协议包头移除, 并将其输出到该通讯子单元的媒体缓冲单元中; 媒体缓冲单元,负责接收上述服务器的通讯子单元的实时传 输协议解包单元输出的数据; 及 ' 媒体源单元,负责从上述服务器的帧内编码帧存储单元和与 其他通讯终端对应的通讯子单元的媒体缓冲单元中获取与该通讯子单元 对应的通讯终端需要的音视频数据;  The real-time transmission protocol unpacking unit is configured to remove the real-time transmission protocol header of the received audio and video data after filtering out the data packet required by the intra-frame coding frame extraction unit of the communication sub-unit of the server, and output the same to the real-time transmission protocol header of the received audio and video data. a media buffer unit of the communication subunit; a media buffer unit configured to receive data output by a real-time transport protocol unpacking unit of the communication subunit of the server; and a media source unit responsible for an intra-coded frame storage unit from the server Obtaining audio and video data required by the communication terminal corresponding to the communication subunit in the media buffer unit of the communication subunit corresponding to the other communication terminal;
其中,上述通讯终端的实时传输协议输出单元分别与上述通讯终端的 实时流编码单元及帧内编码帧编码单元连接;  The real-time transmission protocol output unit of the communication terminal is respectively connected to the real-time stream coding unit and the intra-frame coding frame coding unit of the communication terminal;
上述通讯终端的实时传输协议输入单元与上述通讯终端的媒体缓冲 单元连接; Real-time transmission protocol input unit of the above communication terminal and media buffer of the above communication terminal Unit connection
上述通讯终端的媒体缓冲单元分别与上述通讯终端的实时流解码单 元及帧内编码帧解码单元连接; 及  The media buffer unit of the communication terminal is respectively connected to the real-time stream decoding unit and the intra-frame coded frame decoding unit of the communication terminal; and
上述服务器的通讯子单元的帧内编码帧提取单元分别与上述服务器 的帧内编码帧存储单元、上述服务器的通讯子单元的实时传输协议输入单 元及实时传输协议解包单元连接;  The intra-coded frame extracting unit of the communication sub-unit of the server is respectively connected to the intra-coded frame storage unit of the server, the real-time transport protocol input unit of the communication sub-unit of the server, and the real-time transport protocol unpacking unit;
上述服务器的通讯子单元的实时传输协议解包单元分别与上述服务 器的通讯子单元的实时传输协议输入单元、帧内编码帧提取单元及媒体缓 冲单元连接;  The real-time transport protocol unpacking unit of the communication subunit of the server is respectively connected to the real-time transport protocol input unit, the intra-coded frame extracting unit and the media buffer unit of the communication subunit of the server;
上述服务器的通讯子单元的媒体源单元分别与上述服务器的通讯子 单元的实时传输协议输出单元、上述服务器的帧内编码帧存储单元及流控 制单元连接。  The media source units of the communication subunits of the server are respectively connected to a real-time transmission protocol output unit of the communication subunit of the server, an intra-coded frame storage unit of the server, and a flow control unit.
2. 根据权利要求 1所述的系统, 其特征在于  2. The system of claim 1 wherein
上述通讯终端通过定义实时传输协议包中一特定的负载类型来标记 该通讯终端的帧内编码帧数据,该类型的实时传输协议包仅包含该通讯终 端的音视频数据的帧内编码帧数据;  The communication terminal marks the intra-coded frame data of the communication terminal by defining a specific load type in the real-time transmission protocol packet, and the real-time transmission protocol packet of the type only includes the intra-coded frame data of the audio and video data of the communication terminal;
上述通讯终端与服务器均为双向的音视频流通讯;  The above communication terminal and the server are two-way audio and video stream communication;
多方视频通讯建立时, 上述服务器包含多个上述通讯子单元, 且每一 个上述通讯子单元与每一个上述通讯终端相对应;  When the multi-party video communication is established, the server includes a plurality of the communication subunits, and each of the communication subunits corresponds to each of the communication terminals;
多方视频通讯建立后,上述通讯终端与服务器通过实时传输协议 /实时 传输控制协议传输通道传输音视频数据。  After the multi-party video communication is established, the above communication terminal and the server transmit the audio and video data through the real-time transmission protocol/real-time transmission control protocol transmission channel.
3. 一种多方视频通讯媒体流控制方法, 其特征在于静态图像与实时 流混合传输的方法包括以下步骤:  A multi-party video communication media stream control method, characterized in that the method for hybrid transmission of a static image and a real-time stream comprises the following steps:
步骤 101 :通讯终端的实时传输协议输出\输入单元与服务器中的与该 通讯终端对应的通讯子单元的实时传输协议输入 \输出单元建立一对数据 通讯链路, 其中上述静态图像与上述实时流都通过这对数据通讯链路传 输; 步骤 102: 上述通讯终端的帧内编码帧编码单元定时将上述静态图像 编码为帧内编码帧数据,并将其传输到上述通讯终端的实时传输协议输出 单元中; Step 101: The real-time transmission protocol output/input unit of the communication terminal establishes a pair of data communication links with the real-time transmission protocol input/output unit of the communication sub-unit corresponding to the communication terminal in the server, wherein the static image and the real-time stream are All transmitted through this pair of data communication links; Step 102: The intra-coded frame coding unit of the communication terminal periodically encodes the static image into intra-frame encoded frame data, and transmits the same to the real-time transmission protocol output unit of the communication terminal.
步骤 103 : 如果上述通讯终端的实时传输协议输出单元接收到上述帧 内编码帧编码单元中的帧内编码帧数据,则将其封装成有特定负载类型的 实时传输协议包, 并立即发送; 如果上述通讯终端的实时传输协议输出单 元接收到的是上述通讯终端的实时流编码单元的数据, 则将其封装成实时 传输协议包后, 按时间戳发送; 其中上述特定负载类型的实时传输协议包 区别于上述实时流的实时传输协议包;  Step 103: If the real-time transmission protocol output unit of the communication terminal receives the intra-coded frame data in the intra-frame coding frame coding unit, package the data into a real-time transmission protocol packet with a specific payload type, and send it immediately; The real-time transmission protocol output unit of the communication terminal receives the data of the real-time stream coding unit of the communication terminal, and then encapsulates it into a real-time transmission protocol packet, and sends it according to a time stamp; wherein the real-time transmission protocol package of the specific load type a real-time transport protocol packet that is different from the above-mentioned real-time stream;
步骤 104: 如果上述服务器中的与上述各通讯终端对应的通讯子单元 的实时传输协议输入单元接收到上述特定负载类型的实时传输协议包, 则 将上述帧内编码帧数据提取出来并将其存储到服务器中的帧内编码帧存 储单元中; 如果接收到的是实时流数据, 则经上述通讯子单元的实时传输 协议解包单元去除包头后, 将其存储在上述通讯子单元的媒体缓冲单元 内;  Step 104: If the real-time transport protocol input unit of the communication sub-unit corresponding to each communication terminal in the server receives the real-time transport protocol packet of the specific load type, extract the intra-frame encoded frame data and store the same. Go to the intra-coded frame storage unit in the server; if the real-time stream data is received, the packet header is removed by the real-time transport protocol unpacking unit of the communication subunit, and then stored in the media buffer unit of the communication subunit Inside;
步骤 105 : 上述服务器中的与上述各通讯终端对应的通讯子单元的媒 体源单元定时从上述帧内编码帧存储单元中获取其他通讯终端的帧内编 码帧数据,且从上述各通讯终端确定的主媒体源所对应的通讯子单元的媒 体缓冲单元中获取上述实时流数据;  Step 105: The media source unit timing of the communication subunit corresponding to each communication terminal in the server is used to acquire intraframe encoded frame data of other communication terminals from the intraframe encoded frame storage unit, and determined from each communication terminal. Obtaining the real-time stream data in a media buffer unit of a communication subunit corresponding to the main media source;
步骤 106: 如果上述服务器中的与上述各通讯终端对应的通讯子单元 的实时传输协议输出单元接收到上述特定负载类型的实时传输协议包,则 立即发送; 如果接收到的是上述实时流的实时传输协议包, 则按上述时间 戳发送; '  Step 106: If the real-time transmission protocol output unit of the communication sub-unit corresponding to each communication terminal in the server receives the real-time transmission protocol packet of the specific load type, immediately send; if the real-time flow is received in real time The transport protocol packet is sent according to the above time stamp; '
步骤 107: 如果上述通讯终端的实时传输协议输入模块接收到上述特 定负载类型的实时传输协议包,则将其传送给上述通讯终端的帧内编码帧 解码单元, 形成其他通讯终端的静态图像; 如果接收到的是上述实时流的 实时传输协议包, 则将其传送给上述通讯终端的实时流解码单元, 形成上 述主媒体源的实时流。 Step 107: If the real-time transmission protocol input module of the communication terminal receives the real-time transmission protocol packet of the specific payload type, transmit the same to the intra-coded frame decoding unit of the communication terminal to form a static image of another communication terminal; Receiving the real-time transport protocol packet of the real-time stream, transmitting the real-time transport protocol packet to the real-time stream decoding unit of the communication terminal to form a real-time stream of the main media source.
4. 根据权利要求 3所述的方法, 其特征在于上述步骤 101 中的数据 通讯链路为实时传输协议 /实时传输控制协议传输通道。 The method according to claim 3, characterized in that the data communication link in the above step 101 is a real-time transmission protocol/real-time transmission control protocol transmission channel.
5. 一种多方视频通讯媒体流控制方法, 其特征在于通过通讯终端接 收到的静态图像选择主媒体源的方法包括以下步骤:  A multi-party video communication media stream control method, characterized in that the method for selecting a main media source through a still image received by a communication terminal comprises the following steps:
步骤 201 : 最初开始多方视频通讯时, 通讯终端仅进行帧内编码帧编 码,帧内编码帧数据通过特定负载类型的实时传输协议包定时发送给服务 器;  Step 201: When initially starting multi-party video communication, the communication terminal only performs intra-coded frame coding, and the intra-coded frame data is periodically sent to the server through a real-time transmission protocol packet of a specific load type;
步骤 202: 上述服务器接收各通讯终端的上述帧内编码帧数据, 并将 其存储在上述服务器的帧内编码帧存储单元中;  Step 202: The server receives the intra-frame encoded frame data of each communication terminal, and stores it in an intra-framed frame storage unit of the server.
步骤 203 : 上述服务器中的与上述各通讯终端对应的通讯子单元从上 述帧内编码帧存储单元中获取其他通讯终端的帧内编码帧数据并发送给 该通讯终端;  Step 203: The communication subunit corresponding to each communication terminal in the server acquires intraframe encoded frame data of other communication terminals from the intraframe encoded frame storage unit and sends the intraframe encoded frame data to the communication terminal;
步骤 204: 上述通讯终端根据显示的上述其他通讯终端的静态图像, 选择主媒体源,之后该通讯终端的流控制单元通知上述服务器的流控制单 元该通讯终端选择的主媒体源;  Step 204: The communication terminal selects a primary media source according to the displayed static image of the other communication terminal, and then the flow control unit of the communication terminal notifies the flow control unit of the server of the primary media source selected by the communication terminal;
步骤 205 : '上述服务器检查上述通讯终端所选择的主媒体源是否在传 送实时流, 如果没有, 则通知该主媒体源的流控制单元启动实时流编码单 元;  Step 205: The server checks whether the primary media source selected by the communication terminal is transmitting a real-time stream, and if not, notifying the flow control unit of the primary media source to start the real-time stream coding unit;
步骤 206: 上述服务器的流控制单元发送控制信号给与上述通讯终端 对应的通讯子单元的媒体源单元让其从与上述主媒体源对应的通讯子单 元的媒体缓冲单元中获取实时流数据。  Step 206: The flow control unit of the server sends a control signal to the media source unit of the communication subunit corresponding to the communication terminal to obtain real-time stream data from the media buffer unit of the communication sub-unit corresponding to the main media source.
6. 根据权利要求 5 所述的方法, 其特征在于上述主媒体源为与上述 通讯终端不同的另一通讯终端。  6. The method of claim 5 wherein said primary media source is another communication terminal different from said communication terminal.
7. 一种多方视频通讯媒体流控制方法, 其特征在于一通讯终端从当 前的主媒体源切换至另一主媒体源的方法包括以下步骤:  A multi-party video communication media stream control method, characterized in that a method for a communication terminal to switch from a current primary media source to another primary media source comprises the following steps:
步骤 301: 通讯终端 4的流控制单元通知服务器的流控制单元将其当 前的主媒体源由 4 、 A2 、 . . .、 „切换至^ ; ' Step 301: The flow control unit of the communication terminal 4 notifies the flow control unit of the server to treat it as The former main media source consists of 4, A 2 , . . . , „switch to ^ ; '
步骤 302: 上述服务器检查上述主媒体源 ^是否在传送实时流, 如果 没有,则通知该主媒体源 ^的流控制单元启动该主媒体源 的实时流编码 单元;  Step 302: The server checks whether the primary media source is transmitting a real-time stream, and if not, notifying the flow control unit of the primary media source to activate a real-time stream encoding unit of the primary media source;
步骤 303 : 上述服务器的流控制单元发送控制信号给与上述通讯终端 ^对应的通讯子单元的媒体源单元让其从与上述主媒体源 ^对应的通讯 子单元的媒体缓冲单元中获取实时流数据。  Step 303: The flow control unit of the server sends a control signal to the media source unit of the communication subunit corresponding to the communication terminal, to obtain real-time stream data from the media buffer unit of the communication subunit corresponding to the main media source. .
8. 根据权利要求 7所述的方法, 其特征在于  8. The method of claim 7 wherein
上述当前的主媒体源 4 、 A2 4与上述主媒体源^为与上述通 讯终端 Ax互不相同的通讯终端; The current main media source 4, A 2 4 and the main media source are different communication terminals from the communication terminal A x ;
上述下标 x、 y及 n互不相同。  The above subscripts x, y and n are different from each other.
PCT/CN2005/002152 2005-12-12 2005-12-12 System and method for controlling the media stream of the multiparty video communication WO2007068139A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2005/002152 WO2007068139A1 (en) 2005-12-12 2005-12-12 System and method for controlling the media stream of the multiparty video communication
CN2005800516476A CN101273631B (en) 2005-12-12 2005-12-12 Multi-party video communication media flow control system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2005/002152 WO2007068139A1 (en) 2005-12-12 2005-12-12 System and method for controlling the media stream of the multiparty video communication

Publications (1)

Publication Number Publication Date
WO2007068139A1 true WO2007068139A1 (en) 2007-06-21

Family

ID=38162531

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2005/002152 WO2007068139A1 (en) 2005-12-12 2005-12-12 System and method for controlling the media stream of the multiparty video communication

Country Status (2)

Country Link
CN (1) CN101273631B (en)
WO (1) WO2007068139A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009015611A1 (en) * 2007-08-01 2009-02-05 Huawei Technologies Co., Ltd. Method, system and apparatus for quick switching media source
CN101562911B (en) * 2008-04-16 2011-06-15 北京信威通信技术股份有限公司 Wireless access system air interface RTP agent technology supporting end-to-end application

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101867764A (en) * 2010-04-21 2010-10-20 崔崧 Mobile multi-party video call system
CN106341642A (en) * 2016-08-29 2017-01-18 安徽云图信息技术有限公司 Big data multidimensional visualization system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002045398A1 (en) * 1999-06-14 2002-06-06 Ginganet Corporation Teleconferencing system
CN1510919A (en) * 2002-12-23 2004-07-07 ����ͨѶ�ɷ����޹�˾ Method and device for data processing in video conference multi-point control equipment
CN1581967A (en) * 2003-08-11 2005-02-16 英业达股份有限公司 System and method for switching communication connection in video conference

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002045398A1 (en) * 1999-06-14 2002-06-06 Ginganet Corporation Teleconferencing system
CN1510919A (en) * 2002-12-23 2004-07-07 ����ͨѶ�ɷ����޹�˾ Method and device for data processing in video conference multi-point control equipment
CN1581967A (en) * 2003-08-11 2005-02-16 英业达股份有限公司 System and method for switching communication connection in video conference

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009015611A1 (en) * 2007-08-01 2009-02-05 Huawei Technologies Co., Ltd. Method, system and apparatus for quick switching media source
CN101562911B (en) * 2008-04-16 2011-06-15 北京信威通信技术股份有限公司 Wireless access system air interface RTP agent technology supporting end-to-end application

Also Published As

Publication number Publication date
CN101273631B (en) 2010-04-21
CN101273631A (en) 2008-09-24

Similar Documents

Publication Publication Date Title
US10045052B2 (en) System and method for transferring data
KR100906687B1 (en) Multipoint Conference System and Multipoint Conference Device
CN101156374B (en) A system and method for monitoring a video call
CN104883344B (en) Negotiate the method and apparatus of media capability
CN108366044B (en) VoIP remote audio/video sharing method
US8750317B2 (en) Multimedia data monitoring system and multimedia data monitoring method
TW202423095A (en) Automatic generation of video content in response to network interruption
JPWO2008117892A1 (en) Content distribution system, conversion device, and content distribution method used therefor
WO2010124499A1 (en) Method and terminal for synchronously recording sounds and images of opposite ends based on circuit domain video telephone
JP2013042492A (en) Method and system for switching video streams in resident display type video conference
WO2007068139A1 (en) System and method for controlling the media stream of the multiparty video communication
JP2008311855A (en) Communication terminal device
JP2002152301A (en) Data communication system, data receiver, data communication method, and program storage medium
WO2009145293A1 (en) Server device, communication method, and program
CN102957893B (en) For the method and system switched between the video flowing in continuous presence conferences
JP3969155B2 (en) Multimedia communication transfer method, multimedia communication terminal, exchange, management device
JP2009044287A (en) Video distribution device, distribution video switching method, distribution video switching program, and distribution video switching program recording medium
JP4818309B2 (en) Video phone terminal
JPWO2009145294A1 (en) Server apparatus, communication method and program
CN116471373A (en) Friend sharing system and method based on E1 and IP network video conference terminal
KR20050068095A (en) Apparatus and its method of resetting call path for image frame storing and regeneration
JP2008182336A (en) Reception terminal
JP2006180371A (en) Gateway device, communication terminal, and call control method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 200580051647.6

Country of ref document: CN

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 05820622

Country of ref document: EP

Kind code of ref document: A1