WO2018155351A1 - Procédé de reproduction, système de reproduction et appareil de reproduction - Google Patents
Procédé de reproduction, système de reproduction et appareil de reproduction Download PDFInfo
- Publication number
- WO2018155351A1 WO2018155351A1 PCT/JP2018/005613 JP2018005613W WO2018155351A1 WO 2018155351 A1 WO2018155351 A1 WO 2018155351A1 JP 2018005613 W JP2018005613 W JP 2018005613W WO 2018155351 A1 WO2018155351 A1 WO 2018155351A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- content
- sound
- video
- playback
- reproduction
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 46
- 238000004891 communication Methods 0.000 claims description 83
- 238000005562 fading Methods 0.000 claims description 6
- 230000010365 information processing Effects 0.000 description 27
- 238000010586 diagram Methods 0.000 description 20
- 230000008569 process Effects 0.000 description 11
- 238000012545 processing Methods 0.000 description 11
- 230000008859 change Effects 0.000 description 10
- 230000004048 modification Effects 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 241000282414 Homo sapiens Species 0.000 description 4
- 210000004556 brain Anatomy 0.000 description 4
- 230000007423 decrease Effects 0.000 description 3
- 241000282412 Homo Species 0.000 description 2
- 230000006399 behavior Effects 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 230000001151 other effect Effects 0.000 description 1
- 238000010422 painting Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/45—Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
- H04N21/458—Scheduling content for creating a personalised stream, e.g. by combining a locally stored advertisement with an incoming stream; Updating operations, e.g. for OS modules ; time-related management operations
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/84—Generation or processing of descriptive data, e.g. content descriptors
Definitions
- the present disclosure relates to a reproduction method, a reproduction system, and a reproduction apparatus for reproducing video content and sound content.
- Patent Document 1 discloses a moving image playback apparatus that smoothly switches a moving image provided by streaming.
- This disclosure provides a playback method that can reduce a sense of discomfort given to a user when video content and sound content are switched to different content.
- the reproduction method acquires first content composed of first video content and first sound content that are independent from each other, and includes second video content and second sound content that are independent from each other. After the second content is acquired and the acquired first content is reproduced, the acquired second content is reproduced.
- the method according to the present disclosure can reduce a sense of discomfort given to the user when the video content and the sound content are switched to different content.
- FIG. 1 is a schematic diagram of a reproduction system according to an embodiment.
- FIG. 2 is a block diagram illustrating an example of a hardware configuration of the playback device.
- FIG. 3 is a block diagram illustrating an example of the hardware configuration of the server.
- FIG. 4 is a block diagram illustrating an example of a hardware configuration of the information processing apparatus.
- FIG. 5 is a block diagram illustrating an example of a functional configuration of the reproduction system according to the embodiment.
- FIG. 6 is a block diagram illustrating an example of a specific configuration of the playback unit.
- FIG. 7 is a diagram illustrating an example of processing for switching from the first content to the second content.
- FIG. 8 is a sequence diagram illustrating an example of a reproduction method by the reproduction system according to the embodiment.
- FIG. 8 is a sequence diagram illustrating an example of a reproduction method by the reproduction system according to the embodiment.
- FIG. 9 is a flowchart illustrating an example of the details of the reproduction process performed by the reproduction apparatus according to the embodiment.
- FIG. 10 is a sequence diagram illustrating an example of a registration method by the reproduction system according to the embodiment.
- FIG. 11 is a block diagram illustrating an example of a functional configuration of a reproduction system according to a modification of the embodiment.
- FIG. 1 is a schematic diagram of a reproduction system according to an embodiment.
- a playback device 100, a server 200, a communication network 300, and an information processing device 400 are shown.
- the playback system 1 includes the playback device 100 and the server 200 among these components.
- the playback system 1 may further include an information processing apparatus 400.
- a plurality of playback devices 100 may be connected to the communication network 300.
- a plurality of information processing devices 400 may be connected to the communication network 300.
- the playback system 1 is a system for providing a first user with content configured by a combination of independent video content and sound content from the server 200 to the playback device 100.
- One playback device 100 may correspond to one first user or a plurality of first users.
- the reproduction system 1 includes a plurality of reproduction apparatuses 100
- a plurality of first users may correspond to each of the plurality of reproduction apparatuses 100 in a one-to-one correspondence or a one-to-many correspondence. Also good.
- the plurality of playback devices 100 may correspond to one first user.
- one information processing apparatus 400 may correspond to one second user or a plurality of second users.
- a plurality of second users may correspond to each of the plurality of information processing apparatuses 400, or one to many. It may be. Further, the plurality of information processing apparatuses 400 may correspond to one second user. For example, video content or sound content is provided to the server 200 via the information processing apparatus 400 from a second user such as a content creator.
- the independent content is content generated on the assumption that the content itself is reproduced independently. That is, the reproduction time for reproducing the video content constituting the content once from the beginning to the end is often different from the reproduction time for reproducing the sound content once from the beginning to the end. Further, in the video content and the sound content constituting the content, the creator of the video content and the creator of the sound content are often different.
- the playback system 1 can generate a large amount of content by generating content by combining video content and sound content that are independent of each other. For this reason, it is possible to reduce the shortage of content.
- switching the first video content to the second video content during playback is more likely to give the user a greater sense of discomfort than switching the first video content to the second sound content during playback.
- the present inventor has further reduced the uncomfortable feeling given to the user by performing a reproduction process for stopping the sound content at the timing when the video content ends.
- FIG. 2 is a block diagram showing an example of the hardware configuration of the playback device.
- the playback device 100 includes a CPU 101 (Central Processing Unit), a main memory 102, a storage 103, a communication IF (Interface) 104, a display 105, and a speaker 106 as hardware configurations.
- a CPU 101 Central Processing Unit
- main memory 102 main memory
- main memory 102 main memory
- storage 103 storage
- communication IF (Interface) 104 communication IF
- display 105 display
- speaker 106 speaker
- the CPU 101 is a processor that executes a control program stored in the storage 103 or the like.
- the main memory 102 is a volatile storage area used as a work area used when the CPU 101 executes a control program.
- the storage 103 is a non-volatile storage area that holds a control program, content, and the like.
- the communication IF 104 is a communication interface that communicates with the server 200 via the communication network 300.
- the communication IF 104 is, for example, a wired LAN interface.
- the communication IF 104 may be a wireless LAN interface.
- the communication IF 104 is not limited to a LAN interface, and may be any communication interface as long as it can establish a communication connection with the communication network 300.
- the display 105 is a display device that displays a processing result in the CPU 101.
- the display 105 displays, for example, video obtained by playing video content.
- the display 105 is, for example, a liquid crystal display or an organic EL display.
- Speaker 106 outputs the processing result in CPU 101.
- the speaker 106 outputs, for example, sound or music obtained by playing sound content.
- the hardware configuration of the server 200 will be described with reference to FIG.
- FIG. 3 is a block diagram showing an example of the hardware configuration of the server.
- the server 200 includes a CPU 201 (Central Processing Unit), a main memory 202, a storage 203, and a communication IF (Interface) 204 as hardware configurations.
- CPU 201 Central Processing Unit
- main memory 202 main memory
- storage 203 main memory
- communication IF Interface
- the CPU 201 is a processor that executes a control program stored in the storage 203 or the like.
- the main memory 202 is a volatile storage area used as a work area used when the CPU 201 executes a control program.
- the storage 203 is a non-volatile storage area that holds a control program, content, and the like.
- the communication IF 204 is a communication interface that communicates with the playback apparatus 100 or the information processing apparatus 400 via the communication network 300.
- the communication IF 204 is, for example, a wired LAN interface.
- the communication IF 204 may be a wireless LAN interface.
- the communication IF 204 is not limited to a LAN interface, and may be any communication interface as long as it can establish a communication connection with the communication network 300.
- FIG. 4 is a block diagram illustrating an example of a hardware configuration of the information processing apparatus.
- the information processing apparatus 400 includes a CPU 401 (Central Processing Unit), a main memory 402, a storage 403, a communication IF (Interface) 404, an input IF (Interface) 405, as hardware configurations. Is provided.
- a CPU 401 Central Processing Unit
- main memory 402 main memory
- storage 403 main memory
- communication IF Interface
- input IF Interface
- the CPU 401 is a processor that executes a control program stored in the storage 403 or the like.
- the main memory 402 is a volatile storage area used as a work area used when the CPU 401 executes a control program.
- the storage 403 is a non-volatile storage area that holds a control program, content, and the like.
- the communication IF 404 is a communication interface that communicates with the server 200 via the communication network 300.
- the communication IF 404 is, for example, a wired LAN interface.
- the communication IF 404 may be a wireless LAN interface.
- the communication IF 404 is not limited to a LAN interface, and may be any communication interface as long as it can establish a communication connection with the communication network 300.
- the input IF 405 is an input device such as a numeric keypad, a keyboard, and a mouse.
- FIG. 5 is a block diagram illustrating an example of a functional configuration of the reproduction system according to the embodiment.
- the playback apparatus 100 includes a communication unit 110 and a playback unit 130.
- the playback device 100 may further include a content DB (Database) 120.
- the communication unit 110 acquires the first content from the server 200 via the communication network 300.
- the first content includes first video content and first sound content that are independent of each other.
- the communication unit 110 acquires the second content from the server 200 via the communication network 300.
- the second content includes second video content and second sound content that are independent of each other.
- the communication unit 110 is realized by the CPU 101, the main memory 102, the storage 103, and the communication IF 104, for example.
- the content DB 120 stores the first content and the second content acquired by the communication unit 110.
- the content DB 120 is realized by the storage 103, for example.
- the first content and the second content stored in the content DB 120 are not limited to the content acquired by the communication unit 110 but may be content stored in advance or acquired by the communication unit 110. Content stored in advance and content stored in advance may be mixed.
- the content DB 120 stores, for example, previously stored content before factory shipment.
- reproducing unit 130 will be described with reference to FIGS. 6 and 7.
- FIG. 6 is a block diagram showing an example of a specific configuration of the playback unit.
- the reproduction unit 130 reproduces the first content C10 or the second content C20 acquired by the communication unit 110.
- the playback unit 130 may perform streaming playback of the first content C10 or the second content C20 acquired by the communication unit 110, or read and play back the first content C10 or the second content C20 from the content DB 120. May be.
- the playback unit 130 includes a video playback unit 131 and a sound playback unit 132.
- the video playback unit 131 plays back video content. Specifically, the video reproduction unit 131 reproduces video content and displays the video obtained by the reproduction on the display 105.
- the sound reproduction unit 132 reproduces sound content. Specifically, the sound reproduction unit 132 reproduces sound content and causes the speaker 106 to output sound obtained by the reproduction.
- the playback unit 130 plays back the second content C20 after playing back the first content C10, for example, as shown in FIG.
- the reproduction unit 130 reproduces the first video content C11 and the first sound content C12 in the first period, and the second video content C21 and the second audio content C21 in the second period after the first period.
- the second sound content C22 is reproduced.
- the reproduction unit 130 switches from reproduction of the first video content C11 to reproduction of the second video content C21 and switches from reproduction of the first sound content C12 to reproduction of the second sound content C22 at a specified timing.
- the playback unit 130 stops the playback of the first sound content C12 and starts the playback of the second content C20 at the first timing when the playback of the first video content C11 of the first content C10 ends. Also good.
- the playback time of the first video content C11 is shorter than the playback time of the first sound content C12
- the playback unit 130 plays back the first sound content C12 even if the playback of the first sound content C12 has not ended. Stop at one timing.
- the reproduction time is the time required to reproduce the content once from the beginning to the end at a single speed. That is, each of the first and second video contents C11 and C21 and the first and second sound contents C12 and C22 is a content that is played back with a playback time of a finite length.
- each of the first and second sound contents C12 and C22 may be sound contents that are reproduced in an infinite loop.
- the sound content to be played in an infinite loop is, for example, content including control information for causing the playback device 100 to play back the sound content from the beginning of the sound content at the timing when one playback ends.
- the sound content that is played in an infinite loop is, for example, content that is configured to be played back by seamlessly connecting the end point and the start point of the sound content.
- seamlessly connected and played back means that, for example, the sound at the end of the sound content and the sound at the start of the sound content include similar sounds. is there.
- the similar sound means that both are included in a predetermined volume range and a predetermined frequency region.
- the reproduction unit 130 may repeat the reproduction of the first sound content C12 when the reproduction of the first video content C11 continues even after the reproduction of the first sound content C12 is completed.
- the playback unit 130 When the playback time of the first video content C11 is longer than the playback time of the first sound content C12, the playback unit 130 repeatedly plays back the first sound content C12, thereby playing back the first video content C11 during the first period. Then, the reproduction of the first sound content C12 is continued. Further, the reproduction unit 130 may repeatedly reproduce the first sound content C12 until the reproduction of the first video content C11 is completed.
- the reproduction unit 130 may stop the reproduction of the first sound content C12 at the first timing by fading out. Further, the reproduction unit 130 may start reproduction of the second sound content C22 by fading in the reproduction of the second content C20.
- FIG. 7 is a diagram for explaining an example of processing for switching from the first content to the second content.
- the reproduction unit 130 reproduces the first sound content C12 during the first period ⁇ t11 during reproduction of the first video content C11.
- the reproducing unit 130 repeatedly reproduces the first sound content C12 during the first period ⁇ t11.
- the first period ⁇ t11 is at least twice as long as the reproduction time ⁇ t21 of the first sound content C12. Therefore, in the reproduction unit 130, the sound reproduction unit 132 reproduces the first sound content C12 three times, and reproduces the second sound content C22 at a timing t4 when the first period ⁇ t11 in the middle of the third reproduction ends. Switch to.
- the video playback unit 131 switches to playback of the next second video content C21 because the playback of the first video content C11 ends at timing t4.
- the sound reproduction unit 132 fades out the first sound content C12 and fades in the second sound content C22 at timing t4. For this reason, the sound reproduction unit 132 starts to decrease the reproduction volume of the first sound content C12 that is reproduced at the first volume at the timing t3 that is a fade-out period before the timing t4, and the second volume until the timing t4. Reduce the volume to.
- the sound reproduction unit 132 starts reproduction of the second sound content C22 at the third volume from the timing t4, and increases the reproduction volume to the fourth volume before the timing t5 after the fade-in period.
- the first to fourth sound volumes may be average sound volumes for a predetermined period.
- the first volume and the fourth volume may be the same volume.
- the second volume and the third volume may be the same volume.
- the playback unit 130 may display a period of displaying credit information as a fade-out period in a predetermined period until the playback time of the first video content C11 ends, for example, when displaying credit information indicating the creator of the content. That is, the playback unit 130 may reduce the volume of the first sound content C12 from the first volume to the second volume during the period in which the credit information is displayed.
- the credit information may be included in the content related information.
- the credit information may or may not be included in the content related information of the video content.
- the credit information may or may not be included in the content related information of the sound content.
- the reproduction unit 130 is realized by, for example, the CPU 101, the main memory 102, the storage 103, the display 105, and the speaker 106.
- the server 200 includes a database 210, a comparison unit 220, a generation unit 230, and a communication unit 240.
- the database 210 includes a video content DB (Database) 211 and a sound content DB (Database) 212.
- the video content DB 211 stores a plurality of independent video contents.
- the video content DB 211 stores content related information corresponding to each of the plurality of video contents together with the plurality of video contents.
- the sound content DB 212 stores a plurality of independent sound contents.
- the sound content DB 212 stores content related information corresponding to each of the plurality of sound contents together with the plurality of sound contents.
- the video content DB 211 stores video content acquired from the information processing apparatus 400 via the communication network 300 by the communication unit 240.
- the sound content DB 212 stores sound content acquired from the information processing apparatus 400 via the communication network 300 by the communication unit 240.
- Each of the video content DB 211 and the sound content DB 212 is realized by the storage 203, for example.
- the content related information is, for example, content metadata (that is, attribute information).
- content metadata that is, attribute information.
- One set of metadata exists for one content, and includes information on reproduction time, author, ambient level, video ambient level, or sound ambient level, and content genre. Details of the ambient degree, the video ambient degree, and the sound ambient degree will be described later.
- the playback time is information indicating the length of time when the content is played back.
- the author is information indicating the author of the content, and includes information including the author's name and contact information.
- the ambient degree is an ambient degree associated with the content.
- the video ambient degree is the ambient degree associated with the video part included in the content.
- the sound ambient degree is an ambient degree associated with a sound part included in the content.
- the ambient degree of content and the like can be set by metadata.
- Metadata is created in a predetermined format.
- the index is obtained by analyzing the metadata according to the metadata format.
- the index is an index associated with the content, and is an index expressed by a continuous value.
- An example of the index is an estimated index that indicates the degree of attention the user is directed to the content being played back. More specifically, the index is an index that is an index having a smaller value as the degree of attention directed to the content being played by the user is greater, or the user is directed to the content being played. As the degree of attention directed is greater, an index having a larger value may be employed.
- the former is also referred to as an ambient level and the latter is also referred to as a conscious level.
- the degree of attention directed by the user increases, for example, it is more likely to continue watching the screen on which the video is displayed from the beginning to the end of the playback time of the content, and concentrate on viewing the output sound. It can be said that it is suitable.
- the index may include brightness, saturation, hue, or the like that is an index related to the color of the video included in the content being played back, or volume or frequency distribution that is an index of the sound included in the content being played back Etc. may be included. Further, the index may include an index calculated by a predetermined calculation method from the plurality of indexes.
- the ambient degree is an index expressed as a continuous value from 0 to 100, for example.
- the degree of ambient is 0, it means that the degree of attention estimated to be directed by the user is the largest, and when the degree of ambient is 100, the degree of attention estimated to be directed by the user is the smallest. Then.
- the ambient degree associated with the content can be calculated from the video ambient degree that is the ambient degree associated with the video part of the content and the sound ambient degree that is the ambient degree associated with the sound part of the content.
- the video ambient degree is an example of a video index.
- the sound ambient degree is an example of a sound index.
- the video ambient degree may be calculated based on, for example, the brightness, saturation or hue of the video of the content, or the scene change mode. More specifically, it is calculated as follows.
- the sound ambient degree may be calculated based on, for example, the volume of the sound of the content, the frequency distribution of the sound, or the change in volume. More specifically, it is calculated as follows.
- any method can be adopted, but for example, an average or a weighted average can be used.
- the weighted average weight is in the range from 0 to 1 and the video ambient degree weight is ⁇
- the ambient degree of the content is expressed as (Equation 1) below.
- Ambient degree of content ⁇ x (Video ambient degree) + (1- ⁇ ) x (Sound ambient degree) (Formula 1)
- the weighting of the video ambient degree and the sound ambient is determined as follows, for example.
- the weight of the video ambient degree is set to sound. It is effective to make it heavier than the weight of the ambient degree, that is, to make ⁇ larger than 0.5.
- This threshold value can be about 50 inches or 70 inches in the length of the diagonal line of the display 105, for example.
- ⁇ may be changed by an input from the operator of the playback system 1, the provider of the content, or the user.
- the operator of the playback system 1 can flexibly change the weight of the video ambient level and the sound ambient level. As a result, there is an advantage that it is possible to specify more flexible content suitable for the user's sense.
- the video ambient level and the sound ambient level may be classified into a plurality of ranks according to the magnitude of the ambient level.
- the plurality of ranges of ambient degrees that define the plurality of ranks of the video ambient degree and the plurality of ranges of ambient degrees that define the plurality of ranks of the sound ambient degree do not have to coincide with each other.
- the video ambient degree may be classified as rank A in the range of 0 to 20
- the sound ambient degree may be classified as rank A in the range of 0 to 30. That is, the video ambient degree and the sound ambient degree may be classified into a plurality of ranks within the same rank or different ambient degree ranges.
- the video ambient degree and the sound ambient degree may be normalized so that the minimum value and the maximum value coincide.
- content There can be a variety of content, but it is part of the environment, such as paintings on the wall or parts of wallpaper, floor or ceiling that are not often watched by users It may be content. Note that the content may be content that is assumed to be acquired in order to acquire information on news or culture or to obtain entertainment.
- the server 200 may calculate the ambient degree using the above method using at least one of the content stored in the database 210 and the content related information.
- the degree of ambient is calculated in this way, the content-related information may not include the degree of ambient.
- the comparison unit 220 compares the video attribute information included in each of the plurality of video contents with the sound attribute information included in each of the plurality of sound contents. For example, when the genre of the video content matches the genre of the sound content, the comparison unit 220 determines that they are similar to each other.
- the genre may include the author of the content and the date (or month, year) when the content was created.
- the comparison unit 220 compares the video ambient degree and the sound ambient degree using a predetermined method, and determines whether or not they are similar.
- the comparison unit 220 calculates the video ambient degree from the metadata included in the video attribute information using the above method, and calculates the sound ambient degree from the metadata included in the sound attribute information using the above method. It may be calculated.
- the comparison unit 220 is realized by, for example, the CPU 201, the main memory 202, and the storage 203.
- the generation unit 230 generates a plurality of contents composed of video content and sound content having attribute information similar to each other according to the comparison result by the comparison unit 220. That is, the generation unit 230 generates a plurality of contents composed of combinations of video content and sound content similar to each other.
- the generation unit 230 is realized by the CPU 201, the main memory 202, and the storage 203, for example.
- the communication unit 240 transmits two or more contents among the plurality of contents generated by the generation unit 230 to the playback device 100 via the communication network 300.
- the communication unit 240 may transmit the content corresponding to the acquisition request to the playback device 100.
- the communication unit 240 is realized by the communication IF 204, for example.
- the information processing apparatus 400 includes a content DB 410, a registration unit 420, an input reception unit 430, and a communication unit 440.
- the content DB 410 stores video content or sound content.
- the video content or the sound content is, for example, content created by a second user such as a content creator. When the creator of the video content and the creator of the sound content are different, there are a plurality of second users.
- the content DB 410 is realized by the storage 403, for example.
- the registration unit 420 registers video content or sound content in the server 200 via the communication unit 440 according to information input by the second user to the input reception unit 430.
- the registration unit 420 registers content-related information such as an ID for identifying the second user, content attribute information, and content playback time in association with the content.
- the registration unit 420 causes the communication unit 440 to transmit content related information and content to the server 200 via the communication network 300.
- the registration unit 420 is realized by, for example, the CPU 401, the main memory 402, and the storage 403.
- the input reception unit 430 receives an input by the second user. Specifically, the input receiving unit 430 receives an input for the second user to register content in the server 200.
- the input receiving unit 430 is realized by the input IF 405, for example.
- FIG. 8 is a sequence diagram showing an example of a reproduction method by the reproduction system according to the embodiment.
- the server 200 transmits the first content C10 to the playback device 100 via the communication network 300 (S11).
- the playback device 100 receives the first content C10 transmitted by the server 200 via the communication network 300 (S21).
- the server 200 transmits the second content C20 to the playback device 100 via the communication network 300 (S12).
- the playback device 100 receives the second content C20 transmitted by the server 200 via the communication network 300 (S22).
- the server 200 may transmit the first content C10 and the second content C20 to the playback device 100 together. Therefore, the playback device 100 may receive the first content C10 and the second content C20 together.
- the playback device 100 plays back the received first content C10 and second content C20 (S23). Details of the reproduction processing by the reproduction apparatus 100 will be described later.
- FIG. 9 is a flowchart showing an example of details of the reproduction processing by the reproduction apparatus according to the embodiment.
- the playback unit 130 plays back the first content C10 (S31).
- the video playback unit 131 of the playback unit 130 acquires the timing when the playback of the first video content C11 included in the first content C10 ends (S32). For example, the video reproduction unit 131 acquires the reproduction time of the first video content C11 from the content related information included in the first video content C11. Then, the video reproduction unit 131 sets the timing after the reproduction time of the first video content C11 from the timing when the reproduction of the first content C10 is started as the timing when the reproduction of the first video content C11 ends.
- the sound reproducing unit 132 of the reproducing unit 130 acquires the timing when the reproduction of the first sound content C12 included in the first content C10 ends (S33). For example, the sound reproducing unit 132 acquires the reproduction time of the first sound content C12 from the content related information included in the first sound content C12. Then, the sound reproduction unit 132 sets the timing after the reproduction time of the first sound content C12 from the timing when the reproduction of the first content C10 is started as the timing when the reproduction of the first sound content C12 ends.
- the playback unit 130 determines whether or not the playback of the first video content C11 ends before the playback of the first sound content C12 (S34). That is, the playback unit 130 determines whether or not the timing at which the playback of the first video content C11 ends is earlier than the timing at which the playback of the first sound content C12 ends.
- the reproduction unit 130 determines whether the reproduction of the first video content C11 is completed. (S35).
- the playback unit 130 determines that the playback of the first video content C11 has ended (Yes in S35)
- the playback unit 130 stops the playback of the first sound content C12 and starts the playback of the second content (S36). That is, the playback unit 130 switches from playback of the first video content C11 to playback of the second video content C21 at a timing when playback of the first video content C11 ends, and from playback of the first sound content C12 to second. Switch to the playback of the sound content C22.
- the playback unit 130 may end the playback process, or may perform the same playback process on the third content next to the second content C20.
- the reproducing unit 130 determines that the reproduction of the first video content C11 has not ended (No in S35)
- the reproducing unit 130 repeats Step S35. Therefore, the reproducing unit 130 waits until the reproduction of the first video content C11 is completed.
- step S34 when the playback unit 130 determines that the playback of the first video content C11 ends after the timing when the playback of the first sound content C12 ends (No in S34), the playback of the first sound content C12 ends. It is determined whether or not (S37).
- the reproduction unit 130 determines that the reproduction of the first sound content C12 has ended (Yes in S37), the reproduction unit 130 repeats the reproduction of the first sound content C12 (S38), and returns to step S34. In this case, in the next step S34, it is determined whether or not the reproduction of the first video content C11 ends before the reproduction of the first sound content C12 that is repeatedly reproduced.
- the reproducing unit 130 determines that the reproduction of the first sound content C12 has not ended (No in S37), the reproducing unit 130 repeats Step S37. Therefore, the reproducing unit 130 stands by until the reproduction of the first sound content C12 is completed.
- FIG. 10 is a sequence diagram showing an example of a registration method by the reproduction system according to the embodiment.
- the registration unit 420 of the information processing device 400 selects one content from a plurality of video contents or a plurality of sound contents stored in the content DB 410 according to the input received by the input receiving unit 430 (S41). ).
- the input receiving unit 430 receives input of content related information of the selected content (S42). As a result, the registration unit 420 associates the selected content with the received content-related information.
- the communication unit 440 transmits the associated content related information together with the selected content to the server 200 via the communication network 300 (S43).
- the communication unit 240 receives the content related information together with the content transmitted by the information processing apparatus 400 (S51).
- the database 210 of the server 200 stores content related information together with the content received by the communication unit 240 (S52).
- the playback method after playing back the first content C10 composed of the first video content C11 and the first sound content C12 that are independent from each other, the second video that is independent from each other.
- the second content C20 composed of the content C21 and the second sound content C22 is reproduced. Therefore, the first sound content C12 can be switched to the second sound content C22 at the timing of switching the first video content C11 to the second video content C21. Therefore, it is possible to reduce a sense of discomfort given to the user when the video content and the sound content are switched to different content.
- each of the first content C10 and the second content C20 is composed of a combination of video content and sound content having similar attribute information. For this reason, the impression given to the user can be a unified impression for the video content and the sound content. For this reason, even when the video content and the sound content independent from each other are combined and reproduced, the uncomfortable feeling given to the user can be effectively reduced.
- the playback at the timing when the playback of the first video content C11 ends, the playback is switched from the first video content C11 to the second video content C21, and the first sound content C12 is changed to the second sound content C22. Switch and play. For this reason, the discomfort given to the user can be effectively reduced.
- the playback of the first video content C11 continues even after the playback of the first sound content C12 is completed, the playback of the first sound content C12 is repeated. For this reason, during the reproduction of the first video content C11, the reproduction of the first sound content C12 can be continued. Therefore, the uncomfortable feeling given to the user can be effectively reduced.
- the playback of the first sound content C12 when the playback of the first sound content C12 does not end at the timing when the playback of the first video content C11 ends, the playback of the first sound content C12 is stopped at the timing by fading out. To do. For this reason, switching of reproduction from the first sound content C12 to the second sound content C22 can be realized more naturally. Therefore, it is possible to effectively reduce the uncomfortable feeling given to the user when the video content and the sound content are switched to different content.
- the playback of the second sound content C22 is started by fading in the playback of the second content C20. For this reason, switching of reproduction from the first sound content C12 to the second sound content C22 can be realized more naturally. Therefore, it is possible to effectively reduce the uncomfortable feeling given to the user when the video content and the sound content are switched to different content.
- the server 200 includes the comparison unit 220 and the generation unit 230.
- the playback device may include the comparison unit and the generation unit.
- FIG. 11 is a block diagram illustrating an example of a functional configuration of a reproduction system according to a modification of the embodiment.
- a reproduction system 1A according to the modification includes a server 200A having a configuration that does not include the comparison unit 220 and the generation unit 230, a comparison unit 140 that corresponds to the comparison unit 220, and a generation unit 150 that corresponds to the generation unit 230. 100A.
- the communication unit 240 transmits a plurality of video contents stored in the video content DB 211 and a plurality of sound contents stored in the sound content DB 212 to the playback device 100A via the communication network 300.
- the communication unit 110 receives a plurality of video contents and a plurality of sound contents transmitted by the server 200A via the communication network 300.
- the communication unit 110 stores the received plurality of video contents and the plurality of sound contents in the content DB 120.
- the comparison unit 140 compares the video attribute information included in each of the plurality of video contents with the sound attribute information included in each of the plurality of sound contents.
- the generation unit 150 generates a plurality of contents composed of video content and sound content having attribute information similar to each other according to the comparison result by the comparison unit 140, and stores the generated plurality of contents in the content DB 120.
- the reproducing unit 130 reproduces the second content C20 after reproducing the first content C10 among the plurality of contents stored in the content DB 120. Since the reproduction processing by the reproduction unit 130 is the same as that in the embodiment, description thereof is omitted.
- the reproducing apparatus 100 in the above embodiment may display an image related to the ambient degree together with the contents C10 and C20.
- the image may include at least one of an image indicating the ambient degree of the contents C10 and C20 and an image indicating the range of the ambient degree received by a receiving unit such as a remote controller (not shown).
- the user By displaying an image relating to the ambient degree together with the contents C10 and C20 on the display 105, the user visually recognizes the image together with the reproduced contents C10 and C20. If the user visually recognizes an image indicating the degree of ambient, the user can recognize the degree of ambient of the contents C10 and C20 that are currently reproduced. Further, the user can recognize the range of the ambient degree designated by the user by visually recognizing the image indicating the range of the ambient degree. By recognizing these, for example, the user can instruct the playback device 100 to change the specified ambient degree higher or lower than the current degree through the reception unit.
- a sound relating to the ambient degree may be output by the speaker 106, and the same effect as described above can be obtained.
- the playback device specifies the index associated with the content within the range of the index, and thereby the content to be played back Can be specified. At that time, the user need not recall the search key. The user can specify the content to be played back by the playback device simply by specifying the rough value of the index associated with the content within the range. In this way, the playback device enables more flexible content specification. Also, since flexible content specification is possible, the problem of increase in processing load and power consumption of the playback device when determination of content reflecting the user's intention fails can be avoided.
- the playback device enables more flexible content specification by using, as a specific index, an estimated index that indicates the degree of attention that the user directs to the content being played back.
- the playback device, server, or information processing device calculates an index associated with the content based on the degree of attention that the user has directed to each of the video and sound included in the content.
- the content index can be calculated in consideration of the video and sound included in the content.
- the playback device, server, or information processing device calculates an index associated with the content by a weighted average obtained by increasing the weight of the sound index of the video index and the sound index.
- the playback device, server, or information processing device calculates an index associated with the content by a weighted average obtained by increasing the weight of the video index of the video index and the sound index.
- the index associated with the content the index of the index used for specifying the content is set with respect to the degree of the attention directed by the user by relatively increasing the contribution of the degree of attention directed by the person to the video. It can be an indicator that matches the sense of
- the playback device, server, or information processing device can calculate the video index by specifically using the brightness, saturation, hue, or scene change mode of the video included in the content.
- the playback device, server, or information processing device can calculate the sound index by specifically using the volume, frequency distribution, or volume change mode included in the content.
- the playback device, server, or information processing device can cause the user to recognize the index of the content by presenting the index associated with the content along with the content being played back to the user. Then, it is possible to cause the user to make a determination as to whether or not the content that the user wants to present on the playback apparatus is compatible with the index range designated by the user.
- both the index of the video content and sound content to be played back are included in the range specified by the user. Can do.
- the user can play both the video content and the sound content that are estimated to have the same level of attention by the playback device.
- the playback device can cause the content provider to recognize the index associated with the content by presenting the index when the content is stored in the server in advance.
- the playback device can make the content provider recognize the adjusted content index after adjusting the content.
- the content provider recognizes the index of the adjusted content, confirms the result of the adjustment made to the content provided by itself, and determines whether to store it in the server based on the result Can take action.
- each component is realized by executing a software program suitable for each component, but may be configured by dedicated hardware.
- Each component may be realized by a program execution unit such as a CPU or a processor reading and executing a software program recorded on a recording medium such as a hard disk or a semiconductor memory.
- the software that realizes the reproduction method of each of the above embodiments is the following program.
- this program acquires the first content composed of the first video content and the first sound content independent from each other to the computer, and is composed of the second video content and the second sound content independent from each other. After the acquired second content is acquired and the acquired first content is reproduced, a reproduction method for reproducing the acquired second content is executed.
- the playback method, playback system, and playback device according to one or more aspects of the present invention have been described based on the embodiment, but the present invention is not limited to this embodiment. Unless it deviates from the gist of the present invention, one or more of the present invention may be applied to various modifications that can be conceived by those skilled in the art, or forms constructed by combining components in different embodiments. It may be included within the scope of the embodiments.
- the playback unit 130 stops the playback of the first sound content C12 at the first timing when the playback of the first video content C11 of the first content C10 ends, and although the reproduction of the second content C20 is started, the present invention is not limited to this.
- the ambient level of the video content is larger than a predetermined value and the ambient level of the sound content is smaller than the predetermined value, as described above, at the timing when the reproduction of the first audio content C12 ends, the first video content C11 Even if the process of switching to the second video content C21 is performed on the way, the uncomfortable feeling given to the user is small.
- the playback unit 130 may determine whether the video ambient degree is larger than a predetermined value (or a predetermined rank) and whether the sound ambient degree is smaller than a predetermined value (or a predetermined rank). Then, as a result of the determination, when the video ambient degree is larger than a predetermined value (or predetermined rank) and the sound ambient degree is smaller than the predetermined value (or predetermined rank), the reproducing unit 130 determines that the first content C10 The reproduction of the first video content C11 may be stopped and the reproduction of the second content C20 may be started at the timing when the reproduction of the one-sound content C12 ends.
- the sound ambient degree is described based on the volume of the sound of the content, the frequency distribution of the sound, or the change of the volume.
- the present invention is not limited to this.
- the sound frequency characteristics the approximation with the so-called “1 / f fluctuation” characteristic, the number of overtone components, the regularity of the timbre waveform (frequency of several Hz or less) Area) and the like.
- the sound ambient level is an index at the research stage compared to the video ambient level, but the mid-range sound around 200 Hz is equivalent to vocals and human speech, and is likely to be heard by humans. I know it. Therefore, it is considered that the degree of attention directed by the user increases, and the degree of consciousness increases (the degree of ambient decreases).
- the human brain tries to understand what is different from nature by unknowingly complementing it, so when listening to sounds that are different from the natural world, it will use brain resources, increasing the degree of consciousness (the degree of ambient is increased). It is thought that). Therefore, music that is composed to increase the degree of user's attention is not only highly conscious (low ambient), but also sounds that exist in the natural world, such as river buzz, can be recorded in a recording environment (such as a microphone or Depending on the performance of the recording device, the degree of ambient may be reduced.
- a recording environment such as a microphone or Depending on the performance of the recording device, the degree of ambient may be reduced.
- This disclosure can be applied to a playback method or the like that can reduce a sense of discomfort given to a user when video content and sound content are switched to different content.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Databases & Information Systems (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Abstract
L'invention concerne un procédé de reproduction, qui consiste : à acquérir (S21) un premier contenu (C10) constitué d'un premier contenu vidéo (C11) et d'un premier contenu sonore (C12) indépendants l'un de l'autre; à acquérir (S22) un second contenu (C20) constitué d'un second contenu vidéo (C21) et d'un second contenu sonore (C22) indépendants l'un de l'autre; et à reproduire (S23) le premier contenu acquis (C10), puis à reproduire le second contenu acquis (C20).
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201762461432P | 2017-02-21 | 2017-02-21 | |
US62/461432 | 2017-02-21 | ||
JP2017190030A JP2020065099A (ja) | 2017-02-21 | 2017-09-29 | 再生方法、再生システム、および、再生装置 |
JP2017-190030 | 2017-09-29 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2018155351A1 true WO2018155351A1 (fr) | 2018-08-30 |
Family
ID=63252596
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2018/005613 WO2018155351A1 (fr) | 2017-02-21 | 2018-02-19 | Procédé de reproduction, système de reproduction et appareil de reproduction |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2018155351A1 (fr) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2005184617A (ja) * | 2003-12-22 | 2005-07-07 | Casio Comput Co Ltd | 動画再生装置、撮像装置及びそのプログラム |
JP2006014084A (ja) * | 2004-06-28 | 2006-01-12 | Hiroshima Univ | 映像編集装置、映像編集プログラム、記録媒体、および映像編集方法 |
JP2011216178A (ja) * | 2010-03-18 | 2011-10-27 | Panasonic Corp | 再生装置、再生システム及びサーバ |
-
2018
- 2018-02-19 WO PCT/JP2018/005613 patent/WO2018155351A1/fr active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2005184617A (ja) * | 2003-12-22 | 2005-07-07 | Casio Comput Co Ltd | 動画再生装置、撮像装置及びそのプログラム |
JP2006014084A (ja) * | 2004-06-28 | 2006-01-12 | Hiroshima Univ | 映像編集装置、映像編集プログラム、記録媒体、および映像編集方法 |
JP2011216178A (ja) * | 2010-03-18 | 2011-10-27 | Panasonic Corp | 再生装置、再生システム及びサーバ |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20230060042A1 (en) | Audio conflict resolution | |
KR101251626B1 (ko) | 스마트 기기를 이용한 음향기기의 특성에 대한 보상 서비스 제공 방법 | |
US11812240B2 (en) | Playback of generative media content | |
US20250142283A1 (en) | Systems and methods of providing spatial audio associated with a simulated environment | |
US12382125B2 (en) | Playback of synthetic media content via multiple devices | |
CN101536609A (zh) | 响应音频信号控制光 | |
CN102160115A (zh) | 对于资源受限客户机设备的上游质量增强信号处理 | |
US12361048B2 (en) | Generating digital media based on blockchain data | |
US20190037311A1 (en) | Audio preferences for media content players | |
US9053710B1 (en) | Audio content presentation using a presentation profile in a content header | |
US20200081681A1 (en) | Mulitple master music playback | |
EP4248656A2 (fr) | Lecture de contenu multimédia génératif | |
WO2018155351A1 (fr) | Procédé de reproduction, système de reproduction et appareil de reproduction | |
CN117319888A (zh) | 音效控制方法、装置和系统 | |
WO2018155352A1 (fr) | Procédé de commande de dispositif électronique, dispositif électronique, système de commande de dispositif électronique et programme | |
WO2018155353A1 (fr) | Procédé et dispositif de génération, procédé et système de reproduction | |
JP2020065099A (ja) | 再生方法、再生システム、および、再生装置 | |
CN114598917A (zh) | 显示设备及音频处理方法 | |
JP2020065096A (ja) | 生成方法、生成装置、再生方法および再生システム | |
JP6474292B2 (ja) | カラオケ装置 | |
JP2020065098A (ja) | 電子機器の制御方法、電子機器、電子機器の制御システム、及び、プログラム | |
US20250324198A1 (en) | Playback of generative media content | |
US20250322007A1 (en) | Storing generated digital objects on a distributed ledger | |
KR100703923B1 (ko) | 멀티미디어기기를 위한 입체음향 최적화 장치 및 방법 | |
KR101060546B1 (ko) | 사용자의 청력에 맞게 오디오 재생파일을 변환하는 장치 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 18758192 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 18758192 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: JP |