[go: up one dir, main page]

CN110769286B - A channel-based recommendation method and device, and storage medium - Google Patents

A channel-based recommendation method and device, and storage medium Download PDF

Info

Publication number
CN110769286B
CN110769286B CN201911074399.7A CN201911074399A CN110769286B CN 110769286 B CN110769286 B CN 110769286B CN 201911074399 A CN201911074399 A CN 201911074399A CN 110769286 B CN110769286 B CN 110769286B
Authority
CN
China
Prior art keywords
video
channel
target user
periodic
watching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201911074399.7A
Other languages
Chinese (zh)
Other versions
CN110769286A (en
Inventor
倪维健
刘彤
曾庆田
邵文倩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong University of Science and Technology
Original Assignee
Shandong University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong University of Science and Technology filed Critical Shandong University of Science and Technology
Priority to CN201911074399.7A priority Critical patent/CN110769286B/en
Publication of CN110769286A publication Critical patent/CN110769286A/en
Application granted granted Critical
Publication of CN110769286B publication Critical patent/CN110769286B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/258Client or end-user data management, e.g. managing client capabilities, user preferences or demographics, processing of multiple end-users preferences to derive collaborative data
    • H04N21/25866Management of end-user data
    • H04N21/25891Management of end-user data being end-user preferences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/251Learning process for intelligent management, e.g. learning user preferences for recommending movies
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/466Learning process for intelligent management, e.g. learning user preferences for recommending movies
    • H04N21/4668Learning process for intelligent management, e.g. learning user preferences for recommending movies for recommending content, e.g. movies

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Graphics (AREA)
  • Computing Systems (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本申请实施例公开了一种基于频道的推荐方法及装置、存储介质,所述方法包括:基于获取到的在至少一个预设统计周期内针对频道的播放历史数据,得到针对至少一个频道的周期性播放视频特征模型;基于获取到的在至少一个预设统计周期内针对目标用户的观看历史数据,得到针对目标用户的周期性观看视频特征模型;将针对至少一个频道的周期性播放视频特征模型的视频特征分布数据与所述周期性观看视频特征模型的视频特征分布数据进行匹配,基于匹配结果得到针对目标用户的推荐数据。如此,在充分考虑频道播放视频节目和用户观看视频节目的周期性特征的基础上,实现针对用户的个性化推荐。

Figure 201911074399

The embodiment of the present application discloses a channel-based recommendation method, device, and storage medium. The method includes: obtaining a period for at least one channel based on acquired playback history data for a channel within at least one preset statistical period based on the acquired viewing history data for the target user within at least one preset statistical period, obtain the periodic viewing video feature model for the target user; the periodic playback video feature model for at least one channel Match the video feature distribution data of the periodic viewing video feature model with the video feature distribution data of the periodic viewing video feature model, and obtain recommendation data for the target user based on the matching result. In this way, on the basis of fully considering the periodic characteristics of the channel playing video programs and the user watching the video programs, the personalized recommendation for the user is realized.

Figure 201911074399

Description

一种基于频道的推荐方法及装置、存储介质A channel-based recommendation method and device, and storage medium

技术领域technical field

本发明涉及大数据处理领域,更具体地,涉及一种基于频道的推荐方法及装置、存储介质。The present invention relates to the field of big data processing, and more particularly, to a channel-based recommendation method and device, and a storage medium.

背景技术Background technique

频道推荐指向用户推荐感兴趣的电视频道;但电视频道在不同时段会播放不同的视频节目,节目内容分布更为广泛,因此频道推荐比节目内容推荐具有更大的难度,且针对电视节目的推荐方法也并不适用于针对电视频道的推荐场景。Channel recommendation refers to recommending TV channels that users are interested in; however, TV channels will play different video programs in different time periods, and the program content is more widely distributed, so channel recommendation is more difficult than program content recommendation, and the recommendation for TV programs is more difficult. The method is also not suitable for recommendation scenarios for TV channels.

发明内容SUMMARY OF THE INVENTION

鉴于上述问题,本发明提出了一种基于频道的推荐方法及装置、存储介质,如此,在充分考虑频道播放视频节目和用户观看视频节目的周期性特征的基础上,实现针对用户的个性化推荐。In view of the above problems, the present invention proposes a channel-based recommendation method, device, and storage medium. In this way, on the basis of fully considering the periodic characteristics of channel playing video programs and users watching video programs, personalized recommendation for users is realized. .

本申请实施例的技术方案是这样实现的:The technical solutions of the embodiments of the present application are implemented as follows:

第一方面,本申请实施例提供了一种基于频道的推荐方法,包括:In a first aspect, an embodiment of the present application provides a channel-based recommendation method, including:

基于获取到的在至少一个预设统计周期内针对频道的播放历史数据,得到针对至少一个频道的周期性播放视频特征模型,所述周期性播放视频特征模型能够表征频道周期性规律的视频特征分布数据;Based on the acquired playback history data for a channel within at least one preset statistical period, a periodic playback video feature model for at least one channel is obtained, where the periodic playback video feature model can characterize the periodicity of the channel's regular video feature distribution data;

基于获取到的在至少一个预设统计周期内针对目标用户的观看历史数据,得到针对目标用户的周期性观看视频特征模型,所述周期性观看视频特征模型能够表征目标用户周期性规律的视频特征分布数据;Based on the obtained viewing history data for the target user in at least one preset statistical period, a periodic video viewing feature model for the target user is obtained, and the periodic video viewing feature model can represent the periodic regular video features of the target user distribution data;

将针对至少一个频道的周期性播放视频特征模型的视频特征分布数据与所述周期性观看视频特征模型的视频特征分布数据进行匹配,基于匹配结果得到针对目标用户的推荐数据。Matching the video feature distribution data of the periodically playing video feature model for at least one channel with the video feature distribution data of the periodically viewing video feature model, and obtaining recommendation data for the target user based on the matching result.

在一具体示例中,所述基于匹配结果得到针对目标用户的推荐数据,包括:In a specific example, the obtaining recommendation data for the target user based on the matching result includes:

基于匹配结果得到目标用户在至少一个频道中目标频道上的观看时段推荐数据;和/或,Obtaining viewing period recommendation data of the target user on the target channel in at least one channel based on the matching result; and/or,

基于匹配结果得到目标用户在所选定观看时段的频道推荐数据。Channel recommendation data of the target user in the selected viewing period is obtained based on the matching result.

在一具体示例中,所述方法还包括:In a specific example, the method further includes:

获取在至少一个预设统计周期内频道所播放的视频节目的播放特征,以及视频节目的视频特征,作为针对频道的播放历史数据。The playback characteristics of the video programs played by the channel in at least one preset statistical period and the video characteristics of the video programs are acquired as the playback history data for the channel.

在一具体示例中,得到针对频道的周期性播放视频特征模型的步骤包括:In a specific example, the step of obtaining the periodic playback video feature model for the channel includes:

基于针对频道的播放历史数据,确定出频道在统计周期内所播放的视频节目的视频特征分布数据,以及频道在统计周期内所播放的视频节目的概率分布数据;Based on the playback history data for the channel, determine the video feature distribution data of the video programs played by the channel in the statistical period, and the probability distribution data of the video programs played by the channel in the statistical period;

基于频道在统计周期内所播放的视频节目的视频特征分布数据,以及频道在统计周期内所播放的视频节目的概率分布数据,得到针对频道的周期性播放视频特征模型。Based on the video feature distribution data of the video programs played by the channel in the statistical period, and the probability distribution data of the video programs played by the channel in the statistical period, a periodically played video feature model for the channel is obtained.

在一具体示例中,所述方法还包括:In a specific example, the method further includes:

获取在至少一个预设统计周期内目标用户所观看的视频节目的观看特征以及视频特征,作为针对目标用户的观看历史数据;Acquiring viewing characteristics and video characteristics of video programs watched by the target user in at least one preset statistical period, as viewing history data for the target user;

或者,or,

获取在至少一个预设统计周期内目标用户所观看的视频节目的观看特征以及视频特征,作为针对目标用户的第一子观看历史数据;Acquiring viewing characteristics and video characteristics of video programs watched by the target user in at least one preset statistical period, as the first sub-viewing history data for the target user;

获取确定出的邻域时间内目标用户所观看的视频节目的观看特征以及视频特征,作为针对目标用户的第二子观看历史数据;其中,所述邻域时间为与所述预设统计周期相关联的时间;Obtain the viewing characteristics and video characteristics of the video programs watched by the target user within the determined neighborhood time as the second sub-viewing history data for the target user; wherein the neighborhood time is related to the preset statistical period time of connection;

将所述第一子观看历史数据与第二子观看历史数据作为针对目标用户的观看历史数据。The first sub-viewing history data and the second sub-viewing history data are used as viewing history data for the target user.

在一具体示例中,得到针对目标用户的周期性观看视频特征模型的步骤包括:In a specific example, the step of obtaining the feature model of periodically watching videos for the target user includes:

基于针对目标用户的观看历史数据,确定出目标用户至少在统计周期内所观看的视频节目的视频特征分布数据,以及目标用户至少在统计周期内所观看的视频节目的概率分布数据;Based on the viewing history data for the target user, determine the video feature distribution data of the video programs watched by the target user at least in the statistical period, and the probability distribution data of the video programs watched by the target user at least in the statistical period;

基于目标用户至少在统计周期内所观看的视频节目的视频特征分布数据,以及目标用户至少在统计周期内所观看的视频节目的概率分布数据,得到针对目标用户的周期性观看视频特征模型。Based on the video feature distribution data of the video programs watched by the target user at least in the statistical period, and the probability distribution data of the video programs watched by the target user at least in the statistical period, a periodic viewing video feature model for the target user is obtained.

在一具体示例中,所述将针对至少一个频道的周期性播放视频特征模型的视频特征分布数据与所述周期性观看视频特征模型的视频特征分布数据进行匹配,基于匹配结果得到针对目标用户的推荐数据,包括:In a specific example, the video feature distribution data of the periodic playing video feature model for at least one channel is matched with the video feature distribution data of the periodic viewing video feature model, and based on the matching result, the target user is obtained. Recommended data, including:

确定表征频道周期性规律的视频特征分布数据与表征目标用户周期性规律的视频特征分布数据之间的特征差值;determining the feature difference between the video feature distribution data representing the periodicity of the channel and the video feature distribution data representing the periodicity of the target user;

对特征差值进行排序处理,将排序结果满足预设规则的数据作为推荐数据。The feature difference value is sorted, and the data whose sorting result satisfies the preset rule is used as the recommended data.

第二方面,本申请实施例提供了一种基于频道的推荐装置,包括:In a second aspect, an embodiment of the present application provides a channel-based recommendation device, including:

模型处理单元,用于基于获取到的在至少一个预设统计周期内针对频道的播放历史数据,得到针对至少一个频道的周期性播放视频特征模型,所述周期性播放视频特征模型能够表征频道周期性规律的视频特征分布数据;基于获取到的在至少一个预设统计周期内针对目标用户的观看历史数据,得到针对目标用户的周期性观看视频特征模型,所述周期性观看视频特征模型能够表征目标用户周期性规律的视频特征分布数据;A model processing unit, configured to obtain a periodic play video feature model for at least one channel based on the acquired play history data for the channel within at least one preset statistical period, where the periodic play video feature model can characterize the channel period Regular video feature distribution data; based on the acquired viewing history data for the target user within at least one preset statistical period, obtain a periodic viewing video feature model for the target user, and the periodic viewing video feature model can represent Periodic and regular video feature distribution data of target users;

推荐单元,用于将针对至少一个频道的周期性播放视频特征模型的视频特征分布数据与所述周期性观看视频特征模型的视频特征分布数据进行匹配,基于匹配结果得到针对目标用户的推荐数据。The recommending unit is configured to match the video feature distribution data of the periodically playing video feature model for at least one channel with the video feature distribution data of the periodically viewing video feature model, and obtain recommendation data for the target user based on the matching result.

在一具体示例中,推荐单元,还用于基于匹配结果得到目标用户在至少一个频道中目标频道上的观看时段推荐数据;和/或,基于匹配结果得到目标用户在所选定观看时段的频道推荐数据。In a specific example, the recommending unit is further configured to obtain, based on the matching result, the viewing period recommendation data of the target user on the target channel in at least one channel; and/or, obtain the channel of the target user in the selected viewing period based on the matching result Recommended data.

在一具体示例中,模型处理单元,还用于获取在至少一个预设统计周期内频道所播放的视频节目的播放特征,以及视频节目的视频特征,作为针对频道的播放历史数据。In a specific example, the model processing unit is further configured to acquire playback characteristics of video programs played by the channel in at least one preset statistical period, and video characteristics of the video programs, as playback history data for the channel.

在一具体示例中,模型处理单元,还用于:In a specific example, the model processing unit is also used to:

基于针对频道的播放历史数据,确定出频道在统计周期内所播放的视频节目的视频特征分布数据,以及频道在统计周期内所播放的视频节目的概率分布数据;Based on the playback history data for the channel, determine the video feature distribution data of the video programs played by the channel in the statistical period, and the probability distribution data of the video programs played by the channel in the statistical period;

基于频道在统计周期内所播放的视频节目的视频特征分布数据,以及频道在统计周期内所播放的视频节目的概率分布数据,得到针对频道的周期性播放视频特征模型。Based on the video feature distribution data of the video programs played by the channel in the statistical period, and the probability distribution data of the video programs played by the channel in the statistical period, a periodically played video feature model for the channel is obtained.

在一具体示例中,模型处理单元,还用于:In a specific example, the model processing unit is also used to:

获取在至少一个预设统计周期内目标用户所观看的视频节目的观看特征以及视频特征,作为针对目标用户的观看历史数据;Acquiring viewing characteristics and video characteristics of video programs watched by the target user in at least one preset statistical period, as viewing history data for the target user;

或者,or,

获取在至少一个预设统计周期内目标用户所观看的视频节目的观看特征以及视频特征,作为针对目标用户的第一子观看历史数据;Acquiring viewing characteristics and video characteristics of video programs watched by the target user in at least one preset statistical period, as the first sub-viewing history data for the target user;

获取确定出的邻域时间内目标用户所观看的视频节目的观看特征以及视频特征,作为针对目标用户的第二子观看历史数据;其中,所述邻域时间为与所述预设统计周期相关联的时间;Obtain the viewing characteristics and video characteristics of the video programs watched by the target user within the determined neighborhood time as the second sub-viewing history data for the target user; wherein the neighborhood time is related to the preset statistical period time of connection;

将所述第一子观看历史数据与第二子观看历史数据作为针对目标用户的观看历史数据。The first sub-viewing history data and the second sub-viewing history data are used as viewing history data for the target user.

在一具体示例中,模型处理单元,还用于:In a specific example, the model processing unit is also used to:

基于针对目标用户的观看历史数据,确定出目标用户至少在统计周期内所观看的视频节目的视频特征分布数据,以及目标用户至少在统计周期内所观看的视频节目的概率分布数据;Based on the viewing history data for the target user, determine the video feature distribution data of the video programs watched by the target user at least in the statistical period, and the probability distribution data of the video programs watched by the target user at least in the statistical period;

基于目标用户至少在统计周期内所观看的视频节目的视频特征分布数据,以及目标用户至少在统计周期内所观看的视频节目的概率分布数据,得到针对目标用户的周期性观看视频特征模型。Based on the video feature distribution data of the video programs watched by the target user at least in the statistical period, and the probability distribution data of the video programs watched by the target user at least in the statistical period, a periodic viewing video feature model for the target user is obtained.

在一具体示例中,推荐单元,还用于确定表征频道周期性规律的视频特征分布数据与表征目标用户周期性规律的视频特征分布数据之间的特征差值;对特征差值进行排序处理,将排序结果满足预设规则的数据作为推荐数据。In a specific example, the recommending unit is further configured to determine the feature difference between the video feature distribution data representing the periodicity of the channel and the video feature distribution data representing the periodicity of the target user; perform sorting processing on the feature difference, The data whose sorting result satisfies the preset rules is used as the recommended data.

第三方面,本申请实施例还提供了一种基于频道的推荐装置,包括:In a third aspect, an embodiment of the present application further provides a channel-based recommendation device, including:

一个或多个处理器;one or more processors;

与所述一个或多个处理器通信连接的存储器;a memory communicatively coupled to the one or more processors;

一个或多个应用程序,其中所述一个或多个应用程序被存储在所述存储器中并被配置为由所述一个或多个处理器执行,所述一个或多个程序被配置为执行以上所述的方法。one or more application programs, wherein the one or more application programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs are configured to perform the above the method described.

第四方面,本申请实施例提供了一种计算机可读存储介质,其存储有计算机程序,该程序被处理器执行时实现以上所述的方法。In a fourth aspect, an embodiment of the present application provides a computer-readable storage medium, which stores a computer program, and when the program is executed by a processor, implements the above-mentioned method.

本申请实施例充分考虑了频道播放的视频节目和用户观看的视频节目的周期性规律,并基于周期性规律进行个性化推荐,因此,本申请实施例更加符合频道播放视频节目的规律和用户观看视频节目的特点,且在实现个性化推荐的基础上,提升了用户体验。The embodiment of the present application fully considers the periodicity of the video programs played by the channel and the video programs watched by the user, and makes personalized recommendations based on the periodicity. Therefore, the embodiment of the present application is more in line with the law of the video program played by the channel and the viewing by the user. The characteristics of video programs, and on the basis of realizing personalized recommendation, the user experience is improved.

附图说明Description of drawings

为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions in the embodiments of the present application more clearly, the following briefly introduces the accompanying drawings used in the description of the embodiments. Obviously, the accompanying drawings in the following description are only some embodiments of the present invention. For those skilled in the art, other drawings can also be obtained from these drawings without creative effort.

图1示出了根据本申请实施例的基于频道的推荐方法流程示意图;FIG. 1 shows a schematic flowchart of a channel-based recommendation method according to an embodiment of the present application;

图2示出了根据本申请实施例在一具体应用场景中的预设统计周期的示意图;2 shows a schematic diagram of a preset statistical period in a specific application scenario according to an embodiment of the present application;

图3示出了根据本申请实施例的基于频道的推荐装置的逻辑单元结构示意图。FIG. 3 shows a schematic structural diagram of a logical unit of a channel-based recommendation apparatus according to an embodiment of the present application.

图4示出了根据本申请实施例的基于频道的推荐装置的硬件结构示意图。FIG. 4 shows a schematic diagram of a hardware structure of a channel-based recommendation apparatus according to an embodiment of the present application.

具体实施方式Detailed ways

为了使本技术领域的人员更好地理解本发明方案,下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述。In order to enable those skilled in the art to better understand the solutions of the present invention, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application.

在本发明的说明书和权利要求书及上述附图中的描述的一些流程中,包含了按照特定顺序出现的多个操作,但是应该清楚了解,这些操作可以不按照其在本文中出现的顺序来执行或并行执行,操作的序号如101、102等,仅仅是用于区分开各个不同的操作,序号本身不代表任何的执行顺序。另外,这些流程可以包括更多或更少的操作,并且这些操作可以按顺序执行或并行执行。需要说明的是,本文中的“第一”、“第二”等描述,是用于区分不同的消息、设备、模块等,不代表先后顺序,也不限定“第一”和“第二”是不同的类型。In some of the processes described in the description and claims of the present invention and the above-mentioned drawings, various operations are included in a specific order, but it should be clearly understood that these operations may not be in accordance with the order in which they appear herein. For execution or parallel execution, the sequence numbers of the operations, such as 101, 102, etc., are only used to distinguish different operations, and the sequence numbers themselves do not represent any execution order. Additionally, these flows may include more or fewer operations, and these operations may be performed sequentially or in parallel. It should be noted that the descriptions such as "first" and "second" in this document are used to distinguish different messages, devices, modules, etc., and do not represent a sequence, nor do they limit "first" and "second" are different types.

下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without creative efforts shall fall within the protection scope of the present invention.

电视频道的视频节目具有很强的周期性特征,主要体现在两个方面,一方面,电视频道在编排节目时具有周期性,比如每周一至周四的晚间时段会安排播放电视剧,中午时段会安排播放新闻节目等;另一方面,用户的观看兴趣具有周期性演进的特点,比如在工作日主要以观看新闻节目为主,在周末则更倾向于观看娱乐性更强的节目。因此,基于上述周期性特征,本申请实施例提供了一种基于频道的推荐方法及装置、存储介质,对频道播放的视频节目和用户观看的视频节目在周期内的演进特征进行深入挖掘,进而将挖掘结果(如周期性的视频特征)作为电视频道推荐的依据,如此,完成针对用户的频道推荐,满足不用用户的观看需求,并为提升用户体验奠定了基础。The video programs of TV channels have strong periodic characteristics, which are mainly reflected in two aspects. On the one hand, TV channels are cyclical in programming programs. Arrange to broadcast news programs; on the other hand, users' viewing interests are characterized by periodic evolution. For example, they mainly watch news programs on weekdays, and tend to watch more entertaining programs on weekends. Therefore, based on the above periodic characteristics, the embodiments of the present application provide a channel-based recommendation method and device, and a storage medium, which deeply digs the evolution characteristics of the video programs played on the channel and the video programs watched by the user during the period, and then The mining results (such as periodic video features) are used as the basis for TV channel recommendation. In this way, channel recommendation for users is completed, which satisfies the viewing needs of different users and lays a foundation for improving user experience.

具体地,图1示出了根据本申请实施例的基于频道的推荐方法流程示意图;如图1所示,所述方法包括:Specifically, FIG. 1 shows a schematic flowchart of a channel-based recommendation method according to an embodiment of the present application; as shown in FIG. 1 , the method includes:

步骤101:基于获取到的在至少一个预设统计周期内针对频道的播放历史数据,得到针对至少一个频道的周期性播放视频特征模型,所述周期性播放视频特征模型能够表征频道周期性规律的视频特征分布数据。Step 101: Based on the acquired playback history data for channels within at least one preset statistical period, obtain a periodic playback video feature model for at least one channel, where the periodic playback video feature model can characterize the periodicity of the channel. Video feature distribution data.

实际应用中,本申请实施例所述的预设统计周期可以根据实际场景而设定,比如,预设统计周期为一个月,此时,获取的播放历史数据可以具体为最近1个月,最近两个月,或者最近多个月的播放历史数据,如此,便于基于获取到的播放历史数据来挖掘出周期性规律。In practical applications, the preset statistical period described in the embodiments of the present application may be set according to actual scenarios. For example, the preset statistical period is one month. Two months, or the playback history data of the last several months, in this way, it is convenient to dig out the periodicity rule based on the obtained playback history data.

这里,所述预设统计周期还可以包括多个时间维度,比如,包括三个时间维度,分别为月、周、日,如此,即便只统计一个预设统计周期内的数据,也能挖掘出频道播放视频节目的周期性规律,为基于周期性规律有针对性的进行频道推荐奠定了基础。Here, the preset statistical period may also include multiple time dimensions, for example, three time dimensions, namely month, week, and day, so that even if only the data in one preset statistical period is counted, the The periodicity of video programs played by the channel lays the foundation for targeted channel recommendation based on the periodicity.

需要说明的是,本申请实施例中,周期性播放视频特征模型为与频道对应的模型,换言之,当预设统计周期确定后,每个频道对应一个周期性播放视频特征模型。因此,实际场景中,为提高频道推荐的准确性,需要确定出多个不同频道的周期性播放视频特征模型。It should be noted that, in the embodiment of the present application, the periodic playing video feature model is a model corresponding to a channel. In other words, after the preset statistical period is determined, each channel corresponds to a periodic playing video feature model. Therefore, in an actual scenario, in order to improve the accuracy of channel recommendation, it is necessary to determine the periodic play video feature models of multiple different channels.

在一具体示例中,针对频道的播放历史数据至少包括:在至少一个预设统计周期内频道所播放的视频节目的播放特征以及视频特征;即步骤101之前,本申请实施例所述方法还包括:获取在至少一个预设统计周期内频道所播放的视频节目的播放特征,以及视频节目的视频特征,作为针对频道的播放历史数据。In a specific example, the playback history data for the channel at least includes: playback characteristics and video characteristics of the video programs played by the channel in at least one preset statistical period; that is, before step 101, the method described in this embodiment of the present application further includes: : Acquire the playback characteristics of the video programs played by the channel in at least one preset statistical period, and the video characteristics of the video programs, as the playback history data for the channel.

在一具体示例中,得到针对频道的周期性播放视频特征模型的步骤具体包括:基于针对频道的播放历史数据,确定出频道在统计周期内所播放的视频节目的视频特征分布数据,以及频道在统计周期内所播放的视频节目的概率分布数据;基于频道在统计周期内所播放的视频节目的视频特征分布数据,以及频道在统计周期内所播放的视频节目的概率分布数据,得到针对频道的周期性播放视频特征模型。实际应用中,存在多个频道时,每个频道的周期性播放视频特征模型均可以基于该示例根据每个频道各自的播放历史数据确定出周期性播放视频特征模型。In a specific example, the step of obtaining the periodic playback video feature model for the channel specifically includes: based on the playback history data for the channel, determining the video feature distribution data of the video programs played by the channel in the statistical period, and The probability distribution data of the video programs played in the statistical period; based on the video feature distribution data of the video programs played by the channel in the statistical period, and the probability distribution data of the video programs played by the channel in the statistical period, get the channel-specific data. Play the video feature model periodically. In practical applications, when there are multiple channels, the periodically playing video feature model of each channel can be based on this example to determine the periodically playing video feature model according to the respective playing history data of each channel.

实际场景中,本申请实施例所述的视频特征可以具体包括主题标签,比如,视频节目为新闻联播时,主题标签则可为“新闻”、“时政”,此时,视频特征具体包括:“新闻”、“时政”,如此,利用主题标签即可得到视频节目所呈现的内容特征。当然,实际场景中,视频节目可对应一个或至少两个主题标签,不同视频节目的主题标签可以相同,也可以不相同。In actual scenarios, the video features described in the embodiments of the present application may specifically include topic tags. For example, when the video program is a news broadcast, the topic tags may be "news" and "current affairs". At this time, the video features specifically include: " News", "Current Affairs", in this way, the content characteristics presented by the video program can be obtained by using the subject tag. Of course, in an actual scenario, a video program may correspond to one or at least two topic tags, and the topic tags of different video programs may be the same or different.

步骤102:基于获取到的在至少一个预设统计周期内针对目标用户的观看历史数据,得到针对目标用户的周期性观看视频特征模型,所述周期性观看视频特征模型能够表征目标用户周期性规律的视频特征分布数据。Step 102: Based on the acquired viewing history data for the target user within at least one preset statistical period, obtain a periodic video viewing feature model for the target user, where the periodic video viewing feature model can represent the periodicity of the target user video feature distribution data.

在一具体示例中,针对目标用户的观看历史数据包括:在至少一个预设统计周期内目标用户所观看的视频节目的观看特征以及视频特征,即步骤102之前,本申请实施例所述方法还包括获取在至少一个预设统计周期内目标用户所观看的视频节目的观看特征以及视频特征,作为针对目标用户的观看历史数据。In a specific example, the viewing history data for the target user includes: viewing characteristics and video characteristics of the video programs watched by the target user in at least one preset statistical period, that is, before step 102, the method described in this embodiment of the present application further: The method includes acquiring the viewing characteristics and video characteristics of the video programs watched by the target user in at least one preset statistical period, as viewing history data for the target user.

在另一具体示例中,考虑到目标用户观看行为可能会具有很强的稀疏性,导致无法得到针对目标用户的周期性观看视频特征模型,本示例中采用概率平滑技术,基本思想是利用目标用户在预设统计周期的邻域时间内的观看记录来补充目标用户在该预设统计周期的观看记录,如此,避免获取到的观看记录存在稀疏性问题而导致无法确定出周期性观看视频特征模型;基于此,针对目标用户的观看历史数据包括:在至少一个预设统计周期内目标用户所观看的视频节目的观看特征以及视频特征,以及邻域时间内目标用户所观看的视频节目的观看特征以及视频特征。即步骤102之前,本申请实施例所述方法还包括:In another specific example, considering that the viewing behavior of the target user may have strong sparseness, it is impossible to obtain a periodic video viewing feature model for the target user. In this example, the probability smoothing technology is used. The basic idea is to use the target user. The viewing records in the neighborhood time of the preset statistical period are used to supplement the viewing records of the target users in the preset statistical period. In this way, it is avoided that there is a sparsity problem in the acquired viewing records, which makes it impossible to determine the periodic viewing video feature model. Based on this, the viewing history data for the target user includes: the viewing characteristics and video characteristics of the video programs watched by the target user in at least one preset statistical period, and the viewing characteristics of the video programs watched by the target user within the neighborhood time. and video features. That is, before step 102, the method described in this embodiment of the present application further includes:

获取在至少一个预设统计周期内目标用户所观看的视频节目的观看特征以及视频特征,作为针对目标用户的第一子观看历史数据;获取确定出的邻域时间内目标用户所观看的视频节目的观看特征以及视频特征,作为针对目标用户的第二子观看历史数据;其中,所述邻域时间为与所述预设统计周期相关联的时间;将所述第一子观看历史数据与第二子观看历史数据作为针对目标用户的观看历史数据。Obtain the viewing characteristics and video characteristics of the video programs watched by the target user in at least one preset statistical period, as the first sub-viewing historical data for the target user; obtain the video programs watched by the target user within the determined neighborhood time The viewing characteristics and video characteristics of the target user are used as the second sub-viewing historical data for the target user; wherein, the neighborhood time is the time associated with the preset statistical period; the first sub-viewing historical data and the first sub-viewing historical data are Erzi viewing history data is used as viewing history data for target users.

这里,需要说明的是,邻域时间可以为一特定时间段;该特定时间段是基于预设统计周期而确定出的,比如,当预设统计周期对应第一时间维度,该第一时间维度包括一个或至少两个第一维度,此时,该邻域时间对应的第二时间维度小于第一时间维度,且第二时间维度所包括的第二维度为从第一时间维度包含的第一维度中选出的;如此,利用邻域时间即可丰富观看记录的数量,进而为避免出现数据稀疏性问题奠定了基础。Here, it should be noted that the neighborhood time may be a specific time period; the specific time period is determined based on the preset statistical period, for example, when the preset statistical period corresponds to the first time dimension, the first time dimension One or at least two first dimensions are included. In this case, the second time dimension corresponding to the neighborhood time is smaller than the first time dimension, and the second dimension included in the second time dimension is the first time dimension included in the first time dimension. In this way, the number of viewing records can be enriched by using the neighborhood time, thereby laying a foundation for avoiding the problem of data sparsity.

或者,邻域时间为除预设统计周期外的所有时段,此时,获取到的第二子观看历史数据即为目标用户在除预设统计周期外所观看的所有视频节目的观看特征及视频特征。Or, the neighborhood time is all time periods except the preset statistical period, and at this time, the acquired second sub-viewing history data is the viewing characteristics and videos of all video programs watched by the target user except for the preset statistical period. feature.

举例来说,预设统计周期对应的第一时间维度为3,即包括月、周、日三个第一维度,此时,邻域时间对应的第二时间维度可以为2,1或者0少于第一时间维度;进一步地,若第二时间维度为2,则可具体包括月、周两个第二维度,或者,月、日两个第二维度,或者周、日两个第二维度;若第二时间维度为1,则第二维度为月或周或日。或者,第二时间维度为0,此时,邻域时间即为除预设统计周期之外的所有时段,相应地,基于邻域时间统计得到的第二子观看历史数据即为目标用户在除预设统计周期外所观看的所有视频节目的观看特征及视频特征。For example, the first time dimension corresponding to the preset statistical period is 3, that is, it includes three first dimensions of month, week, and day. In this case, the second time dimension corresponding to the neighborhood time may be 2, 1, or less than 0. In the first time dimension; further, if the second time dimension is 2, it may specifically include two second dimensions of month and week, or two second dimensions of month and day, or two second dimensions of week and day ; If the second time dimension is 1, the second dimension is month or week or day. Or, the second time dimension is 0. At this time, the neighborhood time is all time periods except the preset statistical period. Correspondingly, the second sub-viewing historical data obtained based on the neighborhood time statistics is the target user Viewing characteristics and video characteristics of all video programs watched outside the preset statistical period.

需要说明的是,以上所述的月、周、日可以对应为自然月,自然周及自然日。It should be noted that the above-mentioned month, week, and day may correspond to a natural month, a natural week, and a natural day.

在一具体示例中,得到针对目标用户的周期性观看视频特征模型的步骤包括:基于针对目标用户的观看历史数据,确定出目标用户至少在统计周期内所观看的视频节目的视频特征分布数据,以及目标用户至少在统计周期内所观看的视频节目的概率分布数据;基于目标用户至少在统计周期内所观看的视频节目的视频特征分布数据,以及目标用户至少在统计周期内所观看的视频节目的概率分布数据,得到针对目标用户的周期性观看视频特征模型。In a specific example, the step of obtaining the periodic viewing video feature model for the target user includes: based on the viewing history data for the target user, determining the video feature distribution data of the video programs watched by the target user at least in the statistical period, And the probability distribution data of the video programs watched by the target user at least in the statistical period; based on the video feature distribution data of the video programs watched by the target user at least in the statistical period, and the video programs watched by the target user at least in the statistical period The probability distribution data of , obtain the periodic watching video feature model for target users.

在另一具体示例中,当观看历史数据包括第一子观看历史数据和第二子观看历史数据,此时,得到针对目标用户的周期性观看视频特征模型的步骤包括:基于针对目标用户的观看历史数据,确定出目标用户在统计周期内以及邻域时间内所观看的视频节目的视频特征分布数据,以及目标用户在统计周期内以及邻域时间内所观看的视频节目的概率分布数据;基于目标用户在统计周期内以及邻域时间内所观看的视频节目的视频特征分布数据,以及目标用户在统计周期内以及邻域时间内所观看的视频节目的概率分布数据,得到针对目标用户的周期性观看视频特征模型。In another specific example, when the viewing history data includes the first sub-viewing history data and the second sub-viewing history data, at this time, the step of obtaining the periodic viewing video feature model for the target user includes: based on the viewing of the target user Historical data, determine the video feature distribution data of the video programs watched by the target users in the statistical period and the neighborhood time, and the probability distribution data of the video programs watched by the target users in the statistical period and the neighborhood time; based on The video feature distribution data of the video programs watched by the target users in the statistical period and the neighborhood time, as well as the probability distribution data of the video programs watched by the target users in the statistical period and the neighborhood time, to obtain the period for the target user Sexual viewing video feature model.

步骤103:将针对至少一个频道的周期性播放视频特征模型的视频特征分布数据与所述周期性观看视频特征模型的视频特征分布数据进行匹配,基于匹配结果得到针对目标用户的推荐数据。Step 103 : Match the video feature distribution data of the periodically playing video feature model for at least one channel with the video feature distribution data of the periodically viewing video feature model, and obtain recommendation data for the target user based on the matching result.

在一具体示例中,所述基于匹配结果得到针对目标用户的推荐数据,包括两种类型,分别为:In a specific example, the recommendation data obtained for the target user based on the matching result includes two types, respectively:

类型一:基于匹配结果得到目标用户在至少一个频道中目标频道上的观看时段推荐数据;Type 1: Based on the matching result, the target user's viewing period recommendation data on the target channel in at least one channel is obtained;

类型二:基于匹配结果得到目标用户在所选定观看时段的频道推荐数据。Type 2: Obtain the channel recommendation data of the target user during the selected viewing period based on the matching result.

也就是说,针对目标用户的频道推荐包含两种形式,即在目标频道上的观看时间推荐,或者在特定观看时间的频道推荐,如此,丰富了应用场景。实际应用中,可以基于实际场景择一则执行。That is to say, the channel recommendation for the target user includes two forms, that is, the viewing time recommendation on the target channel, or the channel recommendation at a specific viewing time, thus enriching the application scenarios. In practical applications, one can be selected and executed based on the actual scenario.

在另一具体示例中,可以采用如下匹配方式确定出推荐数据,具体地,确定表征频道周期性规律的视频特征分布数据与表征目标用户周期性规律的视频特征分布数据之间的特征差值;对特征差值进行排序处理,将排序结果满足预设规则的数据作为推荐数据。比如,确定两个视频特征分布数据之间的相对熵,然后基于相对熵来衡量模型之间的匹配程度,进而基于匹配程度得到推荐数据。In another specific example, the following matching methods may be used to determine the recommended data, specifically, determining the feature difference between the video feature distribution data representing the periodicity of the channel and the video feature distribution data representing the periodicity of the target user; The feature difference value is sorted, and the data whose sorting result satisfies the preset rule is used as the recommended data. For example, determine the relative entropy between two video feature distribution data, and then measure the matching degree between the models based on the relative entropy, and then obtain recommendation data based on the matching degree.

本申请实施例充分考虑了频道播放的视频节目和用户观看的视频节目的周期性规律,并基于周期性规律进行个性化推荐,因此,本申请实施例基于频道的推荐方法更加符合频道播放视频节目的规律和用户观看视频节目的特点,在实现个性化推荐的基础上,提升了用户体验。The embodiment of the present application fully considers the periodicity of the video programs played by the channel and the video programs watched by the user, and performs personalized recommendation based on the periodicity. Therefore, the channel-based recommendation method in the embodiment of the present application is more in line with the video program played by the channel. On the basis of personalized recommendation, the user experience is improved.

以下结合具体应用场景对本申请实施例做进一步详细说明;具体地,The embodiments of the present application will be described in further detail below in conjunction with specific application scenarios; specifically,

本申请实施例使用多个维度来表征预设统计周期,如此,利用预设统计周期统计得到的历史数据(播放历史数据和观看历史数据)即可得到体现电视频道播放的视频节目编排的周期性和用户观看的周期性,举例来说,图2示出了使用周维度和日维度来表征预设统计周期的示意图,如图2所示,周维度划分为周一至周日,日维度按时段划分为上午、中午、下午、傍晚、黄金以及深夜等,进而便可将任意时刻表示为二维矩阵中的元素,比如,目标用户在2019年10月12日19:14观看了特定频道,则目标用户的观看行为的周期性特征表示为(周六,傍晚),同理,可以将预设统计周期内获取到的观看历史数据或播放历史数据均通过周期性特征结构表示方法进行表示。In the embodiment of the present application, multiple dimensions are used to represent the preset statistical period. In this way, the historical data (playing historical data and viewing historical data) obtained by statistics in the preset statistical period can be used to obtain the periodicity of the arrangement of video programs played by TV channels. and the periodicity of user viewing. For example, Figure 2 shows a schematic diagram of using the weekly dimension and the daily dimension to represent the preset statistical period. As shown in Figure 2, the weekly dimension is divided into Monday to Sunday, and the daily dimension is divided by time period. It is divided into morning, noon, afternoon, evening, gold, and late night, etc., and then any moment can be represented as an element in a two-dimensional matrix. For example, if the target user watched a specific channel at 19:14 on October 12, 2019, then The periodic feature of the target user's viewing behavior is represented as (Saturday, evening). Similarly, the viewing history data or playback history data obtained within the preset statistical period can be represented by the periodic feature structure representation method.

需要说明的是,通过上述说明即可看出,周期性特征结构是基于预设统计周期的维度特征确定出的。进一步地,图2仅为一示例,实际应用中还可以使用更多的维度(比如月、周、日等)来表示预设统计周期,以及表示周期性特征结构,而且,对每个维度也可以有更多灵活的划分方式,比如周维度可以划分为工作日、周末,日维度可以按照小时划分为24个时段等,本申请实施例对此不作限制。It should be noted that, it can be seen from the above description that the periodic feature structure is determined based on the dimensional features of the preset statistical period. Further, FIG. 2 is only an example, and in practical applications, more dimensions (such as month, week, day, etc.) can be used to represent the preset statistical period and the periodic feature structure. There may be more flexible division methods, for example, the weekly dimension may be divided into weekdays and weekends, and the daily dimension may be divided into 24 time periods according to hours, etc., which is not limited in this embodiment of the present application.

基于上述周期性特征结构表示方法,对本申请实施例方法做进一步说明,首先,需要说明的是,本示例利用主题标签来表征视频特征;进一步地,本示例以下所述的Pr(·|·)表示条件概率,Pr(·)表示概率,c表示频道(比如数字电视频道),l表示视频节目的视频特征(如主题标签),p表示视频节目,u表示用户,t表示使用前述周期性特征结构所表示的时间。具体地,该方法包括:Based on the above-mentioned periodic feature structure representation method, the method of the embodiment of the present application will be further described. First of all, it should be noted that this example uses topic tags to represent video features; further, the following Pr(·|·) represents the conditional probability, Pr( ) represents the probability, c represents the channel (such as a digital TV channel), l represents the video feature of the video program (such as a hashtag), p represents the video program, u represents the user, and t represents the use of the aforementioned periodic features The time represented by the structure. Specifically, the method includes:

步骤一:构建针对频道的周期性播放视频特征模型。Step 1: Build a periodic video feature model for the channel.

这里,需要说明的是,实际应用中每个频道对应一个周期性播放视频特征模型,每个周期性播放视频特征模型均为可以采用下述方式实现。Here, it should be noted that, in practical applications, each channel corresponds to a periodic playing video feature model, and each periodic playing video feature model can be implemented in the following manner.

首先,选中电视频道,收集一定时间段(比如,一个或至少两个预设统计周期)内该电视频道所播放的视频节目的播放历史数据。这里,当预设统计周期采用上述周期性特征结构表示时,收集的时间段则为表征预设统计周期的最大粒度维度的整数倍,也就是说,统计时长是预设统计周期的最大粒度维度的整数倍,举例来说,采用“月、周、日”三个维度来表征预设统计周期,此时,数据收集的时长(也即预设统计周期的统计时长)要求为最大粒度“月”的整数倍,比如N个月,这里,N为正整数。First, a TV channel is selected, and the playing history data of the video programs played by the TV channel within a certain period of time (for example, one or at least two preset statistical periods) are collected. Here, when the preset statistical period is represented by the above periodic feature structure, the collected time period is an integer multiple of the maximum granularity dimension representing the preset statistical period, that is, the statistical duration is the maximum granularity dimension of the preset statistical period For example, three dimensions of "month, week, and day" are used to represent the preset statistical period. At this time, the duration of data collection (that is, the statistical duration of the preset statistical period) is required to be the maximum granularity "monthly" ”, such as N months, where N is a positive integer.

这里,所述播放历史数据中记录了该电视频道所播放的每个视频节目的起止时间,以及每个视频节目的主题标签。比如,央视1套播放新闻联播的时间是2019年10月12日19:00至19:30,新闻联播的主题标签是“新闻”、“时政”等。实际应用中,每个视频节目至少对应一个主题标签。当然,为精确表示视频节目的特征,还可以使用多个主题标签对视频节目进行打标,即一个视频节目还可对应多个主题标签。Here, the play history data records the start and end time of each video program played by the TV channel, and the subject tag of each video program. For example, CCTV 1 broadcasts the news broadcast from 19:00 to 19:30 on October 12, 2019, and the topic tags of the news broadcast are "news", "current affairs", etc. In practical applications, each video program corresponds to at least one topic tag. Of course, in order to accurately represent the characteristics of the video program, multiple topic tags may also be used to mark the video program, that is, one video program may also correspond to multiple topic tags.

其次,基于播放历史数据,使用频道主题标签的概率分布特征,构建得到针对频道的周期性播放视频特征模型,该模型表征如下:Secondly, based on the playback history data, using the probability distribution feature of the channel's hashtags, a periodic playback video feature model for the channel is constructed, and the model is characterized as follows:

Figure BDA0002261950340000121
Figure BDA0002261950340000121

这里,式(1)是对Pr(l|c;t)进行贝叶斯变换所得到的;式(2)是使用频道c在时间t上播放的电视频目集合Prog(c,t)来表征频道,并使用概率加法公式进行展开后得到的;式(3)是对式(2)中的Pr(p|l)进行贝叶斯变换所得到的;式(4)是对式(3)中分子和分母中的相同项目进行约减,并增加归一化因子Zc后得到的,这里,增加归一化因子Zc后能够使最终计算得到的值符合概率分布的要求,Zc的具体计算方式为:Zc=∑l′∈Lp∈Prog(c,t)Pr(l|p)·Pr(p),其中L为主题标签全集。Here, Equation (1) is obtained by Bayesian transformation of Pr(l|c; t); Equation (2) is obtained by using the TV program set Prog(c, t) broadcast by channel c at time t It is obtained by characterizing the channel and expanding it using the probability addition formula; Equation (3) is obtained by Bayesian transformation of Pr(p|l) in Equation (2); Equation (4) is obtained from Equation (3) ) is obtained by reducing the same items in the numerator and denominator, and adding the normalization factor Z c . Here, after increasing the normalization factor Z c , the final calculated value can meet the requirements of the probability distribution, Z c The specific calculation method is: Z c =∑ l′ ∈L ∑ p∈Prog(c,t) Pr(l|p)·Pr(p), where L is the complete set of topic tags.

通过上述周期性播放视频特征模型可以看出,电视频道在特定时间段内的主题标签分布取决于该电视频道在该特定时间段内所播放的视频节目的主题标签分布Pr(l|p),以及各视频节目的概率分布。前者在电视频道的播放历史数据中可直接得到;后者可根据电视频道在预设统计周期的周期性特征计算得到,比如根据预设统计周期的内播放该视频节目的时长比例计算得到,具体地,假设预设统计周期的统计时长为120分钟,则该视频节目播放了60分钟,那么Pr(p)=60/120=0.5。It can be seen from the above-mentioned periodic playing video feature model that the distribution of topic tags of a TV channel in a specific time period depends on the distribution of topic tags Pr(l|p) of the video programs played by the TV channel in the specific time period, And the probability distribution of each video program. The former can be directly obtained from the broadcast history data of the TV channel; the latter can be calculated according to the periodic characteristics of the TV channel in the preset statistical period, for example, according to the proportion of the duration of the video program played in the preset statistical period. Assuming that the statistical duration of the preset statistical period is 120 minutes, and the video program is played for 60 minutes, then Pr(p)=60/120=0.5.

步骤二:构建针对目标用户的周期性观看视频特征模型。Step 2: Build a periodic video feature model for target users.

首先,选中目标用户,收集一定时间段内该目标用户观看的观看电视历史数据。具体收集策略类似于收集电视频道的播放历史数据,即收集的时间段为表征预设统计周期的最大粒度维度的整数倍,举例来说,采用“周、日”两个维度来表征预设统计周期,此时,数据收集的时长(也即预设统计周期的统计时长)要求为最大粒度“周”的整数倍,比如M个周,这里,M为正整数。First, select the target user, and collect the TV viewing history data watched by the target user within a certain period of time. The specific collection strategy is similar to collecting the broadcast history data of TV channels, that is, the collection time period is an integer multiple of the maximum granularity dimension representing the preset statistical period. For example, two dimensions of "week and day" are used to represent the preset statistics Period, at this time, the duration of data collection (that is, the statistical duration of the preset statistical period) is required to be an integer multiple of the maximum granularity "week", such as M weeks, where M is a positive integer.

这里,所述观看历史数据中记录了该目标用户观看视频节目的起止时间,以及每个视频节目的主题标签,如此,利用观看历史数据即可确定出目标用户观看电视的兴趣特征。Here, the viewing history data records the start and end time of the target user's viewing of the video program and the subject tag of each video program, so that the viewing history data can be used to determine the target user's interest in watching TV.

其次,基于观看历史数据,使用目标用户的主题标签的概率分布特征,构建得到周期性观看视频特征模型。这里,采用与周期性播放视频特征模型类似的推导方式,得到周期性观看视频特征模型,该模型表征如下:Secondly, based on the viewing history data, using the probability distribution feature of the target user's hashtags, a feature model of periodically watching videos is constructed. Here, a derivation method similar to the periodic video feature model is used to obtain the periodic video feature model, which is characterized as follows:

Figure BDA0002261950340000131
Figure BDA0002261950340000131

这里,式(5)中Prog(u,t)表示目标用户u在时间t内观看的视频节目列表。Pr(l|p)、Pr(p)和Zu采用与周期性播放视频特征模型相同的计算方式。Here, Prog(u, t) in Equation (5) represents the list of video programs watched by the target user u within time t. Pr(l|p), Pr(p) and Z u are calculated in the same way as the periodic playback video feature model.

通过上述周期性观看视频特征模型可以看出,目标用户在特定时间段内的主题标签分布取决于该目标用户在该特定时间段内所观看的视频节目的主题标签分布Pr(l|p),以及各视频节目的概率分布。前者在观看历史数据中可直接得到;后者可根据目标用户观看的视频节目在预设统计周期的周期性特征计算得到,比如根据预设统计周期的内观看该视频节目的时长比例计算得到,具体地,假设预设统计周期的统计时长为120分钟,则观看的该视频节目播放了60分钟,那么Pr(p)=60/120=0.5。It can be seen from the above-mentioned periodic watching video feature model that the distribution of hashtags of the target user in a specific time period depends on the distribution of hashtags Pr(l|p) of the video programs watched by the target user in the specific time period, And the probability distribution of each video program. The former can be obtained directly in the viewing history data; the latter can be calculated according to the periodic characteristics of the video program watched by the target user in the preset statistical period, for example, calculated according to the time-length ratio of the video program watched within the preset statistical period, Specifically, assuming that the statistical duration of the preset statistical period is 120 minutes, and the watched video program is played for 60 minutes, then Pr(p)=60/120=0.5.

实际场景中,用户观看行为通常具有很强的稀疏性,即用户在多个预设统计周期内没有观看历史记录,此时,Prog(u,t)会存在为空集的情况,这将导致无法有效计算出Pr(l|u;t)。因此,为解决这一问题,本申请实施例采用一种概率平滑技术,基本思想是使用用户在预设统计周期的邻域时间内的观看记录来补充用户在该预设统计周期的观看记录。In actual scenarios, user viewing behavior usually has strong sparseness, that is, users have no viewing history in multiple preset statistical periods. At this time, Prog(u,t) will be an empty set, which will lead to Pr(l|u;t) cannot be computed efficiently. Therefore, to solve this problem, the embodiment of the present application adopts a probabilistic smoothing technique, and the basic idea is to use the user's viewing records in the neighborhood time of the preset statistical period to supplement the user's viewing records in the preset statistical period.

举例来说,在一具体示例中,平滑后的周期性观看视频特征模型为:For example, in a specific example, the smoothed periodic viewing video feature model is:

Figure BDA0002261950340000132
Figure BDA0002261950340000132

式(6)中的Prog(u,t′)表示目标用户u在预设统计周期的时间t的邻域时间t′内观看的视频节目列表,Prog(u)表示目标用户u所观看的所有视频节目。Neig(t)为预设统计周期的时间t的所有邻域时间集合。w1、w2和w3为权重,可为经验值,满足条件:Prog(u, t') in formula (6) represents the list of video programs watched by the target user u in the neighborhood time t' of the time t in the preset statistical period, and Prog(u) represents all the videos watched by the target user u. video program. Neig(t) is the set of all neighborhood times at time t of the preset statistical period. w 1 , w 2 and w 3 are weights, which can be empirical values, satisfying the conditions:

0<w1<1,0<w2<1,0<w3<1,w1+w2+w3=1。0<w 1 <1, 0<w 2 <1, 0<w 3 <1, w 1 +w 2 +w 3 =1.

这里,邻域时间的设置方式为:Here, the neighborhood time is set as:

利用d个维度表示的预设统计周期的维度,此时,预设统计周期可以表示为:t=(t1,t2…,td-1,td);则邻域时间集合为:Using the dimension of the preset statistical period represented by d dimensions, at this time, the preset statistical period can be expressed as: t=(t 1 , t 2 . . . , t d-1 , t d ); then the neighborhood time set is:

Neig(t)={(t1,…,td-1),(t1,…,td-2,td),…,(t2,…,td)};Neig(t)={(t 1 ,...,t d-1 ),(t 1 ,...,t d-2 ,t d ),...,(t 2 ,...,t d )};

即领域时间的维度为t的任意d-1个维度。换言之,邻域时间的维度低于预设统计周期的维度;这里,d为正整数。举例来说,对于一个2维的预设统计周期t=(周三,傍晚),则其邻域时间集合Neig(t)={(周三),(傍晚)}。That is, the dimension of the domain time is any d-1 dimension of t. In other words, the dimension of the neighborhood time is lower than the dimension of the preset statistical period; here, d is a positive integer. For example, for a 2-dimensional preset statistical period t=(Wednesday, evening), its neighborhood time set Neig(t)={(Wednesday), (evening)}.

当然,实际应用中,邻域时间的维度还可以为d-1,d-2,…,1,甚至邻域时间的维度为0,此时,邻域时间统计的是用户除预设统计周期外所观看的所有视频节目的观看历史数据;需要说明的是,本申请实施例对邻域时间的维度不作过多限制,只要邻域时间的维度小于预设统计周期的维度,能够实现补充观看历史数据,解决稀疏性问题即可。Of course, in practical applications, the dimension of the neighborhood time can also be d-1, d-2, ..., 1, or even the dimension of the neighborhood time is 0. In this case, the neighborhood time statistics are calculated by the user except the preset statistical period. The viewing history data of all video programs watched outside; it should be noted that the embodiment of the present application does not impose too many restrictions on the dimension of the neighborhood time, as long as the dimension of the neighborhood time is less than the dimension of the preset statistical period, supplementary viewing can be realized Historical data can solve the sparsity problem.

这里,由于领域时间的范围更大,所以通常比原有预设统计周期具有更多的观看历史数据,即稀疏性更低,因此,能够在一定程度上克服用户观看行为的稀疏性问题。Here, because the scope of the domain time is larger, it usually has more viewing history data than the original preset statistical period, that is, the sparsity is lower. Therefore, the sparsity problem of the user's viewing behavior can be overcome to a certain extent.

而且,本示例进一步给出了邻域时间为0的情况,即当邻域时间的维度为0时,即可统计得到用户所有的观看历史数据Prog(u),因此,对于任何具有历史观看记录的用户而言,Prog(u)均为非空集合,为有效计算平滑处理后的周期性观看视频特征模型Pr′(l|u;t)奠定了基础。Moreover, this example further provides the case where the neighborhood time is 0, that is, when the dimension of the neighborhood time is 0, all the viewing history data Prog(u) of the user can be obtained by statistics. For users of , Prog(u) is a non-empty set, which lays a foundation for effectively calculating the smoothed periodic viewing video feature model Pr′(l|u;t).

步骤三:将周期性播放视频特征模型与周期性观看视频特征模型进行匹配,得到针对目标用户的推荐数据。Step 3: Matching the periodic playing video feature model and the periodic viewing video feature model to obtain recommendation data for the target user.

具体地,本申请实施例,利用电视频道的主题标签概率分布与目标用户的主题标签概率分布之间的相对熵来度量周期性播放视频特征模型与周期性观看视频特征模型之间的匹配程度,具体计算方式为:Specifically, in this embodiment of the present application, the relative entropy between the hashtag probability distribution of the TV channel and the target user's hashtag probability distribution is used to measure the matching degree between the periodic playing video feature model and the periodic viewing video feature model, The specific calculation method is:

Figure BDA0002261950340000141
Figure BDA0002261950340000141

如此,基于式(7)即可计算得到针对目标用户的所有候选频道c的Dist(c,u;t),最后取Dist(c,u;t)值中最大的前M个频道推荐给用户。In this way, based on formula (7), the Dist(c, u; t) of all candidate channels c for the target user can be calculated, and finally the top M channels with the largest Dist(c, u; t) values are taken and recommended to the user .

这里,首先,本申请实施例建立了一种周期性特征表示方法,利用该方法能够将任意时间表示为一个多维周期时间,进而为针对频道的推荐奠定了基础。而且,该周期性特征表示方法良好的扩展性,对维度没有明确的约束,对每个维度内时间的具体划分方式也没有明确的约束,能够反映时间的任意周期特征,为工程化应用奠定了基础。Here, first, the embodiment of the present application establishes a periodic feature representation method, by which any time can be represented as a multi-dimensional periodic time, thereby laying a foundation for channel recommendation. Moreover, the periodic feature representation method has good scalability, has no explicit constraints on dimensions, and has no explicit constraints on the specific division of time in each dimension. It can reflect the arbitrary periodic characteristics of time, laying a solid foundation for engineering applications Base.

其次,本申请实施例基于频道的视频特征概率分布数据来构建针对频道的周期性播放视频特征模型,以及基于用户的视频特征概率分布数据来构建针对用户的周期性观看视频特征模型,并通过概率推导给出了具体的计算方式,在数学上能够保证该计算方式的正确性,而且,整个推导流程简单、可解释性强。Secondly, the embodiment of the present application constructs a periodic playing video feature model for the channel based on the video feature probability distribution data of the channel, and constructs a periodic viewing video feature model for the user based on the video feature probability distribution data of the user. The derivation gives a specific calculation method, which can guarantee the correctness of the calculation method in mathematics, and the whole derivation process is simple and interpretable.

最后,本申请实施例采用了概率平滑技术来保证对于任何具有电视观看历史记录的用户均能够有效计算其分布,进而得到频道推荐数据,如此,提升了用户体验。Finally, the embodiment of the present application adopts a probability smoothing technology to ensure that any user with a TV viewing history can effectively calculate its distribution, thereby obtaining channel recommendation data, thus improving user experience.

本申请实施例还提供了一种基于频道的推荐装置,如图3所示,所述装置包括:The embodiment of the present application also provides a channel-based recommendation device, as shown in FIG. 3 , the device includes:

模型处理单元31,用于基于获取到的在至少一个预设统计周期内针对频道的播放历史数据,得到针对至少一个频道的周期性播放视频特征模型,所述周期性播放视频特征模型能够表征频道周期性规律的视频特征分布数据;基于获取到的在至少一个预设统计周期内针对目标用户的观看历史数据,得到针对目标用户的周期性观看视频特征模型,所述周期性观看视频特征模型能够表征目标用户周期性规律的视频特征分布数据;The model processing unit 31 is configured to obtain a periodic play video feature model for at least one channel based on the acquired play history data for the channel within at least one preset statistical period, where the periodic play video feature model can characterize the channel Periodic and regular video feature distribution data; based on the obtained viewing history data for the target user within at least one preset statistical period, obtain a periodic viewing video feature model for the target user, where the periodic viewing video feature model can Video feature distribution data representing the periodicity of target users;

推荐单元32,用于将针对至少一个频道的周期性播放视频特征模型的视频特征分布数据与所述周期性观看视频特征模型的视频特征分布数据进行匹配,基于匹配结果得到针对目标用户的推荐数据。The recommending unit 32 is configured to match the video feature distribution data of the periodic playback video feature model for at least one channel with the video feature distribution data of the periodic viewing video feature model, and obtain recommendation data for the target user based on the matching result .

在一具体示例中,推荐单元32,还用于基于匹配结果得到目标用户在至少一个频道中目标频道上的观看时段推荐数据;和/或,基于匹配结果得到目标用户在所选定观看时段的频道推荐数据。In a specific example, the recommending unit 32 is further configured to obtain, based on the matching result, the viewing period recommendation data of the target user on the target channel in at least one channel; and/or, based on the matching result, obtain the target user's viewing period of the selected viewing period. Channel recommendation data.

在一具体示例中,模型处理单元31,还用于获取在至少一个预设统计周期内频道所播放的视频节目的播放特征,以及视频节目的视频特征,作为针对频道的播放历史数据。In a specific example, the model processing unit 31 is further configured to acquire the playback characteristics of the video programs played by the channel in at least one preset statistical period, and the video characteristics of the video programs, as the playback history data for the channel.

在一具体示例中,模型处理单元31,还用于:In a specific example, the model processing unit 31 is further configured to:

基于针对频道的播放历史数据,确定出频道在统计周期内所播放的视频节目的视频特征分布数据,以及频道在统计周期内所播放的视频节目的概率分布数据;Based on the playback history data for the channel, determine the video feature distribution data of the video programs played by the channel in the statistical period, and the probability distribution data of the video programs played by the channel in the statistical period;

基于频道在统计周期内所播放的视频节目的视频特征分布数据,以及频道在统计周期内所播放的视频节目的概率分布数据,得到针对频道的周期性播放视频特征模型。Based on the video feature distribution data of the video programs played by the channel in the statistical period, and the probability distribution data of the video programs played by the channel in the statistical period, a periodically played video feature model for the channel is obtained.

在一具体示例中,模型处理单元31,还用于:In a specific example, the model processing unit 31 is further configured to:

获取在至少一个预设统计周期内目标用户所观看的视频节目的观看特征以及视频特征,作为针对目标用户的观看历史数据;Acquiring viewing characteristics and video characteristics of video programs watched by the target user in at least one preset statistical period, as viewing history data for the target user;

或者,or,

获取在至少一个预设统计周期内目标用户所观看的视频节目的观看特征以及视频特征,作为针对目标用户的第一子观看历史数据;Acquiring viewing characteristics and video characteristics of video programs watched by the target user in at least one preset statistical period, as the first sub-viewing history data for the target user;

获取确定出的邻域时间内目标用户所观看的视频节目的观看特征以及视频特征,作为针对目标用户的第二子观看历史数据;其中,所述邻域时间为与所述预设统计周期相关联的时间;Obtain the viewing characteristics and video characteristics of the video programs watched by the target user within the determined neighborhood time as the second sub-viewing history data for the target user; wherein the neighborhood time is related to the preset statistical period time of connection;

将所述第一子观看历史数据与第二子观看历史数据作为针对目标用户的观看历史数据。The first sub-viewing history data and the second sub-viewing history data are used as viewing history data for the target user.

在一具体示例中,模型处理单元31,还用于:In a specific example, the model processing unit 31 is further configured to:

基于针对目标用户的观看历史数据,确定出目标用户至少在统计周期内所观看的视频节目的视频特征分布数据,以及目标用户至少在统计周期内所观看的视频节目的概率分布数据;Based on the viewing history data for the target user, determine the video feature distribution data of the video programs watched by the target user at least in the statistical period, and the probability distribution data of the video programs watched by the target user at least in the statistical period;

基于目标用户至少在统计周期内所观看的视频节目的视频特征分布数据,以及目标用户至少在统计周期内所观看的视频节目的概率分布数据,得到针对目标用户的周期性观看视频特征模型。Based on the video feature distribution data of the video programs watched by the target user at least in the statistical period, and the probability distribution data of the video programs watched by the target user at least in the statistical period, a periodic viewing video feature model for the target user is obtained.

在一具体示例中,推荐单元32,还用于确定表征频道周期性规律的视频特征分布数据与表征目标用户周期性规律的视频特征分布数据之间的特征差值;对特征差值进行排序处理,将排序结果满足预设规则的数据作为推荐数据。In a specific example, the recommending unit 32 is further configured to determine the feature difference between the video feature distribution data representing the periodicity of the channel and the video feature distribution data representing the periodicity of the target user; perform sorting processing on the feature difference. , and the data whose sorting result satisfies the preset rules is used as the recommended data.

这里需要指出的是:以上装置实施例项的描述,与上述方法描述是类似的,具有同方法实施例相同的有益效果,因此不做赘述。对于本发明装置实施例中未披露的技术细节,本领域的技术人员请参照本发明方法实施例的描述而理解,为节约篇幅,这里不再赘述。It should be pointed out here that the descriptions of the above device embodiment items are similar to the above method descriptions, and have the same beneficial effects as the method embodiments, so they will not be repeated. For the technical details not disclosed in the device embodiments of the present invention, those skilled in the art should refer to the descriptions of the method embodiments of the present invention to understand them, and to save space, they will not be repeated here.

第三方面,本申请实施例提供了一种基于频道的推荐装置,包括:In a third aspect, an embodiment of the present application provides a channel-based recommendation device, including:

一个或多个处理器;one or more processors;

与所述一个或多个处理器通信连接的存储器;a memory communicatively coupled to the one or more processors;

一个或多个应用程序,其中所述一个或多个应用程序被存储在所述存储器中并被配置为由所述一个或多个处理器执行,所述一个或多个程序被配置为执行以上所述的方法。one or more application programs, wherein the one or more application programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs are configured to perform the above the method described.

在一具体示例中,本申请实施例所述的基于频道的推荐装置可具体为如图4所示的结构,所述装置包括处理器41、存储介质42以及至少一个外部通信接口43;所述处理器41、存储介质42以及外部通信接口43均通过总线44连接。所述处理器41可为微处理器、中央处理器、数字信号处理器或可编程逻辑阵列等具有处理功能的电子元器件。所述存储介质中存储有计算机可执行代码,所述计算机可执行代码能够执行以上任一实施例所述的图像处理方法。在实际应用中,所述获取单元41、初筛单元42以及文本匹配单元43均可以通过所述处理器41实现。In a specific example, the channel-based recommendation device described in this embodiment of the present application may be specifically structured as shown in FIG. 4 , the device includes a processor 41, a storage medium 42, and at least one external communication interface 43; the The processor 41 , the storage medium 42 and the external communication interface 43 are all connected through the bus 44 . The processor 41 may be an electronic component with processing functions, such as a microprocessor, a central processing unit, a digital signal processor, or a programmable logic array. Computer-executable codes are stored in the storage medium, and the computer-executable codes can execute the image processing method described in any one of the above embodiments. In practical applications, the obtaining unit 41 , the preliminary screening unit 42 and the text matching unit 43 can all be implemented by the processor 41 .

这里需要指出的是:以上装置实施例项的描述,与上述方法描述是类似的,具有同方法实施例相同的有益效果,因此不做赘述。对于本发明装置实施例中未披露的技术细节,本领域的技术人员请参照本发明方法实施例的描述而理解,为节约篇幅,这里不再赘述。It should be pointed out here that the descriptions of the above device embodiment items are similar to the above method descriptions, and have the same beneficial effects as the method embodiments, so they will not be repeated. For the technical details not disclosed in the device embodiments of the present invention, those skilled in the art should refer to the descriptions of the method embodiments of the present invention to understand them, and to save space, they will not be repeated here.

本申请实施例提供了一种计算机可读存储介质,其存储有计算机程序,该程序被处理器执行时实现以下步骤:Embodiments of the present application provide a computer-readable storage medium, which stores a computer program, and when the program is executed by a processor, implements the following steps:

基于获取到的在至少一个预设统计周期内针对频道的播放历史数据,得到针对至少一个频道的周期性播放视频特征模型,所述周期性播放视频特征模型能够表征频道周期性规律的视频特征分布数据;Based on the acquired playback history data for a channel within at least one preset statistical period, a periodic playback video feature model for at least one channel is obtained, where the periodic playback video feature model can characterize the periodicity of the channel's regular video feature distribution data;

基于获取到的在至少一个预设统计周期内针对目标用户的观看历史数据,得到针对目标用户的周期性观看视频特征模型,所述周期性观看视频特征模型能够表征目标用户周期性规律的视频特征分布数据;Based on the obtained viewing history data for the target user in at least one preset statistical period, a periodic video viewing feature model for the target user is obtained, and the periodic video viewing feature model can represent the periodic regular video features of the target user distribution data;

将针对至少一个频道的周期性播放视频特征模型的视频特征分布数据与所述周期性观看视频特征模型的视频特征分布数据进行匹配,基于匹配结果得到针对目标用户的推荐数据。Matching the video feature distribution data of the periodically playing video feature model for at least one channel with the video feature distribution data of the periodically viewing video feature model, and obtaining recommendation data for the target user based on the matching result.

这里需要指出的是:以上存储介质实施例项的描述,与上述方法描述是类似的,具有同方法实施例相同的有益效果,因此不做赘述。对于本发明存储介质实施例中未披露的技术细节,本领域的技术人员请参照本发明方法实施例的描述而理解,为节约篇幅,这里不再赘述。It should be pointed out here that the descriptions of the above storage medium embodiment items are similar to the descriptions of the above-mentioned methods, and have the same beneficial effects as the method embodiments, so they will not be repeated. For technical details not disclosed in the embodiments of the storage medium of the present invention, those skilled in the art should refer to the descriptions of the method embodiments of the present invention to understand them. To save space, they will not be repeated here.

在本申请所提供的几个实施例中,应该理解到,所揭露的系统,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed system, apparatus and method may be implemented in other manners. For example, the apparatus embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored, or not implemented. On the other hand, the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical, mechanical or other forms.

所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.

另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit. The above-mentioned integrated units may be implemented in the form of hardware, or may be implemented in the form of software functional units.

本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序可以存储于一计算机可读存储介质中,存储介质可以包括:只读存储器(ROM,Read Only Memory)、随机存取存储器(RAM,RandomAccess Memory)、磁盘或光盘等。Those of ordinary skill in the art can understand that all or part of the steps in the various methods of the above embodiments can be completed by instructing relevant hardware through a program, and the program can be stored in a computer-readable storage medium, and the storage medium can include: Read Only Memory (ROM, Read Only Memory), Random Access Memory (RAM, Random Access Memory), magnetic disk or optical disk, etc.

本领域普通技术人员可以理解实现上述实施例方法中的全部或部分步骤是可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,上述提到的存储介质可以是只读存储器,磁盘或光盘等。Those of ordinary skill in the art can understand that all or part of the steps in the method of the above-mentioned embodiments can be completed by instructing the relevant hardware through a program, and the program can be stored in a computer-readable storage medium. The medium can be read-only memory, magnetic disk or optical disk, etc.

以上对本发明所提供的一种基于频道的推荐方法和装置、存储介质进行了详细介绍,对于本领域的一般技术人员,依据本申请实施例的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本发明的限制。A channel-based recommendation method, device, and storage medium provided by the present invention have been described in detail above. For those skilled in the art, based on the ideas of the embodiments of the present application, there will be specific implementation methods and application scopes. For changes, in summary, the contents of this specification should not be construed as limiting the present invention.

Claims (8)

1. A channel-based recommendation method, the method comprising:
the method comprises the steps of obtaining a periodic playing video feature model for at least one channel based on obtained playing history data for the channel in at least one preset statistical period, wherein the periodic playing video feature model can represent video feature distribution data of the periodic regularity of the channel, and is a model corresponding to the channel, and the method for determining the playing history data of the channel comprises the following steps: acquiring playing characteristics of a video program played by a channel in at least one preset statistical period and video characteristics of the video program as playing history data aiming at the channel;
obtaining a periodic watching video feature model for the target user based on the obtained watching history data for the target user in at least one preset statistical period, wherein the periodic watching video feature model can represent video feature distribution data of the periodic regularity of the target user, and the watching history data for the target user comprises: the method comprises the following steps of obtaining watching characteristics and video characteristics of a video program watched by a target user in at least one preset statistical period, and watching characteristics and video characteristics of a video program watched by the target user in neighborhood time, wherein the method further comprises the following steps: acquiring the watching characteristics and video characteristics of a video program watched by a target user in at least one preset statistical period as first sub-watching history data aiming at the target user; acquiring the watching characteristics and the video characteristics of the video program watched by the target user in the determined neighborhood time as second sub-watching history data aiming at the target user; wherein the neighborhood time is a time associated with the preset statistical period; taking the first sub-viewing history data and the second sub-viewing history data as viewing history data for a target user;
matching video feature distribution data of a periodically played video feature model for at least one channel with video feature distribution data of a periodically watched video feature model, and obtaining recommendation data for a target user based on a matching result;
the method comprises the following steps of obtaining viewing history data or playing history data in a preset statistical period, representing the viewing history data or the playing history data by a periodic feature structure representation method, determining a periodic feature structure based on the dimensional features of the preset statistical period, representing video features by using a theme label based on the periodic feature structure representation method, and the periodic feature structure representation method comprises the following steps:
the method comprises the following steps: constructing a periodic playing video characteristic model aiming at a channel;
step two: constructing a periodic watching video feature model for a target user;
further, each channel in the step one corresponds to one periodically playing video feature model, and each periodically playing video feature model is realized based on the following method:
selecting a television channel, and collecting play history data of a video program played by the television channel in a certain time period; based on the play history data, a periodic play video feature model for the channel is constructed and obtained by using the probability distribution feature of the channel topic label, and the periodic play video feature model of the channel is characterized as follows:
Figure FDA0002973141620000021
the periodic broadcast video characteristic model of the channel is characterized in that Pr (· c) represents conditional probability, Pr (·) represents probability, c represents channel, l represents video characteristic of video program, p represents video program, u represents user, t represents time represented by the periodic characteristic structure, and the formula (1) is obtained by carrying out Bayesian transformation on Pr (l | c; t); the formula (2) is obtained by representing the channel by using a television program set Prog (c, t) played by the channel c at time t and expanding the channel by using a probability addition formula; the formula (3) is obtained by performing Bayesian transformation on Pr (p | l) in the formula (2); the formula (4) is obtained by reducing the same items in the numerator and denominator in the formula (3) and adding a normalization factor Zc, which is specifically calculated as follows: zc ═ Σ L' ∈ L Σ p ∈ Prog (c, t) Pr (L | p) · Pr (p), where L is the topic label corpus;
further, the second step further comprises the following steps: selecting a target user, and collecting historical data of watching television watched by the target user within a certain time period; based on the watching history data, a periodic watching video feature model is constructed and obtained by using the probability distribution feature of the subject label of the target user, and the periodic watching video feature model is obtained by adopting a derivation mode similar to that of the periodic playing video feature model, and is characterized as follows:
Figure FDA0002973141620000022
prog (u, t) in formula (5) represents a video program list watched by the target user u within the time t, and Pr (l | p), Pr (p) and Zu adopt the same calculation mode as the feature model of the periodically played video;
carrying out balance processing on the characteristic model of the periodically watched video, wherein the smoothed characteristic model of the periodically watched video is as follows:
Figure FDA0002973141620000031
prog (u, t ') in equation (6) represents a list of video programs watched by the target user u in a time t' adjacent to the time t of the preset statistical period, and Prog (u) represents all the video programs watched by the target user u; neig (t) is a set of all neighborhood times of time t of a preset statistical period;
the method comprises the following steps of measuring the matching degree between a periodically played video feature model and a periodically watched video feature model by using the relative entropy between the probability distribution of the theme label of the television channel and the probability distribution of the theme label of a target user, wherein the specific calculation mode is as follows:
Figure FDA0002973141620000032
dist (c, u; t) of all candidate channels c for the target user can be obtained through calculation based on the formula (7), and finally, the first M channels with the maximum Dist (c, u; t) values are selected and recommended to the user.
2. The method of claim 1, wherein obtaining recommendation data for the target user based on the matching result comprises:
obtaining viewing period recommendation data of the target user on a target channel in at least one channel based on the matching result; and/or the presence of a gas in the gas,
and obtaining the channel recommendation data of the target user in the selected watching time period based on the matching result.
3. The method of claim 1, wherein the step of obtaining a periodically playing video feature model for a channel comprises:
based on the play history data aiming at the channel, determining video characteristic distribution data of the video programs played by the channel in the statistical period and probability distribution data of the video programs played by the channel in the statistical period;
and obtaining a periodic playing video feature model aiming at the channel based on the video feature distribution data of the video program played by the channel in the statistical period and the probability distribution data of the video program played by the channel in the statistical period.
4. The method of claim 1, wherein the step of deriving a model of periodically viewed video features for the target user comprises:
based on the watching history data aiming at the target user, determining video characteristic distribution data of the video program watched by the target user at least in the counting period and probability distribution data of the video program watched by the target user at least in the counting period;
and obtaining a periodic watching video feature model aiming at the target user based on the video feature distribution data of the video program watched by the target user at least in the statistical period and the probability distribution data of the video program watched by the target user at least in the statistical period.
5. The method of claim 1, wherein matching video feature distribution data of a periodically playing video feature model for at least one channel with video feature distribution data of a periodically watching video feature model, and obtaining recommendation data for a target user based on the matching result comprises:
determining a characteristic difference value between video characteristic distribution data representing the periodic regularity of a channel and video characteristic distribution data representing the periodic regularity of a target user;
and sequencing the characteristic difference values, and taking data with a sequencing result meeting a preset rule as recommended data.
6. An apparatus for channel-based recommendation, the apparatus comprising:
the model processing unit is used for obtaining a periodic playing video feature model for at least one channel based on the obtained playing history data for the channel in at least one preset statistical period, wherein the periodic playing video feature model can represent video feature distribution data of the periodic regularity of the channel, and is a model corresponding to the channel, and the method for determining the playing history data of the channel comprises the following steps: acquiring playing characteristics of a video program played by a channel in at least one preset statistical period and video characteristics of the video program as playing history data aiming at the channel;
obtaining a periodic watching video feature model for the target user based on the obtained watching history data for the target user in at least one preset statistical period, wherein the periodic watching video feature model can represent video feature distribution data of the periodic regularity of the target user, and the watching history data for the target user comprises: the method comprises the following steps of obtaining watching characteristics and video characteristics of a video program watched by a target user in at least one preset statistical period, and watching characteristics and video characteristics of a video program watched by the target user in neighborhood time, wherein the method further comprises the following steps: acquiring the watching characteristics and video characteristics of a video program watched by a target user in at least one preset statistical period as first sub-watching history data aiming at the target user; acquiring the watching characteristics and the video characteristics of the video program watched by the target user in the determined neighborhood time as second sub-watching history data aiming at the target user; wherein the neighborhood time is a time associated with the preset statistical period; taking the first sub-viewing history data and the second sub-viewing history data as viewing history data for a target user;
viewing history data or playing history data acquired in a preset statistical period are represented by a periodic feature structure representation method, the periodic feature structure is determined based on the dimensional features of the preset statistical period, video features are represented by using a theme label based on the periodic feature structure representation method, and the periodic feature structure representation method comprises the following steps:
the method comprises the following steps: constructing a periodic playing video characteristic model aiming at a channel;
step two: constructing a periodic watching video feature model for a target user;
further, each channel in the step one corresponds to one periodically playing video feature model, and each periodically playing video feature model is realized based on the following method:
selecting a television channel, and collecting play history data of a video program played by the television channel in a certain time period; based on the play history data, a periodic play video feature model for the channel is constructed and obtained by using the probability distribution feature of the channel topic label, and the periodic play video feature model of the channel is characterized as follows:
Figure FDA0002973141620000051
the periodic broadcast video characteristic model of the channel is characterized in that Pr (· c) represents conditional probability, Pr (·) represents probability, c represents channel, l represents video characteristic of video program, p represents video program, u represents user, t represents time represented by the periodic characteristic structure, and the formula (1) is obtained by carrying out Bayesian transformation on Pr (l | c; t); the formula (2) is obtained by representing the channel by using a television program set Prog (c, t) played by the channel c at time t and expanding the channel by using a probability addition formula; the formula (3) is obtained by performing Bayesian transformation on Pr (p | l) in the formula (2); the formula (4) is obtained by reducing the same items in the numerator and denominator in the formula (3) and adding a normalization factor Zc, which is specifically calculated as follows: zc ═ Σ L' ∈ L Σ p ∈ Prog (c, t) Pr (L | p) · Pr (p), where L is the topic label corpus;
further, the second step further comprises the following steps: selecting a target user, and collecting historical data of watching television watched by the target user within a certain time period; based on the watching history data, a periodic watching video feature model is constructed and obtained by using the probability distribution feature of the subject label of the target user, and the periodic watching video feature model is obtained by adopting a derivation mode similar to that of the periodic playing video feature model, and is characterized as follows:
Figure FDA0002973141620000061
prog (u, t) in formula (5) represents a video program list watched by the target user u within the time t, and Pr (l | p), Pr (p) and Zu adopt the same calculation mode as the feature model of the periodically played video;
carrying out balance processing on the characteristic model of the periodically watched video, wherein the smoothed characteristic model of the periodically watched video is as follows:
Figure FDA0002973141620000062
prog (u, t ') in equation (6) represents a list of video programs watched by the target user u in a time t' adjacent to the time t of the preset statistical period, and Prog (u) represents all the video programs watched by the target user u; neig (t) is a set of all neighborhood times of time t of a preset statistical period;
a recommending unit, configured to match video feature distribution data of a periodically playing video feature model for at least one channel with video feature distribution data of the periodically watching video feature model, and obtain recommended data for a target user based on a matching result, where a relative entropy between a topic tag probability distribution of a television channel and a topic tag probability distribution of the target user is used to measure a matching degree between the periodically playing video feature model and the periodically watching video feature model, and a specific calculation manner is as follows:
Figure FDA0002973141620000063
dist (c, u; t) of all candidate channels c for the target user can be obtained through calculation based on the formula (7), and finally, the first M channels with the maximum Dist (c, u; t) values are selected and recommended to the user.
7. A digital television channel-based recommender, comprising:
one or more processors;
a memory communicatively coupled to the one or more processors;
one or more applications, wherein the one or more applications are stored in the memory and configured to be executed by the one or more processors, the one or more programs configured to perform the method of any of claims 1-5.
8. A computer-readable storage medium, in which a computer program is stored which, when being executed by a processor, carries out the method according to any one of claims 1 to 5.
CN201911074399.7A 2019-11-06 2019-11-06 A channel-based recommendation method and device, and storage medium Active CN110769286B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911074399.7A CN110769286B (en) 2019-11-06 2019-11-06 A channel-based recommendation method and device, and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911074399.7A CN110769286B (en) 2019-11-06 2019-11-06 A channel-based recommendation method and device, and storage medium

Publications (2)

Publication Number Publication Date
CN110769286A CN110769286A (en) 2020-02-07
CN110769286B true CN110769286B (en) 2021-04-27

Family

ID=69336286

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911074399.7A Active CN110769286B (en) 2019-11-06 2019-11-06 A channel-based recommendation method and device, and storage medium

Country Status (1)

Country Link
CN (1) CN110769286B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113254794B (en) * 2021-07-15 2021-10-01 中国传媒大学 Modeling-based program data recommendation method and system

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102957969A (en) * 2012-05-18 2013-03-06 华东师范大学 Device and method for recommending program to IPTV (Internet protocol television) terminal user
CN103533393A (en) * 2013-09-17 2014-01-22 上海交通大学 Family analyzing and program recommending method based on family watching records
CN103546773A (en) * 2013-08-15 2014-01-29 Tcl集团股份有限公司 Television program recommendation method and system
CN103260061B (en) * 2013-05-24 2015-11-18 华东师范大学 A kind of IPTV program commending method of context-aware
CN105516810A (en) * 2015-12-04 2016-04-20 山东大学 Television user family member analysis method based on LDA (Latent Dirichlet Allocation) model
CN106028071A (en) * 2016-05-17 2016-10-12 Tcl集团股份有限公司 Video recommendation method and system
CN108322827A (en) * 2018-01-31 2018-07-24 武汉斗鱼网络科技有限公司 Method, system, and computer-readable storage medium for measuring user's video preference
CN108471544A (en) * 2018-03-28 2018-08-31 北京奇艺世纪科技有限公司 A kind of structure video user portrait method and device
CN108632670A (en) * 2018-03-15 2018-10-09 北京奇艺世纪科技有限公司 A kind of video satisfaction determines method and device

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9380325B1 (en) * 2014-09-12 2016-06-28 Sorenson Media, Inc. Overlay content and aggregation of viewing data

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102957969A (en) * 2012-05-18 2013-03-06 华东师范大学 Device and method for recommending program to IPTV (Internet protocol television) terminal user
CN103260061B (en) * 2013-05-24 2015-11-18 华东师范大学 A kind of IPTV program commending method of context-aware
CN103546773A (en) * 2013-08-15 2014-01-29 Tcl集团股份有限公司 Television program recommendation method and system
CN103533393A (en) * 2013-09-17 2014-01-22 上海交通大学 Family analyzing and program recommending method based on family watching records
CN105516810A (en) * 2015-12-04 2016-04-20 山东大学 Television user family member analysis method based on LDA (Latent Dirichlet Allocation) model
CN106028071A (en) * 2016-05-17 2016-10-12 Tcl集团股份有限公司 Video recommendation method and system
CN108322827A (en) * 2018-01-31 2018-07-24 武汉斗鱼网络科技有限公司 Method, system, and computer-readable storage medium for measuring user's video preference
CN108632670A (en) * 2018-03-15 2018-10-09 北京奇艺世纪科技有限公司 A kind of video satisfaction determines method and device
CN108471544A (en) * 2018-03-28 2018-08-31 北京奇艺世纪科技有限公司 A kind of structure video user portrait method and device

Also Published As

Publication number Publication date
CN110769286A (en) 2020-02-07

Similar Documents

Publication Publication Date Title
CN108062409B (en) Live video abstract generation method and device and electronic equipment
CN108322829B (en) Personalized anchor recommendation method and device and electronic equipment
CN108540826B (en) Barrage push method, device, electronic device and storage medium
US9553922B1 (en) Media event based social networking interfaces
CN108040294B (en) Method, system, and computer readable medium for recommending videos
CN111767430B (en) Video resource pushing method, video resource pushing device and storage medium
CN109040795B (en) Video recommendation method and system
CN109429103B (en) Method and device for recommending information, computer readable storage medium and terminal equipment
JP2008542870A (en) Method and apparatus for estimating the overall interest of a group of users for content
US20140259038A1 (en) Demographic prediction channel
CN108235045B (en) A kind of direct broadcasting room recommended method, electronic equipment and readable storage medium storing program for executing
CN110248246B (en) Data analysis method and device, computer equipment and computer readable storage medium
CN103686382A (en) A program recommendation method
CN103686236A (en) Method and system for recommending video resources
CN110598047A (en) Movie and television information recommendation method and device, electronic equipment and storage medium
CN105843876B (en) Quality evaluation method and device for multimedia resources
CN111385606A (en) Video preview method and device and intelligent terminal
CN105208411B (en) A kind of method and device for realizing DTV target audience statistics
WO2018133566A1 (en) Method and device for determining freshness of promotional information, electronic device, and storage medium
CN105338408A (en) Video recommending method based on time factor
CN109359217A (en) A kind of user interest degree calculation method, server and readable storage medium storing program for executing
CN104933135A (en) Method and device for clustering multimedia data
Xu et al. Catch-up TV recommendations: show old favourites and find new ones
US20250310583A1 (en) Predictive Measurement of End-User Activities at Specified Times
CN110769286B (en) A channel-based recommendation method and device, and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant