[go: up one dir, main page]

CN107977383A - To the method and device of user's recommending digital content - Google Patents

To the method and device of user's recommending digital content Download PDF

Info

Publication number
CN107977383A
CN107977383A CN201610940103.5A CN201610940103A CN107977383A CN 107977383 A CN107977383 A CN 107977383A CN 201610940103 A CN201610940103 A CN 201610940103A CN 107977383 A CN107977383 A CN 107977383A
Authority
CN
China
Prior art keywords
user
vector
content
user preference
digital content
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610940103.5A
Other languages
Chinese (zh)
Inventor
叶志辉
赵慧慧
邹剑波
杨静
姚璇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
MIGU Interactive Entertainment Co Ltd
Original Assignee
MIGU Interactive Entertainment Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by MIGU Interactive Entertainment Co Ltd filed Critical MIGU Interactive Entertainment Co Ltd
Priority to CN201610940103.5A priority Critical patent/CN107977383A/en
Publication of CN107977383A publication Critical patent/CN107977383A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides a kind of method and device to user's recommending digital content, one aspect of the present invention obtains user preference vector according to the access behavior of user, on the other hand content adaptation vector can be quantified in turn according to the user characteristics of digital content, when user reuses, required digital content can be recommended to user according to the matching degree between content adaptation vector sum user preference vector, substantially increase the accuracy of digital content recommending.

Description

向用户推荐数字内容的方法及装置Method and device for recommending digital content to users

技术领域technical field

本发明涉及内容推荐领域,尤其涉及向用户推荐数字内容的方法及装置。The invention relates to the field of content recommendation, in particular to a method and device for recommending digital content to users.

背景技术Background technique

数字内容是指在网络中发布的文章、图片、声音、影像等资讯内容。随着互联网技术的迅速发展,数字内容的种类越来越繁多,如何将待推荐数字内容推荐给感兴趣的用户已经成为多媒体领域中的一个重要问题。传统的数字内容推荐方式一般是根据人工输入的用户标签的属性向用户进行数字内容推荐,在推荐时不会考虑数字内容的适应用户,存在推荐精确度不高的问题。Digital content refers to information content such as articles, pictures, sounds, and images published on the Internet. With the rapid development of Internet technology, there are more and more types of digital content, how to recommend the digital content to be recommended to interested users has become an important problem in the multimedia field. The traditional digital content recommendation method generally recommends digital content to users based on the attributes of user tags entered manually, and does not consider the adaptation of digital content to users when recommending, and there is a problem of low recommendation accuracy.

发明内容Contents of the invention

本发明实施例提出了向用户推荐数字内容的方法及装置,用以解决现有数字内容推荐方法推荐精确度不高的问题。Embodiments of the present invention propose a method and device for recommending digital content to users, so as to solve the problem of low recommendation accuracy in existing digital content recommendation methods.

在一个方面,本发明实施例提供了一种向用户推荐数字内容的方法,包括:In one aspect, an embodiment of the present invention provides a method for recommending digital content to a user, including:

根据用户偏好量化获得用户偏好向量;Obtain a user preference vector according to user preference quantification;

根据数字内容的用户特征量化得到内容适配向量;Quantify the content adaptation vector according to the user characteristics of the digital content;

根据用户偏好向量和内容适配向量之间的相关程度向用户推荐数字内容。Digital content is recommended to the user according to the degree of correlation between the user preference vector and the content adaptation vector.

在另一个方面,本发明实施例提供了一种向用户推荐数字内容的装置,包括:In another aspect, an embodiment of the present invention provides an apparatus for recommending digital content to a user, including:

用户偏好向量确定模块,用于用户偏好量化获得用户偏好向量;A user preference vector determination module, used for user preference quantification to obtain a user preference vector;

内容适配向量确定模块,用于根据数字内容的用户特征量化得到内容适配向量;A content adaptation vector determination module, configured to quantify and obtain a content adaptation vector according to user characteristics of the digital content;

内容推荐模块,用于根据用户偏好向量和内容适配向量之间的相关程度向用户推荐数字内容。The content recommendation module is used for recommending digital content to the user according to the degree of correlation between the user preference vector and the content adaptation vector.

当前数字内容推荐方法存在着推荐不够准确,精确度不高的问题,本发明实施例一方面根据用户的访问行为来获得用户偏好向量,另一方面可以根据数字内容的用户特征反过来量化内容适配向量,当用户再次使用时,可根据内容适配向量和用户偏好向量之间的匹配程度向用户推荐所需的数字内容,大大提高了数字内容推荐的精确性。The current digital content recommendation method has the problem that the recommendation is not accurate enough and the accuracy is not high. On the one hand, the embodiment of the present invention obtains the user preference vector according to the user's access behavior; Matching vector, when the user uses it again, the required digital content can be recommended to the user according to the matching degree between the content adaptation vector and the user preference vector, which greatly improves the accuracy of digital content recommendation.

附图说明Description of drawings

下面将参照附图描述本发明的具体实施例,其中:Specific embodiments of the present invention will be described below with reference to the accompanying drawings, wherein:

图1示出了本发明实施例中向用户推荐数字内容的方法的流程示意图;FIG. 1 shows a schematic flowchart of a method for recommending digital content to a user in an embodiment of the present invention;

图2示出了本实施例中对UPV和CAV进行修正的示意图;Fig. 2 shows the schematic diagram of correcting UPV and CAV in this embodiment;

图3A及图3B示出了本发明实施例中采用的艾宾霍斯遗忘曲线和遗忘因子;Fig. 3 A and Fig. 3 B have shown the Ebbinghaus forgetting curve and forgetting factor adopted in the embodiment of the present invention;

图4示出了本发明实施例中向用户推荐数字内容的装置的结构示意图。Fig. 4 shows a schematic structural diagram of an apparatus for recommending digital content to a user in an embodiment of the present invention.

具体实施方式Detailed ways

为了使本发明的技术方案及优点更加清楚明白,以下结合附图对本发明的示例性实施例进行进一步详细的说明,显然,所描述的实施例仅是本发明的一部分实施例,而不是所有实施例的穷举。并且在不冲突的情况下,本说明中的实施例及实施例中的特征可以互相结合。In order to make the technical solutions and advantages of the present invention clearer, the exemplary embodiments of the present invention will be further described in detail below in conjunction with the accompanying drawings. Obviously, the described embodiments are only part of the embodiments of the present invention, not all implementations. Exhaustive list of examples. And in the case of no conflict, the embodiments in this description and the features in the embodiments can be combined with each other.

发明人在发明过程中注意到:随着互联网技术的迅速发展,数字内容的种类越来越繁多,如何将待推荐数字内容推荐给感兴趣的用户已经成为多媒体领域中的一个重要问题。传统的数字内容推荐方式一般是根据人工输入的用户标签的属性向用户进行数字内容推荐,在推荐时不会考虑数字内容的适应用户,存在推荐精确度不高的问题。The inventor noticed during the invention process that with the rapid development of Internet technology, there are more and more types of digital content, how to recommend digital content to be recommended to interested users has become an important issue in the multimedia field. The traditional digital content recommendation method generally recommends digital content to users based on the attributes of user tags entered manually, and does not consider the adaptation of digital content to users when recommending, and there is a problem of low recommendation accuracy.

针对上述不足,本发明实施例提出了一种向用户推荐数字内容的方法,下面进行说明。In view of the above shortcomings, an embodiment of the present invention proposes a method for recommending digital content to a user, which will be described below.

本发明的实施建立在以下基础之上:The implementation of the present invention is based on the following foundations:

(1)大部分用户都能真实提供自身偏好信息。(1) Most users can truly provide their own preference information.

(2)用户的内容选择偏好与多维度有关(年龄,性别,职业,学历,爱好等)(2) The user's content selection preference is related to multiple dimensions (age, gender, occupation, education, hobbies, etc.)

(3)用户网络访问行为是基于用户偏好的一种客观行动。(3) User network access behavior is an objective action based on user preference.

(4)在样本足够多的情况下,内容也能表现出偏向性,偏向性反映了内容适合哪类用户。(4) When there are enough samples, the content can also show bias, which reflects which type of users the content is suitable for.

图1示出了本发明实施例中向用户推荐数字内容的方法的流程示意图,如图所示,包括:Fig. 1 shows a schematic flowchart of a method for recommending digital content to a user in an embodiment of the present invention, as shown in the figure, including:

步骤101,根据用户偏好量化获得用户偏好向量(UPV,User PreferenceVectors);Step 101, obtain user preference vectors (UPV, User PreferenceVectors) according to user preference quantification;

步骤102,根据数字内容的用户特征量化得到内容适配向量(CAV,ContentAdaptation Vectors);Step 102, obtain content adaptation vectors (CAV, ContentAdaptation Vectors) according to the user characteristic quantization of digital content;

步骤103,根据UPV和CAV之间的相关程度向用户推荐数字内容。Step 103, recommending digital content to the user according to the degree of correlation between the UPV and the CAV.

本实施例一方面根据用户的访问行为(如:看电影、下载游戏)来获得UPV,另一方面可以根据数字内容的用户特征反过来量化CAV,当用户再次使用时,可根据CAV和UPV之间的匹配程度向用户推荐所需的数字内容,大大提高了数字内容推荐的精确性。In this embodiment, on the one hand, the UPV is obtained according to the user's access behavior (such as: watching movies, downloading games), and on the other hand, the CAV can be reversely quantified according to the user characteristics of the digital content. According to the matching degree between them, the required digital content is recommended to users, which greatly improves the accuracy of digital content recommendation.

具体实施时,在步骤101中,用户在首次登录时,需填写用户偏好,存入用户偏好数据库,填写的用户偏好包括以下维度中的至少一种:During specific implementation, in step 101, when the user logs in for the first time, the user needs to fill in the user preference and store it in the user preference database. The filled user preference includes at least one of the following dimensions:

(1)年龄(20岁以下,20-30岁,30-40岁,40-50岁,50-60岁,60岁以上);(1) Age (under 20 years old, 20-30 years old, 30-40 years old, 40-50 years old, 50-60 years old, over 60 years old);

(2)性别(男,女);(2) Gender (male, female);

(3)职业(工人,农民、私企雇主、军人,学生,政企单位,退休,服务人员);(3) Occupation (workers, farmers, private employers, soldiers, students, government and enterprise units, retirees, service personnel);

(4)学历(小学和初中,高中和中专,本科和专科,研究生);(4) Educational background (primary and junior high school, high school and technical secondary school, undergraduate and junior college, postgraduate);

(5)爱好(文学、体育、阅读、旅游等)。(5) Hobbies (literature, sports, reading, travel, etc.).

根据用户首次登录时填写的用户偏好确定用户初始的UPV,用户填写的用户偏好的各维度与下述UPV各个分量一一对应,即,若用户首次登录时填写的用户偏好包括5个维度,那对应的UPV包括5个分量,分别为:The initial UPV of the user is determined according to the user preference filled in when the user logs in for the first time. Each dimension of the user preference filled in by the user corresponds to each component of the following UPV. That is, if the user preference filled in by the user when logging in for the first time includes 5 dimensions, then The corresponding UPV includes 5 components, namely:

(1)A(a,b,c,d,e,f)(1)A(a,b,c,d,e,f)

(2)B(a,b)(2)B(a,b)

(3)C(a,b,c,d,e,f,g,h)(3)C(a,b,c,d,e,f,g,h)

(4)D(a,b,c,d)(4)D(a,b,c,d)

(5)E(a,b,c,d)(5)E(a,b,c,d)

具体实施时,在步骤102中,内容第一次提供时,需要填写内容适配的用户特征,格式与上述步骤101中用户填写的用户偏好相同,包括以下维度中的至少一种:During specific implementation, in step 102, when the content is provided for the first time, user characteristics for content adaptation need to be filled in, and the format is the same as the user preference filled in by the user in step 101 above, including at least one of the following dimensions:

(1)年龄(20岁以下,20-30岁,30-40岁,40-50岁,50-60岁,60岁以上);(1) Age (under 20 years old, 20-30 years old, 30-40 years old, 40-50 years old, 50-60 years old, over 60 years old);

(2)性别(男,女);(2) Gender (male, female);

(3)职业(工人,农民、私企雇主、军人,学生,政企单位,退休,服务人员);(3) Occupation (workers, farmers, private employers, soldiers, students, government and enterprise units, retirees, service personnel);

(4)学历(小学和初中,高中和中专,本科和专科,研究生);(4) Educational background (primary and junior high school, high school and technical secondary school, undergraduate and junior college, postgraduate);

(5)爱好(文学、体育、阅读、旅游等)。(5) Hobbies (literature, sports, reading, travel, etc.).

根据内容第一次提供时填写的内容适配的用户特征,确定初始的CAV,内容适配的用户特征的各维度与下述CAV各个分量一一对应,即,若填写的内容适配的用户特征包括5个维度,那对应的CAV也包括5个分量,分别为:The initial CAV is determined according to the content-adapted user characteristics filled in when the content is provided for the first time, and each dimension of the content-adapted user characteristics corresponds to each component of the following CAV one by one, that is, if the filled-in content matches the user The feature includes 5 dimensions, and the corresponding CAV also includes 5 components, which are:

(1)A(a,b,c,d,e,f)(1)A(a,b,c,d,e,f)

(2)B(a,b)(2)B(a,b)

(3)C(a,b,c,d,e,f,g,h)(3)C(a,b,c,d,e,f,g,h)

(4)D(a,b,c,d)(4)D(a,b,c,d)

(5)E(a,b,c,d)(5)E(a,b,c,d)

具体实施时,步骤103中根据UPV和CAV之间的相关程度向用户推荐数字内容的过程如下:During specific implementation, the process of recommending digital content to the user according to the degree of correlation between UPV and CAV in step 103 is as follows:

步骤1031,通过相关系数匹配算法,如皮尔逊相关系数算法计算UPV和CAV之间对应维度的相关系数,得到各数字内容适配用户的相关系数向量|R|(ρ1,ρ2,ρ3,ρ4,ρ5),|R|大于等于0.8时认为是高度相关;|R|大于等于0.5小于0.8时认为是中度相关;|R|大于等于0.3小于0.5时认为是低度相关,小于0.3认为是不相关;Step 1031, calculate the correlation coefficients of the corresponding dimensions between UPV and CAV through a correlation coefficient matching algorithm, such as the Pearson correlation coefficient algorithm, and obtain the correlation coefficient vector |R|(ρ1, ρ2, ρ3, ρ4, ρ5), when |R| is greater than or equal to 0.8, it is considered to be highly correlated; when |R| is greater than or equal to 0.5 and less than 0.8, it is considered to be moderately correlated; relevant;

步骤1032,按照下表,确定各数字内容适配用户的相关系数向量各分量的积分,对各数字内容适配用户的相关系数向量各分量对应的积分求和的结果进行排序;Step 1032, according to the following table, determine the integral of each component of the correlation coefficient vector of each digital content adaptation user, and sort the results of summing the integrals corresponding to each component of the correlation coefficient vector of each digital content adaptation user;

ρρ 相关度relativity 积分integral 0.8≤ρ<=1.00.8≤ρ<=1.0 高度相关Highly correlated 55 0.5≤ρ<0.80.5≤ρ<0.8 中度相关Moderately relevant 44 0.3≤ρ<0.50.3≤ρ<0.5 低度相关low correlation 22 0≤ρ<0.30≤ρ<0.3 不相关irrelevant 11 -1.0≤ρ≤-0.8-1.0≤ρ≤-0.8 高度负相关highly negative correlation -5-5 -0.8≤ρ<-0.5-0.8≤ρ<-0.5 中度负相关moderate negative correlation -4-4 -0.5≤ρ<-0.3-0.5≤ρ<-0.3 低度负相关low negative correlation -2-2 -0.3≤ρ<0-0.3≤ρ<0 不相关irrelevant -1-1

步骤1033,根据步骤1032的排序结果获得数字内容的推荐顺序,按照该顺序向用户呈现数字内容,也可根据经营需要,通过人为增加积分方式,穿插需要推广的数字内容。Step 1033, obtain the recommended order of digital content according to the sorting result of step 1032, and present digital content to the user according to the order, or intersperse digital content to be promoted by artificially increasing points according to business needs.

图2示出了本实施例中对UPV和CAV进行修正的示意图。Fig. 2 shows a schematic diagram of correcting UPV and CAV in this embodiment.

进一步地,本实施例中,会根据系统运算能力,定期(如:10分钟,由于人的喜好多变,需要确保刷新周期小)根据用户历史访问内容(注意要剔除用户短时间浏览的内容,即阅读时长低于预设的阈值,如60秒)对UPV进行修正,修正时需考虑根据艾宾霍斯遗忘模型确定出的遗忘因子,图3A及图3B示出了本发明实施例中采用的艾宾霍斯遗忘曲线和遗忘因子,因为用户最近的访问行为更能体现出用户的偏好。Furthermore, in this embodiment, according to the computing power of the system, the content will be accessed regularly (for example: 10 minutes, because people's preferences are changeable, it is necessary to ensure that the refresh cycle is small) according to the user's history (note that the content browsed by the user for a short time should be removed, That is, the reading time is lower than the preset threshold, such as 60 seconds), and the UPV is corrected, and the forgetting factor determined according to the Ebbinghaus forgetting model needs to be considered when correcting. Figure 3A and Figure 3B show the The Ebbinghaus forgetting curve and forgetting factor, because the user's recent access behavior can better reflect the user's preference.

根据用户的历史访问内容的CAV对所述UPV进行修正的过程为:The process of correcting the UPV according to the CAV of the user's historical access content is as follows:

根据用户的历史访问内容计算CAV,将CAV的每个分量乘以遗忘因子修正后取算术平均值,即:CAV的每个分量值:Calculate the CAV based on the user's historical access content, and multiply each component of the CAV by the forgetting factor to obtain the arithmetic mean value, that is, the value of each component of the CAV:

其中:fn为用户浏览第n项内容,参照艾宾霍斯遗忘曲线得到的遗忘因子;an为用户浏览第n项内容的CAV的第一个分量,其他分量计算方式与之相同。Among them: f n is the forgetting factor obtained by referring to the Ebbinghaus forgetting curve when the user browses the nth item of content; a n is the first component of the CAV of the user browsing the nth item of content, and the other components are calculated in the same way.

将CAV各分量的算术平均值组合得到CAV′,进行归一化处理后得到CAV″,将归一化处理后得到的CAV″和UPV直接向量相加,然后得到的每个分量做归一化处理,最终得到修正后的UPV。Combine the arithmetic mean value of each component of CAV to obtain CAV', and then obtain CAV" after normalization processing, add the CAV" obtained after normalization processing and the direct vector of UPV, and then normalize each component obtained processing, and finally get the corrected UPV.

进一步地,本实施例中,会根据系统运算能力,定期(如:每天)根据用户历史访问内容确定出的UPV(需要剔除用户短时间浏览的内容,即阅读时长低于预设的阈值,如60秒),对CAV进行修正,具体过程如下:Further, in this embodiment, according to the computing power of the system, the UPV (for example, every day) determined based on the user's historical access content (need to exclude the content that the user browses for a short time, that is, the reading time is lower than the preset threshold, such as 60 seconds), correct the CAV, the specific process is as follows:

根据用户的历史访问内容计算UPV,将UPV的每个分量取算术平均值,即:Calculate the UPV based on the user's historical access content, and take the arithmetic mean of each component of the UPV, namely:

其中:an为用户浏览第n个用户的UPV的第一个分量,其他分量计算方式与之相同。Among them: a n is the first component of the UPV of the nth user browsed by the user, and the calculation method of other components is the same.

将UPV各分量的算术平均值组合得到UPV′,进行归一化处理后得到UPV″,将归一化处理后得到的UPV″和CAV直接向量相加,然后得到的每个分量做归一化处理,最终得到修正后的CAV。Combine the arithmetic mean value of each component of UPV to obtain UPV', and then obtain UPV" after normalization processing, add the UPV" obtained after normalization processing and the direct vector of CAV, and then normalize each component obtained processing, and finally get the corrected CAV.

本实施例中,根据用户的历史访问内容会对UPV和CAV进行定期修正,从而可以获得最为准确的推荐效果。In this embodiment, the UPV and CAV are regularly corrected according to the user's historical access content, so that the most accurate recommendation effect can be obtained.

基于同一发明构思,本发明实施例中还提供了一种向用户推荐数字内容的装置,由于本实施例所述装置解决问题的原理与上述一种向用户推荐数字内容的方法相似,因此本实施例的实施可以参见方法的实施,重复之处不再赘述。Based on the same inventive concept, an embodiment of the present invention also provides a device for recommending digital content to users. Since the problem-solving principle of the device in this embodiment is similar to the above-mentioned method for recommending digital content to users, this embodiment For the implementation of the example, refer to the implementation of the method, and the repetition will not be repeated.

图4示出了本发明实施例中向用户推荐数字内容的装置的结构示意图,如图所示,装置可以包括:Fig. 4 shows a schematic structural diagram of a device for recommending digital content to users in an embodiment of the present invention. As shown in the figure, the device may include:

用户偏好向量确定模块401,根据用户偏好量化获得UPV;The user preference vector determination module 401 obtains the UPV according to the user preference quantization;

内容适配向量确定模块402,用于根据数字内容的用户特征量化得到CAV;A content adaptation vector determination module 402, configured to quantify and obtain CAV according to user characteristics of digital content;

内容推荐模块403,用于根据UPV和CAV之间的相关程度向用户推荐数字内容。The content recommendation module 403 is configured to recommend digital content to the user according to the degree of correlation between the UPV and the CAV.

UPV的维度包括年龄、性别、职业、学历、爱好中的至少一种;The dimensions of UPV include at least one of age, gender, occupation, education and hobbies;

CAV的维度包括年龄、性别、职业、学历、爱好中的至少一种。The dimensions of CAV include at least one of age, gender, occupation, education and hobbies.

内容推荐模块403通过相关系数匹配算法计算UPV和CAV之间对应维度的相关系数,得到各数字内容适配用户的相关系数向量,根据各数字内容适配用户的相关系数向量各维度的积分和的排序,向用户进行数字内容推荐。The content recommendation module 403 calculates the correlation coefficient of the corresponding dimension between UPV and CAV through the correlation coefficient matching algorithm, obtains the correlation coefficient vector of each digital content adaptation user, and according to the integral sum of each dimension of the correlation coefficient vector of each digital content adaptation user Sort and recommend digital content to users.

进一步地,用户偏好向量确定模块401还用于根据用户的历史访问内容的CAV,对UPV进行定期修正,修正的过程为:Further, the user preference vector determination module 401 is also used to periodically revise the UPV according to the CAV of the user's historical access content, and the revision process is as follows:

根据用户的历史访问内容计算CAV,将CAV的每个分量乘以遗忘因子修正后取算术平均值,遗忘因子为根据艾宾霍斯遗忘曲线得到的;Calculate the CAV based on the user's historical access content, multiply each component of the CAV by the forgetting factor and correct it to take the arithmetic mean value, and the forgetting factor is obtained according to the Ebbinghaus forgetting curve;

将CAV各分量的算术平均值组合得到CAV′,进行归一化处理后得到CAV″,将归一化处理后得到的CAV″和UPV直接向量相加,然后得到的每个分量做归一化处理,最终得到修正后的UPV。Combine the arithmetic mean value of each component of CAV to obtain CAV', and then obtain CAV" after normalization processing, add the CAV" obtained after normalization processing and the direct vector of UPV, and then normalize each component obtained processing, and finally get the corrected UPV.

进一步地,内容适配向量确定模块402还用于根据用户的历史访问内容确定出的UPV,对CAV进行定期修正,修正的过程为:Further, the content adaptation vector determination module 402 is also used to periodically correct the CAV according to the UPV determined by the user's historical access content, and the correction process is as follows:

根据用户的历史访问内容计算UPV,将UPV的每个分量取算术平均值;Calculate the UPV based on the user's historical access content, and take the arithmetic mean of each component of the UPV;

将UPV各分量的算术平均值组合得到UPV′,进行归一化处理后得到UPV″,将归一化处理后得到的UPV″和CAV直接向量相加,然后得到的每个分量做归一化处理,最终得到修正后的CAV。Combine the arithmetic mean value of each component of UPV to obtain UPV', and then obtain UPV" after normalization processing, add the UPV" obtained after normalization processing and the direct vector of CAV, and then normalize each component obtained processing, and finally get the corrected CAV.

为了描述的方便,以上所述装置的各部分以功能分为各种模块或单元分别描述。当然,在实施本发明时可以把各模块或单元的功能在同一个或多个软件或硬件中实现。For the convenience of description, each part of the device described above is divided into various modules or units by function and described separately. Of course, when implementing the present invention, the functions of each module or unit can be implemented in one or more pieces of software or hardware.

本领域内的技术人员应明白,本发明的实施例可提供为方法、系统、或计算机程序产品。因此,本发明可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本发明可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art should understand that the embodiments of the present invention may be provided as methods, systems, or computer program products. Accordingly, the present invention can take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.

本发明是参照根据本发明实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It should be understood that each procedure and/or block in the flowchart and/or block diagram, and a combination of procedures and/or blocks in the flowchart and/or block diagram can be realized by computer program instructions. These computer program instructions may be provided to a general purpose computer, special purpose computer, embedded processor, or processor of other programmable data processing equipment to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing equipment produce a An apparatus for realizing the functions specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.

这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer-readable memory capable of directing a computer or other programmable data processing apparatus to operate in a specific manner, such that the instructions stored in the computer-readable memory produce an article of manufacture comprising instruction means, the instructions The device realizes the function specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.

这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded onto a computer or other programmable data processing device, causing a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented process, thereby The instructions provide steps for implementing the functions specified in the flow chart or blocks of the flowchart and/or the block or blocks of the block diagrams.

尽管已描述了本发明的优选实施例,但本领域内的技术人员一旦得知了基本创造性概念,则可对这些实施例作出另外的变更和修改。所以,所附权利要求意欲解释为包括优选实施例以及落入本发明范围的所有变更和修改。While preferred embodiments of the invention have been described, additional changes and modifications to these embodiments can be made by those skilled in the art once the basic inventive concept is appreciated. Therefore, it is intended that the appended claims be construed to cover the preferred embodiment as well as all changes and modifications which fall within the scope of the invention.

Claims (11)

1.一种向用户推荐数字内容的方法,其特征在于,包括:1. A method for recommending digital content to users, comprising: 根据用户偏好量化获得用户偏好向量;Obtain a user preference vector according to user preference quantification; 根据数字内容的用户特征量化得到内容适配向量;Quantify the content adaptation vector according to the user characteristics of the digital content; 根据用户偏好向量和内容适配向量之间的相关程度向用户推荐数字内容。Digital content is recommended to the user according to the degree of correlation between the user preference vector and the content adaptation vector. 2.如权利要求1所述的方法,其特征在于,所述用户偏好向量的维度包括年龄、性别、职业、学历、爱好中的至少一种;2. The method according to claim 1, wherein the dimension of the user preference vector includes at least one of age, gender, occupation, education and hobbies; 所述内容适配向量的维度包括年龄、性别、职业、学历、爱好中的至少一种。The dimensions of the content adaptation vector include at least one of age, gender, occupation, education and hobbies. 3.如权利要求2所述的方法,其特征在于,所述根据用户偏好向量和内容适配向量之间的相关程度向用户推荐数字内容包括:3. The method according to claim 2, wherein said recommending digital content to the user according to the degree of correlation between the user preference vector and the content adaptation vector comprises: 通过相关系数匹配算法计算内容适配向量和用户偏好向量之间对应维度的相关系数,得到各数字内容适配用户的相关系数向量;Calculate the correlation coefficient of the corresponding dimension between the content adaptation vector and the user preference vector through the correlation coefficient matching algorithm, and obtain the correlation coefficient vector of each digital content adaptation user; 确定所述各数字内容适配用户的相关系数向量各分量的积分和的排序,向用户进行数字内容推荐。Determine the ranking of the integral sum of each component of the correlation coefficient vector for each digital content adapted to the user, and recommend digital content to the user. 4.如权利要求1或2所述的方法,其特征在于,还包括根据用户的历史访问内容的内容适配向量,对所述用户偏好向量进行定期修正的步骤。4. The method according to claim 1 or 2, further comprising the step of regularly revising the user preference vector according to the content adaptation vector of the user's historical access content. 5.如权利要求4所述的方法,其特征在于,根据用户的历史访问内容的内容适配向量对所述用户偏好向量进行修正的过程为:5. The method according to claim 4, wherein the process of modifying the user preference vector according to the content adaptation vector of the user's historical access content is as follows: 根据用户的历史访问内容计算内容适配向量,将内容适配向量的每个分量乘以遗忘因子修正后取算术平均值,所述遗忘因子为根据艾宾霍斯遗忘曲线得到的;Calculate the content adaptation vector according to the user's historical access content, multiply each component of the content adaptation vector by the forgetting factor and correct and take the arithmetic mean value, and the forgetting factor is obtained according to the Ebbinghaus forgetting curve; 将内容适配向量各分量的算术平均值组合并进行归一化处理,利用归一化处理后的内容适配向量对所述用户偏好向量进行修正。The arithmetic mean value of each component of the content adaptation vector is combined and normalized, and the user preference vector is corrected by using the normalized content adaptation vector. 6.如权利要求1或2所述的方法,其特征在于,还包括根据用户的历史访问内容确定出的用户偏好向量,对所述内容适配向量进行定期修正的步骤。6. The method according to claim 1 or 2, further comprising the step of regularly revising the content adaptation vector based on the user preference vector determined from the user's historical content access. 7.如权利要求6所述的方法,其特征在于,根据用户的历史访问内容确定出的用户偏好向量,对所述内容适配向量进行修正的过程为:7. The method according to claim 6, wherein the process of modifying the content adaptation vector is as follows: 根据用户的历史访问内容计算用户偏好向量,将用户偏好向量的每个分量取算术平均值;Calculate the user preference vector according to the user's historical access content, and take the arithmetic mean value of each component of the user preference vector; 将用户偏好向量各分量的算术平均值组合并进行归一化处理,利用归一化处理后的用户偏好向量向量对所述内容适配向量进行修正。The arithmetic mean value of each component of the user preference vector is combined and normalized, and the content adaptation vector is corrected by using the normalized user preference vector. 8.一种向用户推荐数字内容的装置,其特征在于,包括:8. A device for recommending digital content to users, comprising: 用户偏好向量确定模块,用于用户偏好量化获得用户偏好向量;A user preference vector determination module, used for user preference quantification to obtain a user preference vector; 内容适配向量确定模块,用于根据数字内容的用户特征量化得到内容适配向量;A content adaptation vector determination module, configured to quantify and obtain a content adaptation vector according to user characteristics of the digital content; 内容推荐模块,用于根据用户偏好向量和内容适配向量之间的相关程度向用户推荐数字内容。The content recommendation module is used for recommending digital content to the user according to the degree of correlation between the user preference vector and the content adaptation vector. 9.如权利要求8所述的装置,其特征在于,所述用户偏好向量的维度包括年龄、性别、职业、学历、爱好中的至少一种;9. The device according to claim 8, wherein the dimension of the user preference vector includes at least one of age, gender, occupation, education and hobbies; 所述内容适配向量的维度包括年龄、性别、职业、学历、爱好中的至少一种。The dimensions of the content adaptation vector include at least one of age, gender, occupation, education and hobbies. 10.如权利要求8所述的装置,其特征在于,所述内容推荐模块通过相关系数匹配算法计算内容适配向量和用户偏好向量之间对应维度的相关系数,得到各数字内容适配用户的相关系数向量,根据所述各数字内容适配用户的相关系数向量各分量的积分和的排序,向用户进行数字内容推荐。10. The device according to claim 8, wherein the content recommendation module calculates the correlation coefficient of the corresponding dimension between the content adaptation vector and the user preference vector through a correlation coefficient matching algorithm, and obtains each digital content adaptation user's The correlation coefficient vector is used to recommend the digital content to the user according to the ranking of the integral sum of each component of the correlation coefficient vector of the user for each digital content. 11.如权利要求8所述的装置,其特征在于,所述用户偏好向量确定模块还用于根据用户的历史访问内容的内容适配向量,对所述用户偏好向量进行定期修正;11. The device according to claim 8, wherein the user preference vector determining module is further configured to periodically revise the user preference vector according to the content adaptation vector of the user's historical access content; 所述内容适配向量确定模块还用于根据用户的历史访问内容确定出的用户偏好向量,对所述内容适配向量进行定期修正。The content adaptation vector determination module is further configured to periodically revise the content adaptation vector based on the user preference vector determined from the user's historical access to content.
CN201610940103.5A 2016-10-25 2016-10-25 To the method and device of user's recommending digital content Pending CN107977383A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610940103.5A CN107977383A (en) 2016-10-25 2016-10-25 To the method and device of user's recommending digital content

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610940103.5A CN107977383A (en) 2016-10-25 2016-10-25 To the method and device of user's recommending digital content

Publications (1)

Publication Number Publication Date
CN107977383A true CN107977383A (en) 2018-05-01

Family

ID=62005017

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610940103.5A Pending CN107977383A (en) 2016-10-25 2016-10-25 To the method and device of user's recommending digital content

Country Status (1)

Country Link
CN (1) CN107977383A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113961798A (en) * 2021-09-26 2022-01-21 深圳市铠硕达科技有限公司 Cloud platform data management method and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070043817A1 (en) * 1999-07-27 2007-02-22 MailFrontier, Inc. a wholly owned subsidiary of Personalized electronic-mail delivery
CN103064924A (en) * 2012-12-17 2013-04-24 浙江鸿程计算机系统有限公司 Travel destination situation recommendation method based on geotagged photo excavation
CN103544623A (en) * 2013-11-06 2014-01-29 武汉大学 Web service recommendation method based on user preference feature modeling
CN105430444A (en) * 2015-11-17 2016-03-23 深圳创维数字技术有限公司 A program recommendation method and digital TV receiving terminal
CN105488662A (en) * 2016-01-07 2016-04-13 北京歌利沃夫企业管理有限公司 Bi-directional recommendation-based online recruitment system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070043817A1 (en) * 1999-07-27 2007-02-22 MailFrontier, Inc. a wholly owned subsidiary of Personalized electronic-mail delivery
CN103064924A (en) * 2012-12-17 2013-04-24 浙江鸿程计算机系统有限公司 Travel destination situation recommendation method based on geotagged photo excavation
CN103544623A (en) * 2013-11-06 2014-01-29 武汉大学 Web service recommendation method based on user preference feature modeling
CN105430444A (en) * 2015-11-17 2016-03-23 深圳创维数字技术有限公司 A program recommendation method and digital TV receiving terminal
CN105488662A (en) * 2016-01-07 2016-04-13 北京歌利沃夫企业管理有限公司 Bi-directional recommendation-based online recruitment system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113961798A (en) * 2021-09-26 2022-01-21 深圳市铠硕达科技有限公司 Cloud platform data management method and system
CN113961798B (en) * 2021-09-26 2022-09-16 深圳市铠硕达科技有限公司 Cloud platform data management method and system

Similar Documents

Publication Publication Date Title
US10699321B2 (en) Global vector recommendations based on implicit interaction and profile data
US10339171B2 (en) Dynamic feedback in a recommendation system
CN106503022B (en) Method and device for pushing recommended information
EP2950226A1 (en) New heuristic for optimizing non-convex function for learning to rank
US9760610B2 (en) Personalized search using searcher features
CN107330715B (en) Method and device for selecting picture advertisement material
CN103678329B (en) Recommend method and device
US9679018B1 (en) Document ranking based on entity frequency
JP2021103542A (en) Information providing device, information providing method, and program
US10089675B1 (en) Probabilistic matrix factorization system based on personas
CN112435070B (en) Methods, devices, equipment and storage media for determining user age
US11651255B2 (en) Method and apparatus for object preference prediction, and computer readable medium
WO2018133566A1 (en) Method and device for determining freshness of promotional information, electronic device, and storage medium
WO2020047654A1 (en) Noise contrastive estimation for collaborative filtering
CN114330752A (en) Sorting model training method and sorting method
CN105654198A (en) Brand advertisement effect optimization method capable of realizing optimal threshold value selection
US20170359437A1 (en) Generating job recommendations based on job postings with similar positions
CN110058992B (en) Text template effect feedback method and device and electronic equipment
CN113641915B (en) Object recommendation method, device, equipment, storage medium and program product
CN119226626A (en) An intelligent recommendation method, system and storage medium based on big data mining
WO2015158224A1 (en) Message recommendation method and apparatus
CN112000795B (en) Official document recommendation method and device
CN107977383A (en) To the method and device of user&#39;s recommending digital content
CN108550019A (en) A kind of resume selection method and device
CN109063137A (en) A kind of recommended determines method, apparatus, equipment and readable storage medium storing program for executing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180501

RJ01 Rejection of invention patent application after publication