[go: up one dir, main page]

CN112000795A - Official document recommendation method and device - Google Patents

Official document recommendation method and device Download PDF

Info

Publication number
CN112000795A
CN112000795A CN202010773625.7A CN202010773625A CN112000795A CN 112000795 A CN112000795 A CN 112000795A CN 202010773625 A CN202010773625 A CN 202010773625A CN 112000795 A CN112000795 A CN 112000795A
Authority
CN
China
Prior art keywords
document
target user
official
official document
potential
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010773625.7A
Other languages
Chinese (zh)
Other versions
CN112000795B (en
Inventor
聂砂
白彧斐
贾国琛
郑江
罗奕康
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Construction Bank Corp
Original Assignee
China Construction Bank Corp
CCB Finetech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Construction Bank Corp, CCB Finetech Co Ltd filed Critical China Construction Bank Corp
Priority to CN202010773625.7A priority Critical patent/CN112000795B/en
Publication of CN112000795A publication Critical patent/CN112000795A/en
Application granted granted Critical
Publication of CN112000795B publication Critical patent/CN112000795B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a method and a device for recommending official documents, and relates to the technical field of computers. One embodiment of the method comprises: acquiring an incidence relation among a plurality of documents; determining the official documents to be recommended in the multiple official documents according to the incidence relation and the characteristic data of the target user; and recommending the document to be recommended to the target user. According to the embodiment, the official documents can be recommended to the target user according to the incidence relation among the official documents, so that higher recommendation accuracy is achieved.

Description

公文推荐方法和装置Official document recommendation method and device

技术领域technical field

本发明涉及计算机技术领域,尤其涉及一种公文推荐方法和装置。The present invention relates to the field of computer technology, and in particular, to a method and device for recommending official documents.

背景技术Background technique

公文与新闻、资讯等内容具有明显的不同,其特点是内容较为固定、要素较为齐全、结构较为清晰。在目前的公文推荐技术中,主要采用传统的推荐方法,即基于推荐对象之间的相似度或者用户之间的相似度进行推荐,但事实上,不同的公文之间会基于内容呈现出明显的逻辑关联关系,上述推荐方法不考虑这种关联关系,因此准确性较低。Official documents are obviously different from news, information and other content, and are characterized by relatively fixed content, relatively complete elements, and relatively clear structure. In the current document recommendation technology, the traditional recommendation method is mainly used, that is, the recommendation is made based on the similarity between the recommended objects or the similarity between users, but in fact, different documents will show obvious differences based on the content. Logical association, the above recommendation method does not consider this association, so the accuracy is low.

发明内容SUMMARY OF THE INVENTION

有鉴于此,本发明实施例提供一种公文推荐方法和装置,能够根据公文之间的关联关系向目标用户推荐公文,从而具有较高的推荐准确性。In view of this, the embodiments of the present invention provide a method and apparatus for recommending official documents, which can recommend official documents to target users according to the association relationship between the official documents, thereby having high recommendation accuracy.

为实现上述目的,根据本发明的一个方面,提供了一种公文推荐方法。To achieve the above object, according to an aspect of the present invention, a method for recommending official documents is provided.

本发明实施例的公文推荐方法包括:获取多个公文之间的关联关系;根据所述关联关系和目标用户的特征数据确定所述多个公文中的待推荐公文;将待推荐公文向目标用户推荐。The method for recommending official documents according to the embodiment of the present invention includes: acquiring an association relationship between a plurality of official documents; determining an official document to be recommended among the plurality of official documents according to the association relationship and the feature data of the target user; sending the official document to be recommended to the target user recommend.

可选地,所述多个公文之间的关联关系包括:与所述多个公文中的任一公文具有关联关系的至少一个公文以及该关联关系的类型;以及,所述根据所述关联关系和目标用户的特征数据确定所述多个公文中的待推荐公文,包括:对于所述多个公文中的任一公文:根据与该任一公文具有关联关系的公文以及该关联关系的类型确定该任一公文的特征向量;依据该任一公文的特征向量以及目标用户的特征数据确定目标用户对该任一公文的第一潜在关注度;基于所述第一潜在关注度获取目标用户对该任一公文的综合潜在关注度;在所述综合潜在关注度符合预设的推荐条件时,将该任一公文确定为待推荐公文。Optionally, the association relationship between the multiple official documents includes: at least one official document that has an association relationship with any official document in the multiple official documents, and the type of the association relationship; and, according to the association relationship and the feature data of the target user to determine the official document to be recommended among the multiple official documents, including: for any official document in the multiple official documents: determining according to the official document that has an associated relationship with the any official document and the type of the association relationship The feature vector of the any official document; according to the feature vector of the any official document and the feature data of the target user, determine the first potential degree of attention of the target user to the any official document; The comprehensive potential attention degree of any official document; when the comprehensive potential attention degree meets the preset recommendation conditions, any official document is determined as the official document to be recommended.

可选地,所述根据与该任一公文具有关联关系的公文以及该关联关系的类型确定该任一公文的特征向量,包括:对于所述多个公文中的任一公文:将该任一公文的原始特征、与该任一公文具有关联关系的每一公文的特征向量、以及该关联关系的类型对应的预设权重值输入预设函数进行计算,得到该任一公文的特征向量。Optionally, the determining the feature vector of the any official document according to the official document having an associated relationship with the any official document and the type of the association relationship includes: for any official document in the plurality of official documents: the any official document The original feature of the official document, the feature vector of each official document that has an association relationship with the official document, and the preset weight value corresponding to the type of the association relationship are input into a preset function for calculation to obtain the feature vector of any official document.

可选地,所述根据与该任一公文具有关联关系的公文以及该关联关系的类型确定该任一公文的特征向量,包括:将所述多个公文的原始特征和所述多个公文之间的关联关系数据输入预先训练的特征提取模型,得到所述多个公文中任一公文的特征向量。Optionally, the determining the feature vector of any official document according to the official document having an associated relationship with the any official document and the type of the association relationship includes: combining the original features of the multiple official documents and the difference between the multiple official documents. The correlation data between the two is input into a pre-trained feature extraction model, and a feature vector of any one of the multiple official documents is obtained.

可选地,目标用户的特征数据包括目标用户的当前特征向量;以及,所述方法进一步包括:获取目标用户的初始特征向量;根据目标用户的初始特征向量和目标用户针对公文的历史行为数据确定目标用户的当前特征向量。Optionally, the feature data of the target user includes the current feature vector of the target user; and, the method further includes: obtaining the initial feature vector of the target user; determining according to the initial feature vector of the target user and the historical behavior data of the target user for the official document The current feature vector of the target user.

可选地,所述获取目标用户的初始特征向量,包括:根据目标用户提供的偏好公文类别确定目标用户的初始特征向量。Optionally, the obtaining the initial feature vector of the target user includes: determining the initial feature vector of the target user according to the preferred document category provided by the target user.

可选地,所述根据目标用户的初始特征向量和目标用户针对公文的历史行为数据确定目标用户的当前特征向量,包括:获取所述历史行为数据对应的每一公文的特征向量;将任一公文的特征向量与该公文的影响分数相乘得到该公文的影响向量;其中,所述影响分数根据目标用户针对该公文的历史行为的类型确定;根据每一公文的影响向量与目标用户的初始特征向量确定目标用户的当前特征向量。Optionally, determining the current feature vector of the target user according to the initial feature vector of the target user and the historical behavior data of the target user for the official document includes: acquiring the feature vector of each official document corresponding to the historical behavior data; The feature vector of the official document is multiplied by the impact score of the official document to obtain the impact vector of the official document; wherein, the impact score is determined according to the type of historical behavior of the target user with respect to the official document; according to the impact vector of each official document and the target user's initial The feature vector determines the current feature vector of the target user.

可选地,所述依据该任一公文的特征向量以及目标用户的特征数据确定目标用户对该任一公文的第一潜在关注度,包括:对于所述多个公文中的任一公文,计算该任一公文的特征向量与目标用户的当前特征向量的相似度;依据该相似度确定目标用户对该任一公文的第一潜在关注度。Optionally, the determining the first potential degree of attention of the target user to the any official document according to the feature vector of the any official document and the characteristic data of the target user includes: for any official document in the plurality of official documents, calculating The similarity between the feature vector of any official document and the current feature vector of the target user; the first potential degree of attention of the target user to the any official document is determined according to the similarity.

可选地,所述依据该任一公文的特征向量以及目标用户的特征数据确定目标用户对该任一公文的第一潜在关注度,包括:将所述多个公文中任一公文的特征向量和目标用户的当前特征向量输入预先训练的意图识别模型,得到目标用户对该任一公文的关注概率;依据该关注概率确定目标用户对该任一公文的第一潜在关注度。Optionally, the determining the first potential degree of attention of the target user to the any official document according to the feature vector of the any official document and the characteristic data of the target user includes: converting the feature vector of any official document in the plurality of official documents and the current feature vector of the target user into the pre-trained intent recognition model to obtain the attention probability of the target user to any official document; determine the first potential degree of attention of the target user to any official document according to the attention probability.

可选地,所述基于所述第一潜在关注度获取目标用户对该任一公文的综合潜在关注度,包括:将目标用户对所述多个公文中任一公文的第一潜在关注度确定为目标用户对该任一公文的综合潜在关注度。Optionally, the acquiring the comprehensive potential degree of attention of the target user to any official document based on the first potential degree of interest includes: determining the first potential degree of interest of the target user to any official document among the plurality of official documents. It is the comprehensive potential attention of the target user to any official document.

可选地,所述方法进一步包括:对于所述多个公文中的任一公文,计算该任一公文的原始特征与目标用户的特征数据的相似度;依据该相似度确定目标用户对该任一公文的第二潜在关注度;以及,所述基于所述第一潜在关注度获取目标用户对该任一公文的综合潜在关注度,包括:结合所述第一潜在关注度和所述第二潜在关注度确定目标用户对该任一公文的综合潜在关注度。Optionally, the method further includes: for any official document in the plurality of official documents, calculating the similarity between the original feature of the any official document and the feature data of the target user; determining the similarity between the target user and the target user according to the similarity. The second potential degree of attention of an official document; and the acquiring, based on the first potential degree of attention, the comprehensive potential degree of attention of the target user to any official document includes: combining the first potential degree of attention and the second degree of potential attention The potential attention degree determines the comprehensive potential attention degree of the target user to any official document.

可选地,所述方法进一步包括:将所述多个公文中任一公文的原始特征和目标用户的综合特征输入预先训练的意图判别模型,得到目标用户对该任一公文的关注概率,依据该关注概率确定目标用户对该任一公文的第二潜在关注度;其中,目标用户的综合特征由目标用户针对公文的历史行为数据和目标用户的属性特征确定;以及,所述基于所述第一潜在关注度获取目标用户对该任一公文的综合潜在关注度,包括:结合所述第一潜在关注度和所述第二潜在关注度确定目标用户对该任一公文的综合潜在关注度。Optionally, the method further includes: inputting the original features of any official document among the multiple official documents and the comprehensive characteristics of the target user into a pre-trained intent discrimination model, to obtain the attention probability of the target user to any official document, according to The attention probability determines the second potential degree of attention of the target user to any official document; wherein, the comprehensive characteristics of the target user are determined by the historical behavior data of the target user on the official document and the attribute characteristics of the target user; Obtaining the comprehensive potential attention degree of the target user to any official document by a potential degree of attention includes: combining the first potential attention degree and the second potential attention degree to determine the comprehensive potential attention degree of the target user to any official document.

可选地,所述方法进一步包括:对于所述多个公文中的任一公文,依据该任一公文发布时刻与当前时刻之间的时长确定第一时间惩罚项,使用第一时间惩罚项确定目标用户对该任一公文的第三潜在关注度;以及,所述基于所述第一潜在关注度获取目标用户对该任一公文的综合潜在关注度,包括:结合所述第一潜在关注度、所述第二潜在关注度和所述第三潜在关注度确定目标用户对该任一公文的综合潜在关注度。Optionally, the method further includes: for any official document in the plurality of official documents, determining a first time penalty item according to the time length between the release moment of the any official document and the current moment, and using the first time penalty item to determine. the third potential degree of attention of the target user to any official document; and the acquiring, based on the first potential degree of interest, the comprehensive potential degree of interest of the target user to the any official document includes: combining the first potential degree of interest , the second potential attention degree and the third potential attention degree determine the comprehensive potential attention degree of the target user to any official document.

可选地,所述方法进一步包括:对于所述多个公文中的任一公文:当目标用户在历史时刻浏览过该任一公文时,依据目标用户最近一次浏览该任一公文的时刻与当前时刻之间的时长确定第二时间惩罚项;以及,所述使用第一时间惩罚项确定目标用户对该任一公文的第三潜在关注度,包括:使用第一时间惩罚项和第二时间惩罚项确定目标用户对该任一公文的第三潜在关注度。Optionally, the method further includes: for any official document in the plurality of official documents: when the target user has browsed the any official document at a historical moment, according to the time when the target user browsed the any official document last time and the current The time length between the moments determines the second time penalty item; and, the use of the first time penalty item to determine the third potential degree of attention of the target user to any official document includes: using the first time penalty item and the second time penalty item determines the target user's third potential degree of attention to any official document.

可选地,所述方法进一步包括:对于所述多个公文中的任一公文,根据该任一公文在至少一个历史时间间隔的浏览数量确定该任一公文的第四潜在关注度;以及,所述基于所述第一潜在关注度获取目标用户对该任一公文的综合潜在关注度,包括:结合所述第一潜在关注度、所述第二潜在关注度、所述第三潜在关注度和所述第四潜在关注度确定目标用户对该任一公文的综合潜在关注度。Optionally, the method further includes: for any official document in the plurality of official documents, determining a fourth potential degree of attention of the any official document according to the number of views of the any official document in at least one historical time interval; and, The acquiring, based on the first potential degree of attention, the target user's comprehensive potential degree of attention to any official document includes: combining the first potential degree of attention, the second potential degree of attention, and the third potential degree of attention and the fourth potential degree of attention to determine the comprehensive potential degree of attention of the target user to any official document.

可选地,所述方法进一步包括:当所述多个公文中的任一公文与预先确定的热点事件相关时,按照预设策略提高该任一公文的第四潜在关注度。Optionally, the method further includes: when any official document in the plurality of official documents is related to a predetermined hot event, increasing the fourth potential degree of attention of the any official document according to a preset strategy.

可选地,所述方法进一步包括:当所述多个公文中的任一公文和目标用户满足至少一条预设规则时,根据所述预设规则中设置的推荐权重确定该任一公文的第五潜在关注度;所述基于所述第一潜在关注度获取目标用户对该任一公文的综合潜在关注度,包括:结合所述第一潜在关注度、所述第二潜在关注度、所述第三潜在关注度、所述第四潜在关注度和第五潜在关注度确定目标用户对该任一公文的综合潜在关注度。Optionally, the method further includes: when any official document in the plurality of official documents and the target user satisfy at least one preset rule, determining the No. 1 position of the any official document according to the recommendation weight set in the preset rule. Five potential degrees of attention; the acquiring, based on the first potential degree of attention, the target user's comprehensive potential degree of attention to any official document includes: combining the first potential degree of attention, the second potential degree of attention, the The third potential interest degree, the fourth potential interest degree and the fifth potential interest degree determine the comprehensive potential interest degree of the target user on any official document.

可选地,所述根据所述预设规则中设置的推荐权重确定该任一公文的第五潜在关注度,包括:当所述预设规则为一条时,将该预设规则中设置的推荐权重确定为该任一公文的第五潜在关注度;当所述预设规则为多条时,将多条预设规则中推荐权重的最大值确定为该任一公文的第五潜在关注度。Optionally, determining the fifth potential degree of attention of any official document according to the recommendation weight set in the preset rule includes: when the preset rule is one, recommending the recommendation set in the preset rule. The weight is determined as the fifth potential degree of attention of any official document; when there are multiple preset rules, the maximum recommended weight among the multiple preset rules is determined as the fifth potential degree of interest of the any official document.

可选地,所述基于所述第一潜在关注度获取目标用户对该任一公文的综合潜在关注度,包括:将所述第一潜在关注度、所述第二潜在关注度、所述第三潜在关注度、所述第四潜在关注度和第五潜在关注度的加权和确定为目标用户对该任一公文的综合潜在关注度。Optionally, the acquiring the comprehensive potential degree of attention of the target user to any official document based on the first potential degree of attention includes: combining the first potential degree of attention, the second potential degree of attention, the first The weighted sum of the three potential attention degrees, the fourth potential attention degree and the fifth potential attention degree is determined as the comprehensive potential attention degree of the target user to any official document.

可选地,所述在所述综合潜在关注度符合预设的推荐条件时,将该任一公文确定为待推荐公文,包括:将所述多个公文按照综合潜在关注度从大到小的顺序排列;将排列在前的预设数量的公文确定为待推荐公文。Optionally, when the comprehensive potential degree of attention meets a preset recommendation condition, determining any of the official documents as the official document to be recommended includes: sorting the plurality of official documents according to the comprehensive potential degree of interest from large to small. Arrange in order; determine the pre-arranged official documents as the official documents to be recommended.

可选地,目标用户的特征数据包括:目标用户在预设的历史时间段浏览、收藏或订阅过的公文;以及,所述根据所述关联关系和目标用户的特征数据确定所述多个公文中的待推荐公文,包括:将目标用户在预设的历史时间段浏览、收藏或订阅过的所述多个公文中的公文确定为目标公文;将与目标公文具有关联关系的至少一个公文确定为待推荐公文。Optionally, the characteristic data of the target user includes: official documents that the target user has browsed, favorited or subscribed to in a preset historical time period; The official documents to be recommended in the document include: determining the official document among the plurality of official documents that the target user browsed, favorited or subscribed to in the preset historical time period as the target official document; determining at least one official document that has an associated relationship with the target official document For the recommendation letter.

可选地,所述关联关系的类型包括:修订关系、废止关系、根据关系和提及关系;所述历史行为的类型包括:浏览、收藏、订阅、忽视、取消收藏和取消订阅。Optionally, the types of the association relationship include: revision relationship, revocation relationship, based on relationship, and mention relationship; and the types of the historical behavior include: browse, favorite, subscribe, ignore, unfavorite, and unsubscribe.

为实现上述目的,根据本发明的另一方面,提供了一种公文推荐装置。To achieve the above object, according to another aspect of the present invention, a device for recommending official documents is provided.

本发明实施例的公文推荐装置可以包括:信息抽取单元,用于获取多个公文之间的关联关系;公文确定单元,用于根据所述关联关系和目标用户的特征数据确定所述多个公文中的待推荐公文;推荐单元,用于将待推荐公文向目标用户推荐。The apparatus for recommending official documents according to the embodiment of the present invention may include: an information extraction unit, configured to acquire an association relationship between a plurality of official documents; an official document determination unit, configured to determine the multiple official documents according to the association relationship and characteristic data of a target user The document to be recommended in ; the recommendation unit is used to recommend the document to be recommended to the target user.

为实现上述目的,根据本发明的又一方面,提供了一种电子设备。To achieve the above object, according to yet another aspect of the present invention, an electronic device is provided.

本发明的一种电子设备包括:一个或多个处理器;存储装置,用于存储一个或多个程序,当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现本发明所提供的公文推荐方法。An electronic device of the present invention comprises: one or more processors; a storage device for storing one or more programs, when the one or more programs are executed by the one or more processors, the one or more programs make the One or more processors implement the document recommendation method provided by the present invention.

为实现上述目的,根据本发明的再一方面,提供了一种计算机可读存储介质。To achieve the above object, according to yet another aspect of the present invention, a computer-readable storage medium is provided.

本发明的一种计算机可读存储介质,其上存储有计算机程序,所述程序被处理器执行时实现本发明所提供的公文推荐方法。A computer-readable storage medium of the present invention stores a computer program thereon, and when the program is executed by a processor, the method for recommending official documents provided by the present invention is implemented.

根据本发明的技术方案,上述发明中的实施例具有如下优点或有益效果:According to the technical solution of the present invention, the embodiments in the above-mentioned invention have the following advantages or beneficial effects:

其一,首先提取多个公文之间的修订、废止、根据、提及等关联关系,之后基于这种关联关系以及目标用户的特征数据来确定待推荐公文,最后将待推荐公文向目标用户推荐。基于以上设置,能够充分考虑公文之间的逻辑关联关系来确定匹配于目标用户的公文,从而提高推荐准确性。First, first extract the association relationship between multiple official documents such as revision, revocation, basis, and mention, then determine the official document to be recommended based on this association relationship and the characteristic data of the target user, and finally recommend the official document to be recommended to the target user. . Based on the above settings, it is possible to fully consider the logical relationship between the official documents to determine the official document matching the target user, thereby improving the recommendation accuracy.

其二,在基于公文之间的关联关系确定待推荐公文时,本发明实施例可以首先计算任一公文的特征向量,之后根据该公文的特征向量以及目标用户的特征数据确定目标用户对该公文的第一潜在关注度(即基于公文关联关系的潜在关注度),最后基于第一潜在关注度确定待推荐公文。在计算任一公文的特征向量时,可以将该公文的原始特征、与该公文具有关联关系的公文的特征向量以及该关联关系的类型对应的权重值输入预设函数直接计算,也可以将公文的原始特征和公文之间的关联关系数据输入预先训练的特征提取模型进行计算。在计算第一潜在关注度时,可以依据公文特征向量与目标用户特征数据的相似度,也可以依据将公文特征向量与目标用户特征数据输入预先训练的意图识别模型得到的目标用户对公文的关注概率。通过以上设置,实现了适用多种具体场景的公文特征计算以及公文与用户匹配。Second, when determining an official document to be recommended based on the relationship between the official documents, the embodiment of the present invention may first calculate the feature vector of any official document, and then determine the target user for the official document according to the feature vector of the official document and the feature data of the target user. The first potential degree of attention (that is, the potential degree of attention based on the relationship of the official document), and finally the official document to be recommended is determined based on the first potential degree of interest. When calculating the feature vector of any official document, the original feature of the official document, the feature vector of the official document that has an associated relationship with the official document, and the weight value corresponding to the type of the association relationship can be input into the preset function for direct calculation, or the official document can be directly calculated. The correlation data between the original features and official documents is input into the pre-trained feature extraction model for calculation. When calculating the first potential degree of attention, it can be based on the similarity between the feature vector of the official document and the feature data of the target user, or the focus of the target user on the official document obtained by inputting the feature vector of the official document and the feature data of the target user into the pre-trained intent recognition model. probability. Through the above settings, document feature calculation suitable for a variety of specific scenarios and document and user matching are realized.

其三,在本发明实施例中,目标用户的特征数据可以是目标用户的当前特征向量,该当前特征向量可以在目标用户初始特征向量的基础上随目标用户针对公文的浏览、收藏、订阅等行为动态变化,这样,当目标用户浏览、收藏或订阅某一公文之后,其当前特征向量可发生倾向于该公文特征向量的变化,这种变化使得目标用户与该公文的关联公文(即与该公文具有关联关系的公文)之间会形成更高的匹配程度(匹配程度可以是特征向量相似度、关注概率或者第一潜在关注度),这样即可实现精准的关联公文推荐和实时推荐。Third, in the embodiment of the present invention, the feature data of the target user may be the current feature vector of the target user, and the current feature vector may follow the target user's browsing, collection, subscription, etc. for official documents on the basis of the initial feature vector of the target user. The behavior changes dynamically, so that when the target user browses, bookmarks or subscribes to a certain official document, its current feature vector may change towards the feature vector of the official document. A higher matching degree (the matching degree can be the similarity of the feature vector, the probability of attention, or the first potential degree of attention) will form a higher degree of matching between official documents that have an associated relationship, so that accurate and real-time recommendation of related official documents can be achieved.

其四,在本发明实施例中,除了考虑公文之间的关联关系之外,还可以结合以下的一种或多种方法进行协同推荐,这些方法有:基于用户行为的推荐、基于时间惩罚项的推荐、基于公文浏览数量的推荐、基于预设规则的推荐,这样能够进一步地提高推荐准确性。Fourth, in the embodiment of the present invention, in addition to considering the relationship between official documents, collaborative recommendation can also be performed in combination with one or more of the following methods. These methods include: recommendation based on user behavior, time penalty item based , recommendation based on the number of official document browsing, and recommendation based on preset rules, which can further improve the recommendation accuracy.

上述的非惯用的可选方式所具有的进一步效果将在下文中结合具体实施方式加以说明。Further effects of the above non-conventional alternatives will be described below in conjunction with specific embodiments.

附图说明Description of drawings

附图用于更好地理解本发明,不构成对本发明的不当限定。其中:The accompanying drawings are used for better understanding of the present invention and do not constitute an improper limitation of the present invention. in:

图1是本发明实施例中公文推荐方法的主要步骤示意图;1 is a schematic diagram of the main steps of a method for recommending official documents in an embodiment of the present invention;

图2是本发明实施例中多个公文构建的知识图谱示意图;2 is a schematic diagram of a knowledge map constructed by multiple official documents in an embodiment of the present invention;

图3是本发明实施例中公文推荐方法的具体实现步骤示意图;3 is a schematic diagram of specific implementation steps of a method for recommending official documents in an embodiment of the present invention;

图4是本发明实施例中公文推荐装置的组成部分示意图;4 is a schematic diagram of the components of the official document recommendation device in the embodiment of the present invention;

图5是根据本发明实施例可以应用于其中的示例性系统架构图;5 is an exemplary system architecture diagram to which an embodiment of the present invention may be applied;

图6是用来实现本发明实施例中公文推荐方法的电子设备结构示意图。FIG. 6 is a schematic structural diagram of an electronic device used to implement the method for recommending official documents in an embodiment of the present invention.

具体实施方式Detailed ways

以下结合附图对本发明的示范性实施例做出说明,其中包括本发明实施例的各种细节以助于理解,应当将它们认为仅仅是示范性的。因此,本领域普通技术人员应当认识到,可以对这里描述的实施例做出各种改变和修改,而不会背离本发明的范围和精神。同样,为了清楚和简明,以下的描述中省略了对公知功能和结构的描述。Exemplary embodiments of the present invention are described below with reference to the accompanying drawings, which include various details of the embodiments of the present invention to facilitate understanding and should be considered as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted from the following description for clarity and conciseness.

需要指出的是,在不冲突的情况下,本发明的实施例以及实施例中的技术特征可以相互结合。It should be pointed out that the embodiments of the present invention and the technical features in the embodiments may be combined with each other without conflict.

图1是根据本发明实施例中公文推荐方法的主要步骤示意图。FIG. 1 is a schematic diagram of main steps of a method for recommending official documents according to an embodiment of the present invention.

如图1所示,本发明实施例的公文推荐方法可具体按照如下步骤执行:As shown in FIG. 1 , the method for recommending official documents according to the embodiment of the present invention may be specifically performed according to the following steps:

步骤S101:获取多个公文之间的关联关系。Step S101: Acquire the association relationship between multiple official documents.

在本发明实施例中,公文指的是各种类型的机构或组织按照一定体式形成的书面材料,既包括国家机关、各级政府机构的各种政策、发文和内部文件,也包括事业单位、企业单位的各种发文和内部文件。公文可以通过各种格式进行展示,例如文本、图片、音频、视频。公文一般具有特定体式,并经过一定的处理程序而形成,同时公文之间往往具有内容与逻辑上的关联关系,关联关系的类型可以包括修订关系、废止关系、根据关系、提及关系等。例如,当下级管理职能部门需要根据上级管理职能部门发出的公文A形成公文B时,公文B与公文A之间具有根据关系,即公文B根据了公文A;当公文F废止了公文B时,公文F与公文B之间具有废止关系;当公文F修订了公文E时,公文F与公文E之间具有修订关系;当公文C提及公文A时,公文C与公文A之间具有提及关系。需要说明的是,公文之间的关联关系不限于以上四种,可以根据实际情况进行扩展。In the embodiment of the present invention, official documents refer to written materials formed by various types of institutions or organizations in a certain manner, including various policies, issued documents and internal documents of state agencies and government agencies at all levels, as well as public institutions, Various issued documents and internal documents of enterprise units. Documents can be displayed in various formats, such as text, pictures, audio, and video. Official documents generally have a specific form and are formed through certain processing procedures. At the same time, official documents often have content and logical associations between them. The types of associations can include revision relations, revocation relations, basis relations, and mention relations. For example, when a lower-level management functional department needs to form an official document B based on the official document A issued by the superior management functional department, there is a basis relationship between the official document B and the official document A, that is, the official document B is based on the official document A; when the official document F abolishes the official document B, Official document F and official document B have an annulment relationship; when official document F revises official document E, official document F and official document E have a revision relationship; when official document C refers to official document A, official document C and official document A have a reference to each other. relation. It should be noted that the relationship between official documents is not limited to the above four types, and can be expanded according to the actual situation.

图2是本发明实施例中多个公文构建的知识图谱示意图,在图2中,以圆形节点代表公文,以节点之间的边表示公文之间的关联关系。如图2所示,公文A与公文B之间具有根据关系(公文B根据了公文A),公文E与公文B之间具有根据关系(公文E根据了公文B),公文C与公文A之间具有提及关系(公文C提及了公文A),公文F与公文E之间具有修订关系(公文F修订了公文E),公文F与公文C之间具有根据关系(公文F根据了公文C),公文D与公文B之间具有根据关系(公文D根据了公文B),公文D与公文C之间具有修订关系(公文D修订了公文C),公文F与公文B之间具有废止关系(公文F废止了公文B)。FIG. 2 is a schematic diagram of a knowledge graph constructed by multiple documents in an embodiment of the present invention. In FIG. 2 , circular nodes represent documents, and edges between nodes represent associations between documents. As shown in Figure 2, document A and document B have a basis relationship (document B is based on document A), document E and document B have a basis relationship (document E is based on document B), and the relationship between document C and document A There is a reference relationship between official documents (document C mentions official document A), there is a revision relationship between official document F and official document E (official document F revised official document E), and official document F and official document C have a basis relationship (official document F is based on official document E). C), there is a basis relationship between official document D and official document B (official document D is based on official document B), there is a revision relationship between official document D and official document C (official document D revised official document C), and official document F and official document B have a nullification relationship Relationship (Official Document F repealed Official Document B).

可以看到,任意两个公文之间的关联关系是具有方向性的,图2中使用带箭头的有向边来表示这种方向性。一般地,对于具有关联关系的任意两个公文来说,有向边的箭头指向的公文的发文时刻早于另一公文的发文时刻。可以理解,上述发文时刻可以对应于任一时间精度,如精确到年、月、日、时、分或秒。此外,具体场景中,公文之间不同的关联关系往往代表公文之间不同的相关程度,例如,废止关系和修订关系代表的相关程度高于根据关系,根据关系代表的相关程度高于提及关系。It can be seen that the relationship between any two official documents is directional, and the directional edge with arrows is used in Figure 2 to represent this direction. Generally, for any two official documents with an associated relationship, the sending time of the official document pointed to by the arrow of the directional edge is earlier than the sending time of the other official document. It can be understood that the above-mentioned posting moment may correspond to any time precision, such as year, month, day, hour, minute or second. In addition, in specific scenarios, different correlations between official documents often represent different degrees of correlation between official documents. For example, the revocation relationship and the revision relationship represent a higher degree of correlation than the based relationship, and the based relationship represents a higher degree of correlation than the mention relationship. .

特别地,以上说明的关联关系为公文之间的直接关联关系,除此之外,公文之间还可以具有间接关联关系,例如,图2中的公文F废止了公文B,公文B根据了公文A,公文F与公文A即具有二级的间接关联关系(可简称为二级关联关系),其中,第一级关联关系为废止,第二级关联关系为根据,这种二级关联关系可以表示为废止——根据关系。这样,能够从知识图谱中获取任一公文的一级关联公文(即与该任一公文具有直接关联关系的公文)和多级关联公文(即与该任一公文具有多级关联关系的公文)。例如,公文A的一级关联公文为公文B、C,二级关联公文为公文D、E、F。如果以公文A为根节点,则公文B、C处在第一层级,公文D、E、F处在第二层级。In particular, the association described above is a direct association between official documents. Besides, there may also be an indirect association between official documents. For example, official document F in FIG. 2 abolishes official document B, and official document B is based on official document A. Official document F and official document A have a second-level indirect association relationship (which can be referred to as a second-level association relationship), in which the first-level association relationship is abolished, and the second-level association relationship is the basis. This second-level association relationship can be Expressed as annulment - by relationship. In this way, the first-level related official documents (that is, the official documents that have a direct relationship with the any official document) and the multi-level related official documents (that is, the official documents that have a multi-level relationship with the any official document) can be obtained from the knowledge graph. . For example, the first-level related official documents of official document A are official documents B and C, and the second-level related official documents are official documents D, E, and F. If document A is the root node, documents B and C are at the first level, and documents D, E, and F are at the second level.

在步骤S101中,可以使用预先训练的信息抽取模型来获取预先获取的、用于后续推荐的多个公文之间的关联关系,上述关联关系可以包括:与多个公文中的任一公文具有关联关系的至少一个公文以及该关联关系的类型。实际应用中,可以将多个公文中的任一公文的文本信息输入信息抽取模型,从而得到与该公文具有关联关系的公文以及该关联关系的类型。In step S101, a pre-trained information extraction model may be used to obtain the pre-acquired association relationship between multiple official documents for subsequent recommendation, and the above-mentioned association relationship may include: having an association with any one of the multiple official documents At least one document of the relationship and the type of the relationship. In practical applications, the text information of any one of the multiple official documents can be input into the information extraction model, so as to obtain the official document that has an association relationship with the official document and the type of the association relationship.

步骤S102:根据关联关系和目标用户的特征数据确定多个公文中的待推荐公文。Step S102: Determine the official document to be recommended among the plurality of official documents according to the association relationship and the characteristic data of the target user.

在本步骤中,目标用户指的是推荐针对的用户,待推荐公文指的是经计算确定的、将要向用户推荐的公文,目标用户的特征数据指的是能够反映用户特征的数据,这些数据可以包括以下的一个方面或多个方面:目标用户的性别、年龄、职业、地址等属性特征,目标用户提供的偏好公文类别(公文类别可以从多种维度划分,例如可以划分为命令、决定、决议、指示等),目标用户针对公文的历史行为数据。实际应用中,用户针对公文可以有多种行为类型,如浏览、收藏、订阅、搜索、观看(可以针对视频形式呈现的公文)、反馈、忽视、取消收藏和取消订阅,上述历史行为数据可以包括:目标用户在历史时刻浏览、收藏、订阅、取消收藏和取消订阅的公文的相关数据、浏览某一公文的时长、观看某一公文的进度百分比、在搜索框输入的搜索词、对某一公文的反馈信息(例如评价信息)。示例性地,公文的相关数据可以包括公文的标题、摘要、内容、原始特征以及特征向量;其中,公文的原始特征指的是基于公文的标题、摘要或内容使用TF-IDF(词频-逆文件频率)、word2vec(一种词向量模型)、BERT(一种自然语言处理模型)等算法或模型得到的公文特征,公文的特征向量指的是在公文原始特征基础上考虑公文关联关系之后形成的公文特征。In this step, the target user refers to the user for which the recommendation is directed, the official document to be recommended refers to the official document determined by calculation and to be recommended to the user, and the characteristic data of the target user refers to the data that can reflect the characteristics of the user. It may include one or more of the following aspects: the target user's gender, age, occupation, address and other attributes, and the preferred document category provided by the target user (document category can be divided into various dimensions, such as commands, decisions, resolutions, instructions, etc.), historical behavior data of target users for official documents. In practical applications, users can have a variety of behavior types for official documents, such as browsing, favorites, subscriptions, searching, watching (can be for official documents presented in the form of videos), feedback, ignoring, canceling favorites, and unsubscribing. The above historical behavior data can include: : The relevant data of the official documents that the target users browsed, favorited, subscribed, un-favorites and unsubscribed at historical moments, the duration of browsing a certain official document, the percentage of progress in viewing a certain official document, the search terms entered in the search box, the feedback information (such as evaluation information). Exemplarily, the relevant data of the official document may include the title, abstract, content, original feature and feature vector of the official document; wherein, the original feature of the official document refers to the use of TF-IDF (Term Frequency-Inverse File) based on the title, abstract or content of the official document. frequency), word2vec (a word vector model), BERT (a natural language processing model) and other algorithms or models to obtain official document features, the feature vector of official documents refers to the original features of official documents after considering the relationship between official documents. Document features.

在本发明实施例中,步骤S102可以按照以下步骤具体执行:对于预先获取的多个公文中的任一公文,首先根据与该任一公文具有关联关系的公文以及该关联关系的类型确定该任一公文的特征向量,接着依据该任一公文的特征向量以及目标用户的特征数据确定目标用户对该任一公文的第一潜在关注度,之后基于第一潜在关注度获取目标用户对该任一公文的综合潜在关注度,最后,在综合潜在关注度符合预设的推荐条件时,将该任一公文确定为待推荐公文。其中,潜在关注度可以表征用户对公文的感兴趣概率,第一潜在关注度是考虑了公文关联关系之后得到的潜在关注度,综合潜在关注度是从至少一个方面评价用户对公文的感兴趣概率之后得到的潜在关注度指标。In this embodiment of the present invention, step S102 may be specifically performed according to the following steps: for any official document among the plurality of official documents acquired in advance, first determine the official document with an associated relationship with the any official document and the type of the association relationship. A feature vector of an official document, then according to the feature vector of any official document and the feature data of the target user to determine the first potential degree of attention of the target user to any official document, and then based on the first potential degree of interest to obtain the target user for any one The comprehensive potential attention degree of the official document, and finally, when the comprehensive potential attention degree meets the preset recommendation conditions, any official document is determined as the official document to be recommended. Among them, the potential degree of attention can represent the probability that the user is interested in the official document, the first potential degree of interest is the potential degree of interest obtained after considering the relationship between the official document, and the comprehensive potential degree of interest is the probability of evaluating the user's interest in the official document from at least one aspect Potential attention indicators obtained afterward.

较佳地,在一些实施例中,可以采用两种方法来计算公文的特征向量。在第一种方法中,对于预先获取的多个公文中的任一公文,可以将该任一公文的原始特征、与该任一公文具有关联关系的每一公文的特征向量、以及该关联关系的类型对应的预设权重值输入预设函数进行计算,从而得到该任一公文的特征向量。在执行上述计算之前,可以根据实际场景确定需要选取哪些层级的关联公文进行计算,并根据相关程度为不同的关联关系设置权重值,例如为废止关系和修订关系设置0.5的权重值,为根据关系设置0.3的权重值,为提及关系设置0.1的权重值,为根据——提及关系设置0.03的权重值。Preferably, in some embodiments, two methods can be used to calculate the feature vector of the official document. In the first method, for any official document among the pre-acquired multiple official documents, the original feature of the any official document, the feature vector of each official document that has an associated relationship with the any official document, and the association relationship may be The preset weight value corresponding to the type of is input into the preset function for calculation, so as to obtain the feature vector of any official document. Before performing the above calculation, you can determine which levels of related documents need to be selected for calculation according to the actual scenario, and set weight values for different related relationships according to the degree of correlation. Set a weight value of 0.3, a weight value of 0.1 for the mention relation, and a weight value of 0.03 for the based-mention relation.

此外,在一可选实现方式中,可以选取发文时刻早于该任一公文的关联公文进行计算。例如,在需要选取第一层级的关联公文进行计算的情况下,对于公文F,可以选取发文时刻较早的公文B、C、E进行计算;对于公文B,可以选取发文时间更早的公文A进行计算。在需要选取第一层级和第二层级的关联公文进行计算的情况下,对于公文F,可以选取发文时刻较早的一级关联公文B、C、E以及二级关联公文A、D进行计算。在另一可选实现方式中,可以选取任一公文的任何关联公文进行计算,而不局限于发文时刻早于该任一公文的公文,在这种情况下,可以使用关联公文的原始特征进行计算。以下将以选取第一层级关联公文计算的情况为例进行说明,可以理解的是,计算某一公文的特征向量时,可以根据需要选取任何层级的关联公文进行计算,以上示例并不对此形成任何限制。In addition, in an optional implementation manner, an associated official document whose time of posting the document is earlier than any one of the official documents may be selected for calculation. For example, when it is necessary to select the related official documents of the first level for calculation, for official document F, official documents B, C, and E with earlier issuance time can be selected for calculation; for official document B, official document A with earlier issuance time can be selected. Calculation. When it is necessary to select the first-level and second-level related documents for calculation, for the official document F, the first-level related documents B, C, and E and the second-level related documents A and D can be selected for calculation. In another optional implementation manner, any related official documents of any official document can be selected for calculation, and the calculation is not limited to the official documents whose time of issuance is earlier than any official document. calculate. The following will take the case of selecting the first level of related documents for calculation as an example. It can be understood that when calculating the feature vector of a certain official document, you can select any level of related documents for calculation as needed, and the above example does not form any limit.

用于计算公文特征向量的上述预设函数可以根据实际需要设置,例如可设置如下:The above-mentioned preset function for calculating the feature vector of official documents can be set according to actual needs, for example, it can be set as follows:

Figure BDA0002617559470000111
Figure BDA0002617559470000111

其中,Mi表示公文i(i为公文标识)的特征向量,Mi0表示公文i的原始特征,Mj表示与公文i具有关联关系的公文j,j为从1开始的正整数,表示关联公文的序号,kj表示公文j与公文i的关联关系的权重值,p、q为预设参数。可以理解,以上公式仅为示例,不对上述预设函数的形式形成任何限制。Among them, M i represents the feature vector of the official document i (i is the official document identification), M i0 represents the original feature of the official document i, M j represents the official document j that has an associated relationship with the official document i, and j is a positive integer starting from 1, indicating an association The serial number of the official document, k j represents the weight value of the relationship between the official document j and the official document i, and p and q are preset parameters. It can be understood that the above formula is only an example, and does not form any limitation on the form of the above preset function.

在第二种计算公文特征向量的方法中,可以将预先获取的多个公文的原始特征和公文之间的关联关系数据输入预先训练的特征提取模型,从而得到多个公文中任一公文的特征向量。其中,公文之间的关联关系数据可以包括上述多个公文中任一公文的关联公文的标识以及相应的关联关系的类型,特征提取模型可以是CNN(Convolutional NeuralNetworks,卷积神经网络)、RNN(Recurrent Neural Network,循环神经网络)等神经网络模型。在一个实施例中,特征提取模型内部可以使用以下公式计算公文的特征向量:In the second method of calculating the feature vector of official documents, the pre-acquired original features of multiple official documents and the correlation data between the official documents can be input into the pre-trained feature extraction model, so as to obtain the feature of any one of the multiple official documents. vector. Wherein, the association relationship data between the official documents may include the identification of the associated official document of any of the above-mentioned multiple official documents and the type of the corresponding association relationship, and the feature extraction model may be CNN (Convolutional Neural Networks, Convolutional Neural Networks), RNN ( Recurrent Neural Network, recurrent neural network) and other neural network models. In one embodiment, the following formula can be used inside the feature extraction model to calculate the feature vector of the official document:

Figure BDA0002617559470000112
Figure BDA0002617559470000112

其中,

Figure BDA0002617559470000113
表示特征提取模型中公文节点的特征向量,下标v表示公文节点的索引号,上标k表示该公文节点处在特征提取模型中的第几层,σ表示激活函数(例如线性整流函数ReLU),Wk、Bk均为权重矩阵,N(v)表示公文节点v的关联公文集合,AGG(·)为聚合函数,可以通过计算平均值、加权和、卷积池化等方式收集关联公文的特征,AGG(·)中可以含有关联公文相应关联关系的权重值。在以上两种计算公文特征向量的方法中,均使用了公文的原始特征,这样能够解决系统的冷启动问题。in,
Figure BDA0002617559470000113
Represents the feature vector of the document node in the feature extraction model, the subscript v indicates the index number of the document node, the superscript k indicates the layer of the document node in the feature extraction model, and σ indicates the activation function (such as the linear rectification function ReLU) , W k , B k are weight matrices, N(v) represents the set of related documents of document node v, AGG( ) is an aggregation function, which can collect related documents by calculating the average value, weighted sum, convolution pooling, etc. The feature of AGG(·) can contain the weight value of the corresponding relationship of the related official document. In the above two methods for calculating the feature vector of the official document, the original features of the official document are used, which can solve the problem of cold start of the system.

在本发明实施例中,目标用户的特征数据可以是目标用户的当前特征向量,该当前特征向量可以根据目标用户针对公文的历史行为数据进行动态更新。其根据以下步骤进行计算:首先,获取目标用户的初始特征向量。在一个实施例中,可以根据目标用户提供的偏好公文类别确定目标用户的初始特征向量,例如将目标用户在注册时输入的偏好公文类别的特征向量取平均,即可得到目标用户的初始特征向量。此后,可以根据目标用户的初始特征向量和目标用户针对公文的历史行为数据确定目标用户的当前特征向量。具体地,可以首先获取目标用户的历史行为数据对应的每一公文的特征向量,之后将任一公文的特征向量与该公文的影响分数相乘得到该公文的影响向量,其中,上述影响分数可以根据目标用户针对该公文的历史行为的类型确定,例如,可以将浏览的影响分数设置为0.1,将收藏的影响分数设置为0.2,将订阅的影响分数设置为0.3,将忽视的影响分数设置为-0.1,将取消收藏的影响分数设置为-0.2,将取消订阅的影响分数设置为-0.3。最后,可以根据每一公文的影响向量与目标用户的初始特征向量确定目标用户的当前特征向量,例如将每一公文的影响向量与目标用户的初始特征向量相加从而得到目标用户的当前特征向量。In this embodiment of the present invention, the feature data of the target user may be the current feature vector of the target user, and the current feature vector may be dynamically updated according to the historical behavior data of the target user on official documents. It is calculated according to the following steps: First, the initial feature vector of the target user is obtained. In one embodiment, the initial feature vector of the target user can be determined according to the preferred document category provided by the target user. For example, the initial feature vector of the target user can be obtained by averaging the feature vectors of the preferred document category entered by the target user during registration. . Thereafter, the current feature vector of the target user can be determined according to the initial feature vector of the target user and the historical behavior data of the target user with respect to the official document. Specifically, the feature vector of each official document corresponding to the historical behavior data of the target user can be obtained first, and then the feature vector of any official document is multiplied by the impact score of the official document to obtain the impact vector of the official document, wherein the impact score can be It is determined according to the type of historical behavior of the target user on the official document. For example, the impact score of browsing can be set to 0.1, the impact score of favorites can be set to 0.2, the impact score of subscription can be set to 0.3, and the impact score of ignored can be set to -0.1, sets the impact score for unfavorites to -0.2, and for unsubscribes to -0.3. Finally, the current feature vector of the target user can be determined according to the influence vector of each official document and the initial feature vector of the target user. For example, the current feature vector of the target user can be obtained by adding the impact vector of each official document and the initial feature vector of the target user. .

根据以上设置,能够实现用户当前特征向量随用户行为的实时更新,实际应用中,当目标用户浏览、收藏或订阅某一公文之后,其当前特征向量可发生倾向于该公文特征向量的变化,这种变化使得目标用户与该公文的关联公文之间会形成更高的匹配程度,这样即可实现精准的关联公文推荐和实时推荐。According to the above settings, the user's current feature vector can be updated in real time with the user's behavior. In practical applications, after the target user browses, bookmarks or subscribes to a certain official document, the current feature vector of the target user may change towards the feature vector of the official document. Such changes will result in a higher degree of matching between the target user and the related official documents of the official document, so that accurate and real-time recommendation of related official documents can be achieved.

在得到公文的特征向量以及目标用户的当前特征向量之后,即可计算目标用户对预先获取的多个公文中的任一公文的第一潜在关注度。以下将提供两种计算方法。在一种计算方法中,对于多个公文中的任一公文,可计算该任一公文的特征向量与目标用户的当前特征向量的相似度,并依据该相似度确定目标用户对该任一公文的第一潜在关注度,例如将该相似度与预设系数相乘从而得到第一潜在关注度、或者直接将该相似度作为第一潜在关注度。其中,上述相似度可以是余弦相似度、杰卡德相似度、皮尔逊相关系数或调整余弦相似度,After the feature vector of the official document and the current feature vector of the target user are obtained, the first potential degree of attention of the target user to any one of the pre-acquired multiple official documents can be calculated. Two calculation methods are provided below. In a calculation method, for any official document among the plurality of official documents, the similarity between the feature vector of the any official document and the current feature vector of the target user can be calculated, and the target user can determine the similarity of the any official document according to the similarity. For example, multiply the similarity by a preset coefficient to obtain the first potential degree of attention, or directly use the similarity as the first potential degree of attention. Wherein, the above similarity may be cosine similarity, Jaccard similarity, Pearson correlation coefficient or adjusted cosine similarity,

在第二种计算方法中,可以将多个公文中任一公文的特征向量和目标用户的当前特征向量输入预先训练的意图识别模型,从而得到目标用户对该任一公文的关注概率,最后依据该关注概率确定目标用户对该任一公文的第一潜在关注度,例如将该关注概率与预设系数相乘得到第一潜在关注度、或者直接将该关注概率作为第一潜在关注度。其中,意图识别模型可以是朴素贝叶斯、逻辑回归等分类模型。特别地,上述意图识别模型可以与前述特征提取模型进行连接后一起训练,例如,可以将特征提取模型的输出数据以及目标用户的当前特征向量输入意图识别模型从而实现特征提取模型与意图识别模型的连接,并且,根据历史时刻用户对推荐公文的反馈效果(例如是否浏览、浏览时长、评价信息)来确定正负样本,并基于正负正本构建训练集对上述特征提取模型与意图识别模型同时进行训练。In the second calculation method, the feature vector of any official document among the multiple official documents and the current feature vector of the target user can be input into the pre-trained intent recognition model, so as to obtain the attention probability of the target user to any official document, and finally based on The attention probability determines the first potential attention degree of the target user to any official document, for example, the attention probability is multiplied by a preset coefficient to obtain the first potential attention degree, or the attention probability is directly used as the first potential attention degree. The intent recognition model may be a classification model such as Naive Bayes and logistic regression. In particular, the above-mentioned intent recognition model can be connected with the aforementioned feature extraction model and trained together. For example, the output data of the feature extraction model and the current feature vector of the target user can be input into the intent recognition model, thereby realizing the integration between the feature extraction model and the intent recognition model. Connect, and determine the positive and negative samples according to the user’s feedback effect on the recommended official documents (such as whether to browse, browsing time, and evaluation information) at historical moments, and construct a training set based on the positive and negative positives. train.

在一些实施例中,得到目标用户对任一公文的第一潜在关注度之后,可以将第一潜在关注度确定为目标用户对该任一公文的综合潜在关注度用于后续推荐。In some embodiments, after obtaining the first potential degree of interest of the target user on any official document, the first potential degree of interest may be determined as the comprehensive potential degree of interest of the target user on any official document for subsequent recommendation.

较佳地,在本发明实施例中,可以将第一潜在关注度与其它方面的潜在关注度进行结合来计算综合潜在关注度,这样能够实现更为准确的公文推荐。具体应用中,对于预先获取的多个公文中的任一公文,可以计算该任一公文的原始特征与目标用户的特征数据的相似度,并依据该相似度确定目标用户对该任一公文的第二潜在关注度,例如将该相似度与预设系数相乘从而得到第二潜在关注度、或者直接将该相似度作为第二潜在关注度。其中,目标用户的特征数据可以根据以下步骤进行计算:首先根据目标用户提供的偏好公文类别确定目标用户的初始特征向量,此后获取目标用户的历史行为数据对应的每一公文的原始特征,并将任一公文的原始特征与该公文的影响分数相乘得到该公文的特定向量,最后,可以将每一公文的特定向量与目标用户的初始特征向量相加即可得到目标用户的特征数据。Preferably, in the embodiment of the present invention, the first potential attention degree can be combined with the potential attention degrees of other aspects to calculate the comprehensive potential attention degree, so that a more accurate official document recommendation can be realized. In a specific application, for any official document among a plurality of official documents obtained in advance, the similarity between the original feature of the official document and the feature data of the target user can be calculated, and the similarity of the target user to the any official document can be determined according to the similarity. For the second potential degree of attention, for example, multiplying the similarity by a preset coefficient to obtain the second potential degree of attention, or directly using the similarity as the second potential degree of attention. The characteristic data of the target user can be calculated according to the following steps: first, determine the initial characteristic vector of the target user according to the preferred document category provided by the target user, then obtain the original characteristic of each document corresponding to the historical behavior data of the target user, and use The original feature of any official document is multiplied by the impact score of the official document to obtain the specific vector of the official document. Finally, the characteristic data of the target user can be obtained by adding the specific vector of each official document and the initial feature vector of the target user.

实际应用中,还可以将任一公文的原始特征和目标用户的综合特征输入预先训练的意图判别模型,从而得到目标用户对该任一公文的关注概率,依据该关注概率确定目标用户对该任一公文的第二潜在关注度,例如将该关注概率与预设系数相乘得到第二潜在关注度、或者直接将该关注概率作为第二潜在关注度。其中,目标用户的综合特征可以由目标用户针对公文的历史行为数据和目标用户的属性特征确定,上述综合特征可以由以下几部分拼接而成:对目标用户行为数据的搜索词序列进行词向量计算并平均池化形成的数据,对目标用户行为数据中浏览的每一公文的标题、摘要等通过词向量计算并平均池化后的数据,目标用户的姓名、性别、职业等属性特征数据。具体场景中,上述意图判别模型可以是各种深度学习模型,对上述意图判别模型训练时,可以首先根据历史时刻用户对推荐公文的反馈效果(例如是否浏览、浏览时长、评价信息)确定正负样本,之后基于正负正本构建训练集对上述意图判别模型同时进行训练。In practical applications, the original features of any official document and the comprehensive features of the target user can also be input into the pre-trained intent discrimination model, so as to obtain the probability of the target user's attention to any official document, and determine the target user's attention to any official document according to the probability of interest. For the second potential attention degree of an official document, for example, multiplying the attention probability by a preset coefficient to obtain the second potential attention degree, or directly using the attention probability as the second potential attention degree. Among them, the comprehensive characteristics of the target user can be determined by the historical behavior data of the target user for the official document and the attribute characteristics of the target user. The above comprehensive characteristics can be spliced into the following parts: Perform word vector calculation on the search word sequence of the target user behavior data And the data formed by the average pooling, the title and abstract of each official document viewed in the target user behavior data are calculated through the word vector and the pooled data is averaged, and the target user's name, gender, occupation and other attribute characteristic data. In a specific scenario, the above-mentioned intent discrimination model may be various deep learning models. When training the above-mentioned intent discrimination model, the positive or negative value may be determined according to the user's feedback effect on the recommended official document (for example, whether to browse or not, browsing time, and evaluation information) at historical moments. Samples, and then build a training set based on positive and negative positives to train the above intent discrimination model at the same time.

在得到目标用户对任一公文的第二潜在关注度之后,可以结合第一潜在关注度和第二潜在关注度确定目标用户对该任一公文的综合潜在关注度,例如根据预设方式计算第一潜在关注度和第二潜在关注度的和或加权和作为综合潜在关注度,本发明不对计算综合潜在关注度的方式进行任何限制。After obtaining the second potential degree of attention of the target user to any official document, the comprehensive potential degree of interest of the target user to any official document can be determined by combining the first potential degree of attention and the second potential degree of attention, for example, calculating the first potential degree of interest according to a preset method. The sum or weighted sum of the first potential attention degree and the second potential attention degree is regarded as the comprehensive potential attention degree, and the present invention does not impose any limitation on the method of calculating the comprehensive potential attention degree.

实际应用场景中,相对来说,人们更加关注新发布的公文,某公文发布越久,人们对它的兴趣越低。另一方面,对于人们刚看过的公文,显然不需要反复地推荐。基于上述考虑,可以建立依据公文发布时刻的时间惩罚项进而确定相应的潜在关注度。具体地,对于任一公文,可以依据该任一公文发布时刻与当前时刻之间的时长确定第一时间惩罚项,并使用第一时间惩罚项确定目标用户对该任一公文的第三潜在关注度(例如直接将第一时间惩罚项作为第三潜在关注度)。例如,可以采用下式表示第一时间惩罚项:In practical application scenarios, relatively speaking, people pay more attention to newly released official documents. The longer a certain official document is published, the less people are interested in it. On the other hand, there is obviously no need for repeated recommendations for official documents that people have just read. Based on the above considerations, it is possible to establish a time penalty item based on the time of publication of the official document to determine the corresponding potential degree of attention. Specifically, for any official document, the first time penalty item can be determined according to the time length between the release time of the official document and the current moment, and the first time penalty item can be used to determine the third potential concern of the target user for the any official document degree (for example, the first time penalty item is directly used as the third potential attention degree). For example, the first time penalty term can be expressed as:

-log(tpublish(pi))*punishpublish -log(t publish ( pi ))*punish publish

其中,pi表示任一公文,tpublish(pi)表示该公文的发布时刻与当前时刻之间的时长,punishpublish为可调节的参数。Among them, pi represents any official document, t publish ( pi ) represents the duration between the publication time of the official document and the current time, and punish publish is an adjustable parameter.

在一可选实现方式中,对于任一公文,当目标用户在历史时刻浏览过该任一公文时,可以依据目标用户最近一次浏览该任一公文的时刻与当前时刻之间的时长确定第二时间惩罚项,并基于第一时间惩罚项和第二时间惩罚项计算目标用户对该任一公文的第三潜在关注度。第二时间惩罚项可如下式所示:In an optional implementation manner, for any official document, when the target user has browsed the any official document at a historical moment, the second can be determined according to the time length between the last time the target user browsed the any official document and the current moment. The time penalty item is calculated, and the third potential degree of attention of the target user to any official document is calculated based on the first time penalty item and the second time penalty item. The second time penalty term can be expressed as:

log(T-twatch(pi,uj)+1)*punishwatch log ( Tt watch (pi , u j ) +1)*punish watch

其中,T为预设的时间常数,uj表示目标用户,punishwatch为可调节的参数。Among them, T is a preset time constant, u j represents the target user, and punch watch is an adjustable parameter.

示例性地,基于第一时间惩罚项和第二时间惩罚项计算目标用户对该任一公文的第三潜在关注度可以是:将第一时间惩罚项和第二时间惩罚项相加第三潜在关注度。在得到第三潜在关注度之后,可以结合第一潜在关注度、第二潜在关注度和第三潜在关注度来确定目标用户对该任一公文的综合潜在关注度,例如根据预设方式计算第一潜在关注度、第二潜在关注度以及第三潜在关注度的和或加权和作为综合潜在关注度。Exemplarily, calculating the third potential degree of attention of the target user to any official document based on the first time penalty item and the second time penalty item may be: adding the first time penalty item and the second time penalty item to the third potential Attention. After the third potential degree of attention is obtained, the first potential degree of attention, the second potential degree of attention and the third potential degree of attention can be combined to determine the comprehensive potential degree of attention of the target user to any official document, for example, calculating the first potential degree of attention according to a preset method. The sum or weighted sum of the first potential interest degree, the second potential interest degree and the third potential interest degree is taken as the comprehensive potential interest degree.

在一些实施例中,对于多个公文中的任一公文,还可以根据该任一公文在至少一个历史时间间隔的浏览数量确定该任一公文的第四潜在关注度,例如可以通过下式进行计算:In some embodiments, for any official document among the plurality of official documents, the fourth potential degree of attention of any official document may also be determined according to the number of viewings of the any official document in at least one historical time interval, for example, the following formula can be used to perform calculate:

w4(pi,uj)=log(v1h(pi))*a1h+log(v1d(pi))*a1d+log(v7d(pi))*a7d w 4 (pi , u j ) =log(v 1h ( pi ))*a 1h +log(v 1d ( pi ))*a 1d +log(v 7d ( pi ))*a 7d

其中,pi表示公文,uj表示目标用户,w4(pi,uj)表示第四潜在关注度,v1h表示该公文在最近一小时内的浏览数量,v1d表示该公文在最近一天内的浏览数量,v7d表示该公文在最近一周内的浏览数量,a1h、a1d、a7d为预设参数。Among them, pi represents the official document, u j represents the target user, w 4 (pi , u j ) represents the fourth potential degree of attention, v 1h represents the number of views of the official document in the last hour, and v 1d represents the official document in the last hour. The number of views in one day, v 7d represents the number of views of the document in the last week, and a 1h , a 1d , and a 7d are preset parameters.

实际应用中,当多个公文中的任一公文与预先确定的热点事件相关时,可以按照预设策略提高该任一公文的第四潜在关注度,例如可以提高上述公式中a1h、a1d、a7d的数值从而提高第四潜在关注度。在得到第四潜在关注度之后,可以结合第一潜在关注度、第二潜在关注度、第三潜在关注度以及第四潜在关注度来确定目标用户对该任一公文的综合潜在关注度,例如根据预设方式计算第一潜在关注度、第二潜在关注度、第三潜在关注度以及第四潜在关注度的和或加权和作为综合潜在关注度。In practical applications, when any one of the multiple official documents is related to a predetermined hot event, the fourth potential degree of attention of any one of the official documents can be increased according to a preset strategy, for example, a 1h and a 1d in the above formula can be increased. , the value of a 7d to improve the fourth potential attention degree. After the fourth potential degree of attention is obtained, the first potential degree of attention, the second potential degree of attention, the third potential degree of attention, and the fourth potential degree of attention may be combined to determine the comprehensive potential degree of attention of the target user to any official document, for example The sum or weighted sum of the first potential attention degree, the second potential attention degree, the third potential attention degree, and the fourth potential attention degree is calculated according to a preset method as the comprehensive potential attention degree.

具体应用中,为实现特定的推荐要求,可以支持以确认任务方式将指定公文或指定类别的公文推荐给指定范围或指定类别的用户。因此,在本发明实施例中,当多个公文中的任一公文和目标用户满足至少一条预设规则时,根据预设规则中设置的推荐权重确定该任一公文的第五潜在关注度。具体而言,当满足的预设规则为一条时,可以将该预设规则中设置的推荐权重确定为该任一公文的第五潜在关注度;当满足的预设规则为多条时,可以将多条预设规则中推荐权重的最大值确定为该任一公文的第五潜在关注度。In a specific application, in order to achieve specific recommendation requirements, it can support the recommendation of a specified official document or a specified category of official documents to users of a specified range or a specified category by confirming the task. Therefore, in this embodiment of the present invention, when any official document among the multiple official documents and the target user satisfy at least one preset rule, the fifth potential degree of attention of any official document is determined according to the recommendation weight set in the preset rule. Specifically, when there is one preset rule that is satisfied, the recommendation weight set in the preset rule may be determined as the fifth potential degree of attention of any official document; when there are multiple preset rules that are satisfied, you can The maximum value of the recommendation weights in the plurality of preset rules is determined as the fifth potential degree of attention of any official document.

在得到第五潜在关注度之后,可以结合第一潜在关注度、第二潜在关注度、第三潜在关注度、第四潜在关注度以及第五潜在关注度来确定目标用户对该任一公文的综合潜在关注度,例如根据预设方式计算第一潜在关注度、第二潜在关注度、第三潜在关注度、第四潜在关注度以及第五潜在关注度的和或加权和作为综合潜在关注度。After the fifth potential degree of attention is obtained, the first potential degree of attention, the second potential degree of attention, the third potential degree of attention, the fourth potential degree of attention, and the fifth potential degree of attention can be combined to determine the target user's interest in any official document Comprehensive potential attention degree, for example, calculating the sum or weighted sum of the first potential attention degree, the second potential attention degree, the third potential attention degree, the fourth potential attention degree, and the fifth potential attention degree according to a preset method as the comprehensive potential attention degree .

在得到目标用户对任一公文的综合潜在关注度之后,可以将符合预设推荐条件的公文确定为待推荐公文。例如可以将多个公文按照综合潜在关注度从大到小的顺序排列,并将排列在前的预设数量的公文确定为待推荐公文。After obtaining the comprehensive potential degree of attention of the target user to any official document, the official document that meets the preset recommendation conditions may be determined as the official document to be recommended. For example, a plurality of official documents may be arranged in descending order of comprehensive potential attention degree, and a preset number of official documents arranged in the front may be determined as the official documents to be recommended.

特别地,在本发明实施例中,可以不依赖公文的特征向量实现推荐。具体地,当目标用户在预设的历史时间段(例如最近一小时)浏览、收藏或订阅过前述公文中的公文时,将这些公文确定为目标公文,并将与目标公文具有关联关系的至少一个公文确定为待推荐公文。例如,在图2中,当目标用户在最近一小时浏览过公文A时,可以将公文A的第一层级的关联公文B、C确定为待推荐公文,当然也可以将公文A的第一层级的关联公文B、C以及第二层级的关联公文D、E、F确定为待推荐公文。通过以上设置,能够基于公文关联关系实现简便快速的推荐。In particular, in the embodiment of the present invention, the recommendation can be implemented without relying on the feature vector of the official document. Specifically, when the target user has browsed, bookmarked or subscribed to the official documents in the aforementioned official documents in a preset historical time period (for example, the last hour), these official documents are determined as the target official documents, and at least the official documents that have an associated relationship with the target official documents are determined. An official document is determined as the official document to be recommended. For example, in Fig. 2, when the target user has browsed document A in the last hour, the related documents B and C of the first level of document A can be determined as the documents to be recommended. Of course, the first level of document A can also be determined. The related official documents B and C of , and the second-level related official documents D, E, and F are determined as the official documents to be recommended. Through the above settings, simple and fast recommendation can be implemented based on the relationship between official documents.

步骤S103:将待推荐公文向目标用户推荐。在本步骤中,可以将待推荐公文推送到用户对应的终端,从而完成公文推荐。Step S103: Recommend the official document to be recommended to the target user. In this step, the official document to be recommended may be pushed to the terminal corresponding to the user, thereby completing the official document recommendation.

图3是本发明实施例中公文推荐方法的具体实现步骤示意图,如图3所示,在本发明实施例中,可以通过五条路径计算第一潜在关注度、第二潜在关注度、第三潜在关注度、第四潜在关注度以及第五潜在关注度,并基于上述潜在关注度计算综合潜在关注度,最后基于综合潜在关注度进行公文推荐。在计算第一潜在关注度时,首先获取公文关联公式,并利用预设函数或特征提取模型计算公文的特征向量,之后通过计算相似度的方式或者利用意图识别模型即可得到第一潜在关注度。在计算第二潜在关注度时,可以使用计算相似度的方式或利用意图判别模型。在计算第三潜在关注度时,可以结合两个方面的时间惩罚项。以及,可以使用公文浏览数量计算第四潜在关注度,使用预设规则计算第五潜在关注度。FIG. 3 is a schematic diagram of specific implementation steps of a method for recommending official documents in an embodiment of the present invention. As shown in FIG. 3 , in an embodiment of the present invention, the first potential attention degree, the second potential attention degree, and the third potential attention degree can be calculated through five paths. The attention degree, the fourth potential attention degree, and the fifth potential attention degree are calculated, and the comprehensive potential attention degree is calculated based on the above potential attention degree, and finally the official document is recommended based on the comprehensive potential attention degree. When calculating the first potential degree of attention, first obtain the official document association formula, and use a preset function or feature extraction model to calculate the feature vector of the official document, and then calculate the similarity or use the intent recognition model to obtain the first potential degree of attention. . When calculating the second potential attention degree, the method of calculating the similarity degree or the intention discrimination model can be used. When calculating the third potential attention degree, the time penalty terms of the two aspects can be combined. And, the fourth potential degree of attention may be calculated using the number of official document views, and the fifth potential degree of interest may be calculated using a preset rule.

在本发明实施例的技术方案中,首先提取多个公文之间的修订、废止、根据、提及等关联关系,之后基于这种关联关系以及目标用户的特征数据来确定待推荐公文,最后将待推荐公文向目标用户推荐。基于以上设置,能够充分考虑公文之间的逻辑关联关系来确定匹配于目标用户的公文,从而提高推荐准确性。在基于公文之间的关联关系确定待推荐公文时,本发明实施例可以首先计算任一公文的特征向量,之后根据该公文的特征向量以及目标用户的特征数据确定目标用户对该公文的第一潜在关注度,最后基于第一潜在关注度确定待推荐公文。在计算任一公文的特征向量时,可以将该公文的原始特征、与该公文具有关联关系的公文的特征向量以及该关联关系的类型对应的权重值输入预设函数直接计算,也可以将公文的原始特征和公文之间的关联关系数据输入预先训练的特征提取模型进行计算。在计算第一潜在关注度时,可以依据公文特征向量与目标用户特征数据的相似度,也可以依据将公文特征向量与目标用户特征数据输入预先训练的意图识别模型得到的目标用户对公文的关注概率。通过以上设置,实现了适用多种具体场景的公文特征计算以及公文与用户匹配。在本发明实施例中,目标用户的特征数据可以是目标用户的当前特征向量,该当前特征向量可以在目标用户初始特征向量的基础上随目标用户针对公文的浏览、收藏、订阅等行为动态变化,这样,当目标用户浏览、收藏或订阅某一公文之后,其当前特征向量可发生倾向于该公文特征向量的变化,这种变化使得目标用户与该公文的关联公文之间会形成更高的匹配程度,这样即可实现精准的关联公文推荐和实时推荐。在本发明实施例中,除了考虑公文之间的关联关系之外,还可以结合以下的一种或多种方法进行协同推荐,这些方法有:基于用户行为的推荐、基于时间惩罚项的推荐、基于公文浏览数量的推荐、基于预设规则的推荐,这样能够进一步地提高推荐准确性。In the technical solution of the embodiment of the present invention, firstly, the association relationship between multiple official documents such as revision, revocation, basis, and mention is extracted, then the official document to be recommended is determined based on this association relationship and the characteristic data of the target user, and finally the official document to be recommended is determined. The official document to be recommended is recommended to the target user. Based on the above settings, it is possible to fully consider the logical relationship between the official documents to determine the official document matching the target user, thereby improving the recommendation accuracy. When determining the official document to be recommended based on the relationship between the official documents, the embodiment of the present invention may first calculate the feature vector of any official document, and then determine the target user's first feature vector of the official document according to the feature vector of the official document and the feature data of the target user. The potential attention degree, and finally the official document to be recommended is determined based on the first potential attention degree. When calculating the feature vector of any official document, the original feature of the official document, the feature vector of the official document that has an associated relationship with the official document, and the weight value corresponding to the type of the association relationship can be input into the preset function for direct calculation, or the official document can be directly calculated. The correlation data between the original features and official documents is input into the pre-trained feature extraction model for calculation. When calculating the first potential degree of attention, it can be based on the similarity between the feature vector of the official document and the feature data of the target user, or the focus of the target user on the official document obtained by inputting the feature vector of the official document and the feature data of the target user into the pre-trained intent recognition model. probability. Through the above settings, document feature calculation suitable for a variety of specific scenarios and document and user matching are realized. In this embodiment of the present invention, the feature data of the target user may be the current feature vector of the target user, and the current feature vector may dynamically change with the target user's behaviors such as browsing, saving, and subscribing to official documents on the basis of the initial feature vector of the target user. , in this way, after the target user browses, collects or subscribes to a certain document, its current feature vector may change towards the feature vector of the official document, and this change will make a higher relationship between the target user and the document associated with the official document. Matching degree, so that accurate related document recommendation and real-time recommendation can be achieved. In this embodiment of the present invention, in addition to considering the relationship between official documents, collaborative recommendation can also be performed in combination with one or more of the following methods. These methods include: recommendation based on user behavior, recommendation based on time penalty items, Recommendations based on the number of official document views and recommendations based on preset rules can further improve the accuracy of recommendations.

需要说明的是,对于前述的各方法实施例,为了便于描述,将其表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明并不受所描述的动作顺序的限制,某些步骤事实上可以采用其它顺序进行或者同时进行。此外,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作和模块并不一定是实现本发明所必须的。It should be noted that, for the convenience of description, the foregoing method embodiments are expressed as a series of action combinations, but those skilled in the art should know that the present invention is not limited by the described action sequence, some The steps may in fact be performed in other orders or simultaneously. In addition, those skilled in the art should also know that the embodiments described in the specification are all preferred embodiments, and the related actions and modules are not necessarily necessary to realize the present invention.

为便于更好的实施本发明实施例的上述方案,下面还提供用于实施上述方案的相关装置。In order to better implement the above solutions of the embodiments of the present invention, related devices for implementing the above solutions are also provided below.

请参阅图4所示,本发明实施例提供的公文推荐装置400可以包括:信息抽取单元401、公文确定单元402和推荐单元403。Referring to FIG. 4 , an apparatus for recommending official documents 400 provided in an embodiment of the present invention may include: an information extracting unit 401 , a document determining unit 402 , and a recommending unit 403 .

其中,信息抽取单元401可用于获取多个公文之间的关联关系;公文确定单元402可用于根据所述关联关系和目标用户的特征数据确定所述多个公文中的待推荐公文;推荐单元403用于将待推荐公文向目标用户推荐。Wherein, the information extraction unit 401 can be used to obtain the association relationship between multiple official documents; the official document determination unit 402 can be used to determine the to-be-recommended official document among the multiple official documents according to the association relationship and the feature data of the target user; the recommendation unit 403 It is used to recommend official documents to be recommended to target users.

在本发明实施例中,所述多个公文之间的关联关系包括:与所述多个公文中的任一公文具有关联关系的至少一个公文以及该关联关系的类型;公文确定单元402可进一步用于:对于所述多个公文中的任一公文:根据与该任一公文具有关联关系的公文以及该关联关系的类型确定该任一公文的特征向量;依据该任一公文的特征向量以及目标用户的特征数据确定目标用户对该任一公文的第一潜在关注度;基于所述第一潜在关注度获取目标用户对该任一公文的综合潜在关注度;在所述综合潜在关注度符合预设的推荐条件时,将该任一公文确定为待推荐公文。In this embodiment of the present invention, the association relationship between the multiple official documents includes: at least one official document that has an association relationship with any official document in the multiple official documents and the type of the association relationship; the official document determining unit 402 may further Used for: for any official document in the plurality of official documents: determine the feature vector of the any official document according to the official document that has an associated relationship with the any official document and the type of the association relationship; according to the feature vector of the any official document and The feature data of the target user determines the first potential degree of attention of the target user to any official document; based on the first potential degree of interest, the comprehensive potential degree of interest of the target user to the any official document is obtained; When the preset recommendation condition is used, any official document is determined as the official document to be recommended.

在一些实施例中,公文确定单元402可进一步用于:对于所述多个公文中的任一公文:将该任一公文的原始特征、与该任一公文具有关联关系的每一公文的特征向量、以及该关联关系的类型对应的预设权重值输入预设函数进行计算,得到该任一公文的特征向量。In some embodiments, the official document determining unit 402 may be further configured to: for any official document in the plurality of official documents: the original feature of the any official document and the characteristics of each official document that has an associated relationship with the any official document The vector and the preset weight value corresponding to the type of the association relationship are input into the preset function for calculation, and the feature vector of any official document is obtained.

较佳地,公文确定单元402可进一步用于:将所述多个公文的原始特征和所述多个公文之间的关联关系数据输入预先训练的特征提取模型,得到所述多个公文中任一公文的特征向量。Preferably, the official document determining unit 402 may be further configured to: input the original features of the multiple official documents and the correlation data between the multiple official documents into a pre-trained feature extraction model, and obtain any of the multiple official documents. The feature vector of a document.

具体应用中,目标用户的特征数据包括目标用户的当前特征向量,所述装置400可进一步包括目标用户特征计算单元,其用于:获取目标用户的初始特征向量;根据目标用户的初始特征向量和目标用户针对公文的历史行为数据确定目标用户的当前特征向量。In a specific application, the feature data of the target user includes the current feature vector of the target user, and the apparatus 400 may further include a target user feature calculation unit, which is used for: acquiring the initial feature vector of the target user; according to the initial feature vector of the target user and The target user determines the current feature vector of the target user according to the historical behavior data of the official document.

实际应用中,目标用户特征计算单元可进一步用于:根据目标用户提供的偏好公文类别确定目标用户的初始特征向量。In practical applications, the target user feature calculation unit may be further configured to: determine the initial feature vector of the target user according to the preferred document category provided by the target user.

作为一个优选方案,目标用户特征计算单元可进一步用于:获取所述历史行为数据对应的每一公文的特征向量;将任一公文的特征向量与该公文的影响分数相乘得到该公文的影响向量;其中,所述影响分数根据目标用户针对该公文的历史行为的类型确定;根据每一公文的影响向量与目标用户的初始特征向量确定目标用户的当前特征向量。As a preferred solution, the target user feature calculation unit can be further used to: obtain the feature vector of each official document corresponding to the historical behavior data; multiply the feature vector of any official document by the impact score of the official document to obtain the impact of the official document The influence score is determined according to the type of historical behavior of the target user on the official document; the current feature vector of the target user is determined according to the influence vector of each official document and the initial feature vector of the target user.

具体应用场景中,公文确定单元402可进一步用于:对于所述多个公文中的任一公文,计算该任一公文的特征向量与目标用户的当前特征向量的相似度;依据该相似度确定目标用户对该任一公文的第一潜在关注度。In a specific application scenario, the official document determining unit 402 may be further configured to: for any official document in the plurality of official documents, calculate the similarity between the feature vector of the any official document and the current feature vector of the target user; determine the similarity according to the similarity The first potential degree of attention of the target user to any official document.

在一些实施例中,公文确定单元402可进一步用于:将所述多个公文中任一公文的特征向量和目标用户的当前特征向量输入预先训练的意图识别模型,得到目标用户对该任一公文的关注概率;依据该关注概率确定目标用户对该任一公文的第一潜在关注度。In some embodiments, the official document determining unit 402 may be further configured to: input the feature vector of any official document among the plurality of official documents and the current feature vector of the target user into the pre-trained intent recognition model, and obtain the target user for any one of the official documents. The attention probability of the official document; the first potential attention degree of the target user to any official document is determined according to the attention probability.

在本发明实施例中,公文确定单元402可进一步用于:将目标用户对所述多个公文中任一公文的第一潜在关注度确定为目标用户对该任一公文的综合潜在关注度。In the embodiment of the present invention, the official document determining unit 402 may be further configured to: determine the first potential degree of attention of the target user to any official document among the plurality of official documents as the comprehensive potential degree of interest of the target user to the any official document.

在一可选实现方式中,所述装置400可进一步包括第二关注度计算单元,其用于:对于所述多个公文中的任一公文,计算该任一公文的原始特征与目标用户的特征数据的相似度;依据该相似度确定目标用户对该任一公文的第二潜在关注度;公文确定单元402可进一步用于:结合所述第一潜在关注度和所述第二潜在关注度确定目标用户对该任一公文的综合潜在关注度。In an optional implementation manner, the apparatus 400 may further include a second attention degree calculation unit, which is used for: for any official document in the plurality of official documents, calculate the original feature of the any official document and the target user's original feature. The similarity of the feature data; the second potential degree of attention of the target user to any official document is determined according to the similarity; the official document determination unit 402 may be further configured to: combine the first potential degree of attention and the second potential degree of attention Determine the comprehensive potential attention of the target user to any official document.

在一些实施例中,第二关注度计算单元可进一步用于:将所述多个公文中任一公文的原始特征和目标用户的综合特征输入预先训练的意图判别模型,得到目标用户对该任一公文的关注概率,依据该关注概率确定目标用户对该任一公文的第二潜在关注度;其中,目标用户的综合特征由目标用户针对公文的历史行为数据和目标用户的属性特征确定;公文确定单元402可进一步用于:结合所述第一潜在关注度和所述第二潜在关注度确定目标用户对该任一公文的综合潜在关注度。In some embodiments, the second attention degree calculation unit may be further configured to: input the original feature of any official document among the plurality of official documents and the comprehensive feature of the target user into the pre-trained intention discrimination model, and obtain the target user's interest in any one of the official documents. The attention probability of an official document, the second potential degree of attention of the target user to any official document is determined according to the attention probability; wherein, the comprehensive feature of the target user is determined by the historical behavior data of the target user on the official document and the attribute characteristics of the target user; the official document The determining unit 402 may be further configured to: combine the first potential degree of interest and the second potential degree of interest to determine the comprehensive potential degree of interest of the target user to any official document.

在一可选实现方式中,所述装置400可进一步包括第三关注度计算单元,其用于:对于所述多个公文中的任一公文,依据该任一公文发布时刻与当前时刻之间的时长确定第一时间惩罚项,使用第一时间惩罚项确定目标用户对该任一公文的第三潜在关注度。公文确定单元402可进一步用于:结合所述第一潜在关注度、所述第二潜在关注度和所述第三潜在关注度确定目标用户对该任一公文的综合潜在关注度。In an optional implementation manner, the apparatus 400 may further include a third attention degree calculation unit, which is used for: for any official document in the plurality of official documents, according to the time between the publication time of the any official document and the current time Determine the first time penalty item, and use the first time penalty item to determine the third potential degree of attention of the target user to any official document. The official document determining unit 402 may be further configured to: combine the first potential interest degree, the second potential interest degree and the third potential interest degree to determine the comprehensive potential interest degree of the target user on any official document.

较佳地,第三关注度计算单元可进一步用于:对于所述多个公文中的任一公文:当目标用户在历史时刻浏览过该任一公文时,依据目标用户最近一次浏览该任一公文的时刻与当前时刻之间的时长确定第二时间惩罚项;以及,使用第一时间惩罚项和第二时间惩罚项确定目标用户对该任一公文的第三潜在关注度。Preferably, the third attention degree calculation unit may be further configured to: for any official document among the plurality of official documents: when the target user has browsed the any official document at a historical moment, according to the last time the target user browsed the any official document. The duration between the moment of the official document and the current moment determines the second time penalty item; and the first time penalty item and the second time penalty item are used to determine the third potential degree of attention of the target user to any official document.

在本发明实施例中,所述装置400可进一步包括第四关注度计算单元,其用于:对于所述多个公文中的任一公文,根据该任一公文在至少一个历史时间间隔的浏览数量确定该任一公文的第四潜在关注度;公文确定单元402可进一步用于:结合所述第一潜在关注度、所述第二潜在关注度、所述第三潜在关注度和所述第四潜在关注度确定目标用户对该任一公文的综合潜在关注度。In this embodiment of the present invention, the apparatus 400 may further include a fourth attention degree calculation unit, which is configured to: for any official document in the plurality of official documents, according to the browsing of the any official document in at least one historical time interval The fourth potential degree of attention of any official document is determined by the number; the official document determination unit 402 may be further configured to: combine the first potential degree of interest, the second potential degree of interest, the third potential degree of interest and the third potential degree of interest Four potential attention degrees Determine the comprehensive potential attention degree of the target user to any official document.

较佳地,第四关注度计算单元可进一步用于:当所述多个公文中的任一公文与预先确定的热点事件相关时,按照预设策略提高该任一公文的第四潜在关注度。Preferably, the fourth degree of attention calculation unit may be further configured to: when any official document in the plurality of official documents is related to a predetermined hot event, increase the fourth potential degree of interest of the any official document according to a preset strategy. .

作为一个优选方案,所述装置400可进一步包括第五关注度计算单元,其用于:当所述多个公文中的任一公文和目标用户满足至少一条预设规则时,根据所述预设规则中设置的推荐权重确定该任一公文的第五潜在关注度;公文确定单元402可进一步用于:结合所述第一潜在关注度、所述第二潜在关注度、所述第三潜在关注度、所述第四潜在关注度和第五潜在关注度确定目标用户对该任一公文的综合潜在关注度。As a preferred solution, the apparatus 400 may further include a fifth attention degree calculation unit, which is configured to: when any official document in the plurality of official documents and the target user satisfy at least one preset rule, according to the preset The recommendation weight set in the rule determines the fifth potential degree of attention of any official document; the official document determination unit 402 may be further configured to: combine the first potential degree of interest, the second potential degree of interest, and the third potential degree of interest degree, the fourth potential degree of attention and the fifth potential degree of attention determine the comprehensive potential degree of attention of the target user to any official document.

较佳地,第五关注度计算单元可进一步用于:当所述预设规则为一条时,将该预设规则中设置的推荐权重确定为该任一公文的第五潜在关注度;当所述预设规则为多条时,将多条预设规则中推荐权重的最大值确定为该任一公文的第五潜在关注度。Preferably, the fifth degree of attention calculation unit may be further configured to: when the preset rule is one, determine the recommendation weight set in the preset rule as the fifth potential degree of attention of any official document; When there are multiple preset rules, the maximum value of the recommendation weight in the multiple preset rules is determined as the fifth potential degree of attention of any official document.

在本发明实施例中,公文确定单元402可进一步用于:将所述第一潜在关注度、所述第二潜在关注度、所述第三潜在关注度、所述第四潜在关注度和第五潜在关注度的加权和确定为目标用户对该任一公文的综合潜在关注度。In this embodiment of the present invention, the official document determining unit 402 may be further configured to: determine the first potential attention degree, the second potential attention degree, the third potential attention degree, the fourth potential attention degree and the The weighted sum of the five potential attention degrees is determined as the comprehensive potential attention degree of the target user to any official document.

在一个实施例中,公文确定单元402可进一步用于:将所述多个公文按照综合潜在关注度从大到小的顺序排列;将排列在前的预设数量的公文确定为待推荐公文。In one embodiment, the official document determining unit 402 may be further configured to: arrange the plurality of official documents in descending order of comprehensive potential attention;

实际应用中,目标用户的特征数据包括:目标用户在预设的历史时间段浏览、收藏或订阅过的公文;公文确定单元402可进一步用于:将目标用户在预设的历史时间段浏览、收藏或订阅过的所述多个公文中的公文确定为目标公文;将与目标公文具有关联关系的至少一个公文确定为待推荐公文。In practical applications, the characteristic data of the target user includes: official documents that the target user has browsed, collected or subscribed to in a preset historical time period; the official document determining unit 402 can be further used for: An official document among the plurality of official documents that has been collected or subscribed is determined as a target official document; at least one official document having an associated relationship with the target official document is determined as an official document to be recommended.

此外,在本发明实施例中,所述关联关系的类型包括:修订关系、废止关系、根据关系和提及关系;所述历史行为的类型包括:浏览、收藏、订阅、忽视、取消收藏和取消订阅。In addition, in the embodiment of the present invention, the types of the association relationship include: revision relationship, abolition relationship, based on relationship, and mention relationship; the types of historical behavior include: browse, favorite, subscribe, ignore, unfavorite, and cancel subscription.

在本发明实施例的技术方案中,首先提取多个公文之间的修订、废止、根据、提及等关联关系,之后基于这种关联关系以及目标用户的特征数据来确定待推荐公文,最后将待推荐公文向目标用户推荐。基于以上设置,能够充分考虑公文之间的逻辑关联关系来确定匹配于目标用户的公文,从而提高推荐准确性。在基于公文之间的关联关系确定待推荐公文时,本发明实施例可以首先计算任一公文的特征向量,之后根据该公文的特征向量以及目标用户的特征数据确定目标用户对该公文的第一潜在关注度,最后基于第一潜在关注度确定待推荐公文。在计算任一公文的特征向量时,可以将该公文的原始特征、与该公文具有关联关系的公文的特征向量以及该关联关系的类型对应的权重值输入预设函数直接计算,也可以将公文的原始特征和公文之间的关联关系数据输入预先训练的特征提取模型进行计算。在计算第一潜在关注度时,可以依据公文特征向量与目标用户特征数据的相似度,也可以依据将公文特征向量与目标用户特征数据输入预先训练的意图识别模型得到的目标用户对公文的关注概率。通过以上设置,实现了适用多种具体场景的公文特征计算以及公文与用户匹配。在本发明实施例中,目标用户的特征数据可以是目标用户的当前特征向量,该当前特征向量可以在目标用户初始特征向量的基础上随目标用户针对公文的浏览、收藏、订阅等行为动态变化,这样,当目标用户浏览、收藏或订阅某一公文之后,其当前特征向量可发生倾向于该公文特征向量的变化,这种变化使得目标用户与该公文的关联公文之间会形成更高的匹配程度,这样即可实现精准的关联公文推荐和实时推荐。在本发明实施例中,除了考虑公文之间的关联关系之外,还可以结合以下的一种或多种方法进行协同推荐,这些方法有:基于用户行为的推荐、基于时间惩罚项的推荐、基于公文浏览数量的推荐、基于预设规则的推荐,这样能够进一步地提高推荐准确性。In the technical solution of the embodiment of the present invention, firstly, the association relationship between multiple official documents such as revision, revocation, basis, and mention is extracted, then the official document to be recommended is determined based on this association relationship and the characteristic data of the target user, and finally the official document to be recommended is determined. The official document to be recommended is recommended to the target user. Based on the above settings, it is possible to fully consider the logical relationship between the official documents to determine the official document matching the target user, thereby improving the recommendation accuracy. When determining the official document to be recommended based on the relationship between the official documents, the embodiment of the present invention may first calculate the feature vector of any official document, and then determine the target user's first feature vector of the official document according to the feature vector of the official document and the feature data of the target user. The potential attention degree, and finally the official document to be recommended is determined based on the first potential attention degree. When calculating the feature vector of any official document, the original feature of the official document, the feature vector of the official document that has an associated relationship with the official document, and the weight value corresponding to the type of the association relationship can be input into the preset function for direct calculation, or the official document can be directly calculated. The correlation data between the original features and official documents is input into the pre-trained feature extraction model for calculation. When calculating the first potential degree of attention, it can be based on the similarity between the feature vector of the official document and the feature data of the target user, or the focus of the target user on the official document obtained by inputting the feature vector of the official document and the feature data of the target user into the pre-trained intent recognition model. probability. Through the above settings, document feature calculation suitable for a variety of specific scenarios and document and user matching are realized. In this embodiment of the present invention, the feature data of the target user may be the current feature vector of the target user, and the current feature vector may dynamically change with the target user's behaviors such as browsing, saving, and subscribing to official documents on the basis of the initial feature vector of the target user. , in this way, after the target user browses, collects or subscribes to a certain document, its current feature vector may change towards the feature vector of the official document, and this change will make a higher relationship between the target user and the document associated with the official document. Matching degree, so that accurate related document recommendation and real-time recommendation can be achieved. In this embodiment of the present invention, in addition to considering the relationship between official documents, collaborative recommendation can also be performed in combination with one or more of the following methods. These methods include: recommendation based on user behavior, recommendation based on time penalty items, Recommendations based on the number of official document views and recommendations based on preset rules can further improve the accuracy of recommendations.

图5示出了可以应用本发明实施例的公文推荐方法或公文推荐装置的示例性系统架构500。FIG. 5 shows an exemplary system architecture 500 to which the method for recommending official documents or the apparatus for recommending official documents according to embodiments of the present invention can be applied.

如图5所示,系统架构500可以包括终端设备501、502、503,网络504和服务器505(此架构仅仅是示例,具体架构中包含的组件可以根据申请具体情况调整)。网络504用以在终端设备501、502、503和服务器505之间提供通信链路的介质。网络504可以包括各种连接类型,例如有线、无线通信链路或者光纤电缆等。As shown in FIG. 5, the system architecture 500 may include terminal devices 501, 502, 503, a network 504 and a server 505 (this architecture is only an example, and the components included in the specific architecture can be adjusted according to the specific application). The network 504 is a medium used to provide a communication link between the terminal devices 501 , 502 , 503 and the server 505 . Network 504 may include various connection types, such as wired, wireless communication links, or fiber optic cables, among others.

用户可以使用终端设备501、502、503通过网络504与服务器505交互,以接收或发送消息等。终端设备501、502、503上可以安装有各种客户端应用,例如公文阅读应用等(仅为示例)。The user can use the terminal devices 501, 502, 503 to interact with the server 505 through the network 504 to receive or send messages and the like. Various client applications may be installed on the terminal devices 501 , 502 , and 503 , such as an official document reading application (just an example).

终端设备501、502、503可以是具有显示屏并且支持网页浏览的各种电子设备,包括但不限于智能手机、平板电脑、膝上型便携计算机和台式计算机等等。The terminal devices 501, 502, 503 may be various electronic devices having a display screen and supporting web browsing, including but not limited to smart phones, tablet computers, laptop computers, desktop computers, and the like.

服务器505可以是提供各种服务的服务器,例如对用户利用终端设备501、502、503所操作的公文阅读应用提供支持的后台服务器(仅为示例)。后台服务器可以对接收到的公文推荐请求进行处理,并将处理结果(例如待推荐公文--仅为示例)反馈给终端设备501、502、503。The server 505 may be a server that provides various services, such as a background server (just an example) that provides support for the document reading application operated by the user using the terminal devices 501 , 502 , and 503 . The background server may process the received official document recommendation request, and feed back the processing result (for example, the official document to be recommended—just an example) to the terminal devices 501 , 502 , and 503 .

需要说明的是,本发明实施例所提供的公文推荐方法一般由服务器505执行,相应地,公文推荐装置一般设置于服务器505中。It should be noted that the method for recommending official documents provided by the embodiments of the present invention is generally executed by the server 505 , and accordingly, the device for recommending official documents is generally set in the server 505 .

应该理解,图5中的终端设备、网络和服务器的数目仅仅是示意性的。根据实现需要,可以具有任意数目的终端设备、网络和服务器。It should be understood that the numbers of terminal devices, networks and servers in FIG. 5 are only illustrative. There can be any number of terminal devices, networks and servers according to implementation needs.

本发明还提供了一种电子设备。本发明实施例的电子设备包括:一个或多个处理器;存储装置,用于存储一个或多个程序,当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现本发明所提供的公文推荐方法。The present invention also provides an electronic device. An electronic device according to an embodiment of the present invention includes: one or more processors; and a storage device for storing one or more programs, when the one or more programs are executed by the one or more processors, the one or more programs cause the One or more processors implement the document recommendation method provided by the present invention.

下面参考图6,其示出了适于用来实现本发明实施例的电子设备的计算机系统600的结构示意图。图6示出的电子设备仅仅是一个示例,不应对本发明实施例的功能和使用范围带来任何限制。Referring next to FIG. 6 , it shows a schematic structural diagram of a computer system 600 suitable for implementing an electronic device according to an embodiment of the present invention. The electronic device shown in FIG. 6 is only an example, and should not impose any limitation on the function and scope of use of the embodiments of the present invention.

如图6所示,计算机系统600包括中央处理单元(CPU)601,其可以根据存储在只读存储器(ROM)602中的程序或者从存储部分608加载到随机访问存储器(RAM)603中的程序而执行各种适当的动作和处理。在RAM603中,还存储有计算机系统600操作所需的各种程序和数据。CPU601、ROM 602以及RAM 603通过总线604彼此相连。输入/输出(I/O)接口605也连接至总线604。As shown in FIG. 6, a computer system 600 includes a central processing unit (CPU) 601, which can be loaded into a random access memory (RAM) 603 according to a program stored in a read only memory (ROM) 602 or a program from a storage section 608 Instead, various appropriate actions and processes are performed. In the RAM 603, various programs and data necessary for the operation of the computer system 600 are also stored. The CPU 601 , the ROM 602 , and the RAM 603 are connected to each other through a bus 604 . An input/output (I/O) interface 605 is also connected to bus 604 .

以下部件连接至I/O接口605:包括键盘、鼠标等的输入部分606;包括诸如阴极射线管(CRT)、液晶显示器(LCD)等以及扬声器等的输出部分607;包括硬盘等的存储部分608;以及包括诸如LAN卡、调制解调器等的网络接口卡的通信部分609。通信部分609经由诸如因特网的网络执行通信处理。驱动器610也根据需要连接至I/O接口605。可拆卸介质611,诸如磁盘、光盘、磁光盘、半导体存储器等等,根据需要安装在驱动器610上,以便从其上读出的计算机程序根据需要被安装入存储部分608。The following components are connected to the I/O interface 605: an input section 606 including a keyboard, a mouse, etc.; an output section 607 including a cathode ray tube (CRT), a liquid crystal display (LCD), etc., and a speaker, etc.; a storage section 608 including a hard disk, etc. ; and a communication section 609 including a network interface card such as a LAN card, a modem, and the like. The communication section 609 performs communication processing via a network such as the Internet. A drive 610 is also connected to the I/O interface 605 as needed. A removable medium 611, such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, etc., is mounted on the drive 610 as needed, so that a computer program read therefrom is installed into the storage section 608 as needed.

特别地,根据本发明公开的实施例,上文的主要步骤图描述的过程可以被实现为计算机软件程序。例如,本发明实施例包括一种计算机程序产品,其包括承载在计算机可读介质上的计算机程序,该计算机程序包含用于执行主要步骤图所示的方法的程序代码。在上述实施例中,该计算机程序可以通过通信部分609从网络上被下载和安装,和/或从可拆卸介质611被安装。在该计算机程序被中央处理单元601执行时,执行本发明的系统中限定的上述功能。In particular, according to the disclosed embodiments of the present invention, the processes described in the main step diagrams above may be implemented as computer software programs. For example, embodiments of the present invention include a computer program product comprising a computer program carried on a computer-readable medium, the computer program containing program code for executing the method shown in the main step diagram. In the above-described embodiment, the computer program can be downloaded and installed from the network through the communication section 609 and/or installed from the removable medium 611 . When the computer program is executed by the central processing unit 601, the above-described functions defined in the system of the present invention are performed.

需要说明的是,本发明所示的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本发明中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。在本发明中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述任意合适的组合。计算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:无线、电线、光缆、RF等等,或者上述的任意合适的组合。It should be noted that the computer-readable medium shown in the present invention may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two. The computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above. More specific examples of computer readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Programmable read only memory (EPROM or flash memory), fiber optics, portable compact disk read only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination of the foregoing. In the present invention, a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device. In the present invention, a computer readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, carrying computer readable program code therein. Such propagated data signals may take a variety of forms including, but not limited to, electromagnetic signals, optical signals, or any suitable combination of the foregoing. A computer-readable signal medium can also be any computer-readable medium, other than a computer-readable storage medium, that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any suitable medium including, but not limited to, wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.

附图中的流程图和框图,图示了按照本发明各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,上述模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这根据所涉及的功能而定。也要注意的是,框图或流程图中的每个方框、以及框图或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logical functions for implementing the specified functions executable instructions. It should also be noted that, in some alternative implementations, the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It is also noted that each block of the block diagrams or flowchart illustrations, and combinations of blocks in the block diagrams or flowchart illustrations, can be implemented in special purpose hardware-based systems that perform the specified functions or operations, or can be implemented using A combination of dedicated hardware and computer instructions is implemented.

描述于本发明实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现。所描述的单元也可以设置在处理器中,例如,可以描述为:一种处理器包括信息抽取单元、公文确定单元和推荐单元。其中,这些单元的名称在某种情况下并不构成对该单元本身的限定,例如,信息抽取单元还可以被描述为“向公文确定单元提供公文之间关联关系的单元”。The units involved in the embodiments of the present invention may be implemented in a software manner, and may also be implemented in a hardware manner. The described unit can also be set in the processor, for example, it can be described as: a processor includes an information extraction unit, an official document determination unit and a recommendation unit. Wherein, the names of these units do not constitute a limitation on the unit itself under certain circumstances. For example, the information extraction unit can also be described as "a unit that provides the official document determination unit with the relationship between official documents".

作为另一方面,本发明还提供了一种计算机可读介质,该计算机可读介质可以是上述实施例中描述的设备中所包含的;也可以是单独存在,而未装配入该设备中的。上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该设备执行时,使得该设备执行的步骤包括:获取多个公文之间的关联关系;根据所述关联关系和目标用户的特征数据确定所述多个公文中的待推荐公文;将待推荐公文向目标用户推荐。As another aspect, the present invention also provides a computer-readable medium. The computer-readable medium may be included in the device described in the above embodiments; it may also exist independently without being assembled into the device. . The above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the device, the steps performed by the device include: acquiring the association relationship between multiple official documents; according to the association relationship and The feature data of the target user determines the official documents to be recommended among the plurality of official documents; the official documents to be recommended are recommended to the target user.

在本发明实施例的技术方案中,首先提取多个公文之间的修订、废止、根据、提及等关联关系,之后基于这种关联关系以及目标用户的特征数据来确定待推荐公文,最后将待推荐公文向目标用户推荐。基于以上设置,能够充分考虑公文之间的逻辑关联关系来确定匹配于目标用户的公文,从而提高推荐准确性。在基于公文之间的关联关系确定待推荐公文时,本发明实施例可以首先计算任一公文的特征向量,之后根据该公文的特征向量以及目标用户的特征数据确定目标用户对该公文的第一潜在关注度,最后基于第一潜在关注度确定待推荐公文。在计算任一公文的特征向量时,可以将该公文的原始特征、与该公文具有关联关系的公文的特征向量以及该关联关系的类型对应的权重值输入预设函数直接计算,也可以将公文的原始特征和公文之间的关联关系数据输入预先训练的特征提取模型进行计算。在计算第一潜在关注度时,可以依据公文特征向量与目标用户特征数据的相似度,也可以依据将公文特征向量与目标用户特征数据输入预先训练的意图识别模型得到的目标用户对公文的关注概率。通过以上设置,实现了适用多种具体场景的公文特征计算以及公文与用户匹配。在本发明实施例中,目标用户的特征数据可以是目标用户的当前特征向量,该当前特征向量可以在目标用户初始特征向量的基础上随目标用户针对公文的浏览、收藏、订阅等行为动态变化,这样,当目标用户浏览、收藏或订阅某一公文之后,其当前特征向量可发生倾向于该公文特征向量的变化,这种变化使得目标用户与该公文的关联公文之间会形成更高的匹配程度,这样即可实现精准的关联公文推荐和实时推荐。在本发明实施例中,除了考虑公文之间的关联关系之外,还可以结合以下的一种或多种方法进行协同推荐,这些方法有:基于用户行为的推荐、基于时间惩罚项的推荐、基于公文浏览数量的推荐、基于预设规则的推荐,这样能够进一步地提高推荐准确性。In the technical solution of the embodiment of the present invention, firstly, the association relationship between multiple official documents such as revision, revocation, basis, and mention is extracted, then the official document to be recommended is determined based on this association relationship and the characteristic data of the target user, and finally the official document to be recommended is determined. The official document to be recommended is recommended to the target user. Based on the above settings, it is possible to fully consider the logical relationship between the official documents to determine the official document matching the target user, thereby improving the recommendation accuracy. When determining the official document to be recommended based on the relationship between the official documents, the embodiment of the present invention may first calculate the feature vector of any official document, and then determine the target user's first feature vector of the official document according to the feature vector of the official document and the feature data of the target user. The potential attention degree, and finally the official document to be recommended is determined based on the first potential attention degree. When calculating the feature vector of any official document, the original feature of the official document, the feature vector of the official document that has an associated relationship with the official document, and the weight value corresponding to the type of the association relationship can be input into the preset function for direct calculation, or the official document can be directly calculated. The correlation data between the original features and official documents is input into the pre-trained feature extraction model for calculation. When calculating the first potential degree of attention, it can be based on the similarity between the feature vector of the official document and the feature data of the target user, or the focus of the target user on the official document obtained by inputting the feature vector of the official document and the feature data of the target user into the pre-trained intent recognition model. probability. Through the above settings, document feature calculation suitable for a variety of specific scenarios and document and user matching are realized. In this embodiment of the present invention, the feature data of the target user may be the current feature vector of the target user, and the current feature vector may dynamically change with the target user's behaviors such as browsing, saving, and subscribing to official documents on the basis of the initial feature vector of the target user. , in this way, after the target user browses, collects or subscribes to a certain document, its current feature vector may change towards the feature vector of the official document, and this change will make a higher relationship between the target user and the document associated with the official document. Matching degree, so that accurate related document recommendation and real-time recommendation can be achieved. In this embodiment of the present invention, in addition to considering the relationship between official documents, collaborative recommendation can also be performed in combination with one or more of the following methods. These methods include: recommendation based on user behavior, recommendation based on time penalty items, Recommendations based on the number of official document views and recommendations based on preset rules can further improve the accuracy of recommendations.

上述具体实施方式,并不构成对本发明保护范围的限制。本领域技术人员应该明白的是,取决于设计要求和其他因素,可以发生各种各样的修改、组合、子组合和替代。任何在本发明的精神和原则之内所作的修改、等同替换和改进等,均应包含在本发明保护范围之内。The above-mentioned specific embodiments do not constitute a limitation on the protection scope of the present invention. It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may occur depending on design requirements and other factors. Any modifications, equivalent replacements and improvements made within the spirit and principle of the present invention shall be included within the protection scope of the present invention.

Claims (25)

1. An official document recommendation method, comprising:
acquiring an incidence relation among a plurality of documents;
determining the official documents to be recommended in the multiple official documents according to the incidence relation and the characteristic data of the target user;
and recommending the document to be recommended to the target user.
2. The method of claim 1, wherein the association between the plurality of documents comprises: at least one official document having an association relation with any one of the plurality of official documents and a type of the association relation; and determining the official documents to be recommended in the plurality of official documents according to the incidence relation and the characteristic data of the target user, wherein the method comprises the following steps:
for any of the plurality of documents:
determining a feature vector of any document according to the document having an association relation with the document and the type of the association relation;
determining a first potential attention degree of the target user to any official document according to the feature vector of the official document and the feature data of the target user;
acquiring the comprehensive potential attention degree of the target user to any official document based on the first potential attention degree;
and when the comprehensive potential attention degree meets a preset recommendation condition, determining any official document as an official document to be recommended.
3. The method of claim 2, wherein determining the feature vector of the document according to the document having an association relationship with the document and the type of the association relationship comprises:
for any of the plurality of documents:
inputting the original features of any document, the feature vector of each document having an association relation with the document and the preset weight value corresponding to the type of the association relation into a preset function for calculation to obtain the feature vector of any document.
4. The method of claim 2, wherein determining the feature vector of the document according to the document having an association relationship with the document and the type of the association relationship comprises:
inputting the original features of the multiple documents and the incidence relation data between the multiple documents into a pre-trained feature extraction model to obtain the feature vector of any document in the multiple documents.
5. The method of claim 2, wherein the feature data of the target user comprises a current feature vector of the target user; and, the method further comprises:
acquiring an initial feature vector of a target user;
and determining the current characteristic vector of the target user according to the initial characteristic vector of the target user and the historical behavior data of the target user for the official document.
6. The method of claim 5, wherein the obtaining the initial feature vector of the target user comprises:
and determining the initial characteristic vector of the target user according to the preference document category provided by the target user.
7. The method of claim 5, wherein determining the current feature vector of the target user according to the initial feature vector of the target user and historical behavior data of the target user for the official document comprises:
acquiring a feature vector of each official document corresponding to the historical behavior data;
multiplying the feature vector of any document with the influence fraction of the document to obtain the influence vector of the document; the influence score is determined according to the type of the historical behaviors of the target user aiming at the official document;
and determining the current feature vector of the target user according to the influence vector of each official document and the initial feature vector of the target user.
8. The method of claim 5, wherein determining the first potential degree of interest of the target user for the any document according to the feature vector of the any document and the feature data of the target user comprises:
for any document in the plurality of documents, calculating the similarity between the feature vector of the document and the current feature vector of the target user;
and determining a first potential attention degree of the target user to any document according to the similarity.
9. The method of claim 5, wherein determining the first potential degree of interest of the target user for the any document according to the feature vector of the any document and the feature data of the target user comprises:
inputting the feature vector of any one of the multiple documents and the current feature vector of the target user into a pre-trained intention recognition model to obtain the attention probability of the target user to the any document;
and determining a first potential attention degree of the target user to the any official document according to the attention probability.
10. The method of claim 2, wherein the obtaining of the comprehensive potential attention of the target user to the arbitrary document based on the first potential attention comprises:
and determining the first potential attention degree of the target user to any one official document in the plurality of official documents as the comprehensive potential attention degree of the target user to the any one official document.
11. The method of claim 2,
the method further comprises: for any official document in the multiple official documents, calculating the similarity between the original characteristic of the official document and the characteristic data of the target user; determining a second potential attention degree of the target user to any official document according to the similarity degree; and the number of the first and second groups,
the acquiring of the comprehensive potential attention degree of the target user to any official document based on the first potential attention degree comprises: and determining the comprehensive potential attention degree of the target user to any document by combining the first potential attention degree and the second potential attention degree.
12. The method of claim 2,
the method further comprises: inputting the original features of any one of the multiple documents and the comprehensive features of the target user into a pre-trained intention judging model to obtain the attention probability of the target user to the any one document, and determining a second potential attention degree of the target user to the any one document according to the attention probability; the comprehensive characteristics of the target user are determined by the historical behavior data of the target user aiming at the official document and the attribute characteristics of the target user; and the number of the first and second groups,
the acquiring of the comprehensive potential attention degree of the target user to any official document based on the first potential attention degree comprises: and determining the comprehensive potential attention degree of the target user to any document by combining the first potential attention degree and the second potential attention degree.
13. The method according to claim 11 or 12, characterized in that the method further comprises:
for any official document in the multiple official documents, determining a first time penalty item according to the time length between the issuing time of the any official document and the current time, and determining a third potential attention degree of a target user to the any official document by using the first time penalty item; and
the acquiring of the comprehensive potential attention degree of the target user to any official document based on the first potential attention degree comprises:
and determining the comprehensive potential attention degree of the target user to any document by combining the first potential attention degree, the second potential attention degree and the third potential attention degree.
14. The method of claim 13,
the method further comprises: for any of the plurality of documents: when the target user browses any official document at the historical moment, determining a second time penalty item according to the time length between the moment when the target user browses any official document at the last time and the current moment; and
the determining a third potential attention degree of the target user to any document by using the first time penalty item comprises the following steps: and determining a third potential attention degree of the target user to any document by using the first time penalty item and the second time penalty item.
15. The method of claim 13,
the method further comprises: for any official document in the multiple official documents, determining a fourth potential attention degree of the any official document according to the browsing number of the any official document in at least one historical time interval; and
the acquiring of the comprehensive potential attention degree of the target user to any official document based on the first potential attention degree comprises: and determining the comprehensive potential attention degree of the target user to any document by combining the first potential attention degree, the second potential attention degree, the third potential attention degree and the fourth potential attention degree.
16. The method of claim 15, further comprising:
and when any document in the plurality of documents is related to a predetermined hot spot event, increasing the fourth potential attention of the document according to a preset strategy.
17. The method of claim 15,
the method further comprises: when any official document in the multiple official documents and a target user meet at least one preset rule, determining a fifth potential attention degree of the official document according to a recommendation weight set in the preset rule;
the acquiring of the comprehensive potential attention degree of the target user to any official document based on the first potential attention degree comprises: and determining the comprehensive potential attention degree of the target user to any document by combining the first potential attention degree, the second potential attention degree, the third potential attention degree, the fourth potential attention degree and the fifth potential attention degree.
18. The method according to claim 17, wherein the determining a fifth potential attention degree of any document according to the recommendation weight set in the preset rule comprises:
when the preset rule is one, determining the recommendation weight set in the preset rule as a fifth potential attention degree of any official document;
and when the preset rules are multiple, determining the maximum value of the recommendation weights in the multiple preset rules as the fifth potential attention of any document.
19. The method of claim 17, wherein the obtaining of the comprehensive potential attention of the target user to the arbitrary document based on the first potential attention comprises:
and determining the weighted sum of the first potential attention degree, the second potential attention degree, the third potential attention degree, the fourth potential attention degree and the fifth potential attention degree as the comprehensive potential attention degree of the target user to any official document.
20. The method according to claim 2, wherein when the comprehensive potential attention degree meets a preset recommendation condition, determining any document as a document to be recommended comprises:
arranging the plurality of documents according to the sequence of the comprehensive potential attention degrees from large to small;
and determining the documents in the preset number arranged in the front as documents to be recommended.
21. The method of claim 1, wherein the target user profile comprises: browsing, collecting or subscribing the official documents by the target user in a preset historical time period; and determining the official documents to be recommended in the plurality of official documents according to the incidence relation and the characteristic data of the target user, wherein the method comprises the following steps:
determining the official documents in the plurality of official documents browsed, collected or subscribed by the target user in a preset historical time period as target official documents;
and determining at least one official document having an association relation with the target official document as an official document to be recommended.
22. The method of claim 7,
the type of the incidence relation comprises: revising relationships, revoking relationships, basing relationships and mentioning relationships;
the types of historical behaviors include: browse, collect, subscribe, ignore, cancel collect, and cancel subscribe.
23. An official document recommendation device, comprising:
the information extraction unit is used for acquiring the incidence relation among a plurality of documents;
the official document determining unit is used for determining the official documents to be recommended in the multiple official documents according to the incidence relation and the characteristic data of the target user;
and the recommending unit is used for recommending the official document to be recommended to the target user.
24. An electronic device, comprising:
one or more processors;
a storage device for storing one or more programs,
when executed by the one or more processors, cause the one or more processors to implement the method of any one of claims 1-22.
25. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the method according to any one of claims 1-22.
CN202010773625.7A 2020-08-04 2020-08-04 Official document recommendation method and device Active CN112000795B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010773625.7A CN112000795B (en) 2020-08-04 2020-08-04 Official document recommendation method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010773625.7A CN112000795B (en) 2020-08-04 2020-08-04 Official document recommendation method and device

Publications (2)

Publication Number Publication Date
CN112000795A true CN112000795A (en) 2020-11-27
CN112000795B CN112000795B (en) 2024-12-27

Family

ID=73463634

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010773625.7A Active CN112000795B (en) 2020-08-04 2020-08-04 Official document recommendation method and device

Country Status (1)

Country Link
CN (1) CN112000795B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114090848A (en) * 2021-10-25 2022-02-25 阿里巴巴(中国)有限公司 Data recommendation and classification method, feature fusion model and electronic equipment
CN118643803A (en) * 2024-08-13 2024-09-13 北京中数睿智科技有限公司 Data reliability detection method and system for large model generation of special official documents

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150363688A1 (en) * 2014-06-13 2015-12-17 Microsoft Corporation Modeling interestingness with deep neural networks
CN109299377A (en) * 2018-10-26 2019-02-01 东软集团股份有限公司 Article recommendation method, device, readable storage medium and electronic device
CN110888990A (en) * 2019-11-22 2020-03-17 深圳前海微众银行股份有限公司 Text recommending methods, devices, equipment and media
CN111046657A (en) * 2019-12-04 2020-04-21 东软集团股份有限公司 Method, device and equipment for realizing text information standardization

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150363688A1 (en) * 2014-06-13 2015-12-17 Microsoft Corporation Modeling interestingness with deep neural networks
CN109299377A (en) * 2018-10-26 2019-02-01 东软集团股份有限公司 Article recommendation method, device, readable storage medium and electronic device
CN110888990A (en) * 2019-11-22 2020-03-17 深圳前海微众银行股份有限公司 Text recommending methods, devices, equipment and media
CN111046657A (en) * 2019-12-04 2020-04-21 东软集团股份有限公司 Method, device and equipment for realizing text information standardization

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114090848A (en) * 2021-10-25 2022-02-25 阿里巴巴(中国)有限公司 Data recommendation and classification method, feature fusion model and electronic equipment
CN118643803A (en) * 2024-08-13 2024-09-13 北京中数睿智科技有限公司 Data reliability detection method and system for large model generation of special official documents

Also Published As

Publication number Publication date
CN112000795B (en) 2024-12-27

Similar Documents

Publication Publication Date Title
JP6745384B2 (en) Method and apparatus for pushing information
US11347752B2 (en) Personalized user feed based on monitored activities
US11151203B2 (en) Interest embedding vectors
US10026021B2 (en) Training image-recognition systems using a joint embedding model on online social networks
US9183282B2 (en) Methods and systems for inferring user attributes in a social networking system
US10771424B2 (en) Usability and resource efficiency using comment relevance
US9324112B2 (en) Ranking authors in social media systems
US10083379B2 (en) Training image-recognition systems based on search queries on online social networks
EP3547155A1 (en) Entity representation learning for improving digital content recommendations
US20190266257A1 (en) Vector similarity search in an embedded space
US11080287B2 (en) Methods, systems and techniques for ranking blended content retrieved from multiple disparate content sources
US10102255B2 (en) Categorizing objects for queries on online social networks
Tran et al. Hashtag recommendation approach based on content and user characteristics
US9779169B2 (en) System for ranking memes
CN112074857A (en) Combining machine learning and social data to generate personalized recommendations
US20120197750A1 (en) Methods, systems and devices for recommending products and services
US10061856B2 (en) Multimedia search using reshare text on online social networks
US11423096B2 (en) Method and apparatus for outputting information
CN111279334A (en) Search query enhancement with contextual analysis
CN107526718B (en) Method and apparatus for generating text
US20180101617A1 (en) Ranking Search Results using Machine Learning Based Models
JP2013517563A (en) User communication analysis system and method
CN108228906B (en) Method and apparatus for generating information
CN110059230B (en) Generalized linear mixture model for improved search
US20190258719A1 (en) Emoji classifier

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20220920

Address after: 25 Financial Street, Xicheng District, Beijing 100033

Applicant after: CHINA CONSTRUCTION BANK Corp.

Address before: 25 Financial Street, Xicheng District, Beijing 100033

Applicant before: CHINA CONSTRUCTION BANK Corp.

Applicant before: Jianxin Financial Science and Technology Co.,Ltd.

GR01 Patent grant
GR01 Patent grant