[go: up one dir, main page]

CN106339510B - Click estimation method and device based on artificial intelligence - Google Patents

Click estimation method and device based on artificial intelligence Download PDF

Info

Publication number
CN106339510B
CN106339510B CN201610972619.8A CN201610972619A CN106339510B CN 106339510 B CN106339510 B CN 106339510B CN 201610972619 A CN201610972619 A CN 201610972619A CN 106339510 B CN106339510 B CN 106339510B
Authority
CN
China
Prior art keywords
entity
recommended
determining
click
query statement
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610972619.8A
Other languages
Chinese (zh)
Other versions
CN106339510A (en
Inventor
闭玮
陈泽裕
石磊
何径舟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201610972619.8A priority Critical patent/CN106339510B/en
Publication of CN106339510A publication Critical patent/CN106339510A/en
Application granted granted Critical
Publication of CN106339510B publication Critical patent/CN106339510B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本申请提出一种基于人工智能的点击预估方法及装置,其中,该方法包括:根据用户输入的查询语句,获取待推荐实体的属性信息;对所述查询语句及待推荐实体的属性信息分别进行分词处理,确定所述查询语句及待推荐实体的特征;利用预设的深度神经网络模型,根据所述查询语句及待推荐实体的特征,确定所述待推荐实体的点击率。由此,方法通过将提取的特征融入到深度神经网络模型中,对待推荐实体的点击率进行预估,提高了点击率预估的准确性、使得推荐系统可以准确的为用户提供服务,提高了推荐系统的服务质量,改善了用户体验。

The present application proposes an artificial intelligence-based click estimation method and device, wherein the method includes: obtaining the attribute information of the entity to be recommended according to the query statement input by the user; Carry out word segmentation processing to determine the features of the query statement and the entity to be recommended; use the preset deep neural network model to determine the click-through rate of the entity to be recommended according to the query statement and the features of the entity to be recommended. Therefore, the method integrates the extracted features into the deep neural network model to estimate the click-through rate of the recommended entity, which improves the accuracy of the click-through rate estimation, enables the recommendation system to provide services for users accurately, and improves the accuracy of the click-through rate. The quality of service of the recommendation system improves the user experience.

Description

基于人工智能的点击预估方法及装置Click estimation method and device based on artificial intelligence

技术领域technical field

本申请涉及网络技术领域,尤其涉及一种基于人工智能的点击预估方法及装置。The present application relates to the field of network technology, in particular to an artificial intelligence-based click estimation method and device.

背景技术Background technique

人工智能(Artificial Intelligence),英文缩写为AI。它是研究、开发用于模拟、延伸和扩展人的智能的理论、方法、技术及应用系统的一门新的技术科学。人工智能是计算机科学的一个分支,它企图了解智能的实质,并生产出一种新的能以人类智能相似的方式做出反应的智能机器,该领域的研究包括机器人、语言识别、图像识别、自然语言处理和专家系统等。Artificial Intelligence (Artificial Intelligence), the English abbreviation is AI. It is a new technical science that studies and develops theories, methods, technologies and application systems for simulating, extending and expanding human intelligence. Artificial intelligence is a branch of computer science that attempts to understand the essence of intelligence and produce a new intelligent machine that can respond in a manner similar to human intelligence. Research in this field includes robotics, language recognition, image recognition, natural language processing and expert systems, etc.

点击率(Click through rate,简称CTR)预估是大数据技术应用的经典问题之一。点击率预估的一个重要应该就是把最合适的广告或推荐产品找出来呈现给用户。目前,在广告、金融等推荐领域中,通常用逻辑回归(Logistic regression,简称LR)模型对待推荐产品的点击率进行预估,将到用户输入的查询语句及获取的推荐实体的特征值,进行线性加权和非线性运算,即可确定待推荐实体的点击率。Click through rate (CTR) estimation is one of the classic problems in the application of big data technology. An important aspect of click-through rate estimation is to find out the most suitable advertisement or recommended product and present it to users. At present, in the recommendation fields such as advertising and finance, the click-through rate of the recommended product is usually estimated by the logistic regression (LR) model, and the query statement entered by the user and the feature value of the recommended entity obtained are calculated. Linear weighting and non-linear operations can determine the click-through rate of the entity to be recommended.

但是上述利用LR模型确定待推荐实体点击率的形式,由于LR模型的局限性,使得到的待推荐实体的点击率准确性较低。However, in the form of determining the click-through rate of the entity to be recommended by using the LR model, the accuracy of the click-through rate of the entity to be recommended is low due to limitations of the LR model.

发明内容Contents of the invention

本申请旨在至少在一定程度上解决相关技术中的技术问题之一。This application aims to solve one of the technical problems in the related art at least to a certain extent.

为此,本申请的第一个目的在于提出一种基于人工智能的点击预估方法,该方法通过将提取的特征融入到深度神经网络模型中,对待推荐实体的点击率进行预估,提高了点击率预估的准确性、使得推荐系统可以准确的为用户提供服务,提高了推荐系统的服务质量,改善了用户体验。For this reason, the first purpose of this application is to propose a click estimation method based on artificial intelligence. This method integrates the extracted features into the deep neural network model to estimate the click rate of the entity to be recommended, which improves the The accuracy of click-through rate estimation enables the recommendation system to accurately provide services for users, improves the service quality of the recommendation system, and improves user experience.

本申请的第二个目的在于提出一种基于人工智能的点击预估装置。The second purpose of the present application is to propose an artificial intelligence-based click estimation device.

本申请的第三个目的在于提出一种基于人工智能的点击预估设备。The third purpose of the present application is to propose an artificial intelligence-based click estimation device.

本申请的第四个目的在于提出一种非临时性计算机可读存储介质。The fourth objective of the present application is to provide a non-transitory computer-readable storage medium.

本申请的第五个目的在于提出一种计算机程序产品。The fifth object of the present application is to propose a computer program product.

为达上述目的,本申请第一方面实施例提出了一种基于人工智能的点击预估方法,包括:根据用户输入的查询语句,获取待推荐实体的属性信息;对所述查询语句及待推荐实体的属性信息分别进行分词处理,确定所述查询语句及待推荐实体的特征;利用预设的深度神经网络模型,根据所述查询语句及待推荐实体的特征,确定所述待推荐实体的点击率。In order to achieve the above purpose, the embodiment of the first aspect of the present application proposes a click prediction method based on artificial intelligence, which includes: obtaining the attribute information of the entity to be recommended according to the query sentence input by the user; The attribute information of the entity is separately processed to determine the characteristics of the query statement and the entity to be recommended; using the preset deep neural network model, according to the query statement and the characteristics of the entity to be recommended, determine the click of the entity to be recommended Rate.

本申请实施例的基于人工智能的点击预估方法,在根据用户输入的查询语句获取待推荐的实体的属性信息后,即可所述查询语句及待推荐实体的属性信息分别进行分词处理,确定所述查询语句及待推荐实体的特征,然后利用预设的深度神经网络模型,根据所述查询语句及待推荐实体的特征,确定所述待推荐实体的点击率。由此,通过将提取的特征融入到深度神经网络模型中,对待推荐实体的点击率进行预估,提高了点击率预估的准确性、使得推荐系统可以准确的为用户提供服务,提高了推荐系统的服务质量,改善了用户体验。According to the artificial intelligence-based click estimation method of the embodiment of the present application, after obtaining the attribute information of the entity to be recommended according to the query sentence input by the user, the query sentence and the attribute information of the entity to be recommended can be respectively subjected to word segmentation processing to determine The query statement and the characteristics of the entity to be recommended are then used to determine the click-through rate of the entity to be recommended according to the query statement and the characteristics of the entity to be recommended using a preset deep neural network model. Therefore, by integrating the extracted features into the deep neural network model, the click-through rate of the entity to be recommended is estimated, which improves the accuracy of the click-through rate estimation, enables the recommendation system to provide services for users accurately, and improves the recommendation system. The quality of service of the system improves the user experience.

为达上述目的,本申请第二方面实施例提出了一种基于人工智能的点击预估装置,包括:获取模块,用于根据用户输入的查询语句,获取待推荐实体的属性信息;分词模块,用于对所述查询语句及待推荐实体的属性信息分别进行分词处理,确定所述查询语句及待推荐实体的特征;处理模块,用于利用预设的深度神经网络模型,根据所述查询语句及待推荐实体的特征,确定所述待推荐实体的点击率。In order to achieve the above purpose, the embodiment of the second aspect of the present application proposes an artificial intelligence-based click prediction device, including: an acquisition module, which is used to acquire the attribute information of the entity to be recommended according to the query sentence input by the user; a word segmentation module, It is used to separately perform word segmentation processing on the query statement and the attribute information of the entity to be recommended, and determine the characteristics of the query statement and the entity to be recommended; the processing module is used to use the preset deep neural network model, according to the query statement and the characteristics of the entity to be recommended, and determine the click rate of the entity to be recommended.

本申请实施例的基于人工智能的点击预估装置,在根据用户输入的查询语句获取待推荐的实体的属性信息后,即可所述查询语句及待推荐实体的属性信息分别进行分词处理,确定所述查询语句及待推荐实体的特征,然后利用预设的深度神经网络模型,根据所述查询语句及待推荐实体的特征,确定所述待推荐实体的点击率。由此,通过将提取的特征融入到深度神经网络模型中,对待推荐实体的点击率进行预估,提高了点击率预估的准确性、使得推荐系统可以准确的为用户提供服务,提高了推荐系统的服务质量,改善了用户体验。According to the artificial intelligence-based click estimation device of the embodiment of the present application, after obtaining the attribute information of the entity to be recommended according to the query sentence input by the user, the query sentence and the attribute information of the entity to be recommended can be respectively subjected to word segmentation processing to determine The query statement and the characteristics of the entity to be recommended are then used to determine the click-through rate of the entity to be recommended according to the query statement and the characteristics of the entity to be recommended using a preset deep neural network model. Therefore, by integrating the extracted features into the deep neural network model, the click-through rate of the entity to be recommended is estimated, which improves the accuracy of the click-through rate estimation, enables the recommendation system to provide services for users accurately, and improves the recommendation system. The quality of service of the system improves the user experience.

为达上述目的,本申请第三方面实施例提出了一种基于人工智能的点击预估设备,包括:To achieve the above purpose, the embodiment of the third aspect of the present application proposes an artificial intelligence-based click estimation device, including:

处理器;processor;

用于存储处理器可执行指令的存储器;memory for storing processor-executable instructions;

其中,所述处理器被配置为执行如上述第一方面中的基于人工智能的点击预估方法。Wherein, the processor is configured to execute the artificial intelligence-based click estimation method in the first aspect above.

为达上述目的,本申请第四方面实施例提出了一种非临时性计算机可读存储介质,当所述存储介质中的指令由移动终端的处理器被执行时,使得移动终端能够执行一种如上述第一方面中的基于人工智能的点击预估方法。To achieve the above purpose, the embodiment of the fourth aspect of the present application proposes a non-transitory computer-readable storage medium. When the instructions in the storage medium are executed by the processor of the mobile terminal, the mobile terminal can execute a Such as the artificial intelligence-based click estimation method in the first aspect above.

为达上述目的,本申请第五方面实施例提出了一种计算机程序产品,当所述计算机程序产品中的指令处理器执行时,执行一种如上述第一方面中的基于人工智能的点击预估方法。In order to achieve the above purpose, the embodiment of the fifth aspect of the present application proposes a computer program product, when the instruction processor in the computer program product executes, an artificial intelligence-based click prediction as in the first aspect above is performed. estimation method.

附图说明Description of drawings

本发明上述的和/或附加的方面和优点从下面结合附图对实施例的描述中将变得明显和容易理解,其中:The above and/or additional aspects and advantages of the present invention will become apparent and easy to understand from the following description of the embodiments in conjunction with the accompanying drawings, wherein:

图1是本申请一个实施例的基于人工智能的点击预估方法的流程图;Fig. 1 is a flowchart of an artificial intelligence-based click estimation method according to an embodiment of the present application;

图2为本申请实施例提供的利用深度神经网络模型预估点击率的流程示意图;FIG. 2 is a schematic flow chart of estimating the click-through rate using a deep neural network model provided by the embodiment of the present application;

图3为本申请实施例提供的确定查询语句对应向量的过程示意图;FIG. 3 is a schematic diagram of the process of determining the vector corresponding to the query sentence provided by the embodiment of the present application;

图4为根据查询语句及待推荐实体的向量确定点击率的过程示意图;Fig. 4 is a schematic diagram of the process of determining the click-through rate according to the query statement and the vector of the entity to be recommended;

图5是本申请另一个实施例的基于人工智能的点击预估方法的流程图;FIG. 5 is a flowchart of an artificial intelligence-based click estimation method according to another embodiment of the present application;

图6为本申请实施例提供的点击标签预估过程示意图;FIG. 6 is a schematic diagram of the click label estimation process provided by the embodiment of the present application;

图7是本申请一个实施例的基于人工智能的点击预估装置的结构示意图;FIG. 7 is a schematic structural diagram of an artificial intelligence-based click estimation device according to an embodiment of the present application;

图8是本申请另一个实施例的基于人工智能的点击预估装置的结构示意图。FIG. 8 is a schematic structural diagram of an artificial intelligence-based click estimation device according to another embodiment of the present application.

具体实施方式Detailed ways

下面详细描述本申请的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,旨在用于解释本申请,而不能理解为对本申请的限制。Embodiments of the present application are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary, and are intended to explain the present application, and should not be construed as limiting the present application.

下面参考附图描述本申请实施例的基于人工智能的点击预估方法及装置。The artificial intelligence-based click estimation method and device according to the embodiments of the present application will be described below with reference to the accompanying drawings.

图1是本申请一个实施例的基于人工智能的点击预估方法的流程图。FIG. 1 is a flow chart of an artificial intelligence-based click estimation method according to an embodiment of the present application.

如图1所示,该基于人工智能的点击预估方法包括:As shown in Figure 1, the artificial intelligence-based click estimation method includes:

步骤101,根据用户输入的查询语句,获取待推荐实体的属性信息。Step 101, according to the query sentence input by the user, attribute information of the entity to be recommended is obtained.

具体地,本发明实施例提供的基于人工智能的点击预估方法的执行主体为本申请提供的基于人工智能的点击预估装置,该装置可以被配置在任何搜索、推荐服务系统中,以在用户进行搜索时,对待推荐实体的点击率进行预估。Specifically, the executor of the artificial intelligence-based click estimation method provided in the embodiment of the present invention is the artificial intelligence-based click estimation device provided by this application, which can be configured in any search and recommendation service system to When a user searches, the click rate of the entity to be recommended is estimated.

其中,待推荐实体的属性信息,包括以下信息的至少一个:待推荐实体名称、推荐理由及待推荐实体的标识。举例来说,若用户输入的查询语句为“陕西附近有什么好玩的地方”,点击预估装置通过查询检索数据库,确定其中一个待推荐实体为“陕西香山”,同时在实体库中,还包括“陕西香山”对应的唯一标识“sxxs”及对应的推荐利用“省级名胜风景区”。则与查询语句“陕西附近有什么好玩的地方”对应的待推荐实体“陕西香山”的属性信息可以为包括“陕西香山”、“sxxs”和“省级名胜风景区”。Wherein, the attribute information of the entity to be recommended includes at least one of the following information: the name of the entity to be recommended, the reason for recommendation, and the identifier of the entity to be recommended. For example, if the query sentence entered by the user is "what are the interesting places near Shaanxi", the click estimation device searches the database through query and determines that one of the entities to be recommended is "Shaanxi Xiangshan". At the same time, in the entity database, it also includes The unique identifier "sxxs" corresponding to "Shaanxi Xiangshan" and the corresponding recommended "provincial scenic spot". Then, the attribute information of the entity to be recommended "Shaanxi Xiangshan" corresponding to the query sentence "what interesting places are there near Shaanxi" may include "Shaanxi Xiangshan", "sxxs" and "provincial scenic spot".

可以理解的是,待推荐实体的属性信息中包括的内容越多,即对待推荐实体的画像越准确,对待推荐实体的点击率预估也就会越准确。It is understandable that the more content included in the attribute information of the entity to be recommended, that is, the more accurate the portrait of the entity to be recommended is, the more accurate the click rate estimation of the entity to be recommended will be.

步骤102,对所述查询语句及待推荐实体的属性信息分别进行分词处理,确定所述查询语句及待推荐实体的特征。Step 102, perform word segmentation processing on the query statement and attribute information of the entity to be recommended, and determine the characteristics of the query statement and the entity to be recommended.

具体实现时,为了使得到的查询语句及待推荐实体的特征尽量准确,点击预估装置可以采用多种切词方式,对查询语句及待推荐实体的属性信息进行分词处理。比如,按照不同的切词粒度,对述查询语句及待推荐实体的属性信息分别进行分词处理,确定所述查询语句及待推荐实体分别包括的不同粒度的分词。During specific implementation, in order to make the obtained query statement and the features of the entity to be recommended as accurate as possible, the click estimation device may adopt various word segmentation methods to perform word segmentation processing on the query statement and the attribute information of the entity to be recommended. For example, word segmentation processing is performed on the query statement and the attribute information of the entity to be recommended according to different granularities of word segmentation, and word segmentation of different granularities respectively included in the query statement and the entity to be recommended is determined.

举例来说,对查询语句“陕西附近有什么好玩的地方”,按照细粒度切词(basicword segment)得到的结果为陕西|附近|有|什么|好玩|的|地方,进行粗粒度切词(phraseword segment)得到的结果为陕西|附近|有|什么|好玩的地方。对于给定的实体entity,我们也对其进行两个粒度的切词得到的结果都为:陕西|香山。同时我们可以得到实体在实体库里的唯一标识id,以及实体的一个推荐理由reason=省级名胜风景区。因此我们可以得到推荐理由的两个粒度的切词分别为:省级|名胜|风景|区,以及,省级|名胜|风景区。因此,在搜索推荐的情景中,我们可以选定query的特征包括query细粒度切词结果以及粗粒度切词结果;entity的特征包括entity本身的粗细粒度的切词结果,id,以及推荐理由的粗细粒度的切词结果。For example, for the query sentence "what are the interesting places near Shaanxi", according to the fine-grained word segmentation (basicword segment), the result obtained is Shaanxi|nearby|has|what|interesting|places, and the coarse-grained word segmentation ( phraseword segment) the result obtained is Shaanxi|near|near|has|what|interesting places. For a given entity entity, we also perform word segmentation at two granularities and the result is: Shaanxi|Xiangshan. At the same time, we can get the unique identifier id of the entity in the entity database, and a recommendation reason of the entity=provincial scenic spot. Therefore, we can get the two granularity segmentation words of the recommendation reason respectively: provincial | scenic spot | scenic spot | area, and provincial | scenic spot | scenic spot. Therefore, in the context of search and recommendation, we can select query features including fine-grained word segmentation results and coarse-grained word segmentation results; entity features include coarse-grained word segmentation results of the entity itself, id, and recommendation reasons Coarse and fine-grained word segmentation results.

步骤103,利用预设的深度神经网络(Deep Neural Network,简称DNN)模型,根据所述查询语句及待推荐实体的特征,确定所述待推荐实体的点击率。Step 103, using a preset Deep Neural Network (DNN for short) model to determine the click-through rate of the entity to be recommended according to the query statement and the characteristics of the entity to be recommended.

具体的,点击预估装置可以通过对大量的查询语句及对应的待推荐实体,结合深度神经网络,训练出预设的深度神经网络模型,用于根据查询语句及待推荐实体的特征,对待推荐实体的点击率进行预估。Specifically, the click prediction device can train a preset deep neural network model by combining a large number of query sentences and corresponding entities to be recommended with a deep neural network, which is used to make recommendations based on the characteristics of query sentences and entities to be recommended. The click-through rate of the entity is estimated.

具体实现时,利用预设的深度神经网络模型,对待推荐实体的点击率进行预估时,可以主要通过图2所示的步骤实现。图2为本申请实施例提供的利用深度神经网络模型预估点击率的流程示意图。In specific implementation, when using the preset deep neural network model to estimate the click-through rate of the entity to be recommended, it can mainly be realized through the steps shown in FIG. 2 . FIG. 2 is a schematic flow chart of estimating the click-through rate by using the deep neural network model provided by the embodiment of the present application.

步骤201,根据所述查询语句及待推荐实体分别包括的不同粒度的分词,利用预设的分词与向量的映射关系,确定所述查询语句及待推荐实体分别对应的向量。Step 201 , according to word segmentations of different granularities included in the query sentence and the entity to be recommended, and using a preset mapping relationship between word segmentation and vectors, determine vectors corresponding to the query sentence and the entity to be recommended respectively.

具体的,首先通过向量映射的形式,将query及entity的每一个分词都映射到DNN模型中的嵌入(embedding)层,每一个分词都对应一个embedding表示。其中,对于同一个词,其对应的embedding表示是唯一的。然后把细粒度分词中各个分词的embedding向量相加即可得到query的细粒度切词时的一个embedding表示,如图3所示,图3为本申请实施例提供的确定查询语句对应向量的过程示意图。Specifically, firstly, each word segment of query and entity is mapped to the embedding layer in the DNN model through the form of vector mapping, and each word segment corresponds to an embedding representation. Among them, for the same word, its corresponding embedding representation is unique. Then add the embedding vectors of each word segmentation in the fine-grained word segmentation to obtain an embedding representation of the fine-grained word segmentation of the query, as shown in Figure 3, Figure 3 is the process of determining the corresponding vector of the query statement provided by the embodiment of the present application schematic diagram.

需要说明的是,除了采用向量相加的方式之外,还可以采取向量卷积等方式得到一个特征的embedding表示。对于其他query和entity的特征,我们也采用同样的做法得到不同特征的embedding表示。对于entity id,我们也把每个id映射为一个单独的embedding,得到entity id的embedding表示。It should be noted that, in addition to the method of vector addition, the embedding representation of a feature can also be obtained by vector convolution and other methods. For other query and entity features, we also use the same method to obtain embedding representations of different features. For the entity id, we also map each id to a separate embedding to obtain the embedding representation of the entity id.

可以理解的是,通过embedding映射后,query和entity分别对应的向量数量与对query和entity的切词粒度种类及不同的切词粒度对应的分词是否相同有关。举例来说,若对query和entity分别采用了两种粒度的切词,且两种粒度的切词后得到的分词构成不同,那么通过embedding映射后,query对应的向量数量为2,而entity的名称及推荐理由通过两种粒度的切词后,得到的分词构成都相同,那么entity对应的向量数量为3,分别为entity名称对应1个向量,entity id对应一个向量,entity的推荐理由对应1个向量。It can be understood that after embedding mapping, the number of vectors corresponding to query and entity is related to the type of word segmentation granularity for query and entity and whether the word segmentation corresponding to different word segmentation granularity is the same. For example, if two granularities of word segmentation are used for query and entity respectively, and the word segmentation obtained after the two granularities of word segmentation is different, then after embedding mapping, the number of vectors corresponding to query is 2, and the number of vectors corresponding to entity After the name and recommendation reason are segmented at two granularities, the obtained word segmentation is the same, so the number of vectors corresponding to the entity is 3, respectively, the entity name corresponds to 1 vector, the entity id corresponds to 1 vector, and the recommendation reason of the entity corresponds to 1 vectors.

步骤202,利用第一预设的运算规则,对所述查询语句及待推荐实体分别对应的向量进行运算,确定待推荐实体的点击率。Step 202, using a first preset operation rule to perform calculations on the vectors corresponding to the query statement and the entity to be recommended, to determine the click-through rate of the entity to be recommended.

具体的,点击预估装置在确定query和entity分别对应的向量后,即可通过一定的运算,确定该entity的点击率。Specifically, after determining the vectors corresponding to the query and the entity, the click estimation device can determine the click rate of the entity through certain calculations.

通常在DNN模型中,可以通过线性变换及非线性变换的方式,确定query及entity的最终embedding向量表示,即上述步骤202,具体为:Usually in the DNN model, the final embedding vector representation of query and entity can be determined through linear transformation and nonlinear transformation, that is, the above step 202, specifically:

对所述查询语句及待推荐实体分别对应的向量进行线性变换和非线性变换,确定所述查询语句及待推荐实体分别对应的新向量;Carrying out linear transformation and nonlinear transformation to the vector corresponding to the query statement and the entity to be recommended respectively, and determining the new vector corresponding to the query statement and the entity to be recommended respectively;

根据所述查询语句及待推荐实体分别对应的新向量的内积,确定所述待推荐实体的点击率。The click rate of the entity to be recommended is determined according to the inner product of the query statement and the new vector corresponding to the entity to be recommended.

举例来说,图4为根据查询语句及待推荐实体的向量确定点击率的过程示意图。如图4所示,得到各种特征的embedding表示之后,我们分别把query的所有特征的embedding向量相加得到query的唯一embedding表示,将entity的所有特征的embedding向量相加得到entity的唯一embedding表示。然后经过线性变换以及一个非线性变换之后,得到query和entity的新的向量表示。接着,我们通过对两个向量求内积,并把其映射到0到1之间的值,即为CTR值。For example, FIG. 4 is a schematic diagram of the process of determining the click rate according to the query statement and the vector of the entity to be recommended. As shown in Figure 4, after obtaining the embedding representations of various features, we add the embedding vectors of all the features of the query to obtain the unique embedding representation of the query, and add the embedding vectors of all the features of the entity to obtain the unique embedding representation of the entity . Then after a linear transformation and a nonlinear transformation, a new vector representation of query and entity is obtained. Next, we calculate the inner product of the two vectors and map it to a value between 0 and 1, which is the CTR value.

需要说明的是,上述非线性变换指通过任意的非线性运算,比如可以为余弦运算、正弦运算、正切运算、余切运算等等,本实施例对此不作限定。It should be noted that the above-mentioned nonlinear transformation refers to any nonlinear operation, such as cosine operation, sine operation, tangent operation, cotangent operation, etc., which is not limited in this embodiment.

可以理解的是,点击预估装置,通过上述方式,可以对与用户输入的查询语句对应的每一个待推荐实体的点击率进行预估,从而根据各待推荐实体的点击率,向用户进行推荐。It can be understood that the click estimation device can estimate the click-through rate of each entity to be recommended corresponding to the query sentence input by the user through the above method, so as to recommend to the user according to the click-through rate of each entity to be recommended .

本申请实施例的基于人工智能的点击预估方法,在根据用户输入的查询语句获取待推荐的实体的属性信息后,即可所述查询语句及待推荐实体的属性信息分别进行分词处理,确定所述查询语句及待推荐实体的特征,然后利用预设的深度神经网络模型,根据所述查询语句及待推荐实体的特征,确定所述待推荐实体的点击率。由此,通过将提取的特征融入到深度神经网络模型中,对待推荐实体的点击率进行预估,提高了点击率预估的准确性、使得推荐系统可以准确的为用户提供服务,提高了推荐系统的服务质量,改善了用户体验。According to the artificial intelligence-based click estimation method of the embodiment of the present application, after obtaining the attribute information of the entity to be recommended according to the query sentence input by the user, the query sentence and the attribute information of the entity to be recommended can be respectively subjected to word segmentation processing to determine The query statement and the characteristics of the entity to be recommended are then used to determine the click-through rate of the entity to be recommended according to the query statement and the characteristics of the entity to be recommended using a preset deep neural network model. Therefore, by integrating the extracted features into the deep neural network model, the click-through rate of the entity to be recommended is estimated, which improves the accuracy of the click-through rate estimation, enables the recommendation system to provide services for users accurately, and improves the recommendation system. The quality of service of the system improves the user experience.

通过上述分析可知,点击预估装置可以根据查询语句及待推荐实体的特征,利用预设的DNN模型,对待推荐实体的点击率进行预估。在实际实现时,由于待推荐实体的点击率与展现的位置有关,因此在确定了待推荐实体的点击率后,还可以结合该推荐实体的展现位置,对用户是否对该推荐实体进行点击进行预估,下面结合图5对本申请实施例提供的基于人工智能的点击预估方法进行进一步说明。From the above analysis, it can be seen that the click estimation device can estimate the click rate of the entity to be recommended by using the preset DNN model according to the query statement and the characteristics of the entity to be recommended. In actual implementation, since the click-through rate of the entity to be recommended is related to the displayed position, after determining the click-through rate of the entity to be recommended, it can also be combined with the display position of the recommended entity to determine whether the user clicks on the recommended entity. For estimation, the artificial intelligence-based click estimation method provided in the embodiment of the present application will be further described below in conjunction with FIG. 5 .

图5是本申请另一个实施例的基于人工智能的点击预估方法的流程图。FIG. 5 is a flowchart of an artificial intelligence-based click estimation method according to another embodiment of the present application.

如图5所示,该基于人工智能的点击预估方法包括以下步骤:As shown in Figure 5, the artificial intelligence-based click estimation method includes the following steps:

步骤501,根据用户输入的查询语句,获取待推荐实体的属性信息。Step 501, according to the query sentence input by the user, attribute information of the entity to be recommended is acquired.

步骤502,按照不同的切词粒度,对述查询语句及待推荐实体的属性信息分别进行分词处理,确定所述查询语句及待推荐实体分别包括的不同粒度的分词。Step 502: Perform word segmentation processing on the query statement and the attribute information of the entity to be recommended according to different word segmentation granularities, and determine the word segmentation of different granularities included in the query statement and the entity to be recommended respectively.

步骤503,根据所述查询语句及待推荐实体分别包括的不同粒度的分词,利用预设的分词与向量的映射关系,确定所述查询语句及待推荐实体分别对应的向量。Step 503 , according to the word segmentations of different granularities included in the query sentence and the entity to be recommended, and using the preset mapping relationship between word segmentation and vectors, determine the vectors corresponding to the query sentence and the entity to be recommended respectively.

步骤504,对所述查询语句及待推荐实体分别对应的向量进行线性变换和非线性变换,确定所述查询语句及待推荐实体分别对应的新向量。Step 504, perform linear transformation and nonlinear transformation on the vectors corresponding to the query statement and the entity to be recommended, respectively, and determine new vectors corresponding to the query statement and the entity to be recommended respectively.

步骤505,根据所述查询语句及待推荐实体分别对应的新向量的内积,确定所述待推荐实体的点击率。Step 505: Determine the click-through rate of the entity to be recommended according to the inner product of the query statement and the new vector corresponding to the entity to be recommended.

步骤506,确定所述待推荐实体的展现位置。Step 506, determine the display position of the entity to be recommended.

具体的,点击预估装置可以通过多种方式,确定待推荐实体的展现位置(entitydisplay position)。比如,根据用户的历史行为日志,确定待推荐实体的展现位置;或者,根据待推荐实体的点击率,确定待推荐实体的展现位置;或者,采用随机分配的方式,为待推荐实体分配展现位置等等。Specifically, the click estimation device may determine the display position (entity display position) of the entity to be recommended in various ways. For example, the display position of the entity to be recommended is determined according to the user's historical behavior log; or, the display position of the entity to be recommended is determined according to the click rate of the entity to be recommended; or, the display position of the entity to be recommended is assigned by random allocation and many more.

步骤507,根据所述点击率及所述待推荐实体展现位置,确定所述待推荐实体的点击标签。Step 507: Determine the click label of the entity to be recommended according to the click rate and the display position of the entity to be recommended.

具体的,点击预估装置,可以仍然采用DNN模型,根据点击率及待推荐实体的展现位置,确定待推荐实体的点击标签。即上述步骤507,包括:Specifically, the click estimation device may still use the DNN model to determine the click label of the entity to be recommended according to the click rate and the display position of the entity to be recommended. That is, the above step 507 includes:

根据所述点击率及所述待推荐实体展现位置,确定所述待推荐实体对应的修正向量;determining a correction vector corresponding to the entity to be recommended according to the click-through rate and the display position of the entity to be recommended;

利用第二预设的运算规则,对所述修正向量进行运算,确定所述待推荐实体对应的点击标签。The correction vector is calculated by using a second preset operation rule to determine the click label corresponding to the entity to be recommended.

其中,第二预设的运算规则包括的运算方式,与第一预设的运算规则中包括的运算方式可以向他,也可以不同,本实施例对此不作限定。Wherein, the calculation method included in the second preset operation rule may be the same as or different from the operation method included in the first preset operation rule, which is not limited in this embodiment.

举例来说,图6为本申请实施例提供的点击标签预估过程示意图。如图6所示,首先通过向量映射的形式,确定CTR score及entity display position分别对应的向量表示,然后将CTR score及entity display position的向量串联起来,作为待推荐实体的修正向量,再对修正向量进行线性变换和非线性变换,确定待推荐实体对应的点击标签。For example, FIG. 6 is a schematic diagram of a click label estimation process provided by the embodiment of the present application. As shown in Figure 6, first determine the vector representations corresponding to the CTR score and entity display position in the form of vector mapping, and then concatenate the vectors of the CTR score and entity display position as the correction vector of the entity to be recommended, and then correct the The vector is subjected to linear transformation and nonlinear transformation to determine the click label corresponding to the entity to be recommended.

需要说明的是,点击预估装置,可以通过线性变换及分线性变换对待推荐实体的修正向量处理后,再根据得到的向量值,确定待推荐实体对应的点击标签,举例来说,若得到的向量值最终为大于0.5的值,则对应的点击标签为“易点击”;若得到的向量值为小于0.2的值,则对应的点击标签为“点击困难”;若得到的向量值为小于0.5,且大于0.2的值,则对应的点击标签为“点击概率低”等。It should be noted that the click estimation device can process the correction vector of the entity to be recommended through linear transformation and sub-linear transformation, and then determine the click label corresponding to the entity to be recommended according to the obtained vector value. For example, if the obtained If the final vector value is greater than 0.5, the corresponding click label is "easy to click"; if the obtained vector value is less than 0.2, the corresponding click label is "difficult to click"; if the obtained vector value is less than 0.5 , and a value greater than 0.2, the corresponding click label is "low probability of click", etc.

本申请实施例的基于人工智能的点击预估方法,在获取到与用户输入的查询语句对应的待推荐实体的属性信息后,首先采用不同的切词粒度,对查询语句及推荐实体的属性信息进行切词处理,确定查询语句及待推荐实体分别包括的不同粒度的分词,然后通过向量映射,确定查询语句及待推荐实体的分词分别对应的向量表示,再通过线性变换及非线性变换,确定待推荐实体的点击率,然后再根据待推荐实体的展现位置,对待推荐实体是否被点击进行预估。由此,实现了利用DNN模型,对待推荐实体的点击率及点击标签进行预测,提高了待推荐实体点击率预估的准确性,提高了推荐系统的服务质量,改善了用户体验。In the artificial intelligence-based click prediction method of the embodiment of the present application, after obtaining the attribute information of the entity to be recommended corresponding to the query statement input by the user, firstly, different word segmentation granularities are used to analyze the query statement and the attribute information of the recommended entity. Carry out word segmentation processing to determine the word segmentation of different granularities included in the query statement and the entity to be recommended, and then determine the vector representations corresponding to the word segmentation of the query statement and the entity to be recommended through vector mapping, and then determine through linear transformation and nonlinear transformation Click rate of the entity to be recommended, and then predict whether the entity to be recommended is clicked based on the display position of the entity to be recommended. As a result, the DNN model is used to predict the click-through rate and click tags of the recommended entity, which improves the accuracy of the predicted click-through rate of the recommended entity, improves the service quality of the recommendation system, and improves the user experience.

为了实现上述实施例,本申请还提出一种基于人工智能的点击预估装置。In order to realize the above embodiments, the present application also proposes an artificial intelligence-based click estimation device.

图7是本申请一个实施例的基于人工智能的点击预估装置的结构示意图。FIG. 7 is a schematic structural diagram of an artificial intelligence-based click estimation device according to an embodiment of the present application.

如图7所示,该基于人工智能的点击预估装置包括:As shown in Figure 7, the artificial intelligence-based click estimation device includes:

获取模块71,用于根据用户输入的查询语句,获取待推荐实体的属性信息;An acquisition module 71, configured to acquire the attribute information of the entity to be recommended according to the query sentence input by the user;

分词模块72,用于对所述查询语句及待推荐实体的属性信息分别进行分词处理,确定所述查询语句及待推荐实体的特征;The word segmentation module 72 is used to perform word segmentation processing on the query sentence and the attribute information of the entity to be recommended, and determine the characteristics of the query sentence and the entity to be recommended;

处理模块73,用于利用预设的深度神经网络模型,根据所述查询语句及待推荐实体的特征,确定所述待推荐实体的点击率。The processing module 73 is configured to use a preset deep neural network model to determine the click-through rate of the entity to be recommended according to the query statement and the characteristics of the entity to be recommended.

其中,所述待推荐实体的属性信息包括以下信息中的至少一个:待推荐实体名称、推荐理由及待推荐实体的标识。Wherein, the attribute information of the entity to be recommended includes at least one of the following information: a name of the entity to be recommended, a reason for recommendation, and an identifier of the entity to be recommended.

在本实施例一种可能的实现形式中,上述分词模块72,具体用于:In a possible implementation form of this embodiment, the above word segmentation module 72 is specifically used for:

按照不同的切词粒度,对述查询语句及待推荐实体的属性信息分别进行分词处理,确定所述查询语句及待推荐实体分别包括的不同粒度的分词。According to different granularities of word segmentation, word segmentation processing is performed on the query statement and the attribute information of the entity to be recommended respectively, and word segmentation of different granularities included in the query statement and the entity to be recommended are respectively determined.

相应的,上述处理模块73,包括:Correspondingly, the above processing module 73 includes:

确定单元,用于根据所述查询语句及待推荐实体分别包括的不同粒度的分词,利用预设的分词与向量的映射关系,确定所述查询语句及待推荐实体分别对应的向量;The determination unit is used to determine the corresponding vectors of the query statement and the entity to be recommended according to the word segmentation of different granularities included in the query statement and the entity to be recommended respectively, using the preset mapping relationship between the word segmentation and the vector;

运算单元,用于利用第一预设的运算规则,对所述查询语句及待推荐实体分别对应的向量进行运算,确定待推荐实体的点击率。The calculation unit is configured to use a first preset calculation rule to perform calculations on the vectors corresponding to the query statement and the entity to be recommended, so as to determine the click-through rate of the entity to be recommended.

在本实施例一种可能的实现形式中,所述运算单元,具体用于:In a possible implementation form of this embodiment, the computing unit is specifically configured to:

对所述查询语句及待推荐实体分别对应的向量进行线性变换和非线性变换,确定所述查询语句及待推荐实体分别对应的新向量;Carrying out linear transformation and nonlinear transformation to the vector corresponding to the query statement and the entity to be recommended respectively, and determining the new vector corresponding to the query statement and the entity to be recommended respectively;

根据所述查询语句及待推荐实体分别对应的新向量的内积,确定所述待推荐实体的点击率。The click rate of the entity to be recommended is determined according to the inner product of the query statement and the new vector corresponding to the entity to be recommended.

需要说明的是,前述对基于人工智能的点击预估方法实施例的解释说明也适用于该实施例的基于人工智能的点击预估装置,此处不再赘述。It should be noted that the foregoing explanations of the embodiment of the artificial intelligence-based click estimation method are also applicable to the artificial intelligence-based click estimation device of this embodiment, and details are not repeated here.

本申请实施例的基于人工智能的点击预估装置,在根据用户输入的查询语句获取待推荐的实体的属性信息后,即可所述查询语句及待推荐实体的属性信息分别进行分词处理,确定所述查询语句及待推荐实体的特征,然后利用预设的深度神经网络模型,根据所述查询语句及待推荐实体的特征,确定所述待推荐实体的点击率。由此,通过将提取的特征融入到深度神经网络模型中,对待推荐实体的点击率进行预估,提高了点击率预估的准确性、使得推荐系统可以准确的为用户提供服务,提高了推荐系统的服务质量,改善了用户体验。According to the artificial intelligence-based click estimation device of the embodiment of the present application, after obtaining the attribute information of the entity to be recommended according to the query sentence input by the user, the query sentence and the attribute information of the entity to be recommended can be respectively subjected to word segmentation processing to determine The query statement and the characteristics of the entity to be recommended are then used to determine the click-through rate of the entity to be recommended according to the query statement and the characteristics of the entity to be recommended using a preset deep neural network model. Therefore, by integrating the extracted features into the deep neural network model, the click-through rate of the entity to be recommended is estimated, which improves the accuracy of the click-through rate estimation, enables the recommendation system to provide services for users accurately, and improves the recommendation system. The quality of service of the system improves the user experience.

图8是本申请另一个实施例的基于人工智能的点击预估装置的结构示意图。FIG. 8 is a schematic structural diagram of an artificial intelligence-based click estimation device according to another embodiment of the present application.

如图8所示,在上述图7所示的基础上,该基于人工智能的点击预估装置,还包括:As shown in Figure 8, on the basis of the above-mentioned Figure 7, the artificial intelligence-based click estimation device also includes:

第一确定模块81,用于确定所述待推荐实体的展现位置;The first determining module 81 is configured to determine the display position of the entity to be recommended;

第二确定模块82,用于根据所述点击率及所述待推荐实体展现位置,确定所述待推荐实体的点击标签。The second determination module 82 is configured to determine the click label of the entity to be recommended according to the click rate and the display position of the entity to be recommended.

具体的,上述第一确定模块81,具体用于:Specifically, the above-mentioned first determination module 81 is specifically used for:

根据所述用户的历史行为日志,确定所述待推荐实体的展现位置。According to the historical behavior log of the user, the display position of the entity to be recommended is determined.

相应的,上述第二确定模块82,具体用于:Correspondingly, the above-mentioned second determination module 82 is specifically used for:

根据所述点击率及所述待推荐实体展现位置,确定所述待推荐实体对应的修正向量;determining a correction vector corresponding to the entity to be recommended according to the click-through rate and the display position of the entity to be recommended;

利用第二预设的运算规则,对所述修正向量进行运算,确定所述待推荐实体对应的点击标签。The correction vector is calculated by using a second preset operation rule to determine the click label corresponding to the entity to be recommended.

需要说明的是,前述对基于人工智能的点击预估方法实施例的解释说明也适用于该实施例的基于人工智能的点击预估装置,此处不再赘述。It should be noted that the foregoing explanations of the embodiment of the artificial intelligence-based click estimation method are also applicable to the artificial intelligence-based click estimation device of this embodiment, and details are not repeated here.

本申请实施例的基于人工智能的点击预估装置,在获取到与用户输入的查询语句对应的待推荐实体的属性信息后,首先采用不同的切词粒度,对查询语句及推荐实体的属性信息进行切词处理,确定查询语句及待推荐实体分别包括的不同粒度的分词,然后通过向量映射,确定查询语句及待推荐实体的分词分别对应的向量表示,再通过线性变换及非线性变换,确定待推荐实体的点击率,然后再根据待推荐实体的展现位置,对待推荐实体是否被点击进行预估。由此,实现了利用DNN模型,对待推荐实体的点击率及点击标签进行预测,提高了待推荐实体点击率预估的准确性,提高了推荐系统的服务质量,改善了用户体验。In the artificial intelligence-based click prediction device of the embodiment of the present application, after obtaining the attribute information of the entity to be recommended corresponding to the query statement input by the user, firstly, different word segmentation granularities are used to analyze the query statement and the attribute information of the recommended entity. Carry out word segmentation processing to determine the word segmentation of different granularities included in the query statement and the entity to be recommended, and then determine the vector representations corresponding to the word segmentation of the query statement and the entity to be recommended through vector mapping, and then determine through linear transformation and nonlinear transformation Click rate of the entity to be recommended, and then predict whether the entity to be recommended is clicked based on the display position of the entity to be recommended. As a result, the DNN model is used to predict the click-through rate and click tags of the recommended entity, which improves the accuracy of the predicted click-through rate of the recommended entity, improves the service quality of the recommendation system, and improves the user experience.

基于上述各实施例,本申请再一个实施例提供一种基于人工智能的点击预估设备,包括:Based on the above-mentioned embodiments, another embodiment of the present application provides an artificial intelligence-based click estimation device, including:

处理器;processor;

用于存储处理器可执行指令的存储器;memory for storing processor-executable instructions;

其中,所述处理器被配置为执行以下操作:根据用户输入的查询语句,获取待推荐实体的属性信息;Wherein, the processor is configured to perform the following operations: acquire the attribute information of the entity to be recommended according to the query statement input by the user;

对所述查询语句及待推荐实体的属性信息分别进行分词处理,确定所述查询语句及待推荐实体的特征;Perform word segmentation processing on the query statement and the attribute information of the entity to be recommended, and determine the characteristics of the query statement and the entity to be recommended;

利用预设的深度神经网络模型,根据所述查询语句及待推荐实体的特征,确定所述待推荐实体的点击率。Using a preset deep neural network model, according to the query statement and the characteristics of the entity to be recommended, determine the click rate of the entity to be recommended.

进一步地,本申请实施例还提供一种非临时性计算机可读存储介质,当所述存储介质中的指令由移动终端的处理器被执行时,使得移动终端能够如上实施例中的基于人工智能的点击预估方法。Further, the embodiment of the present application also provides a non-transitory computer-readable storage medium. When the instructions in the storage medium are executed by the processor of the mobile terminal, the mobile terminal can The click estimation method for .

进一步地,本申请实施例还提供一种计算机程序产品,当所述计算机程序产品中的指令处理器执行时,执行一种如上述实施例所示的基于人工智能的点击预估方法。Further, the embodiment of the present application also provides a computer program product, and when the instruction processor in the computer program product executes, an artificial intelligence-based click estimation method as shown in the above-mentioned embodiments is executed.

在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本申请的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不必须针对的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任一个或多个实施例或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the description of this specification, descriptions with reference to the terms "one embodiment", "some embodiments", "example", "specific examples", or "some examples" mean that specific features described in connection with the embodiment or example , structure, material or characteristic is included in at least one embodiment or example of the present application. In this specification, the schematic representations of the above terms are not necessarily directed to the same embodiment or example. Furthermore, the described specific features, structures, materials or characteristics may be combined in any suitable manner in any one or more embodiments or examples. In addition, those skilled in the art can combine and combine different embodiments or examples and features of different embodiments or examples described in this specification without conflicting with each other.

此外,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。在本申请的描述中,“多个”的含义是至少两个,例如两个,三个等,除非另有明确具体的限定。In addition, the terms "first" and "second" are used for descriptive purposes only, and cannot be interpreted as indicating or implying relative importance or implicitly specifying the quantity of indicated technical features. Thus, the features defined as "first" and "second" may explicitly or implicitly include at least one of these features. In the description of the present application, "plurality" means at least two, such as two, three, etc., unless otherwise specifically defined.

流程图中或在此以其他方式描述的任何过程或方法描述可以被理解为,表示包括一个或更多个用于实现定制逻辑功能或过程的步骤的可执行指令的代码的模块、片段或部分,并且本申请的优选实施方式的范围包括另外的实现,其中可以不按所示出或讨论的顺序,包括根据所涉及的功能按基本同时的方式或按相反的顺序,来执行功能,这应被本申请的实施例所属技术领域的技术人员所理解。Any process or method descriptions in flowcharts or otherwise described herein may be understood to represent a module, segment or portion of code comprising one or more executable instructions for implementing custom logical functions or steps of a process , and the scope of preferred embodiments of the present application includes additional implementations in which functions may be performed out of the order shown or discussed, including in substantially simultaneous fashion or in reverse order depending on the functions involved, which shall It should be understood by those skilled in the art to which the embodiments of the present application belong.

在流程图中表示或在此以其他方式描述的逻辑和/或步骤,例如,可以被认为是用于实现逻辑功能的可执行指令的定序列表,可以具体实现在任何计算机可读介质中,以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用,或结合这些指令执行系统、装置或设备而使用。就本说明书而言,"计算机可读介质"可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。计算机可读介质的更具体的示例(非穷尽性列表)包括以下:具有一个或多个布线的电连接部(电子装置),便携式计算机盘盒(磁装置),随机存取存储器(RAM),只读存储器(ROM),可擦除可编辑只读存储器(EPROM或闪速存储器),光纤装置,以及便携式光盘只读存储器(CDROM)。另外,计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质,因为可以例如通过对纸或其他介质进行光学扫描,接着进行编辑、解译或必要时以其他合适方式进行处理来以电子方式获得所述程序,然后将其存储在计算机存储器中。The logic and/or steps represented in the flowcharts or otherwise described herein, for example, can be considered as a sequenced listing of executable instructions for implementing logical functions, can be embodied in any computer-readable medium, For use with instruction execution systems, devices, or devices (such as computer-based systems, systems including processors, or other systems that can fetch instructions from instruction execution systems, devices, or devices and execute instructions), or in conjunction with these instruction execution systems, devices or equipment used. For the purposes of this specification, a "computer-readable medium" may be any device that can contain, store, communicate, propagate or transmit a program for use in or in conjunction with an instruction execution system, device or device. More specific examples (non-exhaustive list) of computer-readable media include the following: electrical connection with one or more wires (electronic device), portable computer disk case (magnetic device), random access memory (RAM), Read Only Memory (ROM), Erasable and Editable Read Only Memory (EPROM or Flash Memory), Fiber Optic Devices, and Portable Compact Disc Read Only Memory (CDROM). In addition, the computer-readable medium may even be paper or other suitable medium on which the program can be printed, since the program can be read, for example, by optically scanning the paper or other medium, followed by editing, interpretation or other suitable processing if necessary. The program is processed electronically and stored in computer memory.

应当理解,本申请的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中,多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。例如,如果用硬件来实现,和在另一实施方式中一样,可用本领域公知的下列技术中的任一项或他们的组合来实现:具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路,具有合适的组合逻辑门电路的专用集成电路,可编程门阵列(PGA),现场可编程门阵列(FPGA)等。It should be understood that each part of the present application may be realized by hardware, software, firmware or a combination thereof. In the embodiments described above, various steps or methods may be implemented by software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, it can be implemented by any one or combination of the following techniques known in the art: Discrete logic circuits, ASICs with suitable combinational logic gates, programmable gate arrays (PGAs), field programmable gate arrays (FPGAs), etc.

本技术领域的普通技术人员可以理解实现上述实施例方法携带的全部或部分步骤是可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,该程序在执行时,包括方法实施例的步骤之一或其组合。Those of ordinary skill in the art can understand that all or part of the steps carried by the methods of the above embodiments can be completed by instructing related hardware through a program, and the program can be stored in a computer-readable storage medium. During execution, one or a combination of the steps of the method embodiments is included.

此外,在本申请各个实施例中的各功能单元可以集成在一个处理模块中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。所述集成的模块如果以软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读取存储介质中。In addition, each functional unit in each embodiment of the present application may be integrated into one processing module, each unit may exist separately physically, or two or more units may be integrated into one module. The above-mentioned integrated modules can be implemented in the form of hardware or in the form of software function modules. If the integrated modules are realized in the form of software function modules and sold or used as independent products, they can also be stored in a computer-readable storage medium.

上述提到的存储介质可以是只读存储器,磁盘或光盘等。尽管上面已经示出和描述了本申请的实施例,可以理解的是,上述实施例是示例性的,不能理解为对本申请的限制,本领域的普通技术人员在本申请的范围内可以对上述实施例进行变化、修改、替换和变型。The storage medium mentioned above may be a read-only memory, a magnetic disk or an optical disk, and the like. Although the embodiments of the present application have been shown and described above, it can be understood that the above embodiments are exemplary and should not be construed as limitations on the present application, and those skilled in the art can make the above-mentioned The embodiments are subject to changes, modifications, substitutions and variations.

Claims (12)

1. A click pre-estimation method based on artificial intelligence is characterized by comprising the following steps:
acquiring attribute information of an entity to be recommended according to a query statement input by a user;
performing word segmentation processing on the attribute information of the query statement and the entity to be recommended respectively, and determining the characteristics of the query statement and the entity to be recommended;
determining the click rate of the entity to be recommended according to the query statement and the characteristics of the entity to be recommended by using a preset deep neural network model;
determining the click rate of the entity to be recommended according to the query statement and the characteristics of the entity to be recommended by using a preset deep neural network model, wherein the determining comprises the following steps:
Determining vectors corresponding to the query statement and the entity to be recommended respectively by utilizing a preset mapping relation between the participles and the vectors according to the participles with different granularities respectively included by the query statement and the entity to be recommended;
calculating vectors corresponding to the query statement and the entity to be recommended respectively by using a first preset operation rule, and determining the click rate of the entity to be recommended;
the method for calculating the vectors corresponding to the query statement and the entity to be recommended respectively by using the first preset operation rule comprises the following steps:
carrying out linear transformation and nonlinear transformation on vectors respectively corresponding to the query statement and the entity to be recommended, and determining new vectors respectively corresponding to the query statement and the entity to be recommended;
and determining the click rate of the entity to be recommended according to the inner products of the new vectors respectively corresponding to the query statement and the entity to be recommended.
2. the method of claim 1, wherein the performing word segmentation processing on the attribute information of the query statement and the entity to be recommended respectively to determine the feature values of the query statement and the entity to be recommended comprises:
And according to different word cutting particle sizes, performing word segmentation processing on the attribute information of the query sentence and the entity to be recommended respectively, and determining word segments with different particle sizes, which are respectively included by the query sentence and the entity to be recommended.
3. the method of claim 1, wherein after determining the click-through rate of the entity to be recommended, further comprising:
Determining the display position of the entity to be recommended;
And determining the click label of the entity to be recommended according to the click rate and the display position of the entity to be recommended.
4. the method of claim 3, wherein the determining the presentation location of the entity to be recommended comprises:
And determining the display position of the entity to be recommended according to the historical behavior log of the user.
5. The method of claim 3, wherein the determining the click label of the entity to be recommended according to the click rate and the presentation position of the entity to be recommended comprises:
Determining a correction vector corresponding to the entity to be recommended according to the click rate and the display position of the entity to be recommended;
Calculating the correction vector by using a second preset calculation rule, and determining a click label corresponding to the entity to be recommended;
the calculating the correction vector by using a second preset calculation rule, and determining the click label corresponding to the entity to be recommended includes:
and performing linear transformation and nonlinear transformation on the correction vector, and determining a click label corresponding to the entity to be recommended.
6. The method according to any of claims 1-5, wherein the attribute information of the entity to be recommended comprises at least one of the following information: the name of the entity to be recommended, the recommendation reason and the identification of the entity to be recommended.
7. the utility model provides a click and predict device based on artificial intelligence which characterized in that includes:
the acquisition module is used for acquiring attribute information of the entity to be recommended according to the query statement input by the user;
The word segmentation module is used for respectively carrying out word segmentation on the attribute information of the query statement and the entity to be recommended and determining the characteristics of the query statement and the entity to be recommended;
the processing module is used for determining the click rate of the entity to be recommended according to the query statement and the characteristics of the entity to be recommended by utilizing a preset deep neural network model;
Wherein the processing module comprises:
The determining unit is used for determining vectors corresponding to the query statement and the entity to be recommended respectively according to the participles with different granularities respectively included by the query statement and the entity to be recommended by utilizing a preset mapping relation between the participles and the vectors;
The operation unit is used for operating the vectors corresponding to the query statement and the entity to be recommended respectively by utilizing a first preset operation rule, and determining the click rate of the entity to be recommended;
Wherein, the arithmetic unit is specifically configured to:
Carrying out linear transformation and nonlinear transformation on vectors respectively corresponding to the query statement and the entity to be recommended, and determining new vectors respectively corresponding to the query statement and the entity to be recommended;
and determining the click rate of the entity to be recommended according to the inner products of the new vectors respectively corresponding to the query statement and the entity to be recommended.
8. The apparatus of claim 7, wherein the word segmentation module is specifically configured to:
And according to different word cutting particle sizes, performing word segmentation processing on the attribute information of the query sentence and the entity to be recommended respectively, and determining word segments with different particle sizes, which are respectively included by the query sentence and the entity to be recommended.
9. The apparatus of claim 7, further comprising:
The first determination module is used for determining the display position of the entity to be recommended;
and the second determining module is used for determining the click label of the entity to be recommended according to the click rate and the display position of the entity to be recommended.
10. the apparatus of claim 9, wherein the first determining module is specifically configured to:
And determining the display position of the entity to be recommended according to the historical behavior log of the user.
11. The apparatus of claim 10, wherein the second determining module is specifically configured to:
determining a correction vector corresponding to the entity to be recommended according to the click rate and the display position of the entity to be recommended;
Calculating the correction vector by using a second preset calculation rule, and determining a click label corresponding to the entity to be recommended;
the calculating the correction vector by using a second preset calculation rule, and determining the click label corresponding to the entity to be recommended includes:
And performing linear transformation and nonlinear transformation on the correction vector, and determining a click label corresponding to the entity to be recommended.
12. The apparatus according to any of claims 7-10, wherein the attribute information of the entity to be recommended includes at least one of the following information: the name of the entity to be recommended, the recommendation reason and the identification of the entity to be recommended.
CN201610972619.8A 2016-10-28 2016-10-28 Click estimation method and device based on artificial intelligence Active CN106339510B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610972619.8A CN106339510B (en) 2016-10-28 2016-10-28 Click estimation method and device based on artificial intelligence

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610972619.8A CN106339510B (en) 2016-10-28 2016-10-28 Click estimation method and device based on artificial intelligence

Publications (2)

Publication Number Publication Date
CN106339510A CN106339510A (en) 2017-01-18
CN106339510B true CN106339510B (en) 2019-12-06

Family

ID=57841712

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610972619.8A Active CN106339510B (en) 2016-10-28 2016-10-28 Click estimation method and device based on artificial intelligence

Country Status (1)

Country Link
CN (1) CN106339510B (en)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107463704B (en) * 2017-08-16 2021-05-07 北京百度网讯科技有限公司 Search method and device based on artificial intelligence
CN108255954A (en) * 2017-12-20 2018-07-06 广州优视网络科技有限公司 Using search method, device, storage medium and terminal
CN110415006B (en) * 2018-04-28 2022-03-08 阿里巴巴(中国)有限公司 Advertisement click rate estimation method and device
CN108875022B (en) * 2018-06-20 2021-03-02 北京奇艺世纪科技有限公司 Video recommendation method and device
CN109255070B (en) * 2018-08-01 2022-04-12 百度在线网络技术(北京)有限公司 Recommendation information processing method and device, computer equipment and storage medium
CN109359247B (en) * 2018-12-07 2021-07-06 广州市百果园信息技术有限公司 Content pushing method, storage medium and computer equipment
CN109784537B (en) * 2018-12-14 2019-12-06 北京达佳互联信息技术有限公司 advertisement click rate estimation method and device, server and storage medium
CN109829116B (en) * 2019-02-14 2021-07-30 北京达佳互联信息技术有限公司 Content recommendation method and device, server and computer readable storage medium
CN110399979B (en) * 2019-06-17 2022-05-13 深圳大学 Click rate pre-estimation system and method based on field programmable gate array
CN110390052B (en) * 2019-07-25 2022-10-28 腾讯科技(深圳)有限公司 Search recommendation method, training method, device and equipment of CTR (China train redundancy report) estimation model
CN110909182B (en) * 2019-11-29 2023-05-09 北京达佳互联信息技术有限公司 Multimedia resource searching method, device, computer equipment and storage medium
CN111445283B (en) * 2020-03-25 2023-09-01 北京百度网讯科技有限公司 Digital person processing method, device and storage medium based on interaction device
CN112559892A (en) * 2020-08-03 2021-03-26 广州知弘科技有限公司 Terminal information recommendation system based on big data
CN111967599B (en) * 2020-08-25 2023-07-28 百度在线网络技术(北京)有限公司 Method, apparatus, electronic device and readable storage medium for training model
CN113435523B (en) * 2021-06-29 2023-09-26 北京百度网讯科技有限公司 Method, device, electronic device and storage medium for predicting content click-through rate

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103425691A (en) * 2012-05-22 2013-12-04 阿里巴巴集团控股有限公司 Search method and search system
CN104750713A (en) * 2013-12-27 2015-07-01 阿里巴巴集团控股有限公司 Method and device for sorting search results
CN105045906A (en) * 2015-08-07 2015-11-11 百度在线网络技术(北京)有限公司 Estimation method and device of click rate of delivery information
CN105046515A (en) * 2015-06-26 2015-11-11 深圳市腾讯计算机系统有限公司 Advertisement ordering method and device

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102346745B (en) * 2010-08-02 2014-04-02 阿里巴巴集团控股有限公司 Method and device for predicting user behavior number for words
CN104572734B (en) * 2013-10-23 2019-04-30 腾讯科技(深圳)有限公司 Method for recommending problem, apparatus and system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103425691A (en) * 2012-05-22 2013-12-04 阿里巴巴集团控股有限公司 Search method and search system
CN104750713A (en) * 2013-12-27 2015-07-01 阿里巴巴集团控股有限公司 Method and device for sorting search results
CN105046515A (en) * 2015-06-26 2015-11-11 深圳市腾讯计算机系统有限公司 Advertisement ordering method and device
CN105045906A (en) * 2015-08-07 2015-11-11 百度在线网络技术(北京)有限公司 Estimation method and device of click rate of delivery information

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于深度学习的搜索广告点击率预测方法研究;李思琴;《中国优秀硕士学位论文全文数据库(电子期刊)信息科技辑》;20160215;第2016年卷(第2期);全文 *

Also Published As

Publication number Publication date
CN106339510A (en) 2017-01-18

Similar Documents

Publication Publication Date Title
CN106339510B (en) Click estimation method and device based on artificial intelligence
US20230237088A1 (en) Automatically detecting user-requested objects in digital images
JP7127120B2 (en) Video classification method, information processing method and server, and computer readable storage medium and computer program
JP6588572B2 (en) Information recommendation method and information recommendation device
CN108255857B (en) Statement detection method and device
WO2019174422A1 (en) Method for analyzing entity association relationship, and related apparatus
WO2021174811A1 (en) Prediction method and prediction apparatus for traffic flow time series
CN110543592B (en) Information searching method and device and computer equipment
CN110084374A (en) Construct method, apparatus and prediction technique, device based on the PU model learnt
CN110196908A (en) Data classification method, device, computer installation and storage medium
CN108121795A (en) User's behavior prediction method and device
CN106649774A (en) Artificial intelligence-based object pushing method and apparatus
CN107330023A (en) Content of text based on focus recommends method and apparatus
CN109766550B (en) Text brand recognition method, recognition device and storage medium
US11972578B2 (en) Method and system for object tracking using online training
CN103617146B (en) A kind of machine learning method and device based on hardware resource consumption
CN112347361B (en) Method for recommending object, neural network, training method, training equipment and training medium thereof
CN111178533B (en) Method and device for realizing automatic semi-supervised machine learning
CN111144109A (en) Text similarity determination method and device
CN114792256B (en) Crowd expansion method and device based on model selection
CN115422346A (en) Multi-round topic recommendation method based on knowledge graph
CN108510326A (en) Determination method or initial value and device
CN111125272B (en) Regional characteristic acquisition method, regional characteristic acquisition device, computer equipment and medium
CN112749293A (en) Image classification method and device and storage medium
CN110851600A (en) Text data processing method and device based on deep learning

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant