CN113919333B

CN113919333B - Text knowledge supplementation method and device based on knowledge graph

Info

Publication number: CN113919333B
Application number: CN202111235816.9A
Authority: CN
Inventors: 吴天博; 王健宗
Original assignee: Ping An Technology Shenzhen Co Ltd
Current assignee: Ping An Technology Shenzhen Co Ltd
Priority date: 2021-10-22
Filing date: 2021-10-22
Publication date: 2025-04-04
Anticipated expiration: 2041-10-22
Also published as: CN113919333A

Abstract

The embodiment of the present application provides a method and device for supplementing text knowledge based on a knowledge graph, the method comprising: obtaining and concatenating the character vector, word vector and topic vector of the text, and inputting them into a bidirectional GRU to obtain a hidden state; using a self-attention mechanism to process the hidden state, obtaining a feature matrix, and converting it into a feature vector through a pooling layer; calling the knowledge graph to conceptualize the text, obtaining a concept set including a concept vector; calculating the relationship weight between the concept vector and the feature vector; calculating the importance weight of the concept vector in the concept set; adjusting the corresponding relationship weight using the importance weight, and performing weighted calculation on each concept vector according to the adjusted relationship weight to obtain a concept set feature, so as to supplement the text with knowledge through the concept set feature. The present application expands the text features from the character granularity, word granularity and text granularity levels, and uses the knowledge graph to make up for the lack of text context information.

Description

Text knowledge supplementing method and device based on knowledge graph

Technical Field

The application relates to the field of natural language processing, in particular to a text knowledge supplementing method and device based on a knowledge graph.

Background

Text classification is a widely used natural language processing technique. By utilizing text classification, implicit information in massive texts in a network can be quickly mined, and the method is widely applied to the fields of information retrieval, question-answering systems, dialogue systems and the like.

The problems of difficult semantic understanding, sparse features and the like in text classification processing are caused by short text space, lack of context and low expression precision in a network. The feature sparsity problem of the text can be effectively relieved by utilizing the knowledge graph, but in the text knowledge supplementing method introducing external knowledge, the problem of low introduced knowledge quality still exists, so that the text lacks context information, and the semantic understanding effect is poor.

Disclosure of Invention

The application aims to solve the problems in the prior art to at least a certain extent, and provides a text knowledge supplementing method, a device, a computer device and a computer readable storage medium based on a knowledge graph, which can improve the quality of introduced knowledge so as to fully and effectively supplement the knowledge to a text.

The technical scheme of the embodiment of the application is as follows:

in a first aspect, the present application provides a text knowledge supplementing method based on a knowledge graph, the method comprising:

acquiring a character vector, a word vector and a theme vector of a text;

splicing the character vector, the word vector and the theme vector to obtain a word vector matrix;

Inputting the word vector matrix to a bi-directional gating loop unit (Gate Recurrent Unit, GRU) network to output a hidden state through the bi-directional GRU network;

processing the hidden state by using a self-attention mechanism to obtain a feature matrix of the text;

Invoking a knowledge graph to perform conceptual processing on the text to obtain a concept set comprising concept vectors;

inputting the feature matrix to a pooling layer to output feature vectors through the pooling layer;

calculating a relationship weight between the concept vector and the feature vector using an attention mechanism;

calculating importance weights of the concept vectors in the concept set using a self-attention mechanism;

and adjusting the corresponding relation weight by using the importance weight, and carrying out weighted calculation on each concept vector according to the adjusted relation weight to obtain concept set characteristics so as to carry out knowledge supplement on the text through the concept set characteristics.

According to some embodiments of the application, the formula for calculating the relationship weight is:

Wherein a _i is a relation weight between a concept vector in the ith concept set and a feature vector q of the text, c _i is the ith concept vector in the concept set, i is an integer greater than 1, and As a weight matrix, saidAs a weight vector, d _a is a hyper-parameter and b ₁ is a bias.

According to some embodiments of the application, the importance weight is calculated by the formula:

wherein β _i is the importance weight of the concept vector in the i-th concept set in the concept set, c _i is the i-th concept vector in the concept set, and As a weight matrix, saidAs a weight vector, d _b is a hyper-parameter and b ₂ is a bias.

According to some embodiments of the application, the calculation formula for adjusting the corresponding relation weight by using the importance weight is:

α_i＝softmax(γα_i+(1-γ)β_i)

Wherein, gamma is the self-adaptive coefficient based on the neural network, and gamma epsilon (0, 1).

According to some embodiments of the application, the method for obtaining the character vector of the text comprises the following steps:

And establishing a convolutional neural network (Convolutional Neural Networks, CNN) model taking the granularity of the characters as an input unit, and extracting character features of each word in the word sequence in the text through the CNN model to obtain a character vector.

According to some embodiments of the application, the method for obtaining the word vector of the text comprises the following steps:

and mapping words in the text into word vectors.

According to some embodiments of the application, the method for obtaining the topic vector of the text comprises the following steps:

Inputting the text into a word to vector (word 2 vec) model to obtain a plurality of word vectors corresponding to the text;

For each word vector, respectively inputting the word vector into a pre-trained implicit dirichlet Allocation (LDA) model so that the LDA model outputs a plurality of topics corresponding to the word vector and probability distribution values corresponding to the topics;

Selecting a topic with the largest probability distribution value from the topics as a target topic, and acquiring a topic word file corresponding to the target topic, wherein the topic word file comprises a plurality of topic words and probability values corresponding to the topic words;

Sorting the plurality of subject words according to the probability values from large to small, and selecting the first K subject words as target subject words;

acquiring the weight of each target subject word according to the probability value of each target subject word;

according to the weight of each target subject word, weighting operation is carried out on K target subject words, so that subject features corresponding to the word vectors are obtained;

And obtaining the topic vector corresponding to the text according to the topic feature corresponding to each word vector.

In a second aspect, the present application provides a text knowledge supplementing device based on a knowledge graph, including:

the vector acquisition module is used for acquiring character vectors, word vectors and theme vectors of the text;

the vector splicing module is used for splicing the character vector, the word vector and the theme vector to obtain a word vector matrix;

The hidden state acquisition module is used for inputting the word vector matrix into a bidirectional GRU network so as to output a hidden state through the bidirectional GRU network;

the hidden state processing module is used for processing the hidden state by using a self-attention mechanism to obtain a feature matrix of the text;

The text knowledge acquisition module is used for calling a knowledge graph to perform conceptual processing on the text to obtain a concept set comprising concept vectors;

The feature matrix processing module is used for inputting the feature matrix into a pooling layer so as to output feature vectors through the pooling layer;

A first weight acquisition module for calculating a relationship weight between the concept vector and the feature vector using an attention mechanism;

A second weight acquisition module for calculating importance weights of the concept vectors in the concept set using a self-attention mechanism;

and the text knowledge supplementing module is used for adjusting the corresponding relation weight by utilizing the importance weight, and carrying out weighted calculation on each concept vector according to the adjusted relation weight to obtain concept set characteristics so as to supplement knowledge to the text through the concept set characteristics.

In a third aspect, the present application provides a computer device comprising a memory and a processor, the memory having stored therein computer readable instructions which, when executed by one or more of the processors, cause the one or more processors to perform the steps of the method as described in any of the first aspects above.

In a fourth aspect, the present application also provides a computer readable storage medium readable and writable by a processor, the storage medium storing computer instructions which when executed by one or more processors cause the one or more processors to perform the steps of a method as described in any of the first aspects above.

The technical scheme provided by the embodiment of the application has the following beneficial effects:

According to the embodiment of the application, the character vector, the word vector and the theme vector of the text are acquired, the word vector matrix is spliced and is used as the input of the bidirectional GRU network, so that the hidden state is output through the bidirectional GRU network, then the hidden state is processed by using a self-attention mechanism, the feature matrix of the text is acquired, and the knowledge graph is called to perform conceptual processing on the text, so that a concept set comprising the concept vector is acquired. In addition, the feature matrix is input into the pooling layer to output feature vectors through the pooling layer, the relationship weights between the concept vectors and the feature vectors are calculated by using an attention mechanism, the importance weights of the concept vectors in the concept set are calculated by using a self-attention mechanism, then the corresponding relationship weights are adjusted by using the importance weights, and each concept vector is weighted according to the adjusted relationship weights to obtain concept set features so as to supplement knowledge to the text through the concept set features. According to the text knowledge supplementing method, the text is modeled from the character granularity, the word granularity and the text granularity, the text characteristics are expanded, knowledge in the knowledge graph is introduced to supplement knowledge to the text by correlating the knowledge in the text with the knowledge graph, wherein the corresponding relation weight is adjusted by using the importance weight, the weight of key concept vectors is increased more reasonably, incorrect concepts introduced due to ambiguity or irrelevant noise of entities in the knowledge graph are reduced, the text is classified based on the text knowledge supplementing method, the introduced knowledge quality can be improved, and the knowledge is supplemented more fully and effectively, so that the text is classified more accurately.

Drawings

FIG. 1 is a flow chart of a method for supplementing text knowledge based on a knowledge graph according to an embodiment of the application;

FIG. 2 is a flow chart of a text knowledge supplement method based on a knowledge graph according to another embodiment of the application;

FIG. 3 is a schematic diagram of a text classification model provided by an embodiment of the application;

FIG. 4 is a schematic diagram of a text knowledge supplement apparatus based on a knowledge graph according to an embodiment of the application;

fig. 5 is a schematic structural diagram of a computer device according to an embodiment of the present application.

Detailed Description

The present application will be described in further detail with reference to the drawings and examples, in order to make the objects, technical solutions and advantages of the present application more apparent. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the application.

In the embodiments of the present application, "at least one" means one or more, and "a plurality" means two or more. "and/or", describes an association relation of association objects, and indicates that there may be three kinds of relations, for example, a and/or B, and may indicate that a alone exists, a and B together, and B alone exists. Wherein A, B may be singular or plural. The text "/" generally indicates that the context associated object is an "or" relationship. "at least one of the following" and the like means any combination of these items, including any combination of single or plural items. For example, at least one of a, b and c may represent a, b, c, a and b, a and c, b and c, or a and b and c, wherein a, b, c may be single or plural.

It should be appreciated that embodiments of the present application may acquire and process relevant data based on artificial intelligence techniques. Among these, artificial intelligence (ARTIFICIAL INTELLIGENCE, al) is the theory, method, technique, and application system that uses a digital computer or a digital computer-controlled machine to simulate, extend, and expand human intelligence, sense the environment, acquire knowledge, and use knowledge to obtain optimal results. Artificial intelligence infrastructure technologies generally include technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, big data processing technologies, operation/interaction systems, mechatronics, and the like. The artificial intelligence software technology mainly comprises a computer vision technology, a robot technology, a biological recognition technology, a voice processing technology, a natural language processing technology, machine learning/deep learning and other directions.

Referring to fig. 1, fig. 1 shows a flow chart of a text knowledge supplementing method based on a knowledge graph according to an embodiment of the present application. It can be understood that the method of the embodiment of the application can be applied to a server, can also be applied to a terminal, can also be applied to a system comprising the terminal and the server, and can be realized through interaction of the terminal and the server. As shown in fig. 1, the method comprises the steps of:

In step S110, a character vector, a word vector, and a topic vector of the text are acquired.

In some embodiments, a CNN model with the granularity of characters as an input unit is established, and character feature extraction is performed on each word in the word sequence in the text through the CNN model to obtain a character vector.

Morphological information in word characters of text, such as prefixes or suffixes of words, can be extracted using convolutional neural networks, and additional information can be provided for words lacking word vectors by embedding the characters as an extension of the word vectors. Character vectors for words can be trained by using character-level word-embedded vectors concatenated with word vectors as input to a text coding model, e.g., by inputting a word sequence { x ₁,x₂,...,x_i,...,x_n},x_i for a text to represent the i-th word in the text, where x _i words contain characters of length L, c _j for each character-embedded vector in word x _i, each character representing a corresponding feature thereof, processing the sequence of characters in each word x _i using a standard convolutional neural network

Where W _CNN and b _CNN are training parameters, ke represents the convolution kernel size, and max represents performing the max pooling operation.

In some embodiments, the method of obtaining the word vector of the text is as follows, mapping words in the text to word vectors.

By mapping words in the word sequence { x ₁,x₂,...,x_i,...,x_n } of the text one by one into word vectors: it will be appreciated that a word2vec model may be utilized to map each word in a word sequence to a word vector.

In some embodiments, referring to fig. 2, fig. 2 shows a flow chart of a text knowledge supplement method based on a knowledge graph, where the method for obtaining a topic vector of a text is as follows:

And step S210, inputting the text into a word2vec model to obtain a plurality of word vectors corresponding to the text.

And mapping each word in the text to a vector by using a word2vec model to obtain a word vector corresponding to each word of the text.

Step S220, for each word vector, respectively inputting the word vectors into a pre-trained LDA model so that the LDA model outputs a plurality of topics corresponding to the word vector and probability distribution values corresponding to the topics.

It can be understood that, by performing topic model training on a text training set by using an LDA model, a topic word file including X topics, Y topic words corresponding to the topics, and probability values is obtained, where X and Y are integers greater than 1, and by inputting word vectors of text words into the trained LDA model, a plurality of topics corresponding to the input word vectors and probability distributions corresponding to the topics are obtained.

For example, 500 news are selected as a text training set, firstly the file training set is cleaned, stop words, special characters other than Chinese are removed, then word segmentation is performed, each news is digitized, and finally the input is used for training an LDA model. The LDA is an unsupervised learning method, and only the number of topics is designated during training, for example, the number of topics is designated as 4, after the LDA model is trained, a topic word file including 4 topics and a plurality of topic words and probabilities under the 4 topics is obtained, for example, the topic word file includes (0, '' 0.08×foundation "+0.014×market" +0.013×company "+0.012×investment" +0.011×stock "+0.010×red" +, ") and the like, wherein 0 represents a first topic, and 0.08×foundation" +, "for example, and 0.08 represents a probability distribution value of the topic word" foundation "under the first topic. Then, word vectors of text words are input into the trained LDA model to obtain topic distribution results, such as [ (0,0.07), (1,0.46), (2,0.24), (3,0.23) ], taking (1,0.46) as an example, 1 represents a second topic, and 0.46 represents a probability value of the input word vector belonging to the topic.

Step S230, selecting a topic with the largest probability distribution value from the topics as a target topic, and acquiring a topic word file corresponding to the target topic, wherein the topic word file comprises a plurality of topic words and probability values corresponding to the topic words.

For example, when the topic probability distribution is [ (0,0.07), (1,0.46), (2,0.24), (3,0.23) ], it is known that the topic with the largest probability distribution value is the second topic, the second topic is taken as the target topic, and the topic word file including the target topic and Y topic words and probability values corresponding to the target topic is obtained.

Step S240, the plurality of subject words are ranked according to the probability values from large to small, and the first K subject words are selected as target subject words.

After a subject word file comprising a target subject, subject words corresponding to the subject and a probability value is acquired, sorting the subject words according to the probability value from large to small, and selecting the first K subject words as the target subject words, wherein K is an integer larger than 1.

Illustratively, a subject word file including a target subject and subject words corresponding to the target subject and a probability value is obtained, and subject words are ranked according to the probability value from large to small, for example, (0, '0.08 ' foundation ' 0.014 ' market ' 0.013 ' company ' 0.012 ' investment ' 0.011 ' stock ' 0.010 ' score ' + ' ') and the first 5 subject words are selected as target subject words, namely, foundation, market, investment, stock and score are selected as target subject words.

Step S250, according to the probability value of each target subject word, the weight of each target subject word is obtained.

The probability value { p ₁,p₂,...,p_k } of each target subject word { r ₁,r₂,...,r_k } is normalized as the weight of the target subject word.

Illustratively, when the probability value corresponding to k=5 target subject words is (0.08, 0.014, 0.013, 0.012, 0.011), the probability value normalization calculation can be performed using the following formula:

Wherein q _i represents the normalized value of the probability value p _i, i.e. the weight corresponding to the target subject term.

And step S260, carrying out weighting operation on K target subject words according to the weight of each target subject word to obtain subject features corresponding to the word vectors.

It can be appreciated that the subject feature of each subject word is obtained by multiplying the weight of the target subject word by the word vector corresponding to the target subject word.

Step S270, according to the topic features corresponding to each word vector, obtaining the topic vector corresponding to the text.

Exemplary, topic vectorThe calculation formula of (2) is as follows:

Wherein K is the number of target subject words, q _m is the weight of the target subject words, C (r _m) is the word vector of the target subject words, and q _m×C(r_m) is the subject feature of the target subject words.

And obtaining the topic characteristics of the text by summing the topic vectors of the target topic words.

And step S120, splicing the character vector, the word vector and the theme vector to obtain a word vector matrix.

Exemplary, concatenated character vectors, word vectors, and subject vectorsWherein the method comprises the steps ofAnd respectively representing the corresponding character vector, the word vector and the theme vector in the text, thereby obtaining a word vector matrix E= (E ₁,E₂,…,E_n).

Step S130, inputting the word vector matrix into a bidirectional GRU network to output hidden states through the bidirectional GRU network.

It can be understood that the word vector matrix e= (E ₁,E₂,…,E_n) is used as the input of the bidirectional GRU network, the forward GRU network in the bidirectional GRU network reads the input sequence (E ₁,E₂,…,E_n) in the forward sequence, the reverse GRU network reads the input sequence (E _n,E_n-1,…,E₁) in the reverse sequence, the input vector E _i at each time t is calculated by the gating loop unit, and the forward hidden state at each time is obtainedAnd reverse hidden stateThe forward hidden state of each momentReverse hidden state corresponding to the timeAnd (3) connecting to obtain the hidden state at the moment:

And step S140, processing the hidden state by using a self-attention mechanism to obtain a feature matrix of the text.

The hidden state of each moment is processed by using a self-attention mechanism, the words input by each time step are weighted according to the attention calculation, so that important words acquire higher weight, and the feature matrix of the text is acquired.

And step S150, invoking a knowledge graph to perform conceptualization processing on the text to obtain a concept set comprising concept vectors.

Mapping keywords in the text with entities in the knowledge graph, mapping the text keywords to target entities through an entity linking technology, obtaining a concept set of the text, and vectorizing the concept set to obtain the concept set comprising concept vectors.

Illustratively, mapping keywords in a text with entities of encyclopedia general knowledge Graph CN-DBpedia or microsoft accept Graph knowledge Graph, mapping the text keywords to target entities through entity linking technology, and obtaining a text Concept set c= (C ₁,c₂,...,c_i,...,c_m) including Concept vectors, where C _i represents Concept vectors in the i-th Concept set.

Step S160, inputting the feature matrix to a pooling layer to output feature vectors through the pooling layer.

The feature matrix of the text is converted into feature vectors by the max pooling layer. It will be appreciated that the feature matrix of text can be compressed into feature vectors using the max pooling layer.

Illustratively, the text matrix h' e R ^n×2n is transformed into the feature vector q e R ²ⁿ through the max pooling layer.

Step S170, calculating a relationship weight between the concept vector and the feature vector using an attention mechanism.

In some embodiments, the relational weights are calculated as:

Step S180, calculating importance weights of the concept vectors in the concept set using a self-attention mechanism.

A self-attention mechanism is introduced in the concept set and attention calculation is performed to acquire importance weights of each concept vector in the whole concept set. It should be noted that, the attention mechanism gives a larger weight to important concepts and gives a smaller weight to unimportant concepts so as to highlight important concepts in the concept set.

In some embodiments, the importance weight is calculated as:

And step S190, adjusting the corresponding relation weight by using the importance weight, and carrying out weighted calculation on each concept vector according to the adjusted relation weight to obtain a concept set feature so as to carry out knowledge supplement on the text through the concept set feature.

In some embodiments, the calculation formula for adjusting the corresponding relation weight by using the importance weight is:

α_i＝softmax(γα_i+(1-γ)β_i)

The importance weight is utilized to adjust the relation weight, so that the semantic relevance of the text and the corresponding concept set is better calculated, the weight of the key concept vector is more reasonably increased, and incorrect concepts introduced due to ambiguity or irrelevant noise of the entity are reduced.

Calculating a weighted sum of the concept vectors according to the adjusted relation weights, thereby obtaining semantic vectors representing the conceptsThat is, a set of conceptual features, where m is the total number of conceptual vectors in the set of concepts, c _i is the i-th conceptual vector in the set of concepts, and a _i is the adjusted relationship weight for the corresponding conceptual vector c _i.

Referring to fig. 3, fig. 3 shows a schematic structural diagram of a text classification model according to an embodiment of the present application, and a text knowledge supplementing method based on a knowledge graph according to an embodiment of the present application is described on the text classification model.

The text classification model integrates a knowledge graph, an attention mechanism and a bidirectional GRU network, and mainly comprises two parts:

And in the first part, text coding, after character vectors, word vectors and theme vectors of the text are acquired, splicing the character vectors, the word vectors and the theme vectors to obtain word vector matrixes, inputting the word vector matrixes into a bidirectional GRU network to output hidden states through the bidirectional GRU network, and finally processing the hidden states through a self-attention layer by utilizing a self-attention mechanism to obtain feature matrixes of the text.

And in the second part, knowledge conceptualization coding, namely, invoking a knowledge graph to carry out conceptualization processing on a text to obtain a concept set comprising concept vectors, inputting a feature matrix into a pooling layer to output the feature vectors through the pooling layer, calculating the relation weights between the concept vectors and the feature vectors by using an attention mechanism and calculating the importance weights of the concept vectors in the concept set by using a self-attention mechanism, adjusting the corresponding relation weights by using the importance weights, improving the weights of the concept vectors closely related to the text in the concept set, reducing the introduction of incorrect concepts due to ambiguity or irrelevant noise of an entity, and carrying out weighted calculation on each concept vector according to the adjusted relation weights to obtain the feature of the concept set.

It will further be appreciated that although operations are depicted in the drawings in a particular order, this should not be understood as requiring that such operations be performed in the particular order shown or in sequential order, or that all illustrated operations be performed, to achieve desirable results. In certain circumstances, multitasking and parallel processing may be advantageous.

Referring to fig. 4, a text knowledge supplementing device 200 based on a knowledge graph according to an embodiment of the present application includes:

a vector obtaining module 210, configured to obtain a character vector, a word vector, and a topic vector of a text;

a vector stitching module 220, configured to stitch the character vector, the word vector, and the topic vector to obtain a word vector matrix;

a hidden state acquisition module 230, configured to input the word vector matrix to a bidirectional GRU network, so as to output a hidden state through the bidirectional GRU network;

a hidden state processing module 240, configured to process the hidden state by using a self-attention mechanism to obtain a feature matrix of the text;

the text knowledge acquisition module 250 is configured to invoke a knowledge graph to perform conceptual processing on the text, so as to obtain a concept set including concept vectors;

a feature matrix processing module 260, configured to input the feature matrix to a pooling layer, so as to output a feature vector through the pooling layer;

A first weight acquisition module 270 for calculating a relationship weight between the concept vector and the feature vector using an attention mechanism;

a second weight acquisition module 280 for calculating importance weights of the concept vectors in the concept set using a self-attention mechanism;

The text knowledge supplementing module 290 is configured to adjust the corresponding relation weight by using the importance weight, and perform weighted calculation on each concept vector according to the adjusted relation weight, so as to obtain a concept set feature, so as to supplement knowledge to the text through the concept set feature.

In some embodiments, the vector acquisition module specifically includes:

A character vector acquisition unit for acquiring a character vector of a text;

a word vector acquisition unit for acquiring a word vector of a text;

And the theme vector acquisition unit is used for acquiring the theme vector of the text.

In some embodiments, the text knowledge supplementing module specifically includes:

the weight adjusting unit is used for adjusting the corresponding relation weight by utilizing the importance weight;

and the weighting calculation unit is used for carrying out weighting calculation on each concept vector according to the adjusted relation weight to obtain the concept set characteristics.

And the knowledge supplementing unit is used for supplementing knowledge to the text through the concept set characteristics.

According to the embodiment of the application, the character vector, the word vector and the theme vector of the text are acquired through the vector acquisition module, the vector splicing module splices the vectors to obtain the word vector matrix, the hidden state acquisition module takes the word vector matrix as the input of the bidirectional GRU network to output the hidden state through the bidirectional GRU network, then the hidden state processing module processes the hidden state by using a self-attention mechanism to obtain the feature matrix of the text, and the text knowledge acquisition module invokes the knowledge graph to perform conceptual processing on the text to obtain the concept set comprising the concept vector. In addition, the feature matrix processing module inputs the feature matrix to the pooling layer to output feature vectors through the pooling layer, the first weight acquisition module calculates a relationship weight between the concept vectors using an attention mechanism, and the second weight acquisition module calculates an importance weight of the concept vectors in the concept set using a self-attention mechanism. And finally, the text knowledge supplementing module adjusts the corresponding relation weight by using the importance weight, and performs weighted calculation on each concept vector according to the adjusted relation weight to obtain concept set characteristics so as to supplement knowledge to the text through the concept set characteristics. According to the text knowledge supplementing method, the text is modeled from the character granularity, the word granularity and the text granularity, the text characteristics are expanded, knowledge in the knowledge graph is introduced to supplement knowledge to the text by correlating the knowledge in the text with the knowledge graph, wherein the corresponding relation weight is adjusted by using the importance weight, the weight of key concept vectors is increased more reasonably, incorrect concepts introduced due to ambiguity or irrelevant noise of entities in the knowledge graph are reduced, the text is classified based on the text knowledge supplementing method, the introduced knowledge quality can be improved, and the knowledge is supplemented more fully and effectively, so that the text is classified more accurately.

It should be noted that, because the content of information interaction and execution process between the modules/units is based on the same concept as the method embodiment of the present application, specific functions and technical effects thereof may be referred to in the method embodiment section, and will not be described herein.

Fig. 5 shows a computer device 300 provided by an embodiment of the application. The computer device 300 may be a server or a terminal, and the internal structure of the computer device 300 includes, but is not limited to:

a memory 310 for storing a program;

The processor 320 is configured to execute the program stored in the memory 310, and when the processor 320 executes the program stored in the memory 310, the processor 320 is configured to execute the above-described text knowledge supplement method based on the knowledge graph.

The processor 320 and the memory 310 may be connected by a bus or other means.

The memory 310 serves as a non-transitory computer readable storage medium that can be used to store non-transitory software programs and non-transitory computer executable programs, such as the knowledge-graph-based text knowledge supplement method described in any embodiment of the invention. The processor 320 implements the above-described knowledge graph-based text knowledge supplement method by running non-transitory software programs and instructions stored in the memory 310.

The memory 310 may include a memory program area that may store an operating system, an application program required for at least one function, and a memory data area that may store a text knowledge supplement method based on a knowledge graph as described above. In addition, memory 310 may include high-speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid state storage device. In some implementations, memory 310 may optionally include memory located remotely from processor 320, which may be connected to processor 320 via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.

The non-transitory software programs and instructions required to implement the knowledge-graph-based text knowledge supplement method described above are stored in the memory 310, which when executed by the one or more processors 320, perform the knowledge-graph-based text knowledge supplement method provided by any embodiment of the invention.

The embodiment of the application also provides a computer-readable storage medium which stores computer-executable instructions for executing the text knowledge supplementing method based on the knowledge graph.

In one embodiment, the storage medium stores computer-executable instructions that are executed by one or more control processors, for example, by one or more processors 320 in the computer device 300, so that the one or more processors 320 perform the method for supplementing text knowledge based on knowledge graph according to any embodiment of the invention.

The embodiments described above are merely illustrative, wherein the units described as separate components may or may not be physically separate, i.e. may be located in one place, or may be distributed over a plurality of network elements. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of this embodiment.

Those of ordinary skill in the art will appreciate that all or some of the steps, systems, and methods disclosed above may be implemented as software, firmware, hardware, and suitable combinations thereof. Some or all of the physical components may be implemented as software executed by a processor, such as a central processing unit, digital signal processor, or microprocessor, or as hardware, or as an integrated circuit, such as an application specific integrated circuit. Such software may be distributed on computer readable media, which may include computer storage media (or non-transitory media) and communication media (or transitory media). The term computer storage media includes both volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data, as known to those skilled in the art. Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital Versatile Disks (DVD) or other optical disk storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can be accessed by a computer. Furthermore, as is well known to those of ordinary skill in the art, communication media typically include computer readable instructions, data structures, program modules, or other data in a modulated data signal, such as a carrier wave or other transport mechanism, and may include any information delivery media.

The preferred embodiments of the present invention have been described in detail, but the present invention is not limited to the above embodiments, and those skilled in the art will appreciate that the present invention may be practiced without departing from the spirit of the present invention. Various equivalent modifications and substitutions may be made in the shared context, and are intended to be included within the scope of the present invention as defined in the following claims.

Claims

1. A text knowledge supplement method based on knowledge graph, characterized in that the method comprises:

Get the character vector, word vector and topic vector of the text;

Concatenate the character vector, the word vector and the topic vector to obtain a word vector matrix;

Inputting the word vector matrix into a bidirectional GRU network to output a hidden state through the bidirectional GRU network;

Processing the hidden state using a self-attention mechanism to obtain a feature matrix of the text;

Calling the knowledge graph to conceptualize the text to obtain a concept set including a concept vector;

Inputting the feature matrix into a pooling layer to output a feature vector through the pooling layer;

Using an attention mechanism to calculate the relationship weight between the concept vector and the feature vector;

Using a self-attention mechanism to calculate the importance weight of the concept vector in the concept set;

Using the importance weight to adjust the corresponding relationship weight, and performing weighted calculation on each concept vector according to the adjusted relationship weight to obtain a concept set feature, so as to supplement the text with knowledge through the concept set feature;

The calculation formula of the relationship weight is:

Wherein, _ai is the relationship weight between the concept vector in the i-th concept set and the feature vector q of the text, _ci represents the i-th concept vector in the concept set, i is an integer greater than 1, is the weight matrix, is a weight vector, d _a is a hyperparameter, and b ₁ is a bias;

The calculation formula of the importance weight is:

Wherein, the β _i is the importance weight of the concept vector in the i-th concept set in the concept set, the c _i represents the i-th concept vector in the concept set, and the is the weight matrix, is the weight vector, d _b is the hyperparameter, and b ₂ is the bias.

2. The method according to claim 1, characterized in that the calculation formula for adjusting the corresponding relationship weight using the importance weight is:

α _i =softmax(γα _i +(1-γ)β _i )

Wherein, the γ is an adaptive coefficient based on a neural network, γ∈(0,1).

3. The method according to claim 1, characterized in that the character vector of the text is obtained as follows:

A CNN model with character granularity as input unit is established, and character features of each word in the word sequence in the text are extracted through the CNN model to obtain a character vector.

4. The method according to claim 1, characterized in that the word vector of the text is obtained as follows:

Map the words in the text into word vectors.

5. The method according to claim 1, characterized in that the method for obtaining the topic vector of the text is as follows:

Input the text into the word2vec model to obtain multiple word vectors corresponding to the text;

For each word vector, input it into a pre-trained LDA model, so that the LDA model outputs multiple topics corresponding to the word vector and probability distribution values corresponding to the topics;

Select a topic with the largest probability distribution value from the topics as a target topic, and obtain a keyword file corresponding to the target topic, wherein the keyword file includes multiple keywords and probability values corresponding to the keywords;

Sort the multiple subject words from large to small according to the probability values, and select the first K subject words as target subject words;

According to the probability value of each target keyword, the weight of each target keyword is obtained;

According to the weight of each target topic word, a weighted operation is performed on the K target topic words to obtain a topic feature corresponding to the word vector;

According to the topic features corresponding to each word vector, the topic vector corresponding to the text is obtained.

6. A text knowledge supplement device based on knowledge graph, characterized by comprising:

Vector acquisition module, used to obtain character vectors, word vectors and topic vectors of text;

A vector concatenation module, used for concatenating the character vector, the word vector and the topic vector to obtain a word vector matrix;

A hidden state acquisition module, used for inputting the word vector matrix into a bidirectional GRU network to output a hidden state through the bidirectional GRU network;

A hidden state processing module, used to process the hidden state using a self-attention mechanism to obtain a feature matrix of the text;

A text knowledge acquisition module, used to call the knowledge graph to conceptualize the text and obtain a concept set including a concept vector;

A feature matrix processing module, used for inputting the feature matrix into a pooling layer to output a feature vector through the pooling layer;

A first weight acquisition module, used for calculating the relationship weight between the concept vector and the feature vector using an attention mechanism;

A second weight acquisition module, used for calculating the importance weight of the concept vector in the concept set by using a self-attention mechanism;

A text knowledge supplementation module, used to adjust the corresponding relationship weight using the importance weight, and perform weighted calculation on each concept vector according to the adjusted relationship weight to obtain a concept set feature, so as to supplement the text with knowledge through the concept set feature;

The calculation formula of the relationship weight is:

The calculation formula of the importance weight is:

7. A computer device, comprising:

Memory, used to store programs;

A processor, configured to execute the program stored in the memory. When the processor executes the program stored in the memory, the processor is configured to execute: the method as claimed in any one of claims 1 to 5.

8. A computer-readable storage medium, characterized in that it stores computer-executable instructions, wherein the computer-executable instructions are used to execute: the method according to any one of claims 1 to 5.