[go: up one dir, main page]

CN103020267B - Based on the complex network community structure method for digging of triangular cluster multi-label - Google Patents

Based on the complex network community structure method for digging of triangular cluster multi-label Download PDF

Info

Publication number
CN103020267B
CN103020267B CN201210573195.XA CN201210573195A CN103020267B CN 103020267 B CN103020267 B CN 103020267B CN 201210573195 A CN201210573195 A CN 201210573195A CN 103020267 B CN103020267 B CN 103020267B
Authority
CN
China
Prior art keywords
label
network
update
nodes
array
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201210573195.XA
Other languages
Chinese (zh)
Other versions
CN103020267A (en
Inventor
李生红
赵郁忻
张爱新
刘超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Jiao Tong University
Original Assignee
Shanghai Jiao Tong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Jiao Tong University filed Critical Shanghai Jiao Tong University
Priority to CN201210573195.XA priority Critical patent/CN103020267B/en
Publication of CN103020267A publication Critical patent/CN103020267A/en
Application granted granted Critical
Publication of CN103020267B publication Critical patent/CN103020267B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

Based on a complex network community mining method for triangular cluster multi-label, comprise the following steps: mutually disjoint triangular cluster in search network; For the initial state that the mutually different label of the Node configuration in different triangular cluster is propagated as label; According to many tag update rules, for all nodes in network, the propagation synchronously carrying out label upgrades; By the label array that obtains after upgrading with upgrade the label array obtained before and compare; If label is still in propagation, then proceed to upgrade; If label produces vibration, then proceed again after carrying out vibration Processing for removing to upgrade; If label no longer changes, then stop upgrading, obtain the community structure with overlap in network.Time complexity of the present invention is very low, is applicable to large-scale complex network.In addition, the present invention can detect the lap of community structure well, and has good robustness, and the accuracy of detection is very high.

Description

基于三角簇多标签传播的复杂网络社区结构挖掘方法Community Structure Mining Method in Complex Networks Based on Multi-label Propagation of Triangular Clusters

技术领域technical field

本发明涉及的是一种复杂网络领域的方法,具体是一种基于三角簇多标签传播的复杂网络社区结构挖掘方法。The invention relates to a method in the field of complex networks, in particular to a method for mining complex network community structures based on triangular cluster multi-label propagation.

背景技术Background technique

复杂网络是现实世界中复杂系统的一种抽象表现形式,复杂网络中的节点代表复杂系统中的个体,节点之间的连接则代表系统中个体之间按照某种规则自然形成或人为构造的一种关系。目前,复杂网络已经广泛应用于表征电力网络、通信网络、互联网、社交网络等各种复杂系统。A complex network is an abstract representation of complex systems in the real world. The nodes in a complex network represent the individuals in the complex system, and the connections between nodes represent the natural or man-made connections between individuals in the system according to certain rules. relationship. At present, complex networks have been widely used to represent various complex systems such as power networks, communication networks, the Internet, and social networks.

社区结构是复杂网络的一个重要的拓扑特性,整个复杂网络是由若干个社区结构组成,每个社区结构内部的节点连接非常紧密,而社区结构之间的连接则相对稀疏。复杂网络的社区结构对应于现实中拥有共同特点的功能单元或组织团体,例如在互联网中社区结构就是讨论共同话题的一些网站,而在社交网络中的社区结构就是拥有共同兴趣爱好的人组成的一个团体。因此,通过挖掘复杂网络中的社区结构来分析网络的特性和功能具有十分重要的现实意义。Community structure is an important topological characteristic of complex networks. The entire complex network is composed of several community structures. The nodes within each community structure are very tightly connected, while the connections between community structures are relatively sparse. The community structure of a complex network corresponds to functional units or organizational groups with common characteristics in reality. For example, the community structure in the Internet is some websites that discuss common topics, while the community structure in social networks is composed of people with common interests. a group. Therefore, it is of great practical significance to analyze the characteristics and functions of the network by mining the community structure in the complex network.

复杂网络的社区结构挖掘需要满足两个关键的性质:Community structure mining of complex networks needs to satisfy two key properties:

第一,复杂度低。复杂网络的规模往往非常庞大,可以包含几千甚至上万个节点,在这种规模的网络下,如果方法的复杂度较高,那么进行社区结构挖掘的时间开销将会非常大;First, the complexity is low. The scale of complex networks is often very large, and can contain thousands or even tens of thousands of nodes. Under such a network, if the complexity of the method is high, the time spent on community structure mining will be very large;

第二,有效的重叠结构检测机制。实际的复杂网络中,社区结构普遍存在重叠现象,即复杂网络中的一些节点可以同时属于多个社区结构,这就要求社区结构挖掘方法能够检测出复杂网络中社区结构的重叠部分。Second, an effective mechanism for detecting overlapping structures. In actual complex networks, community structures commonly overlap, that is, some nodes in complex networks can belong to multiple community structures at the same time, which requires community structure mining methods to be able to detect the overlapping parts of community structures in complex networks.

经文献检索发现,M.E.J.Newman和M.Girvan在文章“Communitystructureinsocialandbiologicalnetworks[J]”(《在社交和生物网络中的社区结构》)(Proc.Natl.Acad.Sci.USA99,7821-7826(2001))(美国科学院院刊)中提出了一种基于最短路径的社区挖掘方法。该方法首先通过计算网络中任意两点之间的最短路径,得到整个网络的最短路径表;然后利用该表统计每条边被最短路径通过的次数作为该边的权值,并移除网络中权值最大的边,之后重新计算整个网络中各条边的权值;重复以上步骤,直到整个网络被划分为合理的社区结构。该方案的检测精度不高,方法复杂度较高,并且无法检测具有重叠的社区结构。After literature search, M.E.J.Newman and M.Girvan found in the article "Community structure in social and biological networks [J]" ("Community structure in social and biological networks") (Proc.Natl.Acad.Sci.USA99,7821-7826(2001)) (Proceedings of the National Academy of Sciences) proposed a shortest path-based community mining method. This method first obtains the shortest path table of the entire network by calculating the shortest path between any two points in the network; Then recalculate the weight of each edge in the entire network; repeat the above steps until the entire network is divided into a reasonable community structure. The detection accuracy of this scheme is not high, the method complexity is high, and the community structure with overlap cannot be detected.

再经检索发现,FilippoRadicchi和ClaudioCastellano等人在文章“Definingandidentifyingcommunitiesinnetworks[J]”(《定义并识别网络中的社区结构》)(Proc.Natl.Acad.Sci.USA101,2658-2663(2004))(美国科学院院刊)中提出了一种基于局部网络拓扑结构的社区挖掘方案。其方法为:首先分析复杂网络中社区结构的局部拓扑特点,并将网络中的三角形结构作为网络社区最基本的结构,统计网络中每条边所从属的三角形结构的数量作为该边的权值;然后,移除网络中权值最大的边,重新计算整个网络中各条边的权值;重复以上步骤,直到整个网络被划分为合理的社区结构。该方案考虑到了网络拓扑结构的局部特点,检测精度有一定的提升,但是仍然无法解决复杂度高和社区结构重叠部分的检测这两个问题。After searching, it was found that Filippo Radicchi and Claudio Castellano et al. wrote in the article "Defining and identifying communities in networks [J]" ("Defining and identifying communities in networks") (Proc.Natl.Acad.Sci.USA101,2658-2663(2004)) (USA Proceedings of the Chinese Academy of Sciences) proposed a community mining scheme based on local network topology. The method is as follows: firstly analyze the local topological characteristics of the community structure in the complex network, take the triangular structure in the network as the most basic structure of the network community, and count the number of triangular structures to which each edge in the network belongs as the weight of the edge ; Then, remove the edge with the largest weight in the network, and recalculate the weight of each edge in the entire network; repeat the above steps until the entire network is divided into a reasonable community structure. This scheme takes into account the local characteristics of the network topology, and the detection accuracy has been improved to a certain extent, but it still cannot solve the two problems of high complexity and overlapping community structure detection.

再经检索发现,GergelyPalla和ImreDerenyi等人在文章“Uncoveringtheoverlappingcommunitystructureofcomplexnetworksinnatureandsociety[J]”(《挖掘自然和社会网络中具有重叠的社区结构》)(Nature435(7043),814-818(2005))(自然)中提出了一个基于完全子图的社区结构挖掘方法。对于复杂网络中的每个节点,首先计算包含该节点的所有完全子图,构建出这些完全子图的重叠矩阵;然后根据连通性对重叠矩阵进行过滤,从而得到具有重叠的社区结构。该方法虽然能够挖掘出具有重叠的社区结构,但是由于复杂度太高,并不具有实际的应用价值。After searching, it was found that GergelyPalla and ImreDerenyi et al. published the article "Uncovering the overlapping community structure of complex network sinnature and society [J]" ("Mining the overlapping community structure in natural and social networks") (Nature435(7043), 814-818(2005)) (Nature) A community structure mining method based on complete subgraph is proposed. For each node in the complex network, first calculate all the complete subgraphs containing the node, and construct the overlap matrix of these complete subgraphs; then filter the overlap matrix according to the connectivity, so as to obtain the community structure with overlap. Although this method can mine overlapping community structures, it has no practical application value due to its high complexity.

最后经检索发现,U.N.Raghavan和R.Albert等人在文章“Nearlineartimealgorithmtodetectcommunitystructuresinlarge-scalenetworks[J]”(应用于大规模网络中社区结构挖掘的一种接近线性时间复杂度的方法)(Phys.Rev.E76,036106(2007))(物理评论)中提出了一种应用于社区结构挖掘的标签传播方法。该方案初始化时首先给复杂网络中的每个节点分配一个独立的标签;之后在每次迭代时都将每个节点的标签更新为其邻居节点中占有比例最大的标签,一直到网络中所有节点的标签都不再发生改变时停止迭代,此时拥有相同标签的节点就属于同一个社区结构。由此,就可以划分出复杂网络的社区结构。该方法复杂度很低,具有接近线性的时间复杂度,可以适用于大规模的复杂网络。但是该方法的鲁棒性很差,检测精度不高,并且无法检测具有重叠的社区结构。Finally, after searching, it was found that U.N.Raghavan and R.Albert et al. wrote in the article "Nearlineartimealgorithmtodetectcommunitystructuresinlarge-scalenetworks[J]" (a method close to linear time complexity applied to community structure mining in large-scale networks) (Phys.Rev.E76 , 036106(2007)) (Physical Review) proposed a label propagation method applied to community structure mining. When the scheme is initialized, first assign an independent label to each node in the complex network; then update the label of each node to the label with the largest proportion among its neighbor nodes at each iteration, until all nodes in the network Stop iterating when the labels of all nodes no longer change. At this time, nodes with the same label belong to the same community structure. From this, the community structure of the complex network can be divided. The complexity of this method is very low, and it has nearly linear time complexity, which can be applied to large-scale complex networks. But this method has poor robustness, low detection accuracy, and cannot detect community structures with overlaps.

发明内容Contents of the invention

本发明的目的在于针对上述现有技术的不足,提出一个基于三角簇多标签传播的复杂网络社区结构挖掘方法。其主要思想是首先寻找社区结构的核心部分作为标签传播的起始状态,之后在标签的更新中采用多标签更新规则,保证社区结构重叠部分的检测效果。The object of the present invention is to propose a complex network community structure mining method based on triangular cluster multi-label propagation to address the above-mentioned deficiencies in the prior art. The main idea is to first find the core part of the community structure as the initial state of label propagation, and then use multi-label update rules in the label update to ensure the detection effect of the overlapping part of the community structure.

本发明是通过如下技术方案实现的:The present invention is achieved through the following technical solutions:

一种基于三角簇多标签传播的复杂网络社区挖掘方法,其特点在于:该方法包括以下步骤:搜索网络中互不相交的三角簇;为不同三角簇中的节点设置互不相同的标签作为标签传播的起始状态;按照多标签更新规则,对于网络中的所有节点,同步进行标签的传播更新;将更新后得到的标签数组与之前更新得到的标签数组进行比较;如果标签仍在传播,则继续进行更新;如果标签产生振荡,则进行振荡消除处理后再继续进行更新;如果标签不再改变,则停止更新,得到网络中的具有重叠的社区结构。A complex network community mining method based on triangular cluster multi-label propagation, which is characterized in that: the method includes the following steps: searching for mutually disjoint triangular clusters in the network; setting different labels for nodes in different triangular clusters as labels The initial state of propagation; according to the multi-label update rule, for all nodes in the network, the propagation update of the label is carried out synchronously; the label array obtained after the update is compared with the previously updated label array; if the label is still propagating, then Continue to update; if the label oscillates, perform oscillation elimination processing before continuing to update; if the label does not change anymore, stop updating to obtain an overlapping community structure in the network.

该方法的具体步骤如下:The concrete steps of this method are as follows:

①为待测的复杂网络G的每个节点赋予一个空标签;① Assign an empty label to each node of the complex network G to be tested;

②按下列公式计算每条边eij的三角簇系数TC(eij):②Calculate the triangular cluster coefficient TC(e ij ) of each edge e ij according to the following formula:

TC(eij)=|N(vi)∩N(vj)|TC(e ij )=|N(v i )∩N(v j )|

其中:N(vi)是节点vi的邻居节点的集合,∩是求交集的运算符,|·|是求集合基数的运算符,i为1、2、3、……,N;Among them: N(v i ) is the set of neighbor nodes of node v i , ∩ is an operator for seeking intersection, |·| is an operator for seeking set base, i is 1, 2, 3, ..., N;

③搜索出三角簇系数最大的边;③ Search out the side with the largest triangular cluster coefficient;

④将该边的三角簇中的所有节点赋予一个不同于网络中其他标签的基数为1的标签;之后将这些节点从网络中删除,得到新的网络G’,④ Give all nodes in the triangular cluster of the edge a label with a cardinality of 1 that is different from other labels in the network; then delete these nodes from the network to obtain a new network G',

⑤判断是否存在三角簇系数大于0的边,存在,则返回步骤②,否则将此时网络中所有节点的标签保存为一个标签数组L(0),作为标签传播的初始状态,进入步骤⑥;⑤ Determine whether there is an edge with a triangular cluster coefficient greater than 0, if it exists, then return to step ②, otherwise save the labels of all nodes in the network as a label array L (0) at this time, as the initial state of label propagation, enter step ⑥;

⑥利用下式同步进行标签的传播更新:⑥ Use the following formula to synchronize the propagation and update of tags:

LL ii (( tt )) == argarg maxmax ll ΣΣ vv jj ∈∈ NN (( vv ii )) δδ (( LL jj (( tt -- 11 )) ,, ll ))

δδ (( LL jj (( tt -- 11 )) ,, ll )) == 11 ,, ll ∈∈ LL jj (( tt -- 11 )) 00 ,, ll ∉∉ LL jj (( tt -- 11 ))

其中:是节点vi进行第t次更新后的标签,t为1以上的正整数,N(vi)是节点vi的邻居节点的集合,是节点vi的邻居节点vj在第t-1次更新后的标签,是求元素l的集合的运算符,所求得的l使得max后所定义的函数取得最大值。在第t次更新时,利用第t-1次更新得到的标签数组L(t-1),对于复杂网络G中的所有节点按照多标签更新的规则,同步进行标签的传播更新,并保存至新的标签数组L(t)in: is the label of node v i after the tth update, t is a positive integer greater than 1, N(v i ) is the set of neighbor nodes of node v i , is the label of node v i 's neighbor node v j after the t-1th update, It is an operator for finding the set of elements l, and the obtained l makes the function defined after max obtain the maximum value. In the t-time update, using the label array L (t-1) obtained from the t-1-th update, for all nodes in the complex network G, according to the multi-label update rule, the label propagation update is performed synchronously, and saved to new label array L (t) ;

⑦比较标签数组:将第t次更新得到的标签数组L(t)与之前t-1次更新得到的标签数组L(1),L(2),…,L(t-1)进行比较,⑦Comparing the label array: compare the label array L (t) obtained by the t-th update with the label array L (1) , L (2) ,...,L (t-1) obtained by the previous t-1 update,

当t次的标签数组均不相同,则返回步骤⑥;When the label arrays of t times are not the same, return to step ⑥;

当L(1),L(2),…,L(t-2)中存在和L(t)相同的标签数组,则进入表明更新出现了振荡,则进行步骤⑧;When L (1) , L (2) ,...,L (t-2) has the same label array as L (t) , it will enter to indicate that the update has oscillated, and then proceed to step ⑧;

当标签数组L(t-1)和L(t)相同,则进入步骤⑨When the label array L (t-1) and L (t) are the same, go to step ⑨

⑧振荡消除处理:假定标签数组L(1),L(2),…,L(t-2)中存在L(k)和L(t)相同,则在标签数组L(k+1),…,L(t)中搜索出所有从第k+1次更新至第t次更新标签发生过改变的节点,在L(t)中将这些节点的标签分别替换成不同于网络中其他标签的基数为1的标签;然后返回步骤⑥;⑧ Oscillation Elimination Processing: Assume that L (k) and L (t) are the same in the label array L (1) , L (2) , ..., L (t-2) , then in the label array L (k+1) , ...,L (t) searches out all the nodes whose labels have changed from the k+1th update to the tth update, and replaces the labels of these nodes in L (t) with different labels from other labels in the network The label whose cardinality is 1; then return to step ⑥;

⑨得到网络的具有重叠的社区结构,网络中拥有相同标签元素的节点属于同一个网络社区结构,标签基数大于1的节点即为网络社区结构的重叠部分。⑨ Obtain the overlapping community structure of the network, the nodes with the same label elements in the network belong to the same network community structure, and the nodes with label cardinality greater than 1 are the overlapping parts of the network community structure.

对本发明的技术解决方案作如下说明:The technical solution of the present invention is described as follows:

1.计算标签传播的初始状态:1. Calculate the initial state of label propagation:

对于待测的复杂网络G=(V,E),其中V={vi|i=1,2,...,N}是复杂网络G的节点集合,vi表示第i个节点,N为复杂网络中节点的数量,是复杂网络G的边集合,eij=(vi,vj)表示网络中连接节点vi和vj的边,为网络G中的每个节点vi(i=1,2,...,N)赋予一个空标签。首先,计算每条边eij(i,j=1,2,...,N)的三角簇系数;然后,搜索出三角簇系数最大的边,将该边的三角簇中的所有节点重新设置一个不同于网络中其他标签的基数为1的标签;之后将这些节点从网络中删除,得到新的网络G'=(V',E′);For the complex network G=(V,E) to be tested, where V={v i |i=1,2,...,N} is the node set of the complex network G, v i represents the i-th node, N is the number of nodes in the complex network, is the edge set of complex network G, e ij = (v i , v j ) represents the edge connecting nodes v i and v j in the network, and is each node v i (i=1,2,.. .,N) assigns an empty label. First, calculate the triangular cluster coefficient of each edge e ij (i,j=1,2,...,N); then, search for the edge with the largest triangular cluster coefficient, and re- Set a label with a cardinality of 1 that is different from other labels in the network; then delete these nodes from the network to obtain a new network G'=(V',E');

重复上述过程,直到网络中不存在三角簇系数大于0的边,将此时网络中所有节点的标签保存为一个标签数组作为标签传播的初始状态,其中表示节点vi在标签传播初始状态时的标签;Repeat the above process until there is no edge with a triangular cluster coefficient greater than 0 in the network, and save the labels of all nodes in the network at this time as a label array As the initial state propagated by labels, where Indicates the label of node vi when the label propagates the initial state;

2.标签的传播更新:2. Tag propagation update:

在第t次更新时,利用第t-1次更新得到的标签数组L(t-1),对于复杂网络G=(V,E)中的所有节点按照多标签更新的规则,同步进行标签的传播更新,并保存至新的标签数组 L ( t ) = [ L 1 t , L 2 t , . . . , L i t , . . . , L N t ] , 其中表示节点vi在第t次更新后的标签;In the t-time update, using the label array L (t-1) obtained from the t-1-th update, for all nodes in the complex network G=(V,E), the label is updated synchronously according to the rule of multi-label update Propagate updates and save to new labels array L ( t ) = [ L 1 t , L 2 t , . . . , L i t , . . . , L N t ] , in Indicates the label of node v i after the tth update;

3、标签数组比较:3. Tag array comparison:

将第t次更新得到的标签数组L(t)与之前t-1次更新得到的标签数组L(1),L(2),…,L(t-1)进行比较,Compare the label array L (t) obtained by the t-th update with the label array L (1) , L (2) ,...,L (t-1) obtained by the previous t-1 update,

当t次的标签数组均不相同,则继续按照步骤2进行标签的传播更新;When the label arrays of t times are not the same, continue to update the labels according to step 2;

当L(1),L(2),…,L(t-2)中存在和L(t)相同的标签数组,则表明更新出现了振荡,须进行振荡消除处理,之后再继续按照步骤2进行标签的传播更新;When L (1) , L (2) ,...,L (t-2) has the same label array as L (t) , it indicates that there is oscillation in the update, and the oscillation elimination process must be performed, and then continue to follow step 2 Propagate and update labels;

当标签数组L(t-1)和L(t)相同,则表明网络G=(V,E)中所有节点的标签都不再发生改变,此时终止标签的传播更新,网络中拥有相同标签元素的节点即属于同一个网络社区结构,标签基数大于1的节点即为网络社区结构的重叠部分。When the label array L (t-1) and L (t) are the same, it means that the labels of all nodes in the network G=(V,E) will no longer change. At this time, the propagation update of the label is terminated, and the network has the same label The nodes of the elements belong to the same network community structure, and the nodes whose label cardinality is greater than 1 are the overlapping parts of the network community structure.

所述的计算标签传播的初始状态。对于待测的复杂网络G=(V,E),其中V={vi|i=1,2,...,N是复杂网络G的节点集合,vi表示第i个节点,N为复杂网络中节点的数量,E={eij=(vi,vj)|i,j=1,2,...,N}是复杂网络G的边集合,eij=(vi,vj)表示网络中连接节点vi和vj的边,为网络G中的每个节点vi(i=1,2,...N,赋予一个空标签。首先,计算每条边eij(i,j=1,2,...,N)的三角簇系数;然后,搜索出三角簇系数最大的边,将该边的三角簇中的所有节点重新设置一个不同于网络中其他标签的基数为1的标签;之后将这些节点从网络中删除,得到新的网络G'=(V',E')。重复上述过程,直到网络中不存在三角簇系数大于0的边。将此时网络中所有节点的标签保存为一个标签数组作为标签传播的初始状态,其中表示节点vi在标签传播初始状态时的标签。具体为:The initial state of the computed label propagation. For the complex network G=(V,E) to be tested, where V={v i |i=1,2,..., N is the node set of the complex network G, vi represents the i-th node, and N is the complex The number of nodes in the network, E={e ij =(v i ,v j )|i,j=1,2,...,N} is the edge set of complex network G, e ij =(v i ,v j ) represents the edge connecting nodes v i and v j in the network, and assigns an empty label to each node v i (i=1,2,...N) in the network G. First, calculate each edge e ij (i,j=1,2,...,N) triangular cluster coefficient; then, search for the edge with the largest triangular cluster coefficient, and reset all nodes in the triangular cluster of the edge to a label different from other labels in the network Labels whose cardinality is 1; these nodes are then deleted from the network to obtain a new network G'=(V', E'). Repeat the above process until there is no edge with a triangular cluster coefficient greater than 0 in the network. This The labels of all nodes in the network are saved as a label array As the initial state propagated by labels, where Indicates the label of node v i at the initial state of label propagation. Specifically:

所述的边eij的三角簇,是指网络中包含边eij的所有三角形组成的拓扑结构。The triangle cluster of edge e ij refers to the topology structure composed of all triangles including edge e ij in the network.

所述的边eij的三角簇系数,是指网络中包含边eij的所有三角形的数量,其计算公式如下:The triangular cluster coefficient of the edge e ij refers to the number of all triangles including the edge e ij in the network, and its calculation formula is as follows:

TC(eij)=|N(vi)∩N(vj)|TC(e ij )=|N(v i )∩N(v j )|

其中,TC(eij)是边eij的三角簇系数,N(vi)是节点vi的邻居节点的集合,∩是求交集的运算符,|·|是求集合基数的运算符。Among them, TC(e ij ) is the triangular cluster coefficient of edge e ij , N(v i ) is the set of neighbor nodes of node v i , ∩ is an operator for intersection, and |·| is an operator for set base.

所述的节点vi的邻居节点,是指网络中和节点vi之间有连接边的节点。The neighbor nodes of the node v i refer to the nodes in the network that have connection edges with the node v i .

所述的集合的基数,是指集合中元素的个数。The cardinality of the set refers to the number of elements in the set.

所述的节点vi的标签,是一个集合Li={lik|k=1,2,...,K},其元素lik用来表征节点vi所属的社区结构的编号,K为集合的基数。如果该标签的基数为1,则表明节点vi只属于一个社区结构;如果该标签的基数大于1,则节点vi同时属于多个社区结构。The label of the node v i is a set L i ={ li ik |k=1,2,...,K}, whose element li ik is used to represent the number of the community structure to which the node v i belongs, K is the base of the set. If the cardinality of the label is 1, it indicates that the node v i only belongs to one community structure; if the cardinality of the label is greater than 1, the node v i belongs to multiple community structures at the same time.

所述的空标签,是指集合为空集的标签,即基数为0的标签。节点拥有的标签为空标签,则说明该节点不属于任何社区结构。The empty label refers to a label whose set is an empty set, that is, a label whose cardinality is 0. If the label owned by the node is an empty label, it means that the node does not belong to any community structure.

所述的标签数组,是一个1×N的数组,用于保存网络中所有节点的标签。其中,N为网络中节点的数量,表示在标签传播初始状态时所有节点所拥有的标签,为标签传播初始状态时节点vi所拥有的标签;表示第t次更新后所有节点所拥有的标签,为第t次更新后节点vi所拥有的标签。The label array is a 1×N array for storing labels of all nodes in the network. Among them, N is the number of nodes in the network, Represents the labels owned by all nodes when labels propagate the initial state, The label owned by node v i when propagating the initial state for the label; Indicates the labels owned by all nodes after the tth update, is the label owned by node v i after the tth update.

所述的标签的传播更新。在第t次更新时,利用第t-1次更新得到的标签数组L(t-1),对于复杂网络G=(V,E)中的所有节点按照多标签更新的规则,同步进行标签的传播更新,并保存至新的标签数组其中表示节点vi在第t次更新后的标签。具体为:The propagating update of the label. In the t-time update, using the label array L (t-1) obtained from the t-1-th update, for all nodes in the complex network G=(V,E), the label is updated synchronously according to the rule of multi-label update Propagate updates and save to new labels array in Indicates the label of node v i after the tth update. Specifically:

所述的多标签更新规则为:The described multi-label update rule is:

LL ii (( tt )) == argarg maxmax ll ΣΣ vv jj ∈∈ NN (( vv ii )) δδ (( LL jj (( tt -- 11 )) ,, ll ))

δδ (( LL jj (( tt -- 11 )) ,, ll )) == 11 ,, ll ∈∈ LL jj (( tt -- 11 )) 00 ,, ll ∉∉ LL jj (( tt -- 11 ))

其中,是节点vi进行第t次更新后的标签,t为1以上的正整数,N(vi)是节点vi的邻居节点的集合,是节点vi的邻居节点vj在第t-1次更新后的标签,是求元素l的集合的运算符,所求得的l使得max后所定义的函数取得最大值。in, is the label of node v i after the tth update, t is a positive integer greater than 1, N(v i ) is the set of neighbor nodes of node v i , is the label of node v i 's neighbor node v j after the t-1th update, It is an operator for finding the set of elements l, and the obtained l makes the function defined after max obtain the maximum value.

所述的同步进行标签的传播更新,是指第t次更新任意节点vi的标签只能由之前t-1次更新得到的标签数组L(1),L(2),…,L(t-1)计算得到,而与第t次更新已得到的其他任何节点vj(j≠i)的标签无关。The synchronous label propagation update refers to updating the label of any node v i for the tth time It can only be calculated from the label array L (1) , L (2) ,…,L (t-1) obtained by the previous t-1 update, and any other node v j (j ≠ i) label irrelevant.

所述的标签数组比较。将第t次更新得到的标签数组L(t)与之前t-1次更新得到的标签数组L(1),L(2),…,L(t-1)进行比较。如果t次的标签数组均不相同,则继续按照步骤2进行标签的传播更新;如果L(1),L(2),…,L(t-2)中存在和L(t)相同的标签数组,则表明更新出现了振荡,须进行振荡消除处理,之后再继续按照步骤2进行标签的传播更新;如果标签数组L(t-1)和L(t)相同,则表明网络G=(V,E)中所有节点的标签都不再发生改变,此时终止标签的传播更新,网络中拥有相同标签元素的节点即属于同一个网络社区结构,标签基数大于1的节点即为网络社区结构的重叠部分。具体为:The labeled arrays to compare. Compare the label array L (t) obtained by the t-th update with the label arrays L (1) , L (2) , ..., L (t-1) obtained by the previous t-1 update. If the label arrays of t times are not the same, continue to follow step 2 to propagate and update the label; if L (1) , L (2) , ..., L (t-2) have the same label as L (t) array, it indicates that the update has oscillated, and the oscillation elimination process must be performed, and then continue to follow step 2 to carry out the label propagation update; if the label array L (t-1) and L (t) are the same, it indicates that the network G=(V ,E) The labels of all nodes in E) are no longer changed. At this time, the propagation update of labels is terminated. The nodes with the same label elements in the network belong to the same network community structure, and the nodes with a label base greater than 1 are the network community structure. Overlap. Specifically:

所述的振荡消除,其步骤为:假定标签数组L(1),L(2),…,L(t-2)中存在L(k)和L(t)相同,则在标签数组L(k+1),…,L(t)中搜索出所有从第k+1次更新至第t次更新标签发生过改变的节点,在L(t)中将这些节点的标签分别替换成不同于网络中其他标签的基数为1的标签。The steps of the vibration elimination are as follows: assuming that L (k) and L (t) are the same in the label array L (1) , L (2) , ..., L (t-2) , then in the label array L ( k+1) ,...,L (t) , search out all the nodes whose labels have changed from the k+1th update to the tth update, and replace the labels of these nodes in L (t) with different The cardinality-1 label of other labels in the network.

本发明从三个方面提高了标签传播的性能与效果。首先,复杂网络社区结构的核心结构为三角簇。本发明从网络中划分出不相交的三角簇,并且分配不同的标签作为标签传播的起始状态,充分利用了复杂网络的拓扑特性,初步构建了社区结构的基本形状,相比为网络中所有节点分配互不相同的标签作为起始状态的方法有了很大的优化。其次,本发明引入了多标签更新规则,使得每个节点的标签传播更新充分利用了邻居节点的标签信息,使得节点的局部信息不会损失,与随机选择最优标签的更新规则相比,多标签更新规则大大提高了标签传播的鲁棒性和准确度。最后,采用振荡消除处理对振荡节点的标签重新赋值,较好地解决了同步更新方式下的振荡问题,彻底消除了异步更新方式中由于节点的更新顺序而引入的随机性缺陷,并且可以通过并行计算,提高计算效率。The present invention improves the performance and effect of label propagation from three aspects. First, the core structure of the complex network community structure is a triangular cluster. The invention divides disjoint triangular clusters from the network, and assigns different labels as the initial state of label propagation, fully utilizes the topological characteristics of the complex network, and preliminarily constructs the basic shape of the community structure. The method of assigning different labels to nodes as starting states has been greatly optimized. Secondly, the present invention introduces a multi-label update rule, so that the label propagation update of each node makes full use of the label information of neighboring nodes, so that the local information of the node will not be lost. Compared with the update rule of randomly selecting the optimal label, more The label update rule greatly improves the robustness and accuracy of label propagation. Finally, the oscillation elimination process is used to reassign the label of the oscillation node, which better solves the oscillation problem in the synchronous update mode, and completely eliminates the randomness defect introduced by the update sequence of the nodes in the asynchronous update mode, and can pass parallel Computing, improving computing efficiency.

附图说明Description of drawings

图1为本发明的方法流程图。Fig. 1 is a flow chart of the method of the present invention.

图2为本方法在空手道俱乐部网络下的标签传播的初始状态示意图。Figure 2 is a schematic diagram of the initial state of label propagation under the karate club network of this method.

图3为本方法在空手道俱乐部网络下得到的具有重叠的社区结构示意图。Figure 3 is a schematic diagram of the overlapping community structure obtained by this method under the karate club network.

具体实施方式detailed description

下面对本发明的实施例作详细说明,本实施例在以本发明技术方案为前提下进行实施,给出了详细的实施方式和具体的操作过程,但本发明的保护范围不限于下述的实施例。The embodiments of the present invention are described in detail below. This embodiment is implemented on the premise of the technical solution of the present invention, and detailed implementation methods and specific operating procedures are provided, but the protection scope of the present invention is not limited to the following implementation example.

本实施例采用社会网络中的经典数据集Zachary空手道俱乐部网络,该网络包含34个节点和78条边。具体包括步骤如下:This embodiment adopts the Zachary karate club network, a classic data set in social networks, which contains 34 nodes and 78 edges. The specific steps are as follows:

①为待测的空手道俱乐部网络的每个节点赋予一个空标签;① Assign an empty label to each node of the karate club network to be tested;

②按下列公式计算每条边ej的三角簇系数TC(eij):②Calculate the triangular cluster coefficient TC(e ij ) of each edge ej according to the following formula:

TC(eij)=|N(vi)∩N(vj)|TC(e ij )=|N(v i )∩N(v j )|

其中:N(vi)是节点vi的邻居节点的集合,∩是求交集的运算符,|·|是求集合基数的运算符,i为1、2、3、……,N;Among them: N(v i ) is the set of neighbor nodes of node v i , ∩ is an operator for seeking intersection, |·| is an operator for seeking set base, i is 1, 2, 3, ..., N;

③搜索出三角簇系数最大的边;③ Search out the side with the largest triangular cluster coefficient;

④将该边的三角簇中的所有节点赋予一个不同于网络中其他标签的基数为1的标签;之后将这些节点从网络中删除,得到新的网络G’,④ Give all nodes in the triangular cluster of the edge a label with a cardinality of 1 that is different from other labels in the network; then delete these nodes from the network to obtain a new network G',

⑤判断是否存在三角簇系数大于0的边,存在,则返回步骤②,否则将此时网络中所有节点的标签保存为一个标签数组L(0),作为标签传播的初始状态,进入步骤⑥;⑤ Determine whether there is an edge with a triangular cluster coefficient greater than 0, if it exists, then return to step ②, otherwise save the labels of all nodes in the network as a label array L (0) at this time, as the initial state of label propagation, enter step ⑥;

由步骤⑤得到的标签传播初始状态的标签数组L(0)共包含3种不同的非空标签l1、l2和l3,对应于空手道俱乐部网络中存在的3个互不相交的三角簇,如图2所示,其中每个标签用一个圆圈表示,不同圆圈内的节点拥有不同的非空标签,空白的节点表示拥有空标签的节点。The label array L (0) of the initial state of label propagation obtained from step ⑤ contains three different non-empty labels l 1 , l 2 and l 3 , corresponding to the three disjoint triangular clusters in the karate club network , as shown in Figure 2, where each label is represented by a circle, nodes in different circles have different non-empty labels, and blank nodes represent nodes with empty labels.

⑥利用下式同步进行标签的传播更新:⑥ Use the following formula to synchronize the propagation and update of labels:

LL ii (( tt )) == argarg maxmax ll ΣΣ vv jj ∈∈ NN (( vv ii )) δδ (( LL jj (( tt -- 11 )) ,, ll ))

δδ (( LL jj (( tt -- 11 )) ,, ll )) == 11 ,, ll ∈∈ LL jj (( tt -- 11 )) 00 ,, ll ∉∉ LL jj (( tt -- 11 ))

其中:是节点vi进行第t次更新后的标签,t为1以上的正整数,N(vi)是节点vi的邻居节点的集合,是节点vi的邻居节点vj在第t-1次更新后的标签,是求元素l的集合的运算符,所求得的l使得max后所定义的函数取得最大值,在第t次更新时,利用第t-1次更新得到的标签数组L(t-1),对于复杂网络G中的所有节点按照多标签更新的规则,同步进行标签的传播更新,并保存至新的标签数组L(t)in: is the label of node v i after the tth update, t is a positive integer greater than 1, N(v i ) is the set of neighbor nodes of node v i , is the label of node v i 's neighbor node v j after the t-1th update, It is an operator for finding the set of elements l. The obtained l makes the function defined after max obtain the maximum value. When the t-th update is performed, the label array L (t-1) obtained by the t-1th update is used. , for all nodes in the complex network G, according to the rule of multi-label update, the label is propagated and updated synchronously, and saved to a new label array L (t) ;

⑦比较标签数组:将第t次更新得到的标签数组L(t)与之前t-1次更新得到的标签数组L(1),L(2),…,L(t-1)进行比较,⑦Comparing the label array: compare the label array L (t) obtained by the t-th update with the label array L (1) , L (2) ,...,L (t-1) obtained by the previous t-1 update,

当t次的标签数组均不相同,则返回步骤⑥;When the label arrays of t times are not the same, return to step ⑥;

当L(1),L(2),…,L(t-2)中存在和L(t)相同的标签数组,则进入表明更新出现了振荡,则进行步骤⑧;When L (1) , L (2) ,...,L (t-2) has the same label array as L (t) , it will enter to indicate that the update has oscillated, and then proceed to step ⑧;

当标签数组L(t-1)和L(t)相同,则进入步骤⑨When the label array L (t-1) and L (t) are the same, go to step ⑨

⑧振荡消除处理:假定标签数组L(1),L(2),…,L(t-2)中存在L(k)和L(t)相同,则在标签数组L(k+1),…,L(t)中搜索出所有从第k+1次更新至第t次更新标签发生过改变的节点,在L(t)中将这些节点的标签分别替换成不同于网络中其他标签的基数为1的标签;然后返回步骤⑥;⑧ Oscillation Elimination Processing: Assume that L (k) and L (t) are the same in the label array L (1) , L (2) , ..., L (t-2) , then in the label array L (k+1) , ...,L (t) searches out all the nodes whose labels have changed from the k+1th update to the tth update, and replaces the labels of these nodes in L (t) with different labels from other labels in the network The label whose cardinality is 1; then return to step ⑥;

⑨得到网络的具有重叠的社区结构,网络中拥有相同标签元素的节点属于同一个网络社区结构,标签基数大于1的节点即为网络社区结构的重叠部分。⑨ Obtain the overlapping community structure of the network, the nodes with the same label elements in the network belong to the same network community structure, and the nodes with the label cardinality greater than 1 are the overlapping parts of the network community structure.

利用本方法,在第2次更新后得到的标签数组L(2)和第1次更新后得到的标签数组L(1)相同,所以终止标签的传播更新,得到空手道俱乐部网络的具有重叠的社区结构,如图3所示。可以看出,空手道俱乐部网络中共存在3个社区结构C1、C2和C3,分别用一个圆圈表示,圆圈内的节点为该社区结构的成员节点;节点5、节点10和节点11为社区结构的重叠部分,用黑色的节点表示;社区结构C1和C2共同拥有节点5和节点11,社区C2和C3共同拥有节点10。Using this method, the label array L (2) obtained after the second update is the same as the label array L (1) obtained after the first update, so the propagation update of the label is terminated, and the overlapping communities of the karate club network are obtained structure, as shown in Figure 3. It can be seen that there are three community structures C 1 , C 2 and C 3 in the karate club network, each represented by a circle, and the nodes in the circle are the member nodes of the community structure; nodes 5, 10 and 11 are community The overlapping parts of the structures are represented by black nodes; community structures C 1 and C 2 share nodes 5 and 11, and communities C 2 and C 3 share node 10.

以上实验使用典型网络数据集,对本发明的方法流程进行了详细的说明。该实验结果与数据集的背景知识基本一致,被划分进入正确社区结构的节点比例达到了95.59%。此外,模块度是一个用于衡量社区结构优劣的非常重要的指标参数,经计算本方法在空手道俱乐部网络下得到的社区结构的模块度为0.3912,非常接近于理论上的最大值0.4020,这也验证了本方法的准确性和有效性。The above experiments use typical network data sets to describe the flow of the method of the present invention in detail. The experimental results are basically consistent with the background knowledge of the dataset, and the proportion of nodes classified into the correct community structure reaches 95.59%. In addition, modularity is a very important index parameter used to measure the quality of the community structure. After calculation, the modularity of the community structure obtained by this method under the karate club network is 0.3912, which is very close to the theoretical maximum value of 0.4020, which is The accuracy and effectiveness of this method are also verified.

Claims (1)

1.一种基于三角簇多标签传播的复杂网络社区挖掘方法,包括以下步骤:搜索网络中互不相交的三角簇;为不同三角簇中的节点设置互不相同的标签作为标签传播的起始状态;按照多标签更新规则,对于网络中的所有节点,同步进行标签的传播更新;将更新后得到的标签数组与之前更新得到的标签数组进行比较;如果标签仍在传播,则继续进行更新;如果标签产生振荡,则进行振荡消除处理后再继续进行更新;如果标签不再改变,则停止更新,得到网络中的具有重叠的社区结构,其特征在于:该方法的具体步骤如下:1. A complex network community mining method based on triangular cluster multi-label propagation, comprising the following steps: searching for disjoint triangular clusters in the network; setting different labels for nodes in different triangular clusters as the starting point of label propagation Status; according to the multi-label update rule, for all nodes in the network, the label propagation update is performed synchronously; the updated label array is compared with the previously updated label array; if the label is still propagating, continue to update; If the label oscillates, continue to update after performing oscillation elimination processing; if the label no longer changes, then stop updating to obtain an overlapping community structure in the network, characterized in that: the specific steps of the method are as follows: ①为待测的复杂网络G的每个节点赋予一个空标签;① Assign an empty label to each node of the complex network G to be tested; ②按下列公式计算每条边eij的三角簇系数TC(eij):②Calculate the triangular cluster coefficient TC(e ij ) of each edge e ij according to the following formula: TC(eij)=|N(vi)∩N(vj)|TC(e ij )=|N(v i )∩N(v j )| 其中:N(vi)是节点vi的邻居节点的集合,∩是求交集的运算符,|·|是求集合基数的运算符,i为1、2、3、……,N;Among them: N(v i ) is the set of neighbor nodes of node v i , ∩ is an operator for seeking intersection, |·| is an operator for seeking set base, i is 1, 2, 3, ..., N; ③搜索出三角簇系数最大的边;③ Search out the side with the largest triangular cluster coefficient; ④将该边的三角簇中的所有节点赋予一个不同于网络中其他标签的基数为1的标签;之后将这些节点从网络中删除,得到新的网络G’,④ Give all nodes in the triangular cluster of the edge a label with a cardinality of 1 that is different from other labels in the network; then delete these nodes from the network to obtain a new network G', ⑤判断是否存在三角簇系数大于0的边,存在,则返回步骤②,否则将此时网络中所有节点的标签保存为一个标签数组L(0),作为标签传播的初始状态,进入步骤⑥;⑤ Determine whether there is an edge with a triangular cluster coefficient greater than 0, if it exists, then return to step ②, otherwise save the labels of all nodes in the network as a label array L (0) at this time, as the initial state of label propagation, enter step ⑥; ⑥利用下式同步进行标签的传播更新:⑥ Use the following formula to synchronize the propagation and update of labels: LL ii (( tt )) == argarg maxmax ll ΣΣ vv jj ∈∈ NN (( vv ii )) δδ (( LL jj (( tt -- 11 )) ,, ll )) δδ (( LL jj (( tt -- 11 )) ,, ll )) == 11 ,, ll ∈∈ LL jj (( tt -- 11 )) 00 ,, ll ∉∉ LL jj (( tt -- 11 )) 其中:是节点vi进行第t次更新后的标签,t为1以上的正整数,N(vi)是节点vi的邻居节点的集合,是节点vi的邻居节点vj在第t-1次更新后的标签,|·|是求集合基数的运算符,是求元素l的集合的运算符,所求得的l使得max后所定义的函数取得最大值,在第t次更新时,利用第t-1次更新得到的标签数组L(t-1),对于复杂网络G中的所有节点按照多标签更新的规则,同步进行标签的传播更新,并保存至新的标签数组L(t)in: is the label of node v i after the tth update, t is a positive integer greater than 1, N(v i ) is the set of neighbor nodes of node v i , is the label of the neighbor node v j of node v i after the t-1th update, |·| is an operator for calculating the cardinality of the set, It is an operator for finding the set of elements l. The obtained l makes the function defined after max obtain the maximum value. When the t-th update is performed, the label array L (t-1) obtained by the t-1th update is used. , for all nodes in the complex network G, according to the rule of multi-label update, the label is propagated and updated synchronously, and saved to a new label array L (t) ; ⑦比较标签数组:将第t次更新得到的标签数组L(t)与之前t-1次更新得到的标签数组L(1),L(2),…,L(t-1)进行比较,⑦Comparing the label array: compare the label array L (t) obtained by the t-th update with the label array L (1) , L (2) ,...,L (t-1) obtained by the previous t-1 update, 当t次的标签数组均不相同,则返回步骤⑥;When the label arrays of t times are not the same, return to step ⑥; 当L(1),L(2),…,L(t-2)中存在和L(t)相同的标签数组,则进入表明更新出现了振荡,则进行步骤⑧;When L (1) , L (2) ,...,L (t-2) has the same label array as L (t) , it will enter to indicate that the update has oscillated, and then proceed to step ⑧; 当标签数组L(t-1)和L(t)相同,则进入步骤⑨;When the label array L (t-1) and L (t) are the same, enter step ⑨; ⑧振荡消除处理:假定标签数组L(1),L(2),…,L(t-2)中存在L(k)和L(t)相同,则在标签数组L(k+1),…,L(t)中搜索出所有从第k+1次更新至第t次更新标签发生过改变的节点,在L(t)中将这些节点的标签分别替换成不同于网络中其他标签的基数为1的标签;然后返回步骤⑥;⑧ Oscillation Elimination Processing: Assume that L (k) and L (t) are the same in the label array L (1) , L (2) ,...,L (t-2) , then in the label array L (k+1) , ...,L (t) searches out all the nodes whose labels have changed from the k+1th update to the tth update, and replaces the labels of these nodes in L (t) with different labels from other labels in the network The label whose cardinality is 1; then return to step ⑥; ⑨得到网络的具有重叠的社区结构,网络中拥有相同标签元素的节点属于同一个网络社区结构,标签基数大于1的节点即为网络社区结构的重叠部分。⑨ Obtain the overlapping community structure of the network, the nodes with the same label elements in the network belong to the same network community structure, and the nodes with label cardinality greater than 1 are the overlapping parts of the network community structure.
CN201210573195.XA 2012-12-26 2012-12-26 Based on the complex network community structure method for digging of triangular cluster multi-label Expired - Fee Related CN103020267B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210573195.XA CN103020267B (en) 2012-12-26 2012-12-26 Based on the complex network community structure method for digging of triangular cluster multi-label

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210573195.XA CN103020267B (en) 2012-12-26 2012-12-26 Based on the complex network community structure method for digging of triangular cluster multi-label

Publications (2)

Publication Number Publication Date
CN103020267A CN103020267A (en) 2013-04-03
CN103020267B true CN103020267B (en) 2016-01-20

Family

ID=47968870

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210573195.XA Expired - Fee Related CN103020267B (en) 2012-12-26 2012-12-26 Based on the complex network community structure method for digging of triangular cluster multi-label

Country Status (1)

Country Link
CN (1) CN103020267B (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105279187A (en) * 2014-07-15 2016-01-27 天津科技大学 Edge clustering coefficient-based social network group division method
CN104102745B (en) * 2014-07-31 2017-12-29 上海交通大学 Complex network community method for digging based on Local Minimum side
CN104199852B (en) * 2014-08-12 2018-01-12 上海交通大学 Label based on node degree of membership propagates community structure method for digging
CN105893381A (en) * 2014-12-23 2016-08-24 天津科技大学 Semi-supervised label propagation based microblog user group division method
CN108898264B (en) * 2018-04-26 2021-10-29 深圳大学 A kind of calculation method and device of overlapping community set quality measurement index
CN110309419A (en) * 2018-05-14 2019-10-08 桂林远望智能通信科技有限公司 A kind of overlapping anatomic framework method for digging and device propagated based on balance multi-tag
CN112015954B (en) * 2020-08-28 2021-08-27 平顶山学院 Martha effect-based community detection method
CN112381360B (en) * 2020-10-28 2023-06-27 广西大学 Power system parallel recovery partitioning method based on label propagation algorithm and game theory
CN112598549B (en) * 2020-12-23 2022-05-03 广东技术师范大学 Learner potential overlapping community detection method, device, equipment and medium
CN112967146B (en) * 2021-02-03 2023-08-04 北京航空航天大学 A method and device for discovering scientific research communities based on label dissemination

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102819611A (en) * 2012-08-27 2012-12-12 方平 Local community digging method of complicated network

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7958120B2 (en) * 2005-05-10 2011-06-07 Netseer, Inc. Method and apparatus for distributed community finding

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102819611A (en) * 2012-08-27 2012-12-12 方平 Local community digging method of complicated network

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Identification of Overlapping Communities by Label Propatation;Lin LI等;《Journal of Information & Computational Science》;20121130;第4413-4419页 *
在线社会网络团结构分析;马延妮;《中国优秀硕士学位论文全文数据库》;20100215(第2期);第15,20-29,34,50页 *
基于概率模型的重叠社区发现算法研究;杨荟蓉;《中国优秀硕士学位论文全文数据库》;20110915(第9期);第11-12页 *

Also Published As

Publication number Publication date
CN103020267A (en) 2013-04-03

Similar Documents

Publication Publication Date Title
CN103020267B (en) Based on the complex network community structure method for digging of triangular cluster multi-label
Zhang et al. Exact solution for mean first-passage time on a pseudofractal scale-free web
CN104199852B (en) Label based on node degree of membership propagates community structure method for digging
Liu et al. Community detection in large-scale bipartite networks
Elseidy et al. Grami: Frequent subgraph and pattern mining in a single large graph
CN104102745B (en) Complex network community method for digging based on Local Minimum side
CN103678671B (en) A kind of dynamic community detection method in social networks
WO2016078368A1 (en) Community search algorithm based on k-kernel
CN105279187A (en) Edge clustering coefficient-based social network group division method
CN105893382A (en) Priori knowledge based microblog user group division method
CN104699883A (en) Circuit design evaluation with compact multi-waveform representations
CN101174261B (en) Multi-regular expression joint search method based on extended finite state machine
CN102722566B (en) Method for inquiring potential friends in social network
CN108765180A (en) The overlapping community discovery method extended with seed based on influence power
CN105893381A (en) Semi-supervised label propagation based microblog user group division method
CN105335438A (en) Local shortest loop based social network group division method
CN108062360A (en) A kind of method, system and device of large-scale complex community structure detection
CN110136016A (en) A multi-label propagation method and system based on implicit association
CN108428006A (en) A kind of Internetwork link prediction technique based on common neighbor node and community structure
CN112182306A (en) A Community Discovery Method Based on Uncertainty Graph
CN107885797A (en) A kind of multi-mode figure matching process based on structural dependence
CN103455612A (en) Method for detecting non-overlapping network communities and overlapping network communities based on two-stage strategy
CN104700311B (en) A kind of neighborhood in community network follows community discovery method
Zhang et al. Discovering frequency bursting patterns in temporal graphs
CN105488247A (en) K-mean community structure mining method and apparatus

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160120

CF01 Termination of patent right due to non-payment of annual fee