[go: up one dir, main page]

CN105956040B - Song popularity analysis method in music information network under the influence of social networks - Google Patents

Song popularity analysis method in music information network under the influence of social networks Download PDF

Info

Publication number
CN105956040B
CN105956040B CN201610264331.5A CN201610264331A CN105956040B CN 105956040 B CN105956040 B CN 105956040B CN 201610264331 A CN201610264331 A CN 201610264331A CN 105956040 B CN105956040 B CN 105956040B
Authority
CN
China
Prior art keywords
song
songs
information
genre
singer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201610264331.5A
Other languages
Chinese (zh)
Other versions
CN105956040A (en
Inventor
余春艳
郑晓燕
苏金池
王秀
郭文忠
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fuzhou University
Original Assignee
Fuzhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fuzhou University filed Critical Fuzhou University
Priority to CN201610264331.5A priority Critical patent/CN105956040B/en
Publication of CN105956040A publication Critical patent/CN105956040A/en
Application granted granted Critical
Publication of CN105956040B publication Critical patent/CN105956040B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/63Querying
    • G06F16/635Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本发明涉及一种音乐信息网络中社交关系影响下的歌曲流行度分析方法,先从音乐信息网络中获取信息子网和信息子网的网络模式,并获取与信息子网相关的社交关系,然后在信息子网中计算不同类型关系边的平均边介数,再根据平均边介数计算得到不同类型关系边的传播因子,接着根据社交关系在信息子网中设计边权重,根据边权重在同一类型节点间选择节点进行随机游走,最后结合传播因子和边权重在信息子网中随机游走,得到网络中节点影响力排名,从而得到歌曲流行度。本发明提出的该方法考虑到了用户群体对歌曲的影响,从而使得最后的流行度分析更为合理。

The invention relates to a method for analyzing the popularity of songs under the influence of social relations in a music information network. First, the information subnet and the network mode of the information subnet are obtained from the music information network, and the social relations related to the information subnet are obtained, and then the information subnet is obtained from the music information network. Calculate the average edge betweenness of different types of relationship edges in the information subnet, and then calculate the propagation factors of different types of relationship edges according to the average edge betweenness, and then design edge weights in the information subnet according to the social relationship. Select nodes to perform random walk among the type nodes, and finally combine the propagation factors and edge weights to randomly walk in the information subnet to obtain the ranking of the nodes' influence in the network, so as to obtain the popularity of songs. The method proposed by the present invention takes into account the influence of user groups on songs, so that the final popularity analysis is more reasonable.

Description

音乐信息网络中社交关系影响下的歌曲流行度分析方法Analysis method of song popularity under the influence of social relations in music information network

技术领域technical field

本发明涉及信息检索领域,特别是一种音乐信息网络中社交关系影响下的歌曲流行度分析方法。The invention relates to the field of information retrieval, in particular to a method for analyzing the popularity of songs under the influence of social relations in a music information network.

背景技术Background technique

音乐信息网络实际上是由一个信息子网和社交子网构成,而信息子网通常包含歌曲以及与歌曲相关的类型节点,如歌手、作词者、流派等等,网络中存在的关系有歌曲-歌手之间的被演唱与演唱关系、歌曲-作词者之间的被作词与作词关系、歌曲-流派之间的被包含与包含关系等等,社交子网中包含用户之间的好友关系、歌曲-用户之间的被播放与播放关系、用户-分组之间的被包含与包含关系。The music information network is actually composed of an information subnet and a social subnet, and the information subnet usually contains songs and genre nodes related to songs, such as singers, lyricists, genres, etc. The relationships that exist in the network are song- The relationship between being sung and singing between singers, the relationship between being written and writing lyrics between songs and lyricists, the relationship between being included and included between songs and genres, etc. The social subnet includes friend relationships and songs between users. -Played-to-play relationship between users, contained-to-included relationship between users-groups.

目前,对于异构信息网络中歌曲流行度分析方法,基本上都是利用与歌曲相关的信息子网中不同类型的对象和关系等全面的结构信息和丰富的语义信息来分析得到流行度。然而这种信息子网中更多注重于与歌曲相关的的歌手、作词者等这些静态信息,因而在这种网络中用传统方法分析得到的歌曲流行度只是表明在这一信息子网中的普适性,并不能真正地体现出歌曲在用户群体中的受欢迎程度;而在引入与歌曲相关的社交关系后,考虑到信息子网与社交子网的网络差异性问题,如前者往往关注于网络中静态信息,后者更多关注于网络拓扑结构,因而不能直接在音乐信息网络这一整个网络中使用传统方法分析歌曲流行度。At present, the analysis methods of song popularity in heterogeneous information networks basically use comprehensive structural information and rich semantic information such as different types of objects and relationships in the information subnet related to songs to analyze the popularity. However, this kind of information subnet pays more attention to the static information such as singers, lyricists, etc. related to songs, so the popularity of songs analyzed by traditional methods in this kind of network only indicates that in this information subnet Universality does not really reflect the popularity of songs in the user group; after the introduction of social relationships related to songs, considering the network differences between information subnets and social subnets, for example, the former often pays attention to Because of the static information in the network, the latter is more concerned with the network topology, so it cannot directly use the traditional method to analyze the popularity of songs in the whole network of music information network.

发明内容SUMMARY OF THE INVENTION

本发明的目的在于提供一种音乐信息网络中社交关系影响下的歌曲流行度分析方法,以克服现有的音乐信息网络中歌曲流行度分析方法中存在的问题。The purpose of the present invention is to provide a method for analyzing the popularity of songs under the influence of social relations in the music information network, so as to overcome the problems existing in the methods for analyzing the popularity of songs in the existing music information network.

为实现上述目的,本发明的技术方案是:一种音乐信息网络中社交关系影响下的歌曲流行度分析方法,步骤包括如下:In order to achieve the above object, the technical scheme of the present invention is: a method for analyzing the popularity of songs under the influence of social relations in a music information network, the steps include the following:

步骤S1:获取音乐信息网络,从所述音乐信息网络中提取没有社交关系影响下的信息子网以及信息子网的网络模式,并从所述音乐信息网络中提取出与所述信息子网相关的社交关系;Step S1: Obtain a music information network, extract information subnets without the influence of social relationships and network modes of the information subnets from the music information network, and extract information related to the information subnets from the music information network. social relationships;

步骤S2:根据所述信息子网的网络模式,计算所述信息子网中不同类型关系边的平均边介数,并根据平均边介数计算随机游走过程中不同类型关系边的传播因子;Step S2: according to the network mode of the information subnet, calculate the average edge betweenness of different types of relation edges in the information subnet, and calculate the propagation factor of different types of relation edges in the random walk process according to the average edge betweenness;

步骤S3:根据所述社交关系,计算在社交关系影响下的信息子网中各种类型边权重;Step S3: According to the social relationship, calculate the edge weights of various types in the information subnet under the influence of the social relationship;

步骤S4:结合所述传播因子与各种类型边权重在所述信息子网中进行随机游走,计算社交关系影响下各种类型的节点影响力排序,从而得到歌曲流行度。Step S4: Perform random walks in the information sub-network in combination with the propagation factor and various types of edge weights, and calculate the influence ranking of various types of nodes under the influence of social relations, thereby obtaining the popularity of the song.

在本发明一实施例中,在所述步骤S1中,在音乐信息网络中,去除用户之间的社交关系以及用户与歌曲之间的播放关系与被播放关系后,得到仅与歌曲相关的且没有社交关系影响下的信息子网G0,该信息子网G0的网络模式有四种类型节点,分别为:歌曲、歌手、流派以及作词者。In an embodiment of the present invention, in the step S1, in the music information network, after removing the social relationship between users and the playback relationship and the played relationship between the user and the song, the music information only related to the song is obtained. The information subnet G 0 without the influence of social relations has four types of nodes in the network mode of the information subnet G 0 , namely: songs, singers, genres, and lyricists.

在本发明一实施例中,所述信息子网络中四种类型节点存在如下类型关系:歌手-歌曲之间的演唱与被演唱关系、作词者-歌曲之间的作词与被作词关系、流派-歌曲之间的包含与被包含关系;并从音乐信息网络中提取用户-歌曲之间的播放关系,作为所述信息子网相关的社交关系SG0In an embodiment of the present invention, the four types of nodes in the information sub-network have the following type relationships: the relationship between singer-song singing and being sung, lyricist-song relationship between writing lyrics and being written, genre- The inclusion and inclusion relationship between songs; and the user-song playing relationship is extracted from the music information network as the social relationship SG 0 related to the information sub-network.

在本发明一实施例中,在所述步骤S2中,基于信息子网的网络模式,计算如下六种类型边的传播因子,即,演唱关系、被演唱关系、作词关系、被作词关系、包含关系以及被包含关系,按照如下步骤计算:In an embodiment of the present invention, in the step S2, based on the network mode of the information subnet, the propagation factors of the following six types of edges are calculated, that is, the relationship between singing, being sung, writing lyrics, writing lyrics, including Relationships and included relationships are calculated as follows:

步骤S21:获取所述信息子网G0Step S21: acquiring the information subnet G 0 ;

步骤S22:计算所述信息子网G0中所述被演唱关系边、所述被作词关系以边及所述被包含关系边的边介数,并计算该三种类型边的平均边介数的比值ems:ema:emtStep S22: Calculate the edge betweenness of the sung relationship edge, the lyric relationship edge, and the included relationship edge in the information subnet G 0 , and calculate the average edge betweenness of the three types of edges The ratio of e ms :e ma :e mt ;

步骤S23:根据λmsmamt=ems:ema:emt和λmsmamt=1计算该三种类型边的传播因子λmsmamt,且所述演唱关系边、所述作词关系边以及所述包含关系边的传播因子均为1,即λsm=1,λam=1,λtm=1。Step S23: Calculate the propagation factors λ ms , λ ma , λ mt of the three types of edges according to λ msmamt =e ms :e ma :e mt and λ msmamt =1, And the propagation factors of the singing relationship edge, the lyric writing relationship edge, and the inclusion relationship edge are all 1, that is, λ sm =1, λ am =1, λ tm =1.

在本发明一实施例中,在所述步骤S3中,各所述种类型边权重计算过程如下:In an embodiment of the present invention, in the step S3, the calculation process of each type of edge weight is as follows:

步骤S31:歌曲-流派之间的被包含关系边的权重按照如下方式计算:Step S31: The weight of the included relationship edge between song-genre is calculated as follows:

其中,Nm T表示歌曲m所属于的流派个数,Nm U表示播放歌曲m的用户人数,P(t)表示流派t包含的歌曲列表;Among them, N m T represents the number of genres to which the song m belongs, N m U represents the number of users who play the song m, and P(t) represents the song list contained in the genre t;

步骤S32:流派-歌曲之间的包含关系边的权重按照如下方式计算:Step S32: The weight of the inclusion relationship edge between genres and songs is calculated as follows:

其中,NM t表示流派t包含的歌曲数;Among them, N M t represents the number of songs contained in genre t;

步骤S33:歌曲-歌手之间的被演唱关系边的权重按照如下方式计算:Step S33: The weight of the sung relationship edge between the song and the singer is calculated as follows:

其中,Nm S表示歌曲m的歌手个数,Nm U表示播放歌曲m的用户人数,P(s)表示歌手s演唱的歌曲列表;Among them, N m S represents the number of singers of song m, N m U represents the number of users who play song m, and P(s) represents the list of songs sung by singer s;

步骤S34:歌手-歌曲之间的演唱关系边的权重按照如下方式计算:Step S34: The weight of the singing relationship edge between singers and songs is calculated as follows:

其中,表示歌手s演唱的歌曲数;in, Represents the number of songs sung by singer s;

步骤S35:歌曲-作词者之间的被作词关系边的权重按照如下方式计算:Step S35: The weight of the lyricist relationship edge between the song and the lyricist is calculated as follows:

其中,Nm A表示歌曲m的作词者个数,Nm U表示播放歌曲m的用户人数,P(a)表示作词者a作词的歌曲列表;Among them, N m A represents the number of lyricists of song m, N m U represents the number of users who play song m, and P(a) represents the list of songs written by lyricist a;

步骤S36:作词者-歌曲之间的作词关系边的权重按照如下方式计算:Step S36: The weight of the lyric relationship edge between the lyricist and the song is calculated as follows:

其中,Na M表示作词者a作词的歌曲数。Among them, N a M represents the number of songs written by lyricist a.

在本发明一实施例中,所述步骤S4还包括如下步骤:In an embodiment of the present invention, the step S4 further includes the following steps:

步骤S41:将所述信息子网中歌曲-歌曲之间边权重Wmm,、歌手-歌手之间边权重Wss、作词者-作词者之间边权重Waa、流派-流派之间边权重Wtt、歌手-作词者之间边权重Wsa、作词者-歌手之间边权重Was、歌手-流派之间边权重Wst、流派-歌手之间边权重Wts、作词者-流派之间边权重Wat、流派-作词者之间边权重Wta均设置为0,即Wmm,Wss,Waa,Wtt,Wsa,Was,Wst,Wts,Wat,Wta都设置为对应大小的零矩阵;将歌曲-歌曲之间传播因子λmm、歌手-歌手之间传播因子λss、作词者-作词者之间传播因子λaa、流派-流派之间传播因子λtt、歌手-作词者之间传播因子λsa、作词者-歌手之间传播因子λas、歌手-流派之间传播因子λst、流派-歌手之间传播因子λts、作词者-流派之间传播因子λat、流派-作词者之间传播因子λta均设置为0,即λmm=λss=λaa=λtt=λsa=λas=λst=λts=λat=λta=0;Step S41: the edge weight W mm between songs and songs in the information subnet, the edge weight W ss between singers and singers, the edge weight W aa between lyricists and the lyricists, and the edge weights between genres and genres. W tt , singer-singer edge weight W sa , lyricist-singer edge weight W as , singer-genre edge weight W st , genre-singer edge weight W ts , lyricist-genre The edge weight W at and the genre-writer edge weight W ta are all set to 0, namely W mm ,W ss ,W aa ,W tt ,W sa ,W as ,W st ,W ts ,W at ,W ta are set to zero matrices of corresponding sizes; the spread factor between song-song λ mm , the spread factor between singer-singer λ ss , the spread factor between lyricist and lyricist λ aa , the spread factor between genres and genres λ tt , singer-singer propagation factor λ sa , lyricist-singer propagation factor λ as , singer-genre propagation factor λ st , genre-singer propagation factor λ ts , lyricist-genre propagation factor λ ts The inter-distribution factor λ at and the genre-writer propagation factor λ ta are all set to 0, that is, λ mm = λ ss = λ aa = λ tt = λ sa = λ as = λ st = λ ts = λ at = λ ta = 0;

步骤S42:在所述信息子网中,节点之间进行随机游走的转移概率矩阵TPM按如下方式计算:Step S42: In the information subnet, the transition probability matrix TPM for random walk between nodes is calculated as follows:

步骤S43:设定两个长度为N的向量Vec_C与Vec_R;Vec_R中的值为所述信息子网中每个节点的影响力值,Vec_C初始为并通过如下方式计算Vec_R:Step S43: Set two vectors of length N, Vec_C and Vec_R; the value in Vec_R is the influence value of each node in the information subnet, and Vec_C is initially and calculate Vec_R as follows:

其中,ε为全图随机跳转概率,N取值为所述信息子网中节点总个数;Among them, ε is the random jump probability of the whole graph, and N is the total number of nodes in the information subnet;

步骤S44:通过以下两个公式计算Vec_C以及Vec_R:Step S44: Calculate Vec_C and Vec_R by the following two formulas:

Vec_C=Vec_R,Vec_C=Vec_R,

且当||Vec_R-Vec_C||≥ξ时,则继续以上两个公式的计算,否则得到Vec_R,ξ为预设误差阈值;And when ||Vec_R-Vec_C||≥ξ, continue the calculation of the above two formulas, otherwise get Vec_R, and ξ is the preset error threshold;

步骤S45:分别对Vec_R中类型节点歌曲、歌手、作词者以及流派值进行排序,得到歌曲、歌手、作词者以及流派的排序结果序列,进而得到歌曲流行度。Step S45: Sort the genre node songs, singers, lyricists and genre values in Vec_R respectively to obtain the sorted result sequence of songs, singers, lyricists and genres, and then obtain the popularity of the songs.

相较于现有技术,本发明具有以下有益效果:本发明提出了一种音乐信息网络中社交关系影响下的歌曲流行度分析方法,该方法不仅有效地利用与歌曲相关的信息子网中不同类型的对象和关系等全面的结构信息和丰富的语义信息,同时又考虑到用户群体对歌曲的影响,并且能够避免音乐信息网络中信息子网和社交子网网络差异性问题。这种在信息子网中借助社交子网的社交关系来分析歌曲流行度的方法更合理。Compared with the prior art, the present invention has the following beneficial effects: the present invention proposes a method for analyzing the popularity of songs under the influence of social relations in a music information network, which not only effectively utilizes the different information in the information sub-network related to songs The comprehensive structural information and rich semantic information such as types of objects and relationships, while taking into account the influence of user groups on songs, can avoid the difference between the information subnet and the social subnet in the music information network. This method is more reasonable to analyze the popularity of songs with the help of the social relations of the social subnet in the information subnet.

附图说明Description of drawings

图1为本发明中一种音乐信息网络中社交关系影响下的歌曲流行度分析方法。FIG. 1 is a method for analyzing the popularity of songs under the influence of social relations in a music information network according to the present invention.

图2为本发明一实施例中音乐信息网络。FIG. 2 is a music information network in an embodiment of the present invention.

图3为本发明一实施例中音乐信息网络中的信息子网的网络模式。FIG. 3 is a network mode of an information subnet in a music information network according to an embodiment of the present invention.

图4为本发明-实施例中音乐信息网络中社交关系影响下的信息子网实例。FIG. 4 is an example of an information subnet under the influence of social relations in a music information network in an embodiment of the present invention.

具体实施方式Detailed ways

下面结合附图,对本发明的技术方案进行具体说明。The technical solutions of the present invention will be described in detail below with reference to the accompanying drawings.

下面通过具体实施例对本发明做进一步的说明,但是需要注意的是,公布实施例的目的在于帮助进一步理解本发明,但是本领域的技术人员可以理解:在不脱离本发明及所附的权利要求的精神和范围内,各种替换和修改都是可能的。因此,本发明不应局限于实施例所公开的内容,本发明要求保护的范围以权利要求书界定的范围为准。The present invention will be further described below by specific embodiments, but it should be noted that the purpose of publishing the embodiments is to help further understand the present invention, but those skilled in the art can understand: without departing from the present invention and the appended claims Various substitutions and modifications are possible within the spirit and scope of the Therefore, the present invention should not be limited to the contents disclosed in the embodiments, and the scope of protection of the present invention shall be subject to the scope defined by the claims.

如图1所示,为本发明所提出的音乐信息网络中社交关系影响下的歌曲流行度分析方法的的流程图,该方法包括如下步骤:As shown in Figure 1, it is a flowchart of a method for analyzing the popularity of songs under the influence of social relationships in a music information network proposed by the present invention, and the method includes the following steps:

步骤S1:获取音乐信息网络,从音乐信息网络中提取没有社交关系影响下的信息子网以及信息子网的网络模式,并从音乐信息网络中提取出与信息子网相关的社交关系。Step S1 : acquiring the music information network, extracting the information subnet without the influence of social relations and the network mode of the information subnet from the music information network, and extracting the social relations related to the information subnet from the music information network.

进一步的,在本实施例中,在音乐信息网络中,去除用户之间的社交关系以及用户与歌曲之间的播放关系与被播放关系等社交信息后,得到只与歌曲相关的没有社交关系影响下的信息子网G0,在这个信息子网G0的网络模式中,有四种类型节点,分别为歌曲、歌手、流派、作词者,网络中存在的关系有:歌手-歌曲之间的演唱与被演唱关系、作词者-歌曲之间的作词与被作词关系、流派-歌曲之间的包含与被包含关系;再从音乐信息网络中提取用户-歌曲之间的播放关系,也就是与信息子网相关的社交关系SG0Further, in this embodiment, in the music information network, after removing the social information such as the social relationship between users and the playing relationship and the played relationship between the user and the song, it is obtained only related to the song without the influence of the social relationship. Below the information subnet G 0 , in the network mode of this information subnet G 0 , there are four types of nodes, namely songs, singers, genres, and lyricists. The relationships in the network are: singer-song The relationship between singing and being sung, the relationship between lyricist and lyricist between the lyricist and the song, and the relationship between genre-song and inclusion; and then extract the playback relationship between users and songs from the music information network, that is, with Information subnet-related social relations SG 0 .

进一步的,在本实施例中,从豆瓣音乐网站中获取音乐网络数据信息,提取网络实体,包括歌曲(M),歌手(S),作词者(A),流派(T),用户(U),分组(G),这些数据中存在关系有:歌曲-歌手之间的被演唱与演唱关系、歌曲-作词者之间的被作词与作词关系、歌曲-流派之间的被包含与包含关系、用户-歌曲之间的播放与被播放关系、分组-用户之间的包含与被包含关系,从而生成音乐信息网络,音乐信息网络例子如图2所示。Further, in this embodiment, music network data information is obtained from the Douban Music website, and network entities are extracted, including song (M), singer (S), lyricist (A), genre (T), user (U) , grouping (G), there are relationships in these data: sung and sung relationship between song-singer, lyric and lyric relationship between song-lyricist, included and included relationship between song-genre, The relationship of playing and being played between users and songs, and the relationship between grouping and containing and being included between users, so as to generate a music information network. An example of the music information network is shown in FIG. 2 .

进一步的,在本实施例中,音乐信息网络中信息子网的网络模式中有节点歌曲(M),歌手(S),作词者(A),流派(T),边代表节点之间的关系,分别为歌曲-歌手之间的被演唱与演唱关系、歌曲-作词者之间的被作词与作词关系、歌曲-流派之间的被包含与包含关系,信息子网的网络模式如图3所示。而与信息子网相关的社交关系是指用户-歌曲之间的播放与被播放关系。Further, in this embodiment, there are node songs (M), singers (S), lyricists (A), genres (T) in the network mode of the information subnet in the music information network, and edges represent the relationship between nodes. , which are the relationship between being sung and singing between songs and singers, the relationship between being written and writing lyrics between songs and lyricists, and the relationship between being included and included between songs and genres. The network mode of the information subnet is shown in Figure 3. Show. The social relationship related to the information subnet refers to the relationship between user and song being played and played.

步骤S2:根据信息子网的网络模式,计算信息子网中不同类型关系边的平均边介数,并根据平均边介数计算随机游走过程中不同类型关系边的传播因子。Step S2: Calculate the average edge betweenness of different types of relation edges in the information subnet according to the network mode of the information subnet, and calculate the propagation factors of different types of relation edges in the random walk process according to the average edge betweenness.

进一步的,基于信息子网的网络模式,计算如下六种类型边的传播因子,即,演唱关系、被演唱关系、作词关系、被作词关系、包含关系、被包含关系,计算方法如下:Further, based on the network mode of the information subnet, the propagation factors of the following six types of edges are calculated, namely, the singing relationship, the singing relationship, the lyric relationship, the lyric writing relationship, the inclusion relationship, and the included relationship. The calculation method is as follows:

步骤S21.获取信息子网G0Step S21. Obtain the information subnet G 0 ;

步骤S22.计算出信息子网G0中被演唱关系、被作词关系、被包含关系边的边介数,并统计出这三种类型边的平均边介数的比值ems:ema:emtStep S22. Calculate the edge betweenness of the sung relationship, the lyric relationship, and the included relationship edge in the information subnet G 0 , and count the ratio of the average edge betweenness of these three types of edges ems :e ma :e mt ;

步骤S23.根据λmsmamt=ems:ema:emt和λmsmamt=1计算出这三种类型边的传播因子λmsmamt,而演唱关系边、作词关系边和包含关系边的传播因子都为1,即λsm=1,λam=1,λtm=1。Step S23. According to λ ms : λ ma : λ mt =e ms :e ma :e mt and λ msmamt =1, calculate the propagation factors λ ms , λ ma , λ mt of these three types of edges , and the propagation factors of the singing relation side, the lyric writing relation side and the inclusion relation side are all 1, that is, λ sm =1, λ am =1, λ tm =1.

如图4所示,在本实施例中,ems:ema:emt=9.5332:7.5288:8.7495,λms=0.3693,λma=0.2917,λmt=0.3390。As shown in FIG. 4 , in this embodiment, e ms :e ma :e mt =9.5332:7.5288:8.7495, λ ms =0.3693, λ ma =0.2917, λ mt =0.3390.

步骤S3:基于信息子网的网络模式,并根据社交关系,设计出在社交关系影响下的信息子网G0中各种类型边权重,具体计算公式如下:Step S3: Designing various types of edge weights in the information subnet G 0 under the influence of the social relationship based on the network mode of the information subnet and according to the social relationship, and the specific calculation formula is as follows:

步骤S31:歌曲-流派之间的被包含关系边的权重计算公式:Nm T表示歌曲m所属于的流派个数,Nm U表示播放歌曲m的用户人数,P(t)表示流派t包含的歌曲列表;Step S31: The weight calculation formula of the included relation edge between songs and genres: N m T represents the number of genres to which the song m belongs, N m U represents the number of users who play the song m, and P(t) represents the list of songs included in the genre t;

步骤S32:流派-歌曲之间的包含关系边的权重计算公式:Nt M表示流派t包含的歌曲数;Step S32: The formula for calculating the weight of the inclusion relationship edge between genres and songs: N t M represents the number of songs contained in genre t;

步骤S33:歌曲-歌手之间的被演唱关系边的权重计算公式:Nm S表示歌曲m的歌手个数,Nm U表示播放歌曲m的用户人数,P(s)表示歌手s演唱的歌曲列表;Step S33: The weight calculation formula of the sung relationship edge between the song and the singer: N m S represents the number of singers of song m, N m U represents the number of users who play song m, and P(s) represents the list of songs sung by singer s;

步骤S34:歌手-歌曲之间的演唱关系边的权重计算公式:Ns M表示歌手s演唱的歌曲数;Step S34: The weight calculation formula of the singing relationship edge between singer-song: N s M represents the number of songs sung by singer s;

步骤S35:歌曲-作词者之间的被作词关系边的权重计算公式:Nm A表示歌曲m的作词者个数,Nm U表示播放歌曲m的用户人数,P(a)表示作词者a作词的歌曲列表;Step S35: the calculation formula of the weight of the lyric relationship edge between the song and the lyricist: N m A represents the number of lyricists of song m, N m U represents the number of users who play song m, and P(a) represents the list of songs written by lyricist a;

步骤S36:作词者-歌曲之间的作词关系边的权重计算公式:Na M表示作词者a作词的歌曲数;Step S36: The weight calculation formula of the lyric relationship edge between the lyricist and the song: N a M represents the number of songs written by lyricist a;

步骤S4:结合传播因子与边权重来在信息子网中进行随机游走,得到歌曲、歌手、作词者、类型这四种类型节点的影响力排序,具体计算方法如下:Step S4: Combine the propagation factor and the edge weight to perform a random walk in the information subnet, and obtain the influence ranking of the four types of nodes: song, singer, lyricist, and genre. The specific calculation method is as follows:

步骤S41.将网络中歌曲-歌曲之间边权重Wmm、歌手-歌手之间边权重Wss、作词者-作词者之间边权重Waa、流派-流派之间边权重Wtt、歌手-作词者之间边权重Wsa、作词者-歌手之间边权重Was、歌手-流派之间边权重Wst、流派-歌手之间边权重Wts、作词者-流派之间边权重Wat、流派-作词者之间边权重Wta均设置为0,在本实施例中,Wmm是6×6的零矩阵、Wss是4×4的零矩阵、Waa是2×2的零矩阵、Wtt是3×3的零矩阵、Wsa是4×2的零矩阵、Was是2×4的零矩阵、Wst是4×3的零矩阵、Wts是3×4的零矩阵、Wat是2×3的零矩阵、Wta是3×2的零矩阵;将歌曲-歌曲之间传播因子λmm、歌手-歌手之间传播因子λss、作词者-作词者之间传播因子λaa、流派-流派之间传播因子λtt、歌手-作词者之间传播因子λsa、作词者-歌手之间传播因子λas、歌手-流派之间传播因子λst、流派-歌手之间传播因子λts、作词者-流派之间传播因子λat、流派-作词者之间传播因子λta均设置为0,即λmm=λss=λaa=λtt=λsa=λas=λst=λts=λat=λta=0;Step S41. Put the song-song edge weight W mm in the network, the singer-singer edge weight W ss , the lyricist-lyricist edge weight W aa , the genre-genre edge weight W tt , the singer- Edge weight between lyricists W sa , edge weight between lyricist-singer W as , edge weight between singer-genre W st , edge weight between genre-singer W ts , edge weight between lyricist-genre W at , The edge weight W ta between genres and lyricists is set to 0. In this embodiment, W mm is a 6×6 zero matrix, W ss is a 4×4 zero matrix, and W aa is a 2×2 zero matrix Matrix, W tt is a 3×3 zero matrix, W sa is a 4×2 zero matrix, W as is a 2×4 zero matrix, W st is a 4×3 zero matrix, W ts is a 3×4 zero matrix matrix, W at is a 2×3 zero matrix, W ta is a 3×2 zero matrix; the song-song propagation factor λ mm , the singer-singer propagation factor λ ss , the lyricist-lyricist Propagation factor λ aa , genre-genre propagation factor λ tt , singer-songwriter propagation factor λ sa , lyricist-singer propagation factor λ as , singer-genre propagation factor λ st , genre-singer The propagation factor λ ts between lyricists, the propagation factor λ at between lyricists and genres, and the propagation factor λ ta between genres and lyricists are all set to 0, that is, λ mm = λ ss = λ aa = λ tt = λ sa = λ assttsatta =0;

步骤S42.在信息子网中,节点进行随机游走需要考虑两点,即,下一步随机游走应选择何种类型节点以及应选择该类型节点集中具体哪一个节点,实际上前者就是各种类型边的传播因子,后者是各种类型边权重,因此,从节点i到节点j的转移概率,就是将这两者进行乘积,即转移概率矩阵TPM计算公式如下:Step S42. In the information subnet, two points need to be considered when a node performs a random walk, that is, what type of node should be selected in the next random walk and which specific node in the set of nodes of this type should be selected. In fact, the former is a variety of The propagation factor of type edges, the latter is the weight of various types of edges, therefore, the transition probability from node i to node j is the product of these two, that is, the calculation formula of transition probability matrix TPM is as follows:

步骤S43.设定两个长度为N的向量Vec_C,Vec_R,Vec_R中的值为信息子网中每个节点的影响力值,Vec_C初始为通过公式计算Vec_R;在本实施例中,N=15;Step S43. Set two vectors of length N, Vec_C, Vec_R, and the values in Vec_R are the influence values of each node in the information subnet, and Vec_C is initially pass The formula calculates Vec_R; in this embodiment, N=15;

步骤S44.通过以下两个公式计算Vec_C和Vec_RStep S44. Calculate Vec_C and Vec_R by the following two formulas

Vec_C=Vec_RVec_C=Vec_R

当||Vec_R-Vec_C||≥ξ时继续以上两个公式的计算,否则得到Vec_R,ξ为事先设定误差阈值;When ||Vec_R-Vec_C||≥ξ, continue the calculation of the above two formulas, otherwise Vec_R is obtained, and ξ is the pre-set error threshold;

步骤S45.分别对Vec_R中歌曲、歌手、作词者和流派值进行排序,得到歌曲、歌手、作词者和流派这四种类型节点的排序结果序列,得到歌曲流行度,如表1所示。Step S45. Sort songs, singers, lyricists and genre values in Vec_R respectively, obtain the sorting result sequence of the four types of nodes of song, singer, lyricist and genre, and obtain song popularity, as shown in Table 1.

表1Table 1

以上是本发明的较佳实施例,凡依本发明技术方案所作的改变,所产生的功能作用未超出本发明技术方案的范围时,均属于本发明的保护范围。The above are the preferred embodiments of the present invention, all changes made according to the technical solutions of the present invention, when the resulting functional effects do not exceed the scope of the technical solutions of the present invention, belong to the protection scope of the present invention.

Claims (2)

1. A song popularity analysis method under the influence of social relations in a music information network is characterized by comprising the following steps: the method comprises the following steps:
step S1: acquiring a music information network, extracting an information subnet and a network mode of the information subnet without the influence of social relations from the music information network, and extracting the social relations related to the information subnet from the music information network;
step S2: calculating the average edge betweenness of different types of relation edges in the information subnet according to the network mode of the information subnet, and calculating the propagation factors of the different types of relation edges in the random walk process according to the average edge betweenness;
step S3: calculating various types of side weights in the information subnet under the influence of the social relationship according to the social relationship;
step S4: combining the propagation factors and various types of edge weights to carry out random walk in the information subnet, and calculating various types of node influence sequencing under the influence of social relations, thereby obtaining song popularity;
in step S1, after removing the social relationship between the users and the playing relationship and played relationship between the users and the songs in the music information network, an information subnet G related to the songs only and without the influence of the social relationship is obtained0The information subnet G0The network mode of (1) has four types of nodes, which are respectively: song, singer, genre, and word writer;
the following type relationships exist among four types of nodes in the information sub-network: the relation between singers and songs, the relation between word makers and songs, and the relation between inclusion and inclusion between genres and songs; and extracting the playing relation between the user and the song from the music information network as the social relation SG related to the information subnet0
In step S2, based on the network model of the information subnet, propagation factors of the following six types of edges, i.e., singing relation, word-making relation, inclusion relation, and inclusion relation, are calculated according to the following steps:
step S21: obtaining the information subnet G0
Step S22: calculating said information subnet G0The edge betweenness of the sung relation edge, the word relation edge and the included relation edge, and calculating the ratio e of the average edge betweenness of the three types of edgesms:ema:emt
Step S23: according to λmsmamt=ems:ema:emtAnd λmsmamtCalculating the propagation factor lambda of the three types of edges as 1msmamtAnd the propagation factors of the singing relation edge, the word making relation edge and the inclusion relation edge are all 1, namely lambdasm=1,λam=1,λtm=1;
In step S3, the calculation procedure of each type edge weight is as follows:
step S31: the weights of the contained relationship edges between song-genres are calculated as follows:
wherein N ism TIndicates the number of genres to which song m belongs, Nm URepresents the number of users playing song m, and p (t) represents a song list contained in genre t;
step S32: genre-weight of inclusion relation edges between songs is calculated as follows:
wherein N ist MRepresents the number of songs included in genre t;
step S33: the weight of the sung relationship edge between song-singer is calculated as follows:
wherein N ism SIndicates the number of singers of Song m, Nm UA list of songs representing the number of users playing song m, p(s) representing singers singing;
step S34: the weights of the singer-song singing relationship edges are calculated as follows:
wherein N iss MRepresents the number of songs sung by singers;
step S35: the weights of the wordled relationship edges between song-worders are calculated as follows:
wherein N ism ANumber of word makers of Song m, Nm UA song list representing the number of users playing song m, and p (a) representing the word writer a;
step S36: the weights of the wordwise relationship edges between the worders-songs are calculated as follows:
wherein N isa MIndicating the number of songs the word writer a makes.
2. The method for analyzing popularity of songs under the influence of social relationships in a music information network as claimed in claim 1, wherein said step S4 further comprises the steps of:
step S41: weighting W between songs in the information subnetmmThe side weight W between the singer and the singerssThe edge weight W between the word writer and the word writeraaGenre-genre boundary weight WttInter-singer-word-maker side weight WsaSide weight W between speaker and singerasInter-artist-genre weight WstGenre-singer side weight WtsEdge weights W between speaker-genresatSide weight W between genre and authortaAre all set to 0; transmitting a song-song propagation factor lambdammSinger-singer propagation factor lambdassPropagation factor lambda between word makers and word makersaaGenre-genre spread factor lambdattPropagation factor lambda between singer and word writersaPropagation factor lambda between speaker and singerasSinger-genre transmission factor lambdastGenre-singer propagation factor lambdatsInterwriter-genre propagation factor lambdaatGenre-to-writer propagation factor lambdataAre all set to 0, i.e. λmm=λss=λaa=λtt=λsa=λas=λst=λts=λat=λta=0;
Step S42: in the information subnet, a transition probability matrix TPM for random walk among nodes is calculated as follows:
step S43: setting two vectors Vec _ C and Vec _ R with the length of N; the value in Vec _ R is the influence value of each node in the information subnet, and Vec _ C is initiallyAnd Vec _ R is calculated as follows:
wherein epsilon is the whole-graph random jump probability, and N is the total number of nodes in the information subnet;
step S44: vec _ C and Vec _ R are calculated by the following two formulas:
Vec_C=Vec_R,
when | | | Vec _ R-Vec _ C | | | | is equal to or greater than ξ, continuing the calculation of the two formulas, otherwise, obtaining Vec _ R, wherein ξ is a preset error threshold;
step S45: and respectively sequencing the songs, singers, word makers and genre values of the type nodes in the Vec _ R to obtain sequencing result sequences of the songs, the singers, the word makers and the genres, and further obtain the song popularity.
CN201610264331.5A 2016-04-26 2016-04-26 Song popularity analysis method in music information network under the influence of social networks Expired - Fee Related CN105956040B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610264331.5A CN105956040B (en) 2016-04-26 2016-04-26 Song popularity analysis method in music information network under the influence of social networks

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610264331.5A CN105956040B (en) 2016-04-26 2016-04-26 Song popularity analysis method in music information network under the influence of social networks

Publications (2)

Publication Number Publication Date
CN105956040A CN105956040A (en) 2016-09-21
CN105956040B true CN105956040B (en) 2019-06-07

Family

ID=56915385

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610264331.5A Expired - Fee Related CN105956040B (en) 2016-04-26 2016-04-26 Song popularity analysis method in music information network under the influence of social networks

Country Status (1)

Country Link
CN (1) CN105956040B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108009933B (en) * 2016-10-27 2021-06-11 中国科学技术大学先进技术研究院 Graph centrality calculation method and device
CN107133288B (en) * 2017-04-19 2019-12-17 福州大学 Personalized recommendation method in music information network
CN108880846B (en) * 2017-05-16 2020-10-09 清华大学 Method and apparatus for determining a vector representation for a node in a network
CN116010598A (en) * 2022-12-23 2023-04-25 中国电信股份有限公司 Song emotion classification method, device, electronic equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104133843A (en) * 2014-06-25 2014-11-05 福州大学 Academic influence cooperative sequencing method of nodes in scientific and technical literature heterogeneous network
CN104573105A (en) * 2015-01-30 2015-04-29 福州大学 Method of recommending hit songs and singers in music on-demand network
CN104657488A (en) * 2015-03-05 2015-05-27 中南大学 Method for calculating author influence based on citation propagation network

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9323836B2 (en) * 2008-02-11 2016-04-26 Popular Metrics, Inc. Internet based method and system for ranking artists using a popularity profile

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104133843A (en) * 2014-06-25 2014-11-05 福州大学 Academic influence cooperative sequencing method of nodes in scientific and technical literature heterogeneous network
CN104573105A (en) * 2015-01-30 2015-04-29 福州大学 Method of recommending hit songs and singers in music on-demand network
CN104657488A (en) * 2015-03-05 2015-05-27 中南大学 Method for calculating author influence based on citation propagation network

Also Published As

Publication number Publication date
CN105956040A (en) 2016-09-21

Similar Documents

Publication Publication Date Title
CN111143604B (en) Similarity matching method and device for audio frequency and storage medium
US20170300567A1 (en) Media content items sequencing
US8260778B2 (en) Mood based music recommendation method and system
Ragno et al. Inferring similarity between music objects with application to playlist generation
US10108708B2 (en) System and method of classifying, comparing and ordering songs in a playlist to smooth the overall playback and listening experience
CN105956040B (en) Song popularity analysis method in music information network under the influence of social networks
CN105161116B (en) The determination method and device of multimedia file climax segment
CN103455538B (en) Information processing unit, information processing method and program
CN106202502B (en) User interest discovery method in music information network
CN109992694A (en) A kind of music intelligent recommendation method and system
Sandag et al. Predictive models for popularity of solo and group singers in Spotify using decision tree
CN109471951A (en) Lyrics generation method, device, equipment and storage medium neural network based
JP2005166035A (en) Information retrieval apparatus, information retrieval method, program for information retrieval and information recording medium
CN106528653A (en) Context-aware music recommendation method based on graph embedding model
CN107133288B (en) Personalized recommendation method in music information network
CN108932308B (en) A Song Recommendation Method Based on Hybrid Recommendation Algorithm
CN106484833A (en) A kind of source of sound screening technique and electronic equipment
Velankar et al. Modeling melody similarity using music synthesis and perception
Caronongan III et al. Predicting chart difficulty in rhythm games through classification using chart pattern derived attributes
Yang et al. Improving Musical Concept Detection by Ordinal Regression and Context Fusion.
Zhang et al. Musical preference in an online music community in China
Mahajan et al. Prediction of Yahoo! Music Sequences on user’s musical taste
Zhang et al. A Study of Music Genres Based on Social Networks and Statistical Methods
Barina et al. MuSeNet: natural patterns in the music artists industry
WANG et al. A Study on the Evolution and Influence of Music from the Perspective of Data in the Last Hundred Years

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20190607