[go: up one dir, main page]

CN113096671B - A method and system for reversible information hiding of large-capacity audio files - Google Patents

A method and system for reversible information hiding of large-capacity audio files Download PDF

Info

Publication number
CN113096671B
CN113096671B CN202010022517.6A CN202010022517A CN113096671B CN 113096671 B CN113096671 B CN 113096671B CN 202010022517 A CN202010022517 A CN 202010022517A CN 113096671 B CN113096671 B CN 113096671B
Authority
CN
China
Prior art keywords
audio
point
information
embedded
embedding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010022517.6A
Other languages
Chinese (zh)
Other versions
CN113096671A (en
Inventor
马宾
侯金程
徐健
王春鹏
李健
杨美红
吴晓明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qilu University of Technology
Original Assignee
Qilu University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qilu University of Technology filed Critical Qilu University of Technology
Priority to CN202010022517.6A priority Critical patent/CN113096671B/en
Publication of CN113096671A publication Critical patent/CN113096671A/en
Application granted granted Critical
Publication of CN113096671B publication Critical patent/CN113096671B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)

Abstract

The invention discloses a reversible information hiding method and a reversible information hiding system for a high-capacity audio file, which comprise an embedding process: dividing different audio points of the initial file into a cross set and a point set according to positions; respectively constructing a first cross set prediction error matrix and a first point set prediction error matrix by using local consistency characteristics between adjacent audio frequency points and adopting a target audio point amplitude prediction algorithm; dividing data to be hidden into two equal halves, a first part being embedded in a cross set and a second part being embedded in a dot set; reversibly constructing a cross set embedded audio sequence and a point set embedded audio sequence according to an error prediction algorithm; and finally combining to form the secret audio file. The invention embeds the secret information into the audio carrier file for many times based on the characteristic that the extension sequences are mutually orthogonal, most elements of different extension sequences are mutually offset in the information embedding process, and the audio fidelity capability of the file is improved while the embedding capacity of the reversible information of the carrier audio is increased.

Description

一种大容量音频文件可逆信息隐藏方法及系统A method and system for reversible information hiding of large-capacity audio files

技术领域technical field

本发明涉及信息隐藏技术领域,尤其涉及一种大容量音频文件可逆信息隐藏方法及系统。The invention relates to the technical field of information hiding, in particular to a reversible information hiding method and system for large-capacity audio files.

背景技术Background technique

本部分的陈述仅仅是提供了与本发明相关的背景技术信息,不必然构成在先技术。The statements in this section merely provide background information related to the present invention and do not necessarily constitute prior art.

信息隐藏通过将秘密信息嵌入到载体当中实现数字隐写、秘密通信、版权保护等功能。可逆信息隐藏是其中一个重要的分支,在保障完整提取所嵌隐秘信息的同时,还可以无损的恢复原始载体信号。因而,在军事、医疗等对原始图像或音视频文件要求较高的领域,基于可逆信息隐藏的版权认证与隐蔽通信算法具有广泛的应用需求。可逆信息隐藏有两个标准,即嵌入容量和失真率,优秀的可逆信息隐藏算法要求在较高的信息嵌入容量的同时,还需要保持载体文件具有较低的失真度。Information hiding realizes functions such as digital steganography, secret communication, and copyright protection by embedding secret information into the carrier. Reversible information hiding is one of the important branches. While ensuring the complete extraction of the embedded secret information, it can also restore the original carrier signal losslessly. Therefore, in military, medical and other fields that require high original images or audio and video files, copyright authentication and covert communication algorithms based on reversible information hiding have a wide range of application requirements. There are two standards for reversible information hiding, namely, the embedding capacity and the distortion rate. An excellent reversible information hiding algorithm requires a higher information embedding capacity while keeping the carrier file with a lower degree of distortion.

近年来随着数字化社会的推进,海量的音频数据在网络上传播,音频信息隐藏技术的发展也越来越成熟。秘密信息通常以比特流的形式嵌入到载体样本的LSB(最低有效位)上,载体音频的失真在感知上几乎可以忽略不计,而通过可逆信息隐藏可以实现载体音频数据的零失真。In recent years, with the advancement of the digital society, massive audio data has been spread on the Internet, and the development of audio information hiding technology has become more and more mature. The secret information is usually embedded in the LSB (least significant bit) of the carrier sample in the form of a bit stream, the distortion of the carrier audio is almost negligible perceptually, and the zero distortion of the carrier audio data can be achieved by reversible information hiding.

可逆信息隐藏(RDH)算法近年来有了很大的发展,但是,可逆信息隐藏误差预测与信息嵌入算法越来越复杂,计算代价越来越高。同时,在大多数情况下,随着有效载荷的增加,音频质量迅速下降。The reversible information hiding (RDH) algorithm has been greatly developed in recent years, but the reversible information hiding error prediction and information embedding algorithms are becoming more and more complex, and the computational cost is getting higher and higher. At the same time, in most cases, audio quality degrades rapidly as the payload increases.

发明内容SUMMARY OF THE INVENTION

为了解决上述问题,本发明提出了一种大容量音频文件可逆信息隐藏方法及系统,根据码分多址(CDMA)的原理,采用不同的正交序列表示待嵌入信息并嵌入到载体音频中;基于正交向量的线性无关特性,隐秘信息可以重叠嵌入而不相互干扰,从而可以实现大容量信息的嵌入。同时,由于不同扩展序列的大部分元素在重叠嵌入时相互抵消,在实现高容量可逆信息嵌入的同时降低音频文件失真,从而有效提升音频文件可逆信息隐藏性能。In order to solve the above problems, the present invention proposes a reversible information hiding method and system for large-capacity audio files. According to the principle of Code Division Multiple Access (CDMA), different orthogonal sequences are used to represent the information to be embedded and embedded in the carrier audio; Based on the linear independence of orthogonal vectors, covert information can be overlapped and embedded without interfering with each other, thus enabling the embedding of large-capacity information. At the same time, since most elements of different extension sequences cancel each other during overlapping embedding, high-capacity reversible information embedding is achieved while reducing audio file distortion, thereby effectively improving the reversible information hiding performance of audio files.

在一些实施方式中,采用如下技术方案:In some embodiments, the following technical solutions are adopted:

一种大容量音频文件可逆信息隐藏方法,包括嵌入过程:A reversible information hiding method for large-capacity audio files, including the embedding process:

将初始文件不同音频点按照位置分为叉集合和点集合;Divide the different audio points of the initial file into fork sets and point sets according to their positions;

利用相邻音频点间的局部一致性特征,采用目标音频点幅值预测算法,分别构建第一叉集合预测误差矩阵和第一点集合预测误差矩阵;Using the local consistency feature between adjacent audio points, the target audio point amplitude prediction algorithm is used to construct the first fork set prediction error matrix and the first point set prediction error matrix respectively;

将待隐藏的数据被分成相等的两半,第一部分被嵌入到叉集中,第二部分被嵌入到点集中;The data to be hidden is divided into two equal halves, the first part is embedded in the fork set, and the second part is embedded in the point set;

按照误差预测算法可逆的构建叉集合嵌密音频序列和点集合嵌密音频序列;最后合并形成载密音频文件。According to the error prediction algorithm, the fork-set ciphered audio sequence and the point-set ciphered audio sequence are reversibly constructed; finally, the ciphered audio file is formed by merging.

进一步地,还包括提取过程:Further, the extraction process is also included:

将载密音频文件分为叉集合和点集合;Divide the encrypted audio files into fork sets and point sets;

根据目标误差预测算法,分别构建第二点集合预测误差矩阵和第二叉集合预测误差矩阵;According to the target error prediction algorithm, construct the second point set prediction error matrix and the second fork set prediction error matrix respectively;

分别恢复点集合预测误差矩阵和叉集合预测误差矩阵中嵌入的附件信息,按照码分多址可逆信息嵌入算法提取误差矩阵中嵌入的秘密信息,并恢复目标音频点原始幅值;Recover the attachment information embedded in the point set prediction error matrix and the cross set prediction error matrix respectively, extract the secret information embedded in the error matrix according to the code division multiple access reversible information embedding algorithm, and restore the original amplitude of the target audio point;

将提取的秘密信息相连接重建隐藏的秘密信息序列,并采用恢复的目标音频点原始幅值矩阵无损恢复原始音频文件。The extracted secret information is connected to reconstruct the hidden secret information sequence, and the original audio file is restored losslessly by using the original amplitude matrix of the restored target audio point.

在另一些实施方式中,采用如下技术方案:In other embodiments, the following technical solutions are adopted:

一种大容量音频文件可逆信息隐藏系统,包括:A reversible information hiding system for large-capacity audio files, comprising:

用于将初始文件中不同音频点按照位置分为叉集合和点集合的装置;A device for dividing different audio points in the initial file into fork sets and point sets according to their positions;

用于利用相邻音频点间的局部一致性特征,采用目标音频点幅值预测算法,分别构建第一叉集合预测误差矩阵和第一点集合预测误差矩阵的装置;A device for constructing a first fork set prediction error matrix and a first point set prediction error matrix respectively by utilizing the local consistency feature between adjacent audio points and adopting the target audio point amplitude prediction algorithm;

用于将待隐藏的数据被分成相等的两半,第一部分被嵌入到叉集中,第二部分被嵌入到点集中的装置;A means for dividing the data to be hidden into equal halves, the first part is embedded in the fork set, and the second part is embedded in the point set;

用于按照误差预测算法可逆的构建叉集合嵌密音频序列和点集合嵌密音频序列;最后合并形成载密音频文件的装置。A device for reversibly constructing a fork-set embedded cipher audio sequence and a point-set cipher audio sequence according to an error prediction algorithm; and finally merging to form a ciphered audio file.

进一步地,还包括:Further, it also includes:

用于将载密音频文件分为叉集合和点集合的装置;means for dividing the encrypted audio file into fork sets and point sets;

用于根据目标误差预测算法,分别构建第二点集合预测误差矩阵和第二叉集合预测误差矩阵的装置;A device for constructing the second point set prediction error matrix and the second cross set prediction error matrix respectively according to the target error prediction algorithm;

用于分别恢复点集合预测误差矩阵和叉集合预测误差矩阵中嵌入的附件信息,按照码分多址可逆信息嵌入算法提取误差矩阵中嵌入的秘密信息,并恢复目标音频点原始幅值的装置;A device for restoring the attachment information embedded in the point set prediction error matrix and the cross set prediction error matrix respectively, extracting the secret information embedded in the error matrix according to the code division multiple access reversible information embedding algorithm, and restoring the original amplitude value of the target audio point;

用于将提取的秘密信息相连接重建隐藏的秘密信息序列,并采用恢复的目标音频点原始幅值矩阵无损恢复原始音频文件的装置。The device is used to connect the extracted secret information to reconstruct the hidden secret information sequence, and use the restored original amplitude matrix of the target audio point to restore the original audio file losslessly.

在另一些实施方式中,采用如下技术方案:In other embodiments, the following technical solutions are adopted:

一种终端设备,其包括处理器和计算机可读存储介质,处理器用于实现各指令;计算机可读存储介质用于存储多条指令,所述指令适于由处理器加载并执行上述的大容量音频文件可逆信息隐藏方法。A terminal device, which includes a processor and a computer-readable storage medium, where the processor is used to implement various instructions; the computer-readable storage medium is used to store a plurality of instructions, the instructions are suitable for being loaded by the processor and executing the above-mentioned large capacity A reversible information hiding method for audio files.

在另一些实施方式中,采用如下技术方案:In other embodiments, the following technical solutions are adopted:

一种计算机可读存储介质,其中存储有多条指令,所述指令适于由终端设备的处理器加载并执行上述的大容量音频文件可逆信息隐藏方法。A computer-readable storage medium stores a plurality of instructions, wherein the instructions are adapted to be loaded by a processor of a terminal device and execute the above-mentioned method for reversible information hiding of large-capacity audio files.

与现有技术相比,本发明的有益效果是:Compared with the prior art, the beneficial effects of the present invention are:

本发明基于码分多址的音频可逆信息隐藏方法,首先设计目标音频点预测算法,形成原始音频文件的预测误差矩阵,然后采用不同的正交扩展序列实现秘密数据的可逆信息隐藏;在解码的过程中,不仅可以无损的提取嵌入的机密信息,而且能够完全恢复原始音频。基于扩展序列拥有相互正交的特点,将秘密信息多次嵌入到音频载体文件中,不同的扩展序列的大部分元素在信息嵌入过程中相互抵消,在增加载体音频可逆信息嵌入容量的同时提高文件音频保真能力。The present invention is based on a code division multiple access audio reversible information hiding method, firstly designing a target audio point prediction algorithm to form a prediction error matrix of the original audio file, and then using different orthogonal expansion sequences to realize the reversible information hiding of secret data; In the process, not only can the embedded confidential information be extracted losslessly, but also the original audio can be completely restored. Based on the mutual orthogonality of the extension sequences, the secret information is embedded into the audio carrier file for many times, and most of the elements of different extension sequences cancel each other during the information embedding process, which increases the reversible information embedding capacity of the carrier audio and improves the file. Audio fidelity capability.

附图说明Description of drawings

图1(a)-(f)分别表示本发明实施例中6段测试音频预测误差直方图;Fig. 1(a)-(f) respectively represent 6-segment test audio prediction error histograms in the embodiment of the present invention;

图2是本发明实施例中大容量音频文件可逆信息隐藏方法流程图;2 is a flowchart of a method for reversible information hiding of large-capacity audio files in the embodiment of the present invention;

图3(a)-(f)分别表示本发明实施例中不同嵌入强度下6段音频文件嵌入率(BPP)与信噪比(SNR);3(a)-(f) respectively represent the embedding rate (BPP) and signal-to-noise ratio (SNR) of 6-segment audio files under different embedding strengths in the embodiment of the present invention;

图4(a)-(f)分别表示本发明实施例中不同长度扩展序列下6段音频文件嵌入率(BPP)与信噪比(SNR);4(a)-(f) respectively represent the embedding rate (BPP) and signal-to-noise ratio (SNR) of 6-segment audio files under extension sequences of different lengths in the embodiment of the present invention;

图5是本发明实施例中不同音频文件在4种嵌入率下的SNR值;Fig. 5 is the SNR value of different audio files under 4 kinds of embedding rates in the embodiment of the present invention;

图6(a)-(f)分别表示本发明实施例中针对数据库中6段代表性测试音频在不同嵌入率下的测试结果。Figures 6(a)-(f) respectively show the test results under different embedding rates for 6 representative test audios in the database in the embodiment of the present invention.

具体实施方式Detailed ways

应该指出,以下详细说明都是例示性的,旨在对本申请提供进一步的说明。除非另有指明,本发明使用的所有技术和科学术语具有与本申请所属技术领域的普通技术人员通常理解的相同含义。It should be noted that the following detailed description is exemplary and intended to provide further explanation of the application. Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs.

需要注意的是,这里所使用的术语仅是为了描述具体实施方式,而非意图限制根据本申请的示例性实施方式。如在这里所使用的,除非上下文另外明确指出,否则单数形式也意图包括复数形式,此外,还应当理解的是,当在本说明书中使用术语“包含”和/或“包括”时,其指明存在特征、步骤、操作、器件、组件和/或它们的组合。It should be noted that the terminology used herein is for the purpose of describing specific embodiments only, and is not intended to limit the exemplary embodiments according to the present application. As used herein, unless the context clearly dictates otherwise, the singular is intended to include the plural as well, furthermore, it is to be understood that when the terms "comprising" and/or "including" are used in this specification, it indicates that There are features, steps, operations, devices, components and/or combinations thereof.

在不冲突的情况下,本发明中的实施例及实施例中的特征可以相互组合。Embodiments of the invention and features of the embodiments may be combined with each other without conflict.

实施例一Example 1

在一个或多个实施方式中,公开了一种大容量音频文件可逆信息隐藏方法,包括嵌入过程和提取过程,其中,嵌入过程具体包括:In one or more embodiments, a reversible information hiding method for large-capacity audio files is disclosed, including an embedding process and an extraction process, wherein the embedding process specifically includes:

将初始文件中不同音频点按照位置分为叉集合和点集合;Divide different audio points in the initial file into fork sets and point sets according to their positions;

利用相邻音频点间的局部一致性特征,采用目标音频点幅值预测算法,分别构建第一叉集合预测误差矩阵和第一点集合预测误差矩阵;Using the local consistency feature between adjacent audio points, the target audio point amplitude prediction algorithm is used to construct the first fork set prediction error matrix and the first point set prediction error matrix respectively;

将待隐藏的数据被分成相等的两半,第一部分被嵌入到叉集中,第二部分被嵌入到点集中;The data to be hidden is divided into two equal halves, the first part is embedded in the fork set, and the second part is embedded in the point set;

按照误差预测算法可逆的构建叉集合嵌密音频序列和点集合嵌密音频序列;最后合并形成载密音频文件。According to the error prediction algorithm, the fork-set ciphered audio sequence and the point-set ciphered audio sequence are reversibly constructed; finally, the ciphered audio file is formed by merging.

提取过程具体包括:The extraction process specifically includes:

将载密音频文件分为叉集合和点集合;Divide the encrypted audio files into fork sets and point sets;

根据目标误差预测算法,分别构建第二点集合预测误差矩阵和第二叉集合预测误差矩阵;According to the target error prediction algorithm, construct the second point set prediction error matrix and the second fork set prediction error matrix respectively;

分别恢复点集合预测误差矩阵和叉集合预测误差矩阵中嵌入的附件信息,按照码分多址可逆信息嵌入算法提取误差矩阵中嵌入的秘密信息,并恢复目标音频点原始幅值;Recover the attachment information embedded in the point set prediction error matrix and the cross set prediction error matrix respectively, extract the secret information embedded in the error matrix according to the code division multiple access reversible information embedding algorithm, and restore the original amplitude of the target audio point;

将提取的秘密信息相连接重建隐藏的秘密信息序列,并采用恢复的目标音频点原始幅值矩阵无损恢复原始音频文件。The extracted secret information is connected to reconstruct the hidden secret information sequence, and the original audio file is restored losslessly by using the original amplitude matrix of the restored target audio point.

下面对本实施例公开的方法进行详细说明。The method disclosed in this embodiment will be described in detail below.

1、基于码分多址(CDMA)的音频可逆信息隐藏1. Audio Reversible Information Hiding Based on Code Division Multiple Access (CDMA)

音频信号是一种典型的一维非平稳信号,其自相关函数和均值函数会随时间的变化而变化,信号具有缓变性的特征。在某些时段中音频信号表现出随机噪声的特性,在另一些时间段又会表现出周期信号的特性,或表现为两者的混合特性。在实际的处理过程中一般首先对音频信号进行逐段的分割,然后对分段对样本进行处理。根据音频信号连续相关性特性,将可逆信息隐藏与码分多址相结合,充分利用相邻音频样本间的一致相关性,将水印信息嵌入到样本向量中,可实现音频文件的可逆信息隐藏。Audio signal is a typical one-dimensional non-stationary signal, its autocorrelation function and mean function will change with time, and the signal has the characteristics of gradual change. The audio signal exhibits random noise characteristics in some time periods, periodic signal characteristics in other time periods, or a mixture of the two. In the actual processing process, the audio signal is generally first segmented segment by segment, and then the segment samples are processed. According to the continuous correlation characteristics of audio signals, the reversible information hiding and code division multiple access are combined, and the consistent correlation between adjacent audio samples is fully utilized, and the watermark information is embedded in the sample vector, which can realize the reversible information hiding of audio files.

码分多址(CDMA)是一种频谱扩展算法,用于无线通信中的信息安全传输和信道复用。基于CDMA的通信系统被认为是一种非常安全的通信系统,它通过一个预定的正交扩展序列载波待发送信息,将被传输的信号嵌入到不同的正交扩展序列当中进行传输,只有采用相同扩展序列才可以正确解码和提取被传输的秘密信息,因而系统拥有非常高的安全性;并且基于码分多址的扩频传输特性,秘密信息可以在相同信道传播且互不干扰,极大地节省了频率资源。Code Division Multiple Access (CDMA) is a spectrum spreading algorithm used for secure information transmission and channel multiplexing in wireless communications. The CDMA-based communication system is considered to be a very secure communication system. It transmits the information to be sent through a predetermined orthogonal spreading sequence carrier, and embeds the transmitted signal into different orthogonal spreading sequences for transmission. Only the spread sequence can correctly decode and extract the transmitted secret information, so the system has very high security; and based on the spread spectrum transmission characteristics of code division multiple access, the secret information can be transmitted in the same channel without interfering with each other, which greatly saves money frequency resources.

由于可逆信息隐藏系统基于载体音频实现秘密信息的传递,基于码分多址的可逆信息隐藏系统可以类比为一个隐蔽通信系统,其中,载体为公开传输信道,秘密信息为编码传输的内容。在基于码分多址的可逆信息隐藏系统中,一般采用Walsh Hardmard矩阵的行(列)来产生正交扩展序列,正交序列的元素由“1”或“-1”组成,由于每个序列中“1”和“-1”的数量相等,因而还具有零均值的特点。在基于码分多址的可逆信息隐藏传输过程中,为发送方分配正交扩展序列,二进制位“1”用序列本身表示,二进制位“0”代表对应的负序列,基于载秘序列的正交特性和0均值特征,可以实现秘密信息的叠加传送,同时,在信息叠加传送过程中,不同序列元素间产生相互抵消,保障基于码分多址的音频可逆信息隐藏系统取得更好的安全性和更高的可逆信息隐藏性能。Since the reversible information hiding system realizes the transmission of secret information based on the carrier audio, the reversible information hiding system based on code division multiple access can be analogous to a covert communication system, in which the carrier is the open transmission channel, and the secret information is the content of the encoded transmission. In the reversible information hiding system based on code division multiple access, the rows (columns) of the Walsh Hardmard matrix are generally used to generate orthogonal spreading sequences, and the elements of the orthogonal sequences are composed of "1" or "-1". The number of "1" and "-1" is equal, so it also has the characteristics of zero mean. In the process of reversible information hiding transmission based on code division multiple access, an orthogonal spreading sequence is allocated to the sender, the binary bit "1" is represented by the sequence itself, and the binary bit "0" represents the corresponding negative sequence. The intersection characteristic and 0-mean feature can realize the superposition transmission of secret information. At the same time, in the process of information superposition transmission, the elements of different sequences cancel each other, which ensures the audio reversible information hiding system based on code division multiple access to achieve better security. and higher reversible information hiding performance.

1.1嵌入信息1.1 Embedding information

设嵌入的秘密信息W=[b1,b2,b3,····,bk](bi∈{1,0},1≤i≤n},基于公式(1)可以将秘密信息bi转化为嵌入的水印信息ωiAssuming the embedded secret information W=[b 1 ,b 2 ,b 3 ,...,b k ]( bi ∈{1,0}, 1≤i≤n}, based on formula (1), the secret can be Information b i is transformed into embedded watermark information ω i .

ωi=-cosπbi (1)ω i = -cosπb i (1)

经过转化后的隐秘信息变为W=[ω123,····,ωk](ωi∈{1,-1},1≤i≤k},在Walsh Hadamard矩阵中选择k个行生成相互正交的扩展序列Si~Sk,构建扩展序列矩阵E:The transformed secret information becomes W=[ω 123 ,...,ω k ](ω i ∈{1,-1}, 1≤i≤k}, in the Walsh Hadamard matrix Select k rows to generate mutually orthogonal spreading sequences S i ~S k , and construct the spreading sequence matrix E:

Figure BDA0002361310310000071
Figure BDA0002361310310000071

令序列Si={ti1,ti2,ti3,····til}(1≤i≤k)的长度为l(偶数),其中“1”和“-1”的数量是相等的,扩展序列具有零均值和相互正交的特点。Let the length of the sequence S i ={t i1 ,t i2 ,t i3 ,...t il } (1≤i≤k) be l (even), where the number of "1" and "-1" is equal , the spreading sequence has the characteristics of zero mean and mutual orthogonality.

设A是一个长度为N的初始音频序列,选择A中目标样本构建载体向量矩阵X:Let A be an initial audio sequence of length N, select the target samples in A to construct a vector matrix X:

Figure BDA0002361310310000081
Figure BDA0002361310310000081

其中,载体向量Vj=[u1,u2,u3,….,ul](1≤j≤k)长度l与扩展序列Si的长度相同。按照公式(4)将水印信息嵌入到载体向量Vj中,嵌入信息后的向量矩阵

Figure BDA0002361310310000085
如式(5)所示。Wherein, the vector vector V j =[u 1 , u 2 , u 3 , . . . , u l ] (1≤j≤k), the length l is the same as the length of the extension sequence Si. According to formula (4), the watermark information is embedded into the carrier vector V j , and the vector matrix after embedding the information
Figure BDA0002361310310000085
As shown in formula (5).

Figure BDA0002361310310000082
Figure BDA0002361310310000082

Figure BDA0002361310310000083
Figure BDA0002361310310000083

公式(5)中的

Figure BDA0002361310310000084
是嵌入信息后的第j个载体向量,ω12,…,ωk是嵌入的秘密信息,S1,S2,…,Sk是正交扩展序列,k为嵌入信息位数,每个扩展序列都和一位嵌入信息相关。in formula (5)
Figure BDA0002361310310000084
is the j-th vector vector after embedding information, ω 1 , ω 2 ,…,ω k is the embedded secret information, S 1 , S 2 ,…,S k is the orthogonal expansion sequence, k is the number of bits of embedded information, Each spreading sequence is associated with a bit of embedded information.

当需要嵌入大量水印信息(例如n·k位)Wn=[ω11123,····,ω1k2122,····,ω2k,····ωn1n2,····,ωnk]时,可以将水印信息序列进行分段处理,按照WM所示的形式分组进行嵌入,如式(6)所示,同时,基于传播序列的正交性,还可以通过多级嵌入将转化后的水印信息序列Wn嵌入到同一个向量矩阵中X中实现大容量可逆信息嵌入,如式(7)所示。When a large amount of watermark information (such as n·k bits) needs to be embedded, W n =[ω 11123 ,····,ω 1k2122 ,····,ω 2k ,·· ·· ωn1 , ωn2 ,····, ωnk ], the watermark information sequence can be segmented and embedded in groups according to the form shown by W M , as shown in equation (6). At the same time, based on Due to the orthogonality of the propagation sequence, the transformed watermark information sequence W n can also be embedded into the same vector matrix X through multi-level embedding to achieve large-capacity reversible information embedding, as shown in equation (7).

Figure BDA0002361310310000091
Figure BDA0002361310310000091

Figure BDA0002361310310000092
Figure BDA0002361310310000092

其中E1~E1为每次嵌入对应的扩展序列矩阵,

Figure BDA0002361310310000093
为多次嵌入后的向量矩阵,n为嵌入的次数,n·k为嵌入信息的比特数,α为正整数,代表嵌入过程中的嵌入强度,α越大,可以嵌入的向量个数越多,嵌入容量越高,对样本的改变程度也会越大,相对应的也会带给音频较大的失真。where E 1 ~E 1 is the corresponding extended sequence matrix for each embedding,
Figure BDA0002361310310000093
is the vector matrix after multiple embeddings, n is the number of times of embedding, n·k is the number of bits of embedded information, α is a positive integer, representing the embedding strength in the embedding process, the larger the α, the more vectors that can be embedded , the higher the embedding capacity is, the greater the degree of change to the sample will be, and the corresponding audio will be distorted.

1.2提取信息1.2 Extract information

设嵌入信息后的向量矩阵为

Figure BDA0002361310310000094
嵌入的水印信息可以通过计算向量矩阵
Figure BDA0002361310310000095
和扩展序列矩阵E的内积提取出来,如式(8)所示:Let the vector matrix after embedding information be
Figure BDA0002361310310000094
The embedded watermark information can be calculated by the vector matrix
Figure BDA0002361310310000095
and the inner product of the extended sequence matrix E is extracted, as shown in formula (8):

Figure BDA0002361310310000096
Figure BDA0002361310310000096

当需要提取第i位秘密信息时,可以通过下式(9)进行计算When the ith secret information needs to be extracted, it can be calculated by the following formula (9)

Figure BDA0002361310310000097
Figure BDA0002361310310000097

如式(9)所示,α为正整数,

Figure BDA0002361310310000098
的计算结果也是正整数,所以表达式
Figure BDA0002361310310000099
的正负由ωi决定,由于ωi的取值只有1和-1两种可能,因此当满足条件
Figure BDA00023613103100000910
嵌入的信息可以通过公式(10)进行提取:As shown in formula (9), α is a positive integer,
Figure BDA0002361310310000098
The result of the calculation is also a positive integer, so the expression
Figure BDA0002361310310000099
The positive or negative of ω i is determined by ω i. Since there are only two possibilities for the value of ω i , 1 and -1, so when the condition is satisfied
Figure BDA00023613103100000910
The embedded information can be extracted by formula (10):

Figure BDA00023613103100000911
Figure BDA00023613103100000911

式(9)和(10)可以进一步转化得到公式(11)判断载体是否有信息嵌入。Formulas (9) and (10) can be further transformed to obtain formula (11) to determine whether the carrier has information embedded.

Figure BDA00023613103100000912
Figure BDA00023613103100000912

由以上分析可知,当满足公式(10)时,可以通过

Figure BDA00023613103100000913
将嵌入到载体向量Vj中的秘密信息ωi提取出来,因为扩展序列Si具有零均值的特点,
Figure BDA0002361310310000101
运算等同于计算载体向量Vj中相邻元素之间的差值的和。因此嵌入强度α的值越大、音频数据越平滑,可以嵌入到载体音频中的信息就越多。It can be seen from the above analysis that when formula (10) is satisfied, it can be obtained by
Figure BDA00023613103100000913
Extract the secret information ω i embedded in the vector vector V j , because the extended sequence Si has the characteristic of zero mean,
Figure BDA0002361310310000101
The operation is equivalent to computing the sum of the differences between adjacent elements in the vector vector Vj . Therefore, the larger the value of the embedding strength α, the smoother the audio data, and the more information that can be embedded in the carrier audio.

另一方面,如当

Figure BDA0002361310310000102
时,载体向量Vj不满足可逆信息嵌入条件,为了区分不可嵌入信息向量,可以将一个伪信息位嵌入到不满足条件的向量中,在解码阶段通过提取向量中的伪位,判断一个向量是否嵌入了水印信息,具体处理如表1所示:On the other hand, if
Figure BDA0002361310310000102
When the carrier vector V j does not meet the reversible information embedding condition, in order to distinguish the non-embeddable information vector, a pseudo-information bit can be embedded into the vector that does not meet the condition. In the decoding stage, the pseudo-bit in the vector is extracted to determine whether a vector is The watermark information is embedded, and the specific processing is shown in Table 1:

表1对于不满足条件的向量处理Table 1 for the vector processing that does not meet the conditions

Figure BDA0002361310310000103
Figure BDA0002361310310000103

1.3原始音频文件恢复1.3 Original Audio File Recovery

根据码分多址嵌入序列的正交特性,从嵌入信息后的载体提取原始水印信息后,可以通过公式(11)获取原始载体向量矩阵X,并重构初始载体音频。According to the orthogonal characteristic of the CDMA embedding sequence, after extracting the original watermark information from the embedded information carrier, the original carrier vector matrix X can be obtained by formula (11), and the original carrier audio can be reconstructed.

Figure BDA0002361310310000104
Figure BDA0002361310310000104

本算法利用嵌入向量正交特性实现多级信息叠加嵌入以增加载体音频可逆信息嵌入能力。由于嵌入向量Si中元素仅由“1”和“-1”组成,多数嵌入向量中的元素在多级信息嵌入过程中相互抵消,因而,即使在增大数据嵌入容量的情况下,本算法仍可以获得较高的听觉质量。This algorithm utilizes the orthogonal characteristic of embedding vector to realize multi-level information superposition embedding to increase the reversible information embedding capability of carrier audio. Since the elements in the embedding vector Si only consist of "1" and "-1", most of the elements in the embedding vector cancel each other out in the multi-level information embedding process. Therefore, even in the case of increasing the data embedding capacity, this algorithm can A high listening quality can still be obtained.

1.4CDMAA具体计算实例1.4 CDMAA specific calculation example

设音频载体样本A=[15,15,16,16,14,14,13,15],基于其构建向量矩阵

Figure BDA0002361310310000111
令扩展序列矩阵
Figure BDA0002361310310000112
嵌入强度α=1。待嵌入的秘密信息W=[1,0],根据公式(1)得出将秘密信息转化为对应位水印信息序列W=[1,-1]进行嵌入。在嵌入过程中,先计算
Figure BDA0002361310310000113
Figure BDA0002361310310000114
的值,满足条件
Figure BDA0002361310310000115
待嵌信息ω1=1,通过
Figure BDA0002361310310000116
嵌入到向量V1中,嵌入信息后的向量变为
Figure BDA0002361310310000117
同理,可将水印信息ω2=-1嵌入到向量V2中,修改后的向量矩阵
Figure BDA0002361310310000118
二次嵌入信息后的载体向量变为A′=[16,14,17,15,15,13,12,16]。当接收方获取嵌入信息的载体向量后,通过sign(<Vj·Si>)分别提取出嵌入到两个向量中的水印信息1和-1,并采用
Figure BDA0002361310310000119
计算出向量V,构建出原始向量矩阵
Figure BDA00023613103100001110
Figure BDA00023613103100001111
得到初始样本序列A=[15,15,16,16,14,14,13,15]。Set the audio carrier sample A = [15, 15, 16, 16, 14, 14, 13, 15], and construct a vector matrix based on it
Figure BDA0002361310310000111
Let the extended sequence matrix
Figure BDA0002361310310000112
Embedded strength α=1. The secret information to be embedded is W=[1,0], according to formula (1), it is obtained that the secret information is converted into the corresponding bit watermark information sequence W=[1,-1] for embedding. In the embedding process, first calculate
Figure BDA0002361310310000113
and
Figure BDA0002361310310000114
value, which satisfies the condition
Figure BDA0002361310310000115
The information to be embedded ω 1 =1, through
Figure BDA0002361310310000116
Embed into vector V 1 , the vector after embedding information becomes
Figure BDA0002361310310000117
Similarly, the watermark information ω 2 =-1 can be embedded into the vector V 2 , the modified vector matrix
Figure BDA0002361310310000118
The vector vector after the secondary embedding information becomes A′=[16, 14, 17, 15, 15, 13, 12, 16]. When the receiver obtains the carrier vector of the embedded information, it extracts the watermark information 1 and -1 embedded in the two vectors through sign(<V j · S i >), and uses
Figure BDA0002361310310000119
Calculate the vector V and construct the original vector matrix
Figure BDA00023613103100001110
Figure BDA00023613103100001111
The initial sample sequence A=[15, 15, 16, 16, 14, 14, 13, 15] is obtained.

当待嵌入的信息量较大,需要多级嵌入时,可以按照以下方法叠加嵌入信息。假设需要嵌入的隐秘信息序列为W=[1,0,1,1],通过式(1)将隐秘信息转化为对应位信息序列W=[1,-1,1,1],构建待嵌信息矩阵

Figure BDA00023613103100001112
嵌入向量矩阵
Figure BDA00023613103100001113
将W1嵌入到向量矩阵X中,经过嵌入后向量矩阵转变为
Figure BDA00023613103100001114
再将W2嵌入到向量矩阵
Figure BDA0002361310310000121
中,最终得到嵌秘向量矩阵
Figure BDA0002361310310000122
在解码端,嵌入的水印信息和原始向量矩阵可以按照如下方法进行恢复(如表2所示):When the amount of information to be embedded is large and multi-level embedding is required, the embedded information can be superimposed according to the following method. Assuming that the secret information sequence to be embedded is W=[1,0,1,1], the secret information is converted into the corresponding bit information sequence W=[1,-1,1,1] by formula (1), and the to-be-embedded information is constructed. information matrix
Figure BDA00023613103100001112
Embedding vector matrix
Figure BDA00023613103100001113
Embed W 1 into the vector matrix X, after embedding, the vector matrix is transformed into
Figure BDA00023613103100001114
Then embed W 2 into the vector matrix
Figure BDA0002361310310000121
, and finally get the embedded secret vector matrix
Figure BDA0002361310310000122
At the decoding end, the embedded watermark information and the original vector matrix can be restored as follows (as shown in Table 2):

表2详细的信息提取阶段的多级信息提取Table 2 Detailed information extraction stages of multi-level information extraction

Figure BDA0002361310310000123
Figure BDA0002361310310000123

在这个过程中,针对初始向量V1与嵌入后的

Figure BDA0002361310310000124
嵌入两位比特信息后,初始向量V1中只有两个元素值发生改变,两个扩展序列的第一个和第四个元素经过两次信息嵌入后相互抵消;对于初始向量V2与嵌入后的
Figure BDA0002361310310000125
两个扩展序列的第二个和第四个元素经过两次信息嵌入后相互抵消。实际上,由于不同扩展序列的大部分元素在多级信息嵌入中是相互抵消的,使得本方法能够在大容量可逆信息嵌入的同时,将失真抑制在较低水平。In this process, for the initial vector V 1 and the embedded
Figure BDA0002361310310000124
After embedding two bits of information, only two element values in the initial vector V 1 change, and the first and fourth elements of the two extended sequences cancel each other after two information embeddings; for the initial vector V 2 and the embedded of
Figure BDA0002361310310000125
The second and fourth elements of the two spreading sequences cancel each other after two information embeddings. In fact, since most elements of different extension sequences cancel each other in multi-level information embedding, this method can suppress distortion at a low level while embedding large-capacity reversible information.

本实施例公开的大容量音频文件可逆信息隐藏方法中,首先构建载体向量;In the reversible information hiding method for large-capacity audio files disclosed in this embodiment, a carrier vector is first constructed;

由以上分析可知,当载体向量满足条件

Figure BDA0002361310310000131
时,可以在向量Vj中嵌入信息。由于嵌入向量中1和-1个数相同且分布均匀,计算
Figure BDA0002361310310000132
等于对向量Vj中相邻元素的差值进行求和,所以,载体向量Vj中的元素值越相近,
Figure BDA0002361310310000133
会更小。即载体向量中的元素值越近似,满足嵌入条件的载体向量越多,信息嵌入生成的音频失真越小,音频文件的嵌入能力越强。It can be seen from the above analysis that when the vector vector meets the conditions
Figure BDA0002361310310000131
, information can be embedded in the vector V j . Since the number of 1 and -1 in the embedded vector is the same and evenly distributed, computing
Figure BDA0002361310310000132
is equal to summing the difference values of adjacent elements in the vector V j , so the closer the element values in the vector vector V j are,
Figure BDA0002361310310000133
will be smaller. That is, the more similar the element values in the carrier vector, the more carrier vectors that satisfy the embedding conditions, the smaller the audio distortion generated by the information embedding, and the stronger the embedding ability of the audio file.

因此,根据音频文件的时序相关性,利用相邻音频点间的局部一致性特征,采用目标音频点幅值预测算法,构建目标音频点幅值预测误差载体矩阵,并以此形成载体向量实现音频文件可逆信息嵌入,可有效提升音频文件的可逆信息隐藏性能。根据音频文件特性,相邻音频样本点之间的数值具有较强的相似性,本实施例采用最近邻均值预测的方法对目标音频点进行预测,产生预测误差,如公式(12)所示:Therefore, according to the time-series correlation of audio files, using the local consistency characteristics between adjacent audio points, the target audio point amplitude prediction algorithm is used to construct the target audio point amplitude prediction error carrier matrix, and the carrier vector is formed to realize the audio frequency. File reversible information embedding can effectively improve the reversible information hiding performance of audio files. According to the characteristics of the audio file, the numerical values between adjacent audio sample points have strong similarity. In this embodiment, the method of nearest neighbor mean prediction is used to predict the target audio point, resulting in a prediction error, as shown in formula (12):

Figure BDA0002361310310000134
Figure BDA0002361310310000134

其中x′i为样本点xi的预测值,xi-1和xi+1为样本点xi左侧和右侧的样本,所以预测误差dj=x′j-xj,由于音频相邻样本点之间密切相关,本算法可产生很小的预测误差,最终形成紧密分布在“0”值附近的陡峭直方图(如图1所示)。保障基于预测误差矩阵构建载体向量元素具有更小的数值和一致相似性特征,有效提升载体音频文件可逆信息隐藏的性能。where x′ i is the predicted value of the sample point x i , and x i-1 and x i+1 are the samples on the left and right side of the sample point x i , so the prediction error d j =x′ j -x j , due to the audio Adjacent sample points are closely related, and this algorithm can generate a small prediction error, and finally form a steep histogram closely distributed around the "0" value (as shown in Figure 1). It is guaranteed that the vector elements constructed based on the prediction error matrix have smaller numerical values and consistent similarity characteristics, which effectively improves the performance of reversible information hiding of the carrier audio files.

图1(a)-(f)分别为Clip27、Clip40、Clip51、Clip58、Clip66和Clip70预测误差直方图,由直方图可以清楚的观察到基于本算法的预测误差紧密地分布在“0”值附近,因而,基于此误差预测算法生成的载体矩阵实现可逆信息嵌入,在取得较高的信息嵌入容量的同时也可以保持相对较低的音频失真率。Figure 1(a)-(f) are the prediction error histograms of Clip27, Clip40, Clip51, Clip58, Clip66 and Clip70, respectively. From the histograms, it can be clearly observed that the prediction errors based on this algorithm are closely distributed around the "0" value Therefore, the carrier matrix generated based on the error prediction algorithm realizes reversible information embedding, which can maintain a relatively low audio distortion rate while achieving higher information embedding capacity.

嵌入过程embedding process

信息的嵌入和提取过程示意图如图2所示,下面是嵌入过程和提取过程的描述:The schematic diagram of the embedding and extraction process of information is shown in Figure 2, and the description of the embedding process and extraction process is as follows:

1)将音频文件内容按照顺序分为两组:奇数音频点(点集合)和偶数音频点(叉集合)。根据音频文件中相邻音频点的密切相关性,首先采用均值预测算法,利用偶数音频点两侧的奇数点对其进行预测,计算音频点实际幅值与预测之间的误差,生成叉集合误差矩阵;此后,采用相同的策略对奇数音频点进行预测,并生成点集合预测误差矩阵。将待隐藏的数据被分成相等的两半,第一部分被嵌入到叉集中,第二部分被嵌入到点集中。1) Divide the audio file content into two groups in order: odd-numbered audio points (point set) and even-numbered audio points (fork set). According to the close correlation of adjacent audio points in the audio file, the mean prediction algorithm is firstly used to predict the even-numbered audio points by using the odd-numbered points on both sides of them, and the error between the actual amplitude of the audio point and the prediction is calculated, and the cross-set error is generated. matrix; thereafter, the same strategy is used to predict odd audio points and generate a point set prediction error matrix. The data to be hidden is divided into two equal halves, the first part is embedded in the fork set and the second part is embedded in the point set.

2)将叉集合中的样本序列分为嵌入区E和保留区S,仅在预留的嵌入区E中使用每个音频点承载秘密信息,而在预留的保存区S中每个音频字节的后三位(最低有效位LSB)中存放信息嵌入过程中的附加信息,如嵌入序列,嵌入强度,嵌入信息的数量等。数据保存区S中的信息长度预设为4096个二进制位,其中前12位用来表示保存区S的长度并设置标识符用于确定保存区的位置,其他位用于保存附加信息。并将S区中每个音频字节的后三位原始信息与秘密信息一起形成待嵌信息流。2) The sample sequence in the fork set is divided into an embedded area E and a reserved area S, and each audio point is used to carry secret information only in the reserved embedded area E, and each audio word in the reserved reserved area S is used. The last three bits of the section (the least significant bit LSB) store additional information in the information embedding process, such as embedding sequence, embedding strength, and the quantity of embedding information. The length of the information in the data storage area S is preset to 4096 binary bits, of which the first 12 bits are used to represent the length of the storage area S and the identifier is set to determine the location of the storage area, and the other bits are used to store additional information. The last three bits of original information of each audio byte in the S area are combined with the secret information to form the information stream to be embedded.

3)同理,采用相同的策略将待嵌入的另一部分秘密信息嵌入到点集合对应的误差矩阵中。3) Similarly, the same strategy is used to embed another part of the secret information to be embedded into the error matrix corresponding to the point set.

4)将基于码分多址的可逆信息隐藏算法所需要的辅助信息通过LSB替换嵌入到保留区中。4) The auxiliary information required by the reversible information hiding algorithm based on code division multiple access is embedded into the reserved area through LSB replacement.

5)基于嵌入信息后的误差矩阵与目标音频点的值,按照误差预测算法可逆的构建嵌密音频序列。按照误差预测顺序,首先构建叉集合嵌密音频序列,然后构建点集合嵌密音频序列,最后合并形成载密音频文件。5) Based on the error matrix after the embedded information and the value of the target audio point, the embedded secret audio sequence is reversibly constructed according to the error prediction algorithm. According to the error prediction sequence, firstly construct the fork set embedded cipher audio sequence, then construct the point set cipher audio sequence, and finally merge to form the ciphered audio file.

其中,目标音频点的值为:基于预测算法利用前后音频样本的幅值线性组合,得出当前目标样本的预测值,预测值为预测算法求取出来的幅值,与样本实际幅值相近似,利用预测值和真实值产生的误差值嵌入数据。Among them, the value of the target audio point is: based on the prediction algorithm, the predicted value of the current target sample is obtained by using the linear combination of the amplitudes of the audio samples before and after, and the predicted value is the amplitude obtained by the prediction algorithm, which is similar to the actual amplitude of the sample. , using the error value generated by the predicted value and the true value to embed the data.

利用点集合预测叉集合的样本,求出预测样本值,计算预测误差,通过基于CDMA的可逆信息隐藏算法,将秘密信息嵌入到预测误差中,得到嵌入信息后改变的预测误差,将改变后的预测误差与预测值相加,得到嵌入信息后样本值,便求出叉集合嵌密音频序列。Using the point set to predict the samples of the fork set, the predicted sample value is obtained, the prediction error is calculated, and the secret information is embedded into the prediction error through the reversible information hiding algorithm based on CDMA, and the prediction error changed after the embedded information is obtained. The prediction error is added to the predicted value to obtain the sample value after the embedded information, and then the cross-set embedded secret audio sequence is obtained.

提取过程Extraction process

接收方在收到在载密的音频文件后,按照以下步骤无损的提取嵌入的秘密信息并恢复原始音频文件:After receiving the encrypted audio file, the receiver extracts the embedded secret information losslessly and restores the original audio file according to the following steps:

1)首先将嵌密的音频样本分为叉集合和点集合。按照信息嵌入的逆序,首先采用叉集合中的数据与点集合中的目标音频点,根据目标误差预测算法预测目标样本点构建预测误差矩阵。提取点集合预测误差矩阵中保留区音频点后三位(LSB)值,恢复信息嵌入的附件信息。根据附加信息,按照码分多址可逆信息嵌入算法提取误差矩阵种所嵌入的秘密信息,并恢复目标音频点原始幅值。1) First, the embedded audio samples are divided into fork sets and point sets. According to the reverse order of information embedding, firstly, the data in the cross set and the target audio points in the point set are used to predict the target sample points according to the target error prediction algorithm to construct a prediction error matrix. Extract the last three (LSB) values of audio points in the reserved area in the prediction error matrix of the point set, and restore the attached information embedded in the information. According to the additional information, the secret information embedded in the error matrix is extracted according to the code division multiple access reversible information embedding algorithm, and the original amplitude value of the target audio point is restored.

2)按照同样的方法,提取叉集合中所嵌入的秘密信息,并恢复目标音频点原始幅值。2) According to the same method, extract the secret information embedded in the fork set, and restore the original amplitude value of the target audio point.

3)使用点集合和叉集合中所恢复的数据,将提取的秘密信息相连接重建隐藏的秘密信息序列,并采用恢复的原始音频点幅值矩阵无损恢复原始音频文件。3) Using the recovered data in the point set and the fork set, connect the extracted secret information to reconstruct the hidden secret information sequence, and use the restored original audio point amplitude matrix to restore the original audio file losslessly.

下面通过实验对本实施例方法进行验证分析。The method of this embodiment is verified and analyzed by experiments below.

本实验采用包含70个标准音频文件的测试数据库,音频文件采样频率为44.1KHz,16位采样位数,测试文件包括人工合成信号、单乐器、声乐、语言等多个类型。选取不同音频信号特征的音频文件进行实验结果,已验证所提算法的详细性能。并分别采用嵌入强度和扩展序列长度两种参数进行性能评价。This experiment uses a test database containing 70 standard audio files. The sampling frequency of the audio files is 44.1KHz and the number of sampling bits is 16. The test files include synthetic signals, single instrument, vocal music, language and other types. The audio files with different audio signal characteristics are selected for experimental results, and the detailed performance of the proposed algorithm has been verified. And two parameters of embedding strength and extended sequence length are used for performance evaluation.

1、不同嵌入强度的实验结果1. Experimental results of different embedding strengths

实验首先选择具有不同音频特性的6段测试音频包含Clip27、Clip40、Clip51、Clip58、Clip66和Clip70进行算法性能验证。所选6段音频分别是单乐器高音、单乐器低音、中文语音、独奏乐器、管弦音乐和流行音乐,因而可以从不同的方面验证所提算法的可行性。实验结果采用SNR-BPP关系曲线进行描述,信噪比(SNR)和嵌入率(BPP)分别是是衡量音频失真与可逆信息嵌入的容量标准参数,有利于实现与其他算法的性能比较。图3(a)-(f)分别表示6段音频文件在不同的嵌入强度下信噪比(SNR)随嵌入率(BPP)的变化关系。由于实验所选取的音频文件的长度不一样,最短为18秒,最长为2分17秒,同时,因为不同的音频类型不一致,其波形走向差异也比较大,为了更好的验证算法性能,本文针对不同的音频文件自适应选择不同的嵌入强度可逆的嵌入信息,检验不同音频文件在不同嵌入强度α时音频文件的嵌入率和信噪比之间的关系。The experiment first selects 6 test audios with different audio characteristics including Clip27, Clip40, Clip51, Clip58, Clip66 and Clip70 to verify the algorithm performance. The selected 6 audio segments are single-instrumental treble, single-instrumental bass, Chinese voice, solo instrument, orchestral music and pop music, so the feasibility of the proposed algorithm can be verified from different aspects. The experimental results are described by the SNR-BPP relationship curve. The signal-to-noise ratio (SNR) and the embedding rate (BPP) are the capacity standard parameters to measure the audio distortion and reversible information embedding respectively, which is beneficial to compare the performance with other algorithms. Figure 3(a)-(f) respectively show the relationship between the signal-to-noise ratio (SNR) and the embedding rate (BPP) of the 6-segment audio files under different embedding strengths. Due to the different lengths of the audio files selected in the experiment, the shortest is 18 seconds and the longest is 2 minutes and 17 seconds. At the same time, due to the inconsistency of different audio types, the waveform trends are also quite different. In order to better verify the performance of the algorithm, This paper adaptively selects reversible embedding information with different embedding strengths for different audio files, and examines the relationship between the embedding rate and the signal-to-noise ratio of audio files when different audio files have different embedding strengths α.

从图3(a)-(f)中的6幅图中可以看出,随着嵌入率的增加,不同音频文件的SNR随之下降。如图3(a)所示,在嵌入容量为0.1BPP时,嵌入强度α=14对应的信噪比为40.12dB,嵌入强度α=16时对应的信噪比为38.14dB。实验结果表明:在相同嵌入容量下,随着嵌入强度α的增加,SNR随之下降,音频文件质量下降明显。这是因为随着嵌入强度的增加,满足迁入调教的音频点比较多,大量的信息可以一次性嵌入到音频文件中;同时,信息嵌入对原始音频文件的修改程度增大,信息嵌入造成原始文件的失真比较明显。但当嵌入强度较小时,满足可逆信息嵌入条件的载体向量不多,单次信息嵌入的容量较小,为增加嵌入容量,信息需要多次嵌入。在多次信息嵌入过程中,基于码分多址的特点,不同正交序列的元素在叠加嵌入过程中互相抵消,随着嵌入容量的增加原始音频文件的质量下降曲线变缓,保障原始音频文件失真变小。所以,在基于码分多址的音频文件可逆信息嵌入过程中,当嵌入容量较大时,采用合适的可逆信息嵌入强度进行多次重叠信息嵌入,可使嵌密文件取得更好的可逆信息嵌入性能,在大量秘密信息嵌入的同时保障原始音频文件具有较小的失真。如图3(b)所示,在嵌入强度α=4,嵌入容量为0.1BPP时,SNR为53.12dB,嵌入容量增加到0.2BPP时,载体音频的SNR下降到47.52dB;然而,当嵌入强度为6时,载体音频的嵌入容量在0.1BPP对应的SNR为46.1dB,在0.2BPP下对应的SNR为44.8dB,SNR的下降程度曲线大于嵌入强度为4时表现,所以在较高的嵌入容量下,采用较小的嵌入强度多次嵌入可取得更好的可逆信息嵌入性能。As can be seen from the 6 graphs in Fig. 3(a)-(f), as the embedding rate increases, the SNR of different audio files decreases accordingly. As shown in Figure 3(a), when the embedding capacity is 0.1 BPP, the SNR corresponding to the embedding strength α=14 is 40.12 dB, and the corresponding SNR is 38.14 dB when the embedding strength α=16. The experimental results show that: under the same embedding capacity, with the increase of the embedding strength α, the SNR decreases, and the quality of the audio file decreases significantly. This is because with the increase of the embedding strength, there are more audio points that can be moved in and adjusted, and a large amount of information can be embedded into the audio file at one time; at the same time, the degree of modification of the original audio file by the information embedding increases, and the The distortion of the file is obvious. However, when the embedding strength is small, there are not many carrier vectors that meet the reversible information embedding conditions, and the capacity of a single information embedding is small. In order to increase the embedding capacity, the information needs to be embedded multiple times. In the process of multiple information embedding, based on the characteristics of code division multiple access, the elements of different orthogonal sequences cancel each other in the process of superimposing and embedding. With the increase of the embedding capacity, the quality degradation curve of the original audio file becomes slower, ensuring the original audio file. Distortion is reduced. Therefore, in the process of reversible information embedding of audio files based on code division multiple access, when the embedding capacity is large, the appropriate reversible information embedding strength is used to perform multiple overlapping information embedding, which can make the embedded encrypted file obtain better reversible information embedding. performance, while ensuring that the original audio file has less distortion while a large amount of secret information is embedded. As shown in Fig. 3(b), when the embedding strength α=4 and the embedding capacity is 0.1BPP, the SNR is 53.12dB, and when the embedding capacity increases to 0.2BPP, the SNR of the carrier audio drops to 47.52dB; however, when the embedding strength When it is 6, the SNR corresponding to the embedded capacity of the carrier audio is 46.1dB at 0.1BPP, and the corresponding SNR is 44.8dB at 0.2BPP. In this case, multiple embeddings with smaller embedding strength can achieve better reversible information embedding performance.

2、不同长度的扩展序列的实验结果2. Experimental results of extended sequences of different lengths

不同长度的扩展序列也会对算法的性能产生影响。如图4(a)所示,当序列长度为2时,一次信息嵌入的最大容量为0.13bpp;但当序列长度为4时,单次嵌入的最大容量仅为0.087bpp,长度较短的传播序列对于音频载体文件的修改程度小(当扩展序列的长度为2时,嵌入1bit信息需要修改两个音频预测误差点的值;而当扩展序列的长度为4时,嵌入1bit的信息则需要修改四个预测误差点的值)。同时,根据相邻音频点之间的密切相关性,较短的载体向量比较长载体向量更容易满足码分多址算法可逆信息嵌入的条件。为了深入探究相邻音频点之间的关联性,本实施例通过构建4种不同长度的载体向量探究不同长度的扩展序列对于算法性能的影响。Spreading sequences of different lengths also have an impact on the performance of the algorithm. As shown in Figure 4(a), when the sequence length is 2, the maximum capacity of a single information embedding is 0.13bpp; but when the sequence length is 4, the maximum capacity of a single embedding is only 0.087bpp, and the propagation of shorter lengths The modification of the sequence to the audio carrier file is small (when the length of the extended sequence is 2, the value of two audio prediction error points needs to be modified to embed 1-bit information; and when the length of the extended sequence is 4, the embedded 1-bit information needs to be modified. value of the four prediction error points). At the same time, according to the close correlation between adjacent audio points, the shorter carrier vector is easier to satisfy the condition of reversible information embedding of the code division multiple access algorithm than the long carrier vector. In order to deeply explore the correlation between adjacent audio points, this embodiment explores the influence of extended sequences of different lengths on the performance of the algorithm by constructing 4 vector vectors of different lengths.

由图4(a)-(f)可以看出,当信息嵌入容量不大时,短扩展序列的嵌入性能明显优于较长的扩展序列,当长度为8扩展序列(1×8个载体音频预测误差点组成)时,嵌入性能相比长度为2的扩展序列显著下降。原因分析如下:一方面,采用长度为8扩展序列每嵌入1bit的信息需要修改8个样本;另一方面,在同样的嵌入强度下,基于载体文件相邻音频点间的相似性关系,满足嵌入条件的长度为1×8个载体向量的个数较少,因此当嵌入容量不高时,SNR-BPP曲线低于其余三种长度较短的扩展序列的曲线。然而,随着有效载荷的增加,需要采用多级叠加信息嵌入来扩大嵌入容量,不同扩展序列的大部分元素在可逆信息嵌入过程中相互抵消,不同长度扩展序列的嵌入性能差异逐渐减小。It can be seen from Figure 4(a)-(f) that when the information embedding capacity is not large, the embedding performance of the short extension sequence is obviously better than that of the longer extension sequence. When the length of the extension sequence is 8 (1 × 8 carrier audio prediction error points), the embedding performance drops significantly compared to extended sequences of length 2. The reasons are analyzed as follows: on the one hand, 8 samples need to be modified for each embedded 1 bit of information using the extended sequence of length 8; The length of the condition is 1 × 8 and the number of vector vectors is small, so when the embedding capacity is not high, the SNR-BPP curve is lower than that of the remaining three extended sequences with shorter lengths. However, as the payload increases, multi-level superposition information embedding is required to expand the embedding capacity, most elements of different extension sequences cancel each other during the reversible information embedding process, and the difference in embedding performance of extension sequences with different lengths gradually decreases.

由图4(a)-(f)可知,随着嵌入容量增加,扩展序列长度为4和2的嵌密载体信噪比(SNR)的差异逐渐减小,尤其是音频文件Clip 40和Clip 66中,当嵌入容量大于0.9时,长度为4的扩展序列嵌入性能优于长度为2的扩展序列。这是因为在音频文件中4位相邻音频点之间的相关性还比较强,满足码分多址可逆信息嵌入条件的载体向量比较多,载体向量中的元素在多次嵌入后相互抵消的数量优于长度为2的扩展序列,因而,在大容量可逆信息嵌入过程中,载体向量长度为4时一般可取得最优的可逆信息嵌入性能。然而,当扩展序列的长度为6和8时,如图4(a)-(f)所示,载体音频SNR随着嵌入容量的增大快速下降,远远低于长度为4的扩展序列的嵌入性能。这是由于音频文件一维性和瞬时突变性的特点,使得较长的扩展序列需要对应更大的嵌入强度才能满足可逆信息嵌入的条件,随着嵌入容量的增大,较长的扩展序在信息嵌入式会造成原始音频发生较大幅度的形变,音频质量下降明显。It can be seen from Fig. 4(a)-(f) that with the increase of the embedding capacity, the difference of the signal-to-noise ratio (SNR) of the embedded dense carrier with extended sequence lengths 4 and 2 gradually decreases, especially for the audio files Clip 40 and Clip 66 , when the embedding capacity is greater than 0.9, the embedding performance of the extended sequence of length 4 is better than that of the extended sequence of length 2. This is because the correlation between 4-bit adjacent audio points in the audio file is still relatively strong, and there are many carrier vectors that satisfy the reversible information embedding condition of CDMA, and the elements in the carrier vector cancel each other after multiple embeddings. The number is better than the extended sequence of length 2. Therefore, in the process of large-capacity reversible information embedding, the optimal reversible information embedding performance can generally be obtained when the length of the vector vector is 4. However, when the lengths of the extension sequences are 6 and 8, as shown in Fig. 4(a)-(f), the carrier audio SNR drops rapidly with the increase of the embedding capacity, which is much lower than that of the extension sequences of length 4. Embedded performance. This is due to the characteristics of one-dimensionality and instantaneous mutation of audio files, so that longer extension sequences need to correspond to larger embedding strengths to meet the conditions of reversible information embedding. With the increase of embedding capacity, longer extension sequences are in The embedded information will cause the original audio to be greatly deformed, and the audio quality will decrease significantly.

为进一步探究基于码分多址的可逆信息隐藏算法在高嵌入容量下的音频文件失真程度,本文测试了数据库所有70段音频文件在嵌入容量为1.0BPP、1.1BPP、1.2BPP和1.3BPP时的嵌入失真率,不同音频文件在4种嵌入率下的SNR值如图5所示。In order to further explore the degree of audio file distortion of the reversible information hiding algorithm based on code division multiple access under high embedding capacity, this paper tests all 70 audio files in the database when the embedding capacity is 1.0BPP, 1.1BPP, 1.2BPP and 1.3BPP. Embedding distortion rate, the SNR values of different audio files under 4 embedding rates are shown in Figure 5.

由图5可以知,当嵌入率大于1.0BPP,本文算法仍可以保障原始音频文件具有较高SNR数值。对于大部分音频文件而言,即便嵌入容量达到1.3BPP时,仍然可以保持较低的声音失真度。这是因为基于码分多址的可逆信息隐藏算法在多级嵌入的过程中,由于扩展序列的正交特性,嵌入向量中的大部分元素相互抵消,锁着嵌入容量的增加,SNR曲线下降程度逐渐变换,原始音频文件即使在大容量可逆信息嵌入的同时仍能保持很高的声音保真度。It can be seen from Figure 5 that when the embedding rate is greater than 1.0BPP, the algorithm in this paper can still ensure that the original audio file has a higher SNR value. For most audio files, even when the embedded capacity reaches 1.3BPP, the sound distortion can still be kept low. This is because in the process of multi-level embedding of the reversible information hiding algorithm based on code division multiple access, due to the orthogonality of the spreading sequence, most of the elements in the embedding vector cancel each other out, which locks the increase of the embedding capacity and reduces the degree of the SNR curve. Gradually transformed, the original audio file retains high sonic fidelity even while bulk reversible information is embedded.

3、与其他方法进行对比3. Compare with other methods

为了进一步评价该方法的性能优越性,将本算法与其他两个最新的高性能音频文件可逆信息隐藏那个算法进行对比,本实施例将上述的两个最新的算法分别简称为Ref.[30]算法和Ref.[32]算法;其中,Ref.[30]算法利用左右两侧的音频样本预测目标音频点的值,并采用直方图平移技术嵌入有效信息,算法耗费时间短,尤其在高嵌入容量的时候,原始音频文件质量下降很快。Ref.[32]算法首先计算预测点与目标音频点间的距离,并根据距离确定预测系数以减低目标音频点预测误差的值,然后采用差值扩展算法实现可逆信息嵌入,此算法较大程度上提高了目标音频样本的信息隐藏能力,但当嵌入容量较大时,仍会造成音频文件失真严重。In order to further evaluate the performance superiority of this method, this algorithm is compared with the other two latest algorithms for reversible information hiding of high-performance audio files. In this embodiment, the above two latest algorithms are referred to as Ref. [30] algorithm and Ref.[32] algorithm; among them, the Ref.[30] algorithm uses the audio samples on the left and right sides to predict the value of the target audio point, and uses the histogram translation technology to embed effective information, the algorithm consumes a short time, especially in high embedding The quality of the original audio file degrades rapidly when the capacity is increased. The Ref.[32] algorithm first calculates the distance between the prediction point and the target audio point, and determines the prediction coefficient according to the distance to reduce the value of the prediction error of the target audio point, and then uses the difference expansion algorithm to achieve reversible information embedding. The information hiding ability of the target audio sample is improved, but when the embedding capacity is large, the audio file will still be seriously distorted.

本实验中仍然采用SNR-BPP作为衡量可逆信息隐藏性能评价标准,图6(a)-(f)为针对数据库中6段代表性测试音频Clip27、Clip40、Clip51、Clip58、Clip66和Clip70在不同嵌入率下的测试结果:In this experiment, SNR-BPP is still used as the evaluation standard to measure the reversible information hiding performance. Test results at rate:

从图6(a)-(f)中可以看到,本方案在低载荷的情况下可逆信息隐藏性能与其他两种算法相比并没有太大优势。然而,随着有效载荷的增加,本算法适用于大容量信息隐藏的优越性也得以表现出来。在音频文件Clip27、Clip40、Clip66和Clip70中,当嵌入率分别大于0.38bpp、0.56bpp、0.39bpp和0.40bpp时,本算法的SNR值开始超越其他两种音频文件可逆信息隐藏算法。而对于音频文件Clip51和Clip58来说,当嵌入率达到0.95bpp和0.79bpp时,本文算法的性能开始超越其它两种算法。实验结果表明本算法在大容量可逆信息隐藏方面性能明显优于其它算法。其原因如下:一方面,当嵌入载荷较小时,本算法至少需要修改两个样本才能嵌入一个信息比特位,因而在低嵌入容量下相对其它两种算法性能表现一般。然而,随着嵌入容量的增加,本算法可以采用多级嵌入来扩大嵌入容量,不同扩展序列的大部分元素相互抵消,原始音频文件的失真相对于其它算法明显减小。因此,在中到高嵌入容量的情况下本方案具有更高的信噪比(SNR)。另一方面,如图1(c)和图1(d)所示,由于音频Clip51和Clip58的变化比较剧烈,其预测误差直方图左右分布均匀,需要较大的嵌入强度才可以实现秘密信息的嵌入,信息嵌入导致原文件的失真较大,因而需要多次信息叠加嵌入后,本算法的性能才能逐渐超过其他两种算法,这与以上理论分析是一致的。As can be seen from Figure 6(a)-(f), the reversible information hiding performance of this scheme does not have much advantage compared with the other two algorithms in the case of low load. However, with the increase of payload, the superiority of this algorithm for large-capacity information hiding is also shown. In the audio files Clip27, Clip40, Clip66 and Clip70, when the embedding rate is greater than 0.38bpp, 0.56bpp, 0.39bpp and 0.40bpp, respectively, the SNR value of this algorithm begins to surpass the other two reversible information hiding algorithms for audio files. For the audio files Clip51 and Clip58, when the embedding rate reaches 0.95bpp and 0.79bpp, the performance of this algorithm begins to surpass the other two algorithms. The experimental results show that the performance of this algorithm is significantly better than other algorithms in large-capacity reversible information hiding. The reasons are as follows: On the one hand, when the embedding load is small, the algorithm needs to modify at least two samples to embed one information bit, so the performance of the algorithm is average compared to the other two algorithms under low embedding capacity. However, with the increase of the embedding capacity, the algorithm can use multi-level embedding to expand the embedding capacity, most of the elements of different extension sequences cancel each other, and the distortion of the original audio file is significantly reduced compared with other algorithms. Therefore, the present scheme has a higher signal-to-noise ratio (SNR) in the case of medium to high embedding capacity. On the other hand, as shown in Figure 1(c) and Figure 1(d), since the audio Clip51 and Clip58 change drastically, their prediction error histograms are evenly distributed on the left and right, and a large embedding strength is required to realize the secret information. Embedding, information embedding leads to large distortion of the original file, so the performance of this algorithm can gradually surpass the other two algorithms after multiple times of information superposition and embedding, which is consistent with the above theoretical analysis.

实施例二Embodiment 2

在一个或多个实施方式中,公开了一种大容量音频文件可逆信息隐藏系统,包括:In one or more embodiments, a reversible information hiding system for large-capacity audio files is disclosed, including:

将初始文件不同音频点按照位置分为叉集合和点集合;Divide the different audio points of the initial file into fork sets and point sets according to their positions;

用于利用相邻音频点间的局部一致性特征,采用目标音频点幅值预测算法,分别构建第一叉集合预测误差矩阵和第一点集合预测误差矩阵的装置;A device for constructing a first fork set prediction error matrix and a first point set prediction error matrix respectively by utilizing the local consistency feature between adjacent audio points and adopting the target audio point amplitude prediction algorithm;

用于将待隐藏的数据被分成相等的两半,第一部分被嵌入到叉集中,第二部分被嵌入到点集中的装置;A means for dividing the data to be hidden into equal halves, the first part is embedded in the fork set, and the second part is embedded in the point set;

用于按照误差预测算法可逆的构建叉集合嵌密音频序列和点集合嵌密音频序列;最后合并形成载密音频文件的装置。A device for reversibly constructing a fork-set embedded cipher audio sequence and a point-set cipher audio sequence according to an error prediction algorithm; and finally merging to form a ciphered audio file.

还包括:Also includes:

用于将载密音频文件分为叉集合和点集合的装置;means for dividing the encrypted audio file into fork sets and point sets;

用于根据目标误差预测算法,分别构建第二点集合预测误差矩阵和第二叉集合预测误差矩阵的装置;A device for constructing the second point set prediction error matrix and the second cross set prediction error matrix respectively according to the target error prediction algorithm;

用于分别恢复点集合预测误差矩阵和叉集合预测误差矩阵中嵌入的附件信息,按照码分多址可逆信息嵌入算法提取误差矩阵中嵌入的秘密信息,并恢复目标音频点原始幅值的装置;A device for restoring the attachment information embedded in the point set prediction error matrix and the cross set prediction error matrix respectively, extracting the secret information embedded in the error matrix according to the code division multiple access reversible information embedding algorithm, and restoring the original amplitude value of the target audio point;

用于将提取的秘密信息相连接重建隐藏的秘密信息序列,并采用恢复的目标音频点原始幅值矩阵无损恢复原始音频文件的装置。The device is used to connect the extracted secret information to reconstruct the hidden secret information sequence, and use the restored original amplitude matrix of the target audio point to restore the original audio file losslessly.

上述装置的具体工作过程均采用实施例一中公开的方法,在此不再赘述。The specific working process of the above device adopts the method disclosed in the first embodiment, and is not repeated here.

实施例三Embodiment 3

在一个或多个实施方式中,公开了一种终端设备,包括服务器,所述服务器包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述程序时实现实施例一中的大容量音频文件可逆信息隐藏方法。为了简洁,在此不再赘述。In one or more embodiments, a terminal device is disclosed, including a server, the server including a memory, a processor, and a computer program stored on the memory and executable on the processor, the processor executing the The program implements the reversible information hiding method for a large-capacity audio file in the first embodiment. For brevity, details are not repeated here.

应理解,本实施例中,处理器可以是中央处理单元CPU,处理器还可以是其他通用处理器、数字信号处理器DSP、专用集成电路ASIC,现成可编程门阵列FPGA或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that, in this embodiment, the processor may be a central processing unit CPU, and the processor may also be other general-purpose processors, digital signal processors DSP, application-specific integrated circuits ASIC, off-the-shelf programmable gate array FPGA or other programmable logic devices , discrete gate or transistor logic devices, discrete hardware components, etc. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.

存储器可以包括只读存储器和随机存取存储器,并向处理器提供指令和数据、存储器的一部分还可以包括非易失性随机存储器。例如,存储器还可以存储设备类型的信息。The memory may include read-only memory and random access memory and provide instructions and data to the processor, and a portion of the memory may also include non-volatile random access memory. For example, the memory may also store device type information.

在实现过程中,上述方法的各步骤可以通过处理器中的硬件的集成逻辑电路或者软件形式的指令完成。In the implementation process, each step of the above-mentioned method can be completed by a hardware integrated logic circuit in a processor or an instruction in the form of software.

实施例一中的大容量音频文件可逆信息隐藏方法可以直接体现为硬件处理器执行完成,或者用处理器中的硬件及软件模块组合执行完成。软件模块可以位于随机存储器、闪存、只读存储器、可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质中。该存储介质位于存储器,处理器读取存储器中的信息,结合其硬件完成上述方法的步骤。为避免重复,这里不再详细描述。The method for reversible information hiding of a large-capacity audio file in the first embodiment may be directly embodied in the execution of a hardware processor, or executed in a combination of hardware and software modules in the processor. The software modules may be located in random access memory, flash memory, read-only memory, programmable read-only memory or electrically erasable programmable memory, registers and other storage media mature in the art. The storage medium is located in the memory, and the processor reads the information in the memory, and completes the steps of the above method in combination with its hardware. To avoid repetition, detailed description is omitted here.

本领域普通技术人员可以意识到,结合本实施例描述的各示例的单元即算法步骤,能够以电子硬件或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。Those of ordinary skill in the art can realize that the unit, that is, the algorithm step of each example described in conjunction with this embodiment, can be implemented by electronic hardware or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may implement the described functionality using different methods for each particular application, but such implementations should not be considered beyond the scope of this application.

上述虽然结合附图对本发明的具体实施方式进行了描述,但并非对本发明保护范围的限制,所属领域技术人员应该明白,在本发明的技术方案的基础上,本领域技术人员不需要付出创造性劳动即可做出的各种修改或变形仍在本发明的保护范围以内。Although the specific embodiments of the present invention have been described above in conjunction with the accompanying drawings, they do not limit the scope of protection of the present invention. Those skilled in the art should understand that on the basis of the technical solutions of the present invention, those skilled in the art do not need to pay creative work. Various modifications or deformations that can be made are still within the protection scope of the present invention.

Claims (7)

1.一种大容量音频文件可逆信息隐藏方法,其特征在于,包括嵌入过程:1. a large-capacity audio file reversible information hiding method, is characterized in that, comprises embedding process: 将初始文件不同音频点按照位置分为叉集合和点集合;Divide the different audio points of the initial file into fork sets and point sets according to their positions; 所述叉集合为偶数音频点,点集合为奇数音频点;The fork set is an even-numbered audio point, and the point set is an odd-numbered audio point; 利用相邻音频点间的局部一致性特征,采用目标音频点幅值预测算法,分别构建第一叉集合预测误差矩阵和第一点集合预测误差矩阵;Using the local consistency feature between adjacent audio points, the target audio point amplitude prediction algorithm is used to construct the first fork set prediction error matrix and the first point set prediction error matrix respectively; 将待隐藏的数据被分成相等的两半,第一部分被嵌入到叉集中,第二部分被嵌入到点集中;The data to be hidden is divided into two equal halves, the first part is embedded in the fork set, and the second part is embedded in the point set; 按照误差预测算法可逆的构建叉集合嵌密音频序列和点集合嵌密音频序列;最后合并形成载密音频文件;According to the error prediction algorithm, the fork set embedded secret audio sequence and the point set embedded secret audio sequence are reversibly constructed; finally, the secret audio file is formed by merging; 构建叉集合预测误差矩阵,具体为:利用偶数音频点两侧的奇数点对偶数音频点进行预测,计算偶数音频点实际幅值与预测之间的误差,生成第一叉集合误差矩阵;Constructing a fork set prediction error matrix, specifically: using the odd-numbered points on both sides of the even-numbered audio points to predict the even-numbered audio points, calculating the error between the actual amplitude of the even-numbered audio points and the prediction, and generating a first fork set error matrix; 构建点集合预测误差矩阵,具体为:利用奇数音频点两侧的偶数点对奇数音频点进行预测,计算奇数音频点实际幅值与预测之间的误差,生成第一点集合误差矩阵;Constructing a point set prediction error matrix, specifically: using the even-numbered points on both sides of the odd-numbered audio points to predict the odd-numbered audio points, calculating the error between the actual amplitude of the odd-numbered audio points and the prediction, and generating the first point set error matrix; 按照误差预测算法可逆的构建叉集合嵌密音频序列,具体为:According to the error prediction algorithm, the fork set-embedded audio sequence is reversibly constructed, specifically: 将叉集合中的样本序列分为嵌入区E和保留区S,仅在预留的嵌入区E中使用每个音频点承载秘密信息,而在预留的保存区S中每个音频点的后三位中存放信息嵌入过程中的附加信息;将S区中每个音频点的后三位原始信息与秘密信息一起形成待嵌信息流;The sample sequence in the fork set is divided into an embedded area E and a reserved area S, and each audio point is used to carry secret information only in the reserved embedded area E, while the back of each audio point in the reserved save area S is used to carry secret information. The additional information in the information embedding process is stored in the three bits; the last three original information of each audio point in the S area is formed with the secret information to form the information stream to be embedded; 将基于码分多址的可逆信息隐藏算法所需要的辅助信息通过LSB替换嵌入到保留区S中;The auxiliary information required by the code division multiple access-based reversible information hiding algorithm is embedded in the reserved area S through LSB replacement; 基于嵌入信息后的误差矩阵与目标音频点的值,按照误差预测算法可逆的构建叉集合嵌密音频序列。Based on the error matrix after embedding information and the value of the target audio point, according to the error prediction algorithm, the cross-set embedded cipher audio sequence is reversibly constructed. 2.如权利要求1所述的一种大容量音频文件可逆信息隐藏方法,其特征在于,采用最近邻均值预测的方法对目标音频点进行预测,产生预测误差,具体为:2. a kind of large-capacity audio file reversible information hiding method as claimed in claim 1, is characterized in that, adopts the method of nearest neighbor mean value prediction to predict target audio frequency point, produces prediction error, is specially:
Figure FDA0003580657230000021
Figure FDA0003580657230000021
其中,x′i为样本点xi的预测值,xi-1和xi+1为样本点xi左侧和右侧的样本,所以预测误差di=x′i-xiAmong them, x' i is the predicted value of the sample point x i , and x i-1 and x i+1 are the samples on the left and right sides of the sample point x i , so the prediction error d i =x' i -xi .
3.如权利要求1所述的一种大容量音频文件可逆信息隐藏方法,其特征在于,3. a kind of large-capacity audio file reversible information hiding method as claimed in claim 1, is characterized in that, 还包括提取过程:Also includes the extraction process: 将载密音频文件分为叉集合和点集合;Divide the encrypted audio files into fork sets and point sets; 根据目标误差预测算法,分别构建第二点集合预测误差矩阵和第二叉集合预测误差矩阵;According to the target error prediction algorithm, construct the second point set prediction error matrix and the second fork set prediction error matrix respectively; 分别恢复点集合预测误差矩阵和叉集合预测误差矩阵中嵌入的附件信息,按照码分多址可逆信息嵌入算法提取误差矩阵中嵌入的秘密信息,并恢复目标音频点原始幅值;Recover the attachment information embedded in the point set prediction error matrix and the cross set prediction error matrix respectively, extract the secret information embedded in the error matrix according to the code division multiple access reversible information embedding algorithm, and restore the original amplitude of the target audio point; 将提取的秘密信息相连接重建隐藏的秘密信息序列,并采用恢复的目标音频点原始幅值矩阵无损恢复原始音频文件。The extracted secret information is connected to reconstruct the hidden secret information sequence, and the original audio file is restored losslessly by using the original amplitude matrix of the restored target audio point. 4.一种大容量音频文件可逆信息隐藏系统,其特征在于,包括:4. a large-capacity audio file reversible information hiding system, is characterized in that, comprises: 用于将初始文件中不同音频点按照位置分为叉集合和点集合的装置;A device for dividing different audio points in the initial file into fork sets and point sets according to their positions; 所述叉集合为偶数音频点,点集合为奇数音频点;The fork set is an even-numbered audio point, and the point set is an odd-numbered audio point; 用于利用相邻音频点间的局部一致性特征,采用目标音频点幅值预测算法,分别构建第一叉集合预测误差矩阵和第一点集合预测误差矩阵的装置;A device for constructing a first fork set prediction error matrix and a first point set prediction error matrix respectively by utilizing the local consistency feature between adjacent audio points and adopting the target audio point amplitude prediction algorithm; 用于将待隐藏的数据被分成相等的两半,第一部分被嵌入到叉集中,第二部分被嵌入到点集中的装置;A means for dividing the data to be hidden into equal halves, the first part is embedded in the fork set, and the second part is embedded in the point set; 用于按照误差预测算法可逆的构建叉集合嵌密音频序列和点集合嵌密音频序列;最后合并形成载密音频文件的装置;For reversibly constructing the fork set embedded encrypted audio sequence and the point set embedded encrypted audio sequence according to the error prediction algorithm; finally merge to form the device of the encrypted audio file; 构建叉集合预测误差矩阵,具体为:利用偶数音频点两侧的奇数点对偶数音频点进行预测,计算偶数音频点实际幅值与预测之间的误差,生成第一叉集合误差矩阵;Constructing a fork set prediction error matrix, specifically: using the odd-numbered points on both sides of the even-numbered audio point to predict the even-numbered audio point, calculating the error between the actual amplitude value of the even-numbered audio point and the prediction, and generating the first fork set error matrix; 构建点集合预测误差矩阵,具体为:利用奇数音频点两侧的偶数点对奇数音频点进行预测,计算奇数音频点实际幅值与预测之间的误差,生成第一点集合误差矩阵;Constructing a point set prediction error matrix, specifically: using the even-numbered points on both sides of the odd-numbered audio points to predict the odd-numbered audio points, calculating the error between the actual amplitude of the odd-numbered audio points and the prediction, and generating the first point set error matrix; 按照误差预测算法可逆的构建叉集合嵌密音频序列,具体为:According to the error prediction algorithm, the fork set-embedded audio sequence is reversibly constructed, specifically: 将叉集合中的样本序列分为嵌入区E和保留区S,仅在预留的嵌入区E中使用每个音频点承载秘密信息,而在预留的保存区S中每个音频点的后三位中存放信息嵌入过程中的附加信息;将S区中每个音频点的后三位原始信息与秘密信息一起形成待嵌信息流;The sample sequence in the fork set is divided into an embedded area E and a reserved area S, and each audio point is used to carry secret information only in the reserved embedded area E, while the back of each audio point in the reserved save area S is used to carry secret information. The additional information in the information embedding process is stored in the three bits; the last three original information of each audio point in the S area is formed with the secret information to form the information stream to be embedded; 将基于码分多址的可逆信息隐藏算法所需要的辅助信息通过LSB替换嵌入到保留区S中;The auxiliary information required by the code division multiple access-based reversible information hiding algorithm is embedded in the reserved area S through LSB replacement; 基于嵌入信息后的误差矩阵与目标音频点的值,按照误差预测算法可逆的构建叉集合嵌密音频序列。Based on the error matrix after embedding information and the value of the target audio point, according to the error prediction algorithm, the cross-set embedded cipher audio sequence is reversibly constructed. 5.如权利要求4所述的一种大容量音频文件可逆信息隐藏系统,其特征在于,还包括:5. a kind of large-capacity audio file reversible information hiding system as claimed in claim 4, is characterized in that, also comprises: 用于将载密音频文件分为叉集合和点集合的装置;means for dividing the encrypted audio file into fork sets and point sets; 用于根据目标误差预测算法,分别构建第二点集合预测误差矩阵和第二叉集合预测误差矩阵的装置;A device for constructing the second point set prediction error matrix and the second fork set prediction error matrix respectively according to the target error prediction algorithm; 用于分别恢复点集合预测误差矩阵和叉集合预测误差矩阵中嵌入的附件信息,按照码分多址可逆信息嵌入算法提取误差矩阵中嵌入的秘密信息,并恢复目标音频点原始幅值的装置;A device for restoring the attachment information embedded in the point set prediction error matrix and the cross set prediction error matrix respectively, extracting the secret information embedded in the error matrix according to the code division multiple access reversible information embedding algorithm, and restoring the original amplitude value of the target audio point; 用于将提取的秘密信息相连接重建隐藏的秘密信息序列,并采用恢复的目标音频点原始幅值矩阵无损恢复原始音频文件的装置。The device is used to connect the extracted secret information to reconstruct the hidden secret information sequence, and use the restored original amplitude matrix of the target audio point to restore the original audio file losslessly. 6.一种终端设备,其包括处理器和计算机可读存储介质,处理器用于实现各指令;计算机可读存储介质用于存储多条指令,其特征在于,所述指令适于由处理器加载并执行权利要求1-3任一项所述的大容量音频文件可逆信息隐藏方法。6. A terminal device, comprising a processor and a computer-readable storage medium, wherein the processor is used to implement each instruction; the computer-readable storage medium is used to store a plurality of instructions, wherein the instructions are suitable for being loaded by the processor And execute the reversible information hiding method for large-capacity audio files described in any one of claims 1-3. 7.一种计算机可读存储介质,其中存储有多条指令,其特征在于,所述指令适于由终端设备的处理器加载并执行权利要求1-3任一项所述的大容量音频文件可逆信息隐藏方法。7. A computer-readable storage medium, wherein a plurality of instructions are stored, wherein the instructions are adapted to be loaded by the processor of the terminal device and execute the large-capacity audio file according to any one of claims 1-3 Reversible information hiding method.
CN202010022517.6A 2020-01-09 2020-01-09 A method and system for reversible information hiding of large-capacity audio files Active CN113096671B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010022517.6A CN113096671B (en) 2020-01-09 2020-01-09 A method and system for reversible information hiding of large-capacity audio files

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010022517.6A CN113096671B (en) 2020-01-09 2020-01-09 A method and system for reversible information hiding of large-capacity audio files

Publications (2)

Publication Number Publication Date
CN113096671A CN113096671A (en) 2021-07-09
CN113096671B true CN113096671B (en) 2022-05-13

Family

ID=76663680

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010022517.6A Active CN113096671B (en) 2020-01-09 2020-01-09 A method and system for reversible information hiding of large-capacity audio files

Country Status (1)

Country Link
CN (1) CN113096671B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114466112B (en) * 2022-01-11 2025-04-04 山东师范大学 Reversible data hiding method and system based on adaptive modification of multiple histograms
CN114202453B (en) * 2022-01-26 2025-01-21 北京奇艺世纪科技有限公司 Information hiding method, device, equipment and medium
CN115131189A (en) * 2022-06-30 2022-09-30 齐鲁工业大学 Image reversible information hiding method and system based on convolutional neural network
CN119561735A (en) * 2024-11-19 2025-03-04 华中师范大学 Network dating system based on audio information hiding and audio information hiding method

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103050120A (en) * 2012-12-28 2013-04-17 暨南大学 High-capacity digital audio reversible watermark processing method
CN105900169A (en) * 2014-01-09 2016-08-24 杜比实验室特许公司 Spatial error metrics of audio content
CN106205627A (en) * 2016-07-14 2016-12-07 暨南大学 Based on side information prediction and the DAB reversible water mark algorithm of rectangular histogram translation
CN109617682A (en) * 2018-12-12 2019-04-12 北京邮电大学 A Reversible Information Hiding Method in Ciphertext Domain Based on Left and Right Shift of Histogram
CN110110318A (en) * 2019-01-22 2019-08-09 清华大学 Text Stego-detection method and system based on Recognition with Recurrent Neural Network

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6065452B2 (en) * 2012-08-14 2017-01-25 富士通株式会社 Data embedding device and method, data extraction device and method, and program

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103050120A (en) * 2012-12-28 2013-04-17 暨南大学 High-capacity digital audio reversible watermark processing method
CN105900169A (en) * 2014-01-09 2016-08-24 杜比实验室特许公司 Spatial error metrics of audio content
CN106205627A (en) * 2016-07-14 2016-12-07 暨南大学 Based on side information prediction and the DAB reversible water mark algorithm of rectangular histogram translation
CN109617682A (en) * 2018-12-12 2019-04-12 北京邮电大学 A Reversible Information Hiding Method in Ciphertext Domain Based on Left and Right Shift of Histogram
CN110110318A (en) * 2019-01-22 2019-08-09 清华大学 Text Stego-detection method and system based on Recognition with Recurrent Neural Network

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
音频中的信息隐藏和隐藏分析技术研究;魏一方;《中国优秀硕士学位论文全文数据库 信息科技辑》;20111015(第10期);第1-53页 *

Also Published As

Publication number Publication date
CN113096671A (en) 2021-07-09

Similar Documents

Publication Publication Date Title
CN113096671B (en) A method and system for reversible information hiding of large-capacity audio files
Zhao et al. Desynchronization attacks resilient watermarking method based on frequency singular value coefficient modification
JP3986150B2 (en) Digital watermarking to one-dimensional data
JP2022185105A (en) Method and device for generating mixed spatial/coefficient domain representation of hoa signal from coefficient domain representation of the hoa signal
Qiao et al. MP3 audio steganalysis
CN111755018B (en) Audio hiding method and device based on wavelet transform and quantized embedded key
Abdulrazzaq et al. A novel steganography approach for audio files
CN103050120B (en) high-capacity digital audio reversible watermark processing method
Altinbaş et al. Bit Reduction based audio steganography algorithm
CN116072129A (en) Robust audio watermark embedding and extracting method capable of resisting large-scale shearing
Bazyar et al. A recent review of MP3 based steganography methods
Sultani et al. Image and audio steganography based on indirect LSB
CN102664014B (en) Blind audio watermark implementing method based on logarithmic quantization index modulation
Hu et al. FFT-based dual-mode blind watermarking for hiding binary logos and color images in audio
CN113766084B (en) Reversible information hiding method and system for enhancing image smoothness
Mushgil et al. An efficient selective method for audio watermarking against de-synchronization attacks
Amrulloh et al. Minimizing Sample Space to Optimize Quality of Stego-Audio.
CN113362835A (en) Audio watermark processing method and device, electronic equipment and storage medium
Sadeghzadeh et al. A new method for watermarking using genetic algorithms
Ma et al. A High-performance reversible data hiding scheme for audio streams base on code division multiplexing
Li et al. Data hiding with tree based parity check
Pooyan et al. Adaptive and robust audio watermarking in wavelet domain
Kuriakose et al. A novel method for MP3 steganalysis
Peng et al. Optimal audio watermarking scheme using genetic optimization
Setiawan et al. Improving the performance of data hiding by designing three samples-based smoothing in audio

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP03 Change of name, title or address
CP03 Change of name, title or address

Address after: 250353 University Road, Changqing District, Ji'nan, Shandong Province, No. 3501

Patentee after: Qilu University of Technology (Shandong Academy of Sciences)

Country or region after: China

Address before: 250353 University Road, Changqing District, Ji'nan, Shandong Province, No. 3501

Patentee before: Qilu University of Technology

Country or region before: China