WO2018120819A1 - Method and device for producing presentation - Google Patents
Method and device for producing presentation Download PDFInfo
- Publication number
- WO2018120819A1 WO2018120819A1 PCT/CN2017/094598 CN2017094598W WO2018120819A1 WO 2018120819 A1 WO2018120819 A1 WO 2018120819A1 CN 2017094598 W CN2017094598 W CN 2017094598W WO 2018120819 A1 WO2018120819 A1 WO 2018120819A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- audio data
- presentation
- target time
- time interval
- audio
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/14—Tree-structured documents
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
Definitions
- the present application relates to the field of web technologies, and in particular, to a method for fabricating a presentation and a device for making a presentation.
- the user In order to realize distance learning, the user usually records the operation of the presentation while the user is speaking, keeping the user's speech synchronized with the presentation.
- the video data obtained by recording the operation of the presentation is bulky and takes up a lot of storage space.
- the video data is often compressed to reduce the resolution of the video data, resulting in blurry content of the presentation.
- the present application has been made in order to provide a method of fabricating a presentation and a corresponding apparatus for producing a presentation that overcomes the above problems or at least partially solves or alleviates the above problems.
- a method of making a presentation including:
- the audio data is re-added to the target time interval.
- a production apparatus for a presentation including:
- a web page loading module adapted to load a web page generated for the presentation
- a presentation element configuration module adapted to configure a presentation element in the web page
- An audio data adding module adapted to add audio data to the presentation element on a time axis to synchronously play the audio data when the presentation element is played according to the time axis;
- a target time interval selection module adapted to select a target time interval on the time axis
- An audio data adding module adapted to re-add audio data to the target time interval.
- a computer program comprising computer readable code causing the terminal device to perform the production of any of the aforementioned presentations when the computer readable code is run on a terminal device method.
- a computer readable medium storing a computer program of a method of fabricating a presentation as described above.
- the embodiment of the present application loads a web page generated for a presentation in a client, and configures a presentation element in the web page, and further adds audio data to the presentation element on the timeline, so that the presentation can be played according to the timeline
- the elements are synchronized to play audio data
- the web page is used as a carrier to create a presentation
- the audio data is used to synchronize the presentation of the presentation elements and audio data, allowing the user to view the contents of the presentation and listen to the presentation of the presentation.
- Using the web element as a presentation element compared to the video data, can greatly reduce the mention, reduce the occupation of the storage space, and, because the web element is directly drawn and loaded on the web page, without compression processing, the web element can be guaranteed.
- Sharpness on the other hand, by re-adding audio data to the target time interval, the user is prevented from manually deleting the uncovered audio data, and the limitation on the length of the re-added audio data is removed, thereby improving the efficiency of production.
- FIG. 1 is a flow chart showing the steps of an embodiment of a method for creating a presentation according to an embodiment of the present application
- FIGS. 2A-2C illustrate example diagrams of a configuration presentation element in accordance with one embodiment of the present application
- 3A-3D illustrate example diagrams of editing a presentation element and audio data playback order, in accordance with one embodiment of the present application
- FIGS. 4A-4D illustrate example diagrams of playing presentation elements and audio data in accordance with one embodiment of the present application
- FIGS. 5A-5B illustrate example diagrams of recording audio data in accordance with one embodiment of the present application
- 6A-6C illustrate example diagrams of a selection re-recording in accordance with one embodiment of the present application
- FIG. 7 is a structural block diagram of a device for fabricating a presentation according to an embodiment of the present application.
- Figure 8 schematically shows a block diagram of a terminal device for performing the method according to the present application
- Fig. 9 schematically shows a storage unit for holding or carrying program code implementing the method according to the present application.
- FIG. 1 is a flow chart showing the steps of an embodiment of a method for creating a presentation according to an embodiment of the present application. Specifically, the method may include the following steps:
- Step 101 Load a web page generated for the presentation.
- the user can log in to the server by using a user account on a client such as a browser, and send a request for generating a presentation to the server.
- a client such as a browser
- the server can configure a new presentation and configure the presentation with a unique presentation identifier, such as slide_id (slide ID), which is used to generate a unique one for the presentation. Edit the URL (Uniform Resource Locator) and return the URL for editing to the client.
- slide_id segment ID
- URL Uniform Resource Locator
- the client accesses the URL for editing to load a web page, which is the carrier of the presentation, ie the presentation can edit the content in the web page.
- the information of the presentation can be displayed in the area such as the user center.
- the client can directly load the web page by using the URL for editing, which is not used in the embodiment of the present application. limit.
- the presentation ID is used to generate a unique URL for the presentation, and the URL for the presentation is returned to the client.
- the client can access the URL for the presentation to load the web page, which is the carrier of the presentation, ie the presentation can be played in the web page.
- Step 102 configuring a presentation element in the web page.
- the presentation elements can include one or more of the following:
- Text images, images of specified shapes, lines, tables, frames, and code.
- the user can trigger the presentation element to edit state by clicking or the like.
- the editing operation bar of the presentation element is popped up in the web page, and the user can display the element of the presentation element in the editing operation column. Parameters for the user to adjust.
- the edit operation bar of the text box can be popped up in the web page, and the user can set the font alignment and font. Play multiplier, font color, line spacing, font spacing and other element parameters.
- the edit operation column of the table may be popped up in the web page, and the user may set the number of rows, the number of columns, and the cell.
- Element parameters such as margins, border width, and border color.
- the user can save it manually, or the script of the client executing the web page can be automatically saved.
- the parameters configured in the presentation element of the web page can be synchronized with the server during saving, and the server takes the parameter. Store under the presentation (represented by the presentation ID) for subsequent loading.
- the client loads the web page by using the URL for editing.
- the corresponding presentation element is loaded according to the previously set element parameters, so that the user can continue editing, which is not limited by the embodiment of the present application.
- Step 103 adding audio data to the presentation element on a time axis to synchronously play the audio data when the presentation element is played according to the time axis.
- the client in order to control the playing of the presentation, can configure a timeline and set the playing time of the presentation element on the timeline.
- the user can record audio data
- the client adds audio data to the presentation element, such as a user's speech, so that the presentation elements can be played while the audio data is being played on the time axis, so that the two can be synchronized.
- the user can set the playing time of the presentation element. With the passage of time, when the audio data is set to be played, the speech can be set to be switched in order.
- the manuscript elements that is, the text "Quiet Night Thinking", “Li Bai”, “Before the Moon”.
- the timing control is displayed in the lower left corner, and as time passes, the audio presentation data is played, and the presentation document elements are switched in order, that is, the text is displayed. "Quiet night thinking”, “Li Bai”, “before the bed bright moonlight”.
- step 103 may include the following sub-steps:
- Sub-step S11 the recorder is called to record audio data to the presentation element.
- the microphone can be called to collect the original audio data, and the recorder is called to record the audio data.
- a recording control can be loaded, after clicking the recording control, recording is started, and a visual element of the audio data is displayed on the axis element of the visual axis of the time axis.
- the sub-step S11 may include the following sub-steps:
- Sub-step S111 acquiring original audio stream data collected by the microphone
- Sub-step S112 the original audio stream data is transmitted to the recorder
- Sub-step S113 the original audio stream data is visualized in the recorder according to the recording parameters, and the original audio stream data is converted into audio data of a specified format.
- the client can obtain the original audio stream data collected by the microphone through the getUserMedia interface provided by WebRTC (Web Real-Time Communication).
- WebRTC Web Real-Time Communication
- a script processing node is created by the createScriptProcess method of the Web Audio API, which is used to process raw audio stream data using Javascript.
- the audio source node is connected to the processing node, and the processing node is connected to the audio output node to form a complete processing flow.
- the processing node can listen to the AudioProcessingEvent event through the onaudioprocess method, and the event acquires a certain length of data from the original audio stream data for processing at regular intervals.
- the original audio stream data is visualized by the drawAudioWave method (the visualized elements are generated based on the frequency, waveform and other attributes of the original audio stream data), and the audio data is transmitted to the Web Worker for audio.
- the drawAudioWave method the visualized elements are generated based on the frequency, waveform and other attributes of the original audio stream data
- the audio data is transmitted to the Web Worker for audio.
- the audio processing is paused, and a format file such as WAV is requested from the Web Worker, and the Web Worker converts the existing original audio stream data into audio data of a format such as WAV and returns it.
- a format file such as WAV
- the computing power of the client (such as a browser) is mostly limited, and the temporary storage and processing of the original audio stream data generally requires a large amount of computing power, another thread is opened by introducing a Web Worker.
- the temporary storage and processing of the original audio stream data is performed to ensure that other processing of the client (such as a browser) can be performed normally.
- step 103 may include the following sub-steps:
- Sub-step S21 inputting text information to the presentation element
- Sub-step S22 converting the text information into audio data.
- the terminal where the client is located is not configured with a microphone, the user may be allowed to By entering text information into the presentation elements, text information can be converted to audio data by speech synthesis (The Emperor Waltz, TEW).
- speech synthesis The Emperor Waltz, TEW.
- Speech synthesis also known as Text to Speech (TTS) technology
- TTS Text to Speech
- the characteristics of the segment such as pitch, length and intensity, are made, so that the synthesized speech can correctly express the semantics and sound more natural.
- the phonetic primitives of the single words or phrases corresponding to the processed text are extracted from the speech synthesis library, and the prosody characteristics of the speech primitives are adjusted and modified by using a specific speech synthesis technique, and finally synthesized. Meet the required voice data.
- the manner of adding audio data is only an example.
- other manners of adding audio data may be set according to actual conditions, for example, directly importing existing audio data, and the like. This is not limited.
- those skilled in the art may also adopt other manners of adding audio data according to actual needs, and the embodiment of the present application does not limit this.
- the audio data on the time axis can be uploaded to the server.
- the audio data can be retrieved from the Web Worker, and the audio file is compressed by the amrnb.js library, compressed into a specified format such as amr, and then uploaded to the server, the server stores Under the presentation (represented by the presentation ID) for subsequent loading.
- Step 104 selecting a target time interval on the time axis.
- the unsatisfactory area may be selected for re-recording on the time axis, and the area is referred to as a target time interval.
- the timeline has a visual axis element on the web page with a time scale on the axis element of the visualization, such as 00:00, 00:05, 00:10, etc. .
- a scrolling marker strip is inserted over the visualized axis element, as shown in Figure 6A, with a solid dot, like a pin.
- the interval between the start position and the end position of the scroll bar is taken as the target time interval as shown in the rectangular area of the visual axis element as shown in FIG. 6A.
- Step 105 re-adding audio data to the target time interval.
- the audio data in the uncovered area is automatically deleted to ensure the consistency of the audio data, and the re-recording is selected.
- the client can replace the target time interval with audio data of any length, that is, the time of the added audio data can be longer than the length of the target time interval.
- the client automatically deletes the remaining 8 seconds of audio data.
- step 105 may include the following sub-steps:
- Sub-step S31 deleting original audio data located in the target time interval
- Sub-step S32 moving original audio data located after the target time interval to a start time of the target time zone
- Sub-step S33 inserting new audio data from the start time of the target time interval, and moving the original audio data located after the target time interval to the end time of the new audio data.
- the original audio data is audio data before re-adding audio data
- the new audio data is re-added audio data
- the timeline has visual axis elements on the web page, and the audio data has visual audio elements on the visual axis elements.
- the client can delete the original audio data located in the target time interval, and move the original audio data located after the target time interval to the start time of the target time zone to ensure the consistency of the audio data.
- the visualized audio elements of the original audio data located in the target time interval can be deleted on the visualized axis elements.
- the visualized audio element of the original audio data located after the target time interval is moved to the start time of the target time zone.
- new audio data can be inserted from the start time of the target time interval, and the original audio data located after the target time interval is moved to the end time of the new audio data, ensuring continuity of the audio data.
- the visualized audio element of the new audio data can be inserted from the start time of the target time zone, and the visualized audio element of the original audio data located after the target time zone is moved to the new audio.
- the data is visualized after the audio element.
- some recording editing applications may support selection re-recording, but generally delete the audio data in the covered area, the audio data in the uncovered area will be retained, and the user needs to manually delete it.
- the re-recorded The length of the audio data generally cannot exceed the area covered.
- the selected target time interval is 10 seconds, but the newly recorded audio data is 2 seconds, the remaining 8 seconds of audio data is not overwritten, and the remaining 8 seconds of audio data still exists, the user needs to manually The remaining 8 seconds of audio data is selected for deletion, and the length of the recording cannot exceed 10 seconds.
- the embodiment of the present application loads a web page generated for a presentation in a client, and configures a presentation element in the web page, and further adds audio data to the presentation element on the timeline, so that the presentation can be played according to the timeline
- the elements are synchronized to play audio data
- the web page is used as a carrier to create a presentation
- the audio data is used to synchronize the presentation of the presentation elements and audio data, allowing the user to view the contents of the presentation and listen to the presentation of the presentation.
- Using the web element as a presentation element compared to the video data, can greatly reduce the mention, reduce the occupation of the storage space, and, because the web element is directly drawn and loaded on the web page, without compression processing, the web element can be guaranteed. Sharpness, on the other hand, through the target time interval
- the audio data is re-added to prevent the user from manually deleting the uncovered audio data, thereby removing the limitation on the length of the re-added audio data and improving the efficiency of the production.
- FIG. 7 a structural block diagram of a device for creating a presentation according to an embodiment of the present application is shown, which may specifically include the following modules:
- a web page loading module 701, configured to load a web page generated for the presentation
- a presentation element configuration module 702 adapted to configure a presentation element in the web page
- An audio data adding module 703, configured to add audio data to the presentation element on a time axis to synchronously play the audio data when the presentation element is played according to the time axis;
- a target time interval selection module 704 adapted to select a target time interval on the time axis;
- the audio data adding module 705 is adapted to re-add audio data to the target time interval.
- the audio data adding module 701 includes:
- a recording sub-module adapted to call the recorder to record audio data to the presentation element.
- the recording submodule includes:
- the original audio stream data acquiring unit is adapted to acquire original audio stream data collected in the microphone
- a recorder incoming unit adapted to transmit the raw audio stream data to the recorder
- a recorder processing unit adapted to visualize the original audio stream data in the recorder according to recording parameters, and convert the original audio stream data into audio data of a specified format.
- the audio data adding module 701 includes:
- a text information input submodule adapted to input text information to the presentation element
- a text information conversion sub-module adapted to convert the text information into audio data.
- the timeline has a visual axis element on the web page
- the target time interval selection module 703 includes:
- a scrolling marker strip insertion sub-module adapted to insert a scrolling marker strip on the visualized axis element
- the interval selection sub-module is adapted to use a section between the start position and the end position of the scroll marker as the target time interval.
- the audio data adding module 704 includes:
- the original audio data deletion submodule is adapted to delete original audio data located in the target time interval
- An original audio data moving submodule adapted to move original audio data located after the target time interval to a start time of the target time zone
- a new audio data insertion sub-module adapted to insert new audio data from a start time of the target time interval and to move original audio data located after the target time interval to an end time of the new audio data.
- the timeline has a visual axis element on the web page, the audio data having a visualized audio element on the visualized axis element;
- the audio data adding module 704 further includes:
- An audio element deletion submodule adapted to delete a visualized audio element of the original audio data located in the target time interval on the visualized axis element
- An audio element moving submodule adapted to move a visualized audio element of the original audio data located after the target time interval to a start time of the target time zone;
- An audio element insertion sub-module adapted to insert a visualized audio element of the new audio data from a start time of the target time zone and to move the visualized audio element of the original audio data located after the target time interval to the After the audio elements of the new audio data are visualized.
- the method further includes:
- An audio uploading module adapted to upload audio data on the timeline to a server.
- the description is relatively simple, and the relevant parts can be referred to the description of the method embodiment.
- the various component embodiments of the present application can be implemented in hardware, or in a software module running on one or more processors, or in a combination thereof.
- a microprocessor or digital signal processor may be used in practice to implement some or all of the functionality of some or all of the components of the presentation device in accordance with embodiments of the present application.
- the application can also be implemented as a device or device program (e.g., a computer program and a computer program product) for performing some or all of the methods described herein.
- Such a program implementing the present application may be stored on a computer readable medium or may be in the form of one or more signals. Such signals may be downloaded from an Internet website, provided on a carrier signal, or provided in any other form.
- FIG. 8 illustrates a terminal device that can implement the production of a presentation according to the present application.
- the terminal device conventionally includes a processor 810 and a computer program product or computer readable medium in the form of a memory 820.
- the memory 820 may be an electronic memory such as a flash memory, an EEPROM (Electrically Erasable Programmable Read Only Memory), an EPROM, a hard disk, or a ROM.
- Memory 820 has a memory space 830 for program code 831 for performing any of the method steps described above.
- storage space 830 for program code may include various program code 831 for implementing various steps in the above methods, respectively.
- the program code can be read from or written to one or more computer program products.
- Such computer program products include program code carriers such as hard disks, compact disks (CDs), memory cards or floppy disks.
- Such a computer program product is typically a portable or fixed storage unit as described with reference to FIG.
- the storage unit may have a storage section, a storage space, and the like arranged similarly to the storage 820 in the terminal device of FIG.
- the program code can be compressed, for example, in an appropriate form.
- the storage unit includes computer readable code 831', i.e., code readable by a processor, such as 810, that when executed by the terminal device causes the terminal device to perform each of the methods described above step.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- User Interface Of Digital Computer (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
本申请涉及web技术领域,特别是涉及一种演示文稿的制作方法和一种演示文稿的制作装置。The present application relates to the field of web technologies, and in particular, to a method for fabricating a presentation and a device for making a presentation.
在人们的日常工作、学习和技术交流中,为了方便而有条理的讲解,经常采用演示文稿辅助演讲。In people's daily work, study and technical exchanges, for the convenience of a structured explanation, often use presentations to assist the speech.
为了实现进行远程教学,用户在进行演讲时,通常对演示文稿的操作进行录屏,保持用户的演讲与演示文稿同步。In order to realize distance learning, the user usually records the operation of the presentation while the user is speaking, keeping the user's speech synchronized with the presentation.
此后,其他用户可以播放录屏获得的视频数据,收听该用户的演讲以及观看演示文稿的内容,而无需该用户实时面对其他用户进行演讲。Thereafter, other users can play the video data obtained by the screen recording, listen to the user's speech and view the content of the presentation without the user having to speak to other users in real time.
但是,对演示文稿的操作进行录屏获得的视频数据,体积较大,占用大量的存储空间。However, the video data obtained by recording the operation of the presentation is bulky and takes up a lot of storage space.
为了减少存储空间,经常会对视频数据进行压缩,降低视频数据的分辨率,导致演示文稿的内容模糊。In order to reduce the storage space, the video data is often compressed to reduce the resolution of the video data, resulting in blurry content of the presentation.
发明内容Summary of the invention
鉴于上述问题,提出了本申请以便提供一种克服上述问题或者至少部分地解决或减缓上述问题的一种演示文稿的制作方法和相应的一种演示文稿的制作装置。In view of the above problems, the present application has been made in order to provide a method of fabricating a presentation and a corresponding apparatus for producing a presentation that overcomes the above problems or at least partially solves or alleviates the above problems.
依据本申请的一个方面,提供了一种演示文稿的制作方法,包括:According to an aspect of the present application, a method of making a presentation is provided, including:
加载对演示文稿生成的web页面;Load a web page generated for the presentation;
在所述web页面中配置演示文稿元素;Configuring a presentation element in the web page;
在时间轴上对所述演示文稿元素添加音频数据,以按照所述时间轴播放所述演示文稿元素时同步播放所述音频数据;Adding audio data to the presentation element on a timeline to synchronously play the audio data when the presentation element is played in accordance with the timeline;
在所述时间轴上选定目标时间区间; Selecting a target time interval on the time axis;
对所述目标时间区间重新添加音频数据。The audio data is re-added to the target time interval.
根据本申请的另一方面,提供了一种演示文稿的制作装置,包括:According to another aspect of the present application, a production apparatus for a presentation is provided, including:
web页面加载模块,适于加载对演示文稿生成的web页面;a web page loading module adapted to load a web page generated for the presentation;
演示文稿元素配置模块,适于在所述web页面中配置演示文稿元素;a presentation element configuration module adapted to configure a presentation element in the web page;
音频数据添加模块,适于在时间轴上对所述演示文稿元素添加音频数据,以按照所述时间轴播放所述演示文稿元素时同步播放所述音频数据;An audio data adding module adapted to add audio data to the presentation element on a time axis to synchronously play the audio data when the presentation element is played according to the time axis;
目标时间区间选定模块,适于在所述时间轴上选定目标时间区间;a target time interval selection module adapted to select a target time interval on the time axis;
音频数据添加模块,适于对所述目标时间区间重新添加音频数据。An audio data adding module adapted to re-add audio data to the target time interval.
根据本申请的另一方面,提供了一种计算机程序,包括计算机可读代码,当所述计算机可读代码在终端设备上运行时,导致所述终端设备执行前述任一个所述演示文稿的制作方法。According to another aspect of the present application, there is provided a computer program comprising computer readable code causing the terminal device to perform the production of any of the aforementioned presentations when the computer readable code is run on a terminal device method.
根据本申请的另一方面,提供了一种计算机可读介质,其中存储了如前述演示文稿的制作方法的计算机程序。According to another aspect of the present application, there is provided a computer readable medium storing a computer program of a method of fabricating a presentation as described above.
本申请实施例在客户端中加载对演示文稿生成的web页面,并在该web页面中配置演示文稿元素,此外,在时间轴上对演示文稿元素添加音频数据,使得可以按照时间轴播放演示文稿元素时同步播放音频数据,将web页面作为载体制作演示文稿,并配以音频数据,实现在同步播放演示文稿元素及音频数据,可供用户观看演示文稿的内容、收听演示文稿的演讲,一方面,将web元素作为演示文稿元素,相比视频数据,可以大大减少提及,减少存储空间的占用,并且,由于web元素是直接在web页面绘制、加载,无需进行压缩处理,可以保证web元素的清晰度,另一方面,通过对目标时间区间重新添加音频数据,避免用户手动删除未被覆盖的音频数据,解除了对重新添加的音频数据的长度的限制,提高了制作的效率。The embodiment of the present application loads a web page generated for a presentation in a client, and configures a presentation element in the web page, and further adds audio data to the presentation element on the timeline, so that the presentation can be played according to the timeline The elements are synchronized to play audio data, the web page is used as a carrier to create a presentation, and the audio data is used to synchronize the presentation of the presentation elements and audio data, allowing the user to view the contents of the presentation and listen to the presentation of the presentation. Using the web element as a presentation element, compared to the video data, can greatly reduce the mention, reduce the occupation of the storage space, and, because the web element is directly drawn and loaded on the web page, without compression processing, the web element can be guaranteed. Sharpness, on the other hand, by re-adding audio data to the target time interval, the user is prevented from manually deleting the uncovered audio data, and the limitation on the length of the re-added audio data is removed, thereby improving the efficiency of production.
上述说明仅是本申请技术方案的概述,为了能够更清楚了解本申请的技术手段,而可依照说明书的内容予以实施,并且为了让本申请的上述和其它目的、特征和优点能够更明显易懂,以下特举本申请的具体实施方式。 The above description is only an overview of the technical solutions of the present application, and the technical means of the present application can be more clearly understood, and the above and other objects, features and advantages of the present application can be more clearly understood. The following is a specific embodiment of the present application.
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本申请的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:Various other advantages and benefits will become apparent to those skilled in the art from a The drawings are only for the purpose of illustrating the preferred embodiments and are not intended to be limiting. Throughout the drawings, the same reference numerals are used to refer to the same parts. In the drawing:
图1示出了根据本申请一个实施例的一种演示文稿的制作方法实施例的步骤流程图;1 is a flow chart showing the steps of an embodiment of a method for creating a presentation according to an embodiment of the present application;
图2A-图2C示出了根据本申请一个实施例的一种配置演示文稿元素的示例图;2A-2C illustrate example diagrams of a configuration presentation element in accordance with one embodiment of the present application;
图3A-图3D示出了根据本申请一个实施例的一种编辑演示文稿元素与音频数据播放顺序的示例图;3A-3D illustrate example diagrams of editing a presentation element and audio data playback order, in accordance with one embodiment of the present application;
图4A-图4D示出了根据本申请一个实施例的一种播放演示文稿元素与音频数据的示例图;4A-4D illustrate example diagrams of playing presentation elements and audio data in accordance with one embodiment of the present application;
图5A-图5B示出了根据本申请一个实施例的一种录制音频数据的示例图;5A-5B illustrate example diagrams of recording audio data in accordance with one embodiment of the present application;
图6A-图6C示出了根据本申请一个实施例的一种选区重录的示例图;以及6A-6C illustrate example diagrams of a selection re-recording in accordance with one embodiment of the present application;
图7示出了根据本申请一个实施例的一种演示文稿的制作装置的结构框图;FIG. 7 is a structural block diagram of a device for fabricating a presentation according to an embodiment of the present application;
图8示意性地示出了用于执行根据本申请的方法的终端设备的框图;Figure 8 schematically shows a block diagram of a terminal device for performing the method according to the present application;
图9示意性地示出了用于保持或者携带实现根据本申请的方法的程序代码的存储单元。Fig. 9 schematically shows a storage unit for holding or carrying program code implementing the method according to the present application.
下面将参照附图更详细地描述本公开的示例性实施例。虽然附图中显示了本公开的示例性实施例,然而应当理解,可以以各种形式实现本公开而不应被这里阐述的实施例所限制。相反,提供这些实施例是为了能够更透彻地 理解本公开,并且能够将本公开的范围完整的传达给本领域的技术人员。Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While the embodiments of the present invention have been shown in the drawings, the embodiments Rather, these embodiments are provided to provide a more thorough The disclosure is to be understood, and the scope of the present disclosure can be fully conveyed to those skilled in the art.
参照图1,示出了根据本申请一个实施例的一种演示文稿的制作方法实施例的步骤流程图,具体可以包括如下步骤:1 is a flow chart showing the steps of an embodiment of a method for creating a presentation according to an embodiment of the present application. Specifically, the method may include the following steps:
步骤101,加载对演示文稿生成的web页面。Step 101: Load a web page generated for the presentation.
在本申请实施例中,用户可以在浏览器等客户端采用用户账号登录服务器,并向服务器发送演示文稿的生成请求。In the embodiment of the present application, the user can log in to the server by using a user account on a client such as a browser, and send a request for generating a presentation to the server.
服务器若接收到客户端发送的生成请求,则可以配置一个新的演示文稿,并对该演示文稿配置唯一的演示文稿标识,如slide_id(幻灯片ID),采用该演示文稿标识生成唯一的用于编辑的URL(Uniform Resource Locator,统一资源定位符),并将该用于编辑的URL返回客户端。If the server receives the build request sent by the client, it can configure a new presentation and configure the presentation with a unique presentation identifier, such as slide_id (slide ID), which is used to generate a unique one for the presentation. Edit the URL (Uniform Resource Locator) and return the URL for editing to the client.
例如,假设某个演示文稿标识的slide_id为abcd1234,则可以生成用于编辑的URL为https://ppt.***.com/editor?slide_id=abcd1234,其中,“***”为该服务器的域名。For example, if the slide_id of a presentation ID is abcd1234, the URL that can be generated for editing is https://ppt.***.com/editor? Slide_id=abcd1234, where "***" is the domain name of the server.
客户端访问该用于编辑的URL加载web(网页)页面,该web页面为演示文稿的载体,即演示文稿可以在该web页面中编辑内容。The client accesses the URL for editing to load a web page, which is the carrier of the presentation, ie the presentation can edit the content in the web page.
当然,若在先已生成演示文稿,则在用户中心等区域可以显示该演示文稿的信息,用户点击之后,客户端可以直接采用用于编辑的URL加载web页面,本申请实施例对此不加以限制。Of course, if the presentation has been generated in the first place, the information of the presentation can be displayed in the area such as the user center. After the user clicks, the client can directly load the web page by using the URL for editing, which is not used in the embodiment of the present application. limit.
此外,当演示文稿编辑完成之后,采用演示文稿标识生成唯一的用于演讲的URL,并将该用于演讲的URL返回客户端。In addition, when the presentation is edited, the presentation ID is used to generate a unique URL for the presentation, and the URL for the presentation is returned to the client.
例如,假设某个演示文稿标识的slide_id为abcd1234,则可以生成用于编辑的URL为http://ppt.***.com/present?slide_id=abcd1234#/,其中,“***”为该服务器的域名。For example, if the slide_id of a presentation ID is abcd1234, the URL for editing can be generated as http://ppt.***.com/present? Slide_id=abcd1234#/, where "***" is the domain name of the server.
客户端可以访问该用于演讲的URL加载web页面,该web页面为演示文稿的载体,即可以在该web页面中播放演示文稿。The client can access the URL for the presentation to load the web page, which is the carrier of the presentation, ie the presentation can be played in the web page.
步骤102,在所述web页面中配置演示文稿元素。
在web页面中具有一个或多个web操作控件,用户可以通过鼠标点击 等方式触发该web操作控件,从而添加演示文稿元素。Have one or more web action controls in the web page, the user can click through the mouse This way, the web action control is triggered to add a presentation element.
在一个示例中,如图2A所示,演示文稿元素可以包括如下的一种或多种:In one example, as shown in FIG. 2A, the presentation elements can include one or more of the following:
文字、图片、指定形状的图像、直线、表格、框架(iframe)、代码。Text, images, images of specified shapes, lines, tables, frames, and code.
在web页面中,可以直接对这些演示文稿元素进行操作,编辑部分元素参数,例如,调整字体的文本框、图片、图像、直线、表格框架、代码等演示文稿元素在web页面的大小,旋转字体的文本框、图片、图像、直线、表格框架、代码等演示文稿元素,拖动字体的文本框、图片、图像、直线、表格框架、代码等演示文稿元素在web页面的位置,在文字的文本框、表格等演示文稿元素中输入文本,等等。In the web page, you can directly manipulate these presentation elements, edit some of the element parameters, for example, adjust the font's text box, image, image, line, table frame, code, etc. The size of the presentation element in the web page, rotate the font Presentation elements such as text boxes, images, images, lines, table frames, code, etc., dragging text boxes, images, images, lines, table frames, code, etc. of the presentation elements in the position of the web page, text in the text Enter text in presentation elements such as boxes, tables, and more.
此外,用户可以通过点击等方式触发该演示文稿元素进行编辑状态,此时,在web页面中弹出该演示文稿元素的编辑操作栏,用户可以在该编辑操作栏中,显示该演示文稿元素的元素参数,以供用户进行调整。In addition, the user can trigger the presentation element to edit state by clicking or the like. At this time, the editing operation bar of the presentation element is popped up in the web page, and the user can display the element of the presentation element in the editing operation column. Parameters for the user to adjust.
例如,如图2B所示,如果在web页面中插入了文字的文本框,若用户点击该文本框,则可以在web页面中弹出该文本框的编辑操作栏,用户可以设置字体对齐方式、字体播放倍数、字体颜色、行间距、字体间距等元素参数。For example, as shown in FIG. 2B, if a text box of text is inserted in a web page, if the user clicks on the text box, the edit operation bar of the text box can be popped up in the web page, and the user can set the font alignment and font. Play multiplier, font color, line spacing, font spacing and other element parameters.
又例如,如图2C所示,如果在web页面中插入了表格,若用户点击该表格,则可以在web页面中弹出该表格的编辑操作栏,用户可以设置行数、列数、单元格内边距、边框宽度、边框颜色等元素参数。For another example, as shown in FIG. 2C, if a form is inserted in the web page, if the user clicks on the form, the edit operation column of the table may be popped up in the web page, and the user may set the number of rows, the number of columns, and the cell. Element parameters such as margins, border width, and border color.
需要说明的是,这些元素参数与可直接对演示文稿元素进行的编辑的种类可以相同,也可以不相同,本申请实施例对此不加以限制。It should be noted that these element parameters may be the same as or different from the types of editing that can be directly performed on the presentation elements, and the embodiment of the present application does not limit this.
对于演示文稿元素配置的编辑,用户可以手动保存,也可以客户端执行web页面的脚本自动保存,在web页面中对演示文稿元素配置的参数,可以在保存时与服务器进行同步,服务器将该参数存储至该演示文稿(以演示文稿标识表示)之下,以便后续进行加载。For the editing of the presentation element configuration, the user can save it manually, or the script of the client executing the web page can be automatically saved. The parameters configured in the presentation element of the web page can be synchronized with the server during saving, and the server takes the parameter. Store under the presentation (represented by the presentation ID) for subsequent loading.
当然,若在先已生成演示文稿,则在用户中心等区域可以显示该演示文稿的信息,用户点击之后,客户端在采用用于编辑的URL加载web页面的 同时,按照在先设置的元素参数加载相应的演示文稿元素,以供用户继续进行编辑,本申请实施例对此不加以限制。Of course, if the presentation has been generated before, the information of the presentation can be displayed in the user center and the like area, and after the user clicks, the client loads the web page by using the URL for editing. At the same time, the corresponding presentation element is loaded according to the previously set element parameters, so that the user can continue editing, which is not limited by the embodiment of the present application.
步骤103,在时间轴上对所述演示文稿元素添加音频数据,以按照所述时间轴播放所述演示文稿元素时同步播放所述音频数据。
在本申请实施例中,为了控制演示文稿的播放,客户端可以配置一时间轴,并设置演示文稿元素在该时间轴的播放时间。In the embodiment of the present application, in order to control the playing of the presentation, the client can configure a timeline and set the playing time of the presentation element on the timeline.
此外,用户可以录制音频数据,客户端对演示文稿元素添加音频数据,例如用户的演讲讲话,以便按照时间轴在播放音频数据进行演讲等操作的同时,播放演示文稿元素,使得两者可以同步。In addition, the user can record audio data, and the client adds audio data to the presentation element, such as a user's speech, so that the presentation elements can be played while the audio data is being played on the time axis, so that the two can be synchronized.
在编辑状态下,如图3A、图3B、图3C、图3D所示,用户可以设置演示文稿元素的播放时间,随着时间的推移,在设置播放音频数据的同时,可以设置按照顺序切换演讲文稿元素,即显示文本“静夜思”、“李白”、“床前明月光”。In the editing state, as shown in FIG. 3A, FIG. 3B, FIG. 3C, and FIG. 3D, the user can set the playing time of the presentation element. With the passage of time, when the audio data is set to be played, the speech can be set to be switched in order. The manuscript elements, that is, the text "Quiet Night Thinking", "Li Bai", "Before the Moon".
在演讲状态下,如图4A、图4B、图4C、图4D所示,左下角显示计时的控件,随着时间的推移,在播放音频数据的同时,按照顺序切换演讲文稿元素,即显示文本“静夜思”、“李白”、“床前明月光”。In the speech state, as shown in FIG. 4A, FIG. 4B, FIG. 4C, and FIG. 4D, the timing control is displayed in the lower left corner, and as time passes, the audio presentation data is played, and the presentation document elements are switched in order, that is, the text is displayed. "Quiet night thinking", "Li Bai", "before the bed bright moonlight".
在本申请的一个实施例中,步骤103可以包括如下子步骤:In an embodiment of the present application,
子步骤S11,对所述演示文稿元素调用录音器录制音频数据。Sub-step S11, the recorder is called to record audio data to the presentation element.
在本申请实施例中,若客户端所处的终端配置有麦克风,则可以调用该麦克风采集原始的音频数据,并调用录音器录制成音频数据。In the embodiment of the present application, if the terminal where the client is located is configured with a microphone, the microphone can be called to collect the original audio data, and the recorder is called to record the audio data.
例如,如图5A和图5B所示,在web页面中,可以加载一录音控件,点击该录音控件之后,开始录音,并生成时间轴的可视化的轴元素上,显示音频数据的可视化的音频元素。For example, as shown in FIG. 5A and FIG. 5B, in a web page, a recording control can be loaded, after clicking the recording control, recording is started, and a visual element of the audio data is displayed on the axis element of the visual axis of the time axis. .
在本申请实施例的一个示例中,子步骤S11可以包括如下子步骤:In an example of an embodiment of the present application, the sub-step S11 may include the following sub-steps:
子步骤S111,获取麦克风采集的原始音频流数据;Sub-step S111, acquiring original audio stream data collected by the microphone;
子步骤S112,将所述原始音频流数据传入录音器;Sub-step S112, the original audio stream data is transmitted to the recorder;
子步骤S113,在所述录音器中按照录音参数对所述原始音频流数据进行可视化处理,并将所述原始音频流数据转换为指定格式的音频数据。 Sub-step S113, the original audio stream data is visualized in the recorder according to the recording parameters, and the original audio stream data is converted into audio data of a specified format.
在本示例中,客户端可以通过WebRTC(Web Real-Time Communication,源自网页实时通信)提供的getUserMedia接口,获取麦克风采集到的原始音频流数据。In this example, the client can obtain the original audio stream data collected by the microphone through the getUserMedia interface provided by WebRTC (Web Real-Time Communication).
通过Web Audio API(Application Programming Interface,应用程序编程接口)的createMediaStreamSource方法创建一个音频流源节点,并将该音频源传入录音器Recorder中,进行录音器的初始化。Create an audio stream source node through the createMediaStreamSource method of the Web Programming API (Application Programming Interface), and pass the audio source to the Recorder Recorder for initialization of the recorder.
在录音器中,通过Web Audio API的createScriptProcess方法创建一个脚本处理节点,该处理节点用于使用Javascript处理原始音频流数据。In the recorder, a script processing node is created by the createScriptProcess method of the Web Audio API, which is used to process raw audio stream data using Javascript.
此外,将音频源节点与处理节点相连,处理节点与音频输出节点相连,形成一个完整的处理流程。In addition, the audio source node is connected to the processing node, and the processing node is connected to the audio output node to form a complete processing flow.
处理节点可以通过onaudioprocess方法监听AudioProcessingEvent事件,该事件每隔一定时间从原始音频流数据中获取一定长度的数据进行处理。The processing node can listen to the AudioProcessingEvent event through the onaudioprocess method, and the event acquires a certain length of data from the original audio stream data for processing at regular intervals.
开始录音时,向Web Worker传递录音相关的参数(如开始的位置、选区的大小、选区右侧剩余的宽度),并通知onaudioprocess进行音频处理。When you start recording, pass the recording-related parameters (such as the starting position, the size of the selection, and the remaining width to the right of the selection) to the Web Worker, and notify onaudioprocess for audio processing.
参数设置完毕后,在onaudioprocess中,通过drawAudioWave方法对原始音频流数据进行可视化处理(基于原始音频流数据的频率、波形等属性生成可视化的元素),并将音频数据传递到Web Worker,进行音频的暂存与初步处理。After the parameters are set, in the onaudioprocess, the original audio stream data is visualized by the drawAudioWave method (the visualized elements are generated based on the frequency, waveform and other attributes of the original audio stream data), and the audio data is transmitted to the Web Worker for audio. Temporary and preliminary processing.
暂停录音时,暂停音频处理,并向Web Worker请求WAV等格式文件,Web Worker将已有的原始音频流数据为WAV等格式的音频数据并传回。When the recording is paused, the audio processing is paused, and a format file such as WAV is requested from the Web Worker, and the Web Worker converts the existing original audio stream data into audio data of a format such as WAV and returns it.
在本示例中,由于客户端(如浏览器)的计算能力大多比较有限,而对原始音频流数据的暂存以及处理一般需要很大的计算能力,因此,通过引入Web Worker另开启一个线程,进行原始音频流数据的暂存以及处理,从而保证客户端(如浏览器)的其他处理能够正常进行。In this example, since the computing power of the client (such as a browser) is mostly limited, and the temporary storage and processing of the original audio stream data generally requires a large amount of computing power, another thread is opened by introducing a Web Worker. The temporary storage and processing of the original audio stream data is performed to ensure that other processing of the client (such as a browser) can be performed normally.
在本申请的另一个实施例中,步骤103可以包括如下子步骤:In another embodiment of the present application,
子步骤S21,对所述演示文稿元素输入文本信息;Sub-step S21, inputting text information to the presentation element;
子步骤S22,将所述文本信息转换为音频数据。Sub-step S22, converting the text information into audio data.
在本申请实施例中,若客户端所处的终端未配置麦克风,则可以让用户 对演示文稿元素输入文本信息,可以通过语音合成(The Emperor Waltz,TEW)将文本信息转换为音频数据。In the embodiment of the present application, if the terminal where the client is located is not configured with a microphone, the user may be allowed to By entering text information into the presentation elements, text information can be converted to audio data by speech synthesis (The Emperor Waltz, TEW).
语音合成,又称文语转换(Text to Speech,TTS)技术,通常可以包括如下处理:Speech synthesis, also known as Text to Speech (TTS) technology, can usually include the following:
(1)文本分析(1) Text analysis
对输入的文本信息进行语言学分析(模拟人对自然语言的理解过程),逐句进行词汇的、语法的和语义的分析,以确定句子的低层结构和每个字的音素的组成,包括文本的断句、字词切分、多音字的处理、数字的处理、缩略语的处理等。使计算机对输入的文本能完全理解,并给出后两部分所需要的各种发音提示。Perform linguistic analysis on the input text information (simulating human understanding of natural language), lexical, grammatical and semantic analysis step by sentence to determine the low-level structure of the sentence and the composition of the phoneme of each word, including text Breaking sentences, word segmentation, processing of multi-syllable words, processing of numbers, processing of abbreviations, etc. Make the computer fully understand the input text and give the various pronunciation prompts needed in the last two parts.
(2)韵律建模(2) Prosody modeling
为合成语音规划出音段特征,如音高、音长和音强等,使合成语音能正确表达语意,听起来更加自然。For the synthesis of speech, the characteristics of the segment, such as pitch, length and intensity, are made, so that the synthesized speech can correctly express the semantics and sound more natural.
(3)语音合成(核心模块)(3) Speech synthesis (core module)
根据韵律建模的结果,把处理好的文本所对应的单字或短语的语音基元从语音合成库中提取,利用特定的语音合成技术对语音基元进行韵律特性的调整和修改,最终合成出符合要求的语音数据。According to the result of prosody modeling, the phonetic primitives of the single words or phrases corresponding to the processed text are extracted from the speech synthesis library, and the prosody characteristics of the speech primitives are adjusted and modified by using a specific speech synthesis technique, and finally synthesized. Meet the required voice data.
当然,上述添加音频数据的方式只是作为示例,在实施本申请实施例时,可以根据实际情况设置其他添加音频数据的方式,例如,直接导入已有的音频数据,等等,本申请实施例对此不加以限制。另外,除了上述判断处理方法外,本领域技术人员还可以根据实际需要采用其它添加音频数据的方式,本申请实施例对此也不加以限制。Of course, the manner of adding audio data is only an example. In the implementation of the embodiment of the present application, other manners of adding audio data may be set according to actual conditions, for example, directly importing existing audio data, and the like. This is not limited. In addition, in addition to the above-mentioned judging processing method, those skilled in the art may also adopt other manners of adding audio data according to actual needs, and the embodiment of the present application does not limit this.
在本申请实施例中,当添加音频数据完成时,可以将时间轴上的音频数据上传至服务器。In the embodiment of the present application, when the addition of the audio data is completed, the audio data on the time axis can be uploaded to the server.
在具体实现中,对于录制音频数据的情况,可以从Web Worker取回音频数据,并通过amrnb.js库等方式进行音频文件的压缩,压缩为amr等指定的格式,然后上传至服务器,服务器存储至该演示文稿(以演示文稿标识表示)之下,以便后续进行加载。 In the specific implementation, for recording audio data, the audio data can be retrieved from the Web Worker, and the audio file is compressed by the amrnb.js library, compressed into a specified format such as amr, and then uploaded to the server, the server stores Under the presentation (represented by the presentation ID) for subsequent loading.
步骤104,在所述时间轴上选定目标时间区间。
在本申请实施例中,如果用户对音频数据的部分内容不满意,可以在时间轴上选中不满意区域重新录制,该区域称之为目标时间区间。In the embodiment of the present application, if the user is not satisfied with part of the content of the audio data, the unsatisfactory area may be selected for re-recording on the time axis, and the area is referred to as a target time interval.
在一个示例中,如图6A所示,时间轴在所述web页面上具有可视化的轴元素,在该可视化的轴元素上具有时间刻度,如00:00、00:05、00:10等等。In one example, as shown in FIG. 6A, the timeline has a visual axis element on the web page with a time scale on the axis element of the visualization, such as 00:00, 00:05, 00:10, etc. .
在可视化的轴元素上插入滚动标志条,如图6A所示的具有实心圆点、类似大头针的标志条。A scrolling marker strip is inserted over the visualized axis element, as shown in Figure 6A, with a solid dot, like a pin.
将滚动标志条的起始位置与终结位置之间的区间作为目标时间区间,如图6A所示的在可视化的轴元素的矩形区域。The interval between the start position and the end position of the scroll bar is taken as the target time interval as shown in the rectangular area of the visual axis element as shown in FIG. 6A.
步骤105,对所述目标时间区间重新添加音频数据。
本申请实施例对目标时间区间重新添加音频数据,会自动将未覆盖的区域中的音频数据删除,保证音频数据的连贯性,实现选取重录。When the audio data is re-added to the target time interval, the audio data in the uncovered area is automatically deleted to ensure the consistency of the audio data, and the re-recording is selected.
此外,客户端可以将目标时间区间替换为任意长度的音频数据,即添加的音频数据的时间可以长度超过目标时间区间的长度。In addition, the client can replace the target time interval with audio data of any length, that is, the time of the added audio data can be longer than the length of the target time interval.
例如,如果选择的目标时间区间的长度为10秒,但是新录制的音频数据为2秒,剩下8秒的音频数据没被覆盖,客户端则自动把剩下的8秒的音频数据删除。For example, if the selected target time interval is 10 seconds, but the newly recorded audio data is 2 seconds, the remaining 8 seconds of audio data is not overwritten, and the client automatically deletes the remaining 8 seconds of audio data.
在本申请的一个实施例中,步骤105可以包括如下子步骤:In an embodiment of the present application,
子步骤S31,删除位于所述目标时间区间的原音频数据;Sub-step S31, deleting original audio data located in the target time interval;
子步骤S32,将位于所述目标时间区间之后的原音频数据移动至所述目标时间区域的起始时间;Sub-step S32, moving original audio data located after the target time interval to a start time of the target time zone;
子步骤S33,从所述目标时间区间的起始时间插入新音频数据,并将位于所述目标时间区间之后的原音频数据移动至所述新音频数据的结束时间。Sub-step S33, inserting new audio data from the start time of the target time interval, and moving the original audio data located after the target time interval to the end time of the new audio data.
在本申请实施例中,原音频数据为重新添加音频数据之前的音频数据,新音频数据为重新添加的音频数据。In the embodiment of the present application, the original audio data is audio data before re-adding audio data, and the new audio data is re-added audio data.
时间轴在所述web页面上具有可视化的轴元素,音频数据在可视化的轴元素上具有可视化的音频元素。 The timeline has visual axis elements on the web page, and the audio data has visual audio elements on the visual axis elements.
客户端可以删除位于目标时间区间的原音频数据,并将位于所述目标时间区间之后的原音频数据移动至目标时间区域的起始时间,保证音频数据的连贯性。The client can delete the original audio data located in the target time interval, and move the original audio data located after the target time interval to the start time of the target time zone to ensure the consistency of the audio data.
为了方便用户操作,如图6B所示,可以在可视化的轴元素上删除位于目标时间区间的原音频数据的可视化的音频元素。To facilitate user operation, as shown in FIG. 6B, the visualized audio elements of the original audio data located in the target time interval can be deleted on the visualized axis elements.
将位于目标时间区间之后的原音频数据的可视化的音频元素,移动至目标时间区域的起始时间。The visualized audio element of the original audio data located after the target time interval is moved to the start time of the target time zone.
当重新录制音频数据,则可以从目标时间区间的起始时间插入新音频数据,并将位于目标时间区间之后的原音频数据移动至所述新音频数据的结束时间,保证音频数据的连贯性。When the audio data is re-recorded, new audio data can be inserted from the start time of the target time interval, and the original audio data located after the target time interval is moved to the end time of the new audio data, ensuring continuity of the audio data.
为了方便用户操作,如图6C所示,可以从目标时间区域的起始时间插入新音频数据的可视化的音频元素,并将位于目标时间区间之后的原音频数据的可视化的音频元素移动至新音频数据的可视化的音频元素之后。In order to facilitate the user's operation, as shown in FIG. 6C, the visualized audio element of the new audio data can be inserted from the start time of the target time zone, and the visualized audio element of the original audio data located after the target time zone is moved to the new audio. The data is visualized after the audio element.
目前,某些录音编辑应用可能支持选区重录,但是,一般是删除已覆盖的区域中的音频数据,未覆盖的区域中的音频数据会被保留,需要用户手动进行删除,此外,重新录制的音频数据的长度一般不能超过已覆盖的区域。At present, some recording editing applications may support selection re-recording, but generally delete the audio data in the covered area, the audio data in the uncovered area will be retained, and the user needs to manually delete it. In addition, the re-recorded The length of the audio data generally cannot exceed the area covered.
例如,如果选择的目标时间区间的长度为10秒,但是新录制的音频数据为2秒,剩下8秒的音频数据没被覆盖,这剩下的8秒的音频数据仍然存在,用户需手动选中剩下的8秒的音频数据进行删除,并且,录制的长度并不能超过10秒。For example, if the selected target time interval is 10 seconds, but the newly recorded audio data is 2 seconds, the remaining 8 seconds of audio data is not overwritten, and the remaining 8 seconds of audio data still exists, the user needs to manually The remaining 8 seconds of audio data is selected for deletion, and the length of the recording cannot exceed 10 seconds.
本申请实施例在客户端中加载对演示文稿生成的web页面,并在该web页面中配置演示文稿元素,此外,在时间轴上对演示文稿元素添加音频数据,使得可以按照时间轴播放演示文稿元素时同步播放音频数据,将web页面作为载体制作演示文稿,并配以音频数据,实现在同步播放演示文稿元素及音频数据,可供用户观看演示文稿的内容、收听演示文稿的演讲,一方面,将web元素作为演示文稿元素,相比视频数据,可以大大减少提及,减少存储空间的占用,并且,由于web元素是直接在web页面绘制、加载,无需进行压缩处理,可以保证web元素的清晰度,另一方面,通过对目标时间区间 重新添加音频数据,避免用户手动删除未被覆盖的音频数据,解除了对重新添加的音频数据的长度的限制,提高了制作的效率。The embodiment of the present application loads a web page generated for a presentation in a client, and configures a presentation element in the web page, and further adds audio data to the presentation element on the timeline, so that the presentation can be played according to the timeline The elements are synchronized to play audio data, the web page is used as a carrier to create a presentation, and the audio data is used to synchronize the presentation of the presentation elements and audio data, allowing the user to view the contents of the presentation and listen to the presentation of the presentation. Using the web element as a presentation element, compared to the video data, can greatly reduce the mention, reduce the occupation of the storage space, and, because the web element is directly drawn and loaded on the web page, without compression processing, the web element can be guaranteed. Sharpness, on the other hand, through the target time interval The audio data is re-added to prevent the user from manually deleting the uncovered audio data, thereby removing the limitation on the length of the re-added audio data and improving the efficiency of the production.
对于方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本申请实施例并不受所描述的动作顺序的限制,因为依据本申请实施例,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作并不一定是本申请实施例所必须的。For the method embodiments, for the sake of brevity, they are all described as a series of action combinations, but those skilled in the art should understand that the embodiments of the present application are not limited by the described action sequence, because the embodiment according to the present application Some steps can be performed in other orders or at the same time. In the following, those skilled in the art should also understand that the embodiments described in the specification are all preferred embodiments, and the actions involved are not necessarily required in the embodiments of the present application.
参照图7,示出了根据本申请一个实施例的一种演示文稿的制作装置的结构框图,具体可以包括如下模块:Referring to FIG. 7, a structural block diagram of a device for creating a presentation according to an embodiment of the present application is shown, which may specifically include the following modules:
web页面加载模块701,适于加载对演示文稿生成的web页面;a web
演示文稿元素配置模块702,适于在所述web页面中配置演示文稿元素;a presentation
音频数据添加模块703,适于在时间轴上对所述演示文稿元素添加音频数据,以按照所述时间轴播放所述演示文稿元素时同步播放所述音频数据;An audio
目标时间区间选定模块704,适于在所述时间轴上选定目标时间区间;a target time
音频数据添加模块705,适于对所述目标时间区间重新添加音频数据。The audio
在本申请的一个实施例中,所述音频数据添加模块701包括:In an embodiment of the present application, the audio
录音子模块,适于对所述演示文稿元素调用录音器录制音频数据。A recording sub-module adapted to call the recorder to record audio data to the presentation element.
在本申请的一个实施例中,所述录音子模块包括:In an embodiment of the present application, the recording submodule includes:
原始音频流数据获取单元,适于获取麦克风中采集的原始音频流数据;The original audio stream data acquiring unit is adapted to acquire original audio stream data collected in the microphone;
录音器传入单元,适于将所述原始音频流数据传入录音器;a recorder incoming unit adapted to transmit the raw audio stream data to the recorder;
录音器处理单元,适于在所述录音器中按照录音参数对所述原始音频流数据进行可视化处理,并将所述原始音频流数据转换为指定格式的音频数据。And a recorder processing unit adapted to visualize the original audio stream data in the recorder according to recording parameters, and convert the original audio stream data into audio data of a specified format.
在本申请的另一个实施例中,所述音频数据添加模块701包括:In another embodiment of the present application, the audio
文本信息输入子模块,适于对所述演示文稿元素输入文本信息;a text information input submodule adapted to input text information to the presentation element;
文本信息转换子模块,适于将所述文本信息转换为音频数据。 A text information conversion sub-module adapted to convert the text information into audio data.
在本申请的一个实施例中,所述时间轴在所述web页面上具有可视化的轴元素;In one embodiment of the present application, the timeline has a visual axis element on the web page;
所述目标时间区间选定模块703包括:The target time
滚动标志条插入子模块,适于在所述可视化的轴元素上插入滚动标志条;a scrolling marker strip insertion sub-module adapted to insert a scrolling marker strip on the visualized axis element;
区间选择子模块,适于将所述滚动标志条的起始位置与终结位置之间的区间作为目标时间区间。The interval selection sub-module is adapted to use a section between the start position and the end position of the scroll marker as the target time interval.
在本申请的一个实施例中,所述音频数据添加模块704包括:In an embodiment of the present application, the audio
原音频数据删除子模块,适于删除位于所述目标时间区间的原音频数据;The original audio data deletion submodule is adapted to delete original audio data located in the target time interval;
原音频数据移动子模块,适于将位于所述目标时间区间之后的原音频数据移动至所述目标时间区域的起始时间;An original audio data moving submodule adapted to move original audio data located after the target time interval to a start time of the target time zone;
新音频数据插入子模块,适于从所述目标时间区间的起始时间插入新音频数据,并将位于所述目标时间区间之后的原音频数据移动至所述新音频数据的结束时间。A new audio data insertion sub-module adapted to insert new audio data from a start time of the target time interval and to move original audio data located after the target time interval to an end time of the new audio data.
在本申请的一个实施例中,所述时间轴在所述web页面上具有可视化的轴元素,所述音频数据在所述可视化的轴元素上具有可视化的音频元素;In one embodiment of the present application, the timeline has a visual axis element on the web page, the audio data having a visualized audio element on the visualized axis element;
所述音频数据添加模块704还包括:The audio
音频元素删除子模块,适于在所述可视化的轴元素上删除位于所述目标时间区间的原音频数据的可视化的音频元素;An audio element deletion submodule adapted to delete a visualized audio element of the original audio data located in the target time interval on the visualized axis element;
音频元素移动子模块,适于将位于所述目标时间区间之后的原音频数据的可视化的音频元素,移动至所述目标时间区域的起始时间;An audio element moving submodule adapted to move a visualized audio element of the original audio data located after the target time interval to a start time of the target time zone;
音频元素插入子模块,适于从所述目标时间区域的起始时间插入新音频数据的可视化的音频元素,并将位于所述目标时间区间之后的原音频数据的可视化的音频元素移动至所述新音频数据的可视化的音频元素之后。An audio element insertion sub-module adapted to insert a visualized audio element of the new audio data from a start time of the target time zone and to move the visualized audio element of the original audio data located after the target time interval to the After the audio elements of the new audio data are visualized.
在本申请的一个实施例中,还包括:In an embodiment of the present application, the method further includes:
音频上传模块,适于将所述时间轴上的音频数据上传至服务器。 An audio uploading module adapted to upload audio data on the timeline to a server.
对于装置实施例而言,由于其与方法实施例基本相似,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。For the device embodiment, since it is basically similar to the method embodiment, the description is relatively simple, and the relevant parts can be referred to the description of the method embodiment.
本申请的各个部件实施例可以以硬件实现,或者以在一个或者多个处理器上运行的软件模块实现,或者以它们的组合实现。本领域的技术人员应当理解,可以在实践中使用微处理器或者数字信号处理器(DSP)来实现根据本申请实施例的演示文稿的制作设备中的一些或者全部部件的一些或者全部功能。本申请还可以实现为用于执行这里所描述的方法的一部分或者全部的设备或者装置程序(例如,计算机程序和计算机程序产品)。这样的实现本申请的程序可以存储在计算机可读介质上,或者可以具有一个或者多个信号的形式。这样的信号可以从因特网网站上下载得到,或者在载体信号上提供,或者以任何其他形式提供。The various component embodiments of the present application can be implemented in hardware, or in a software module running on one or more processors, or in a combination thereof. Those skilled in the art will appreciate that a microprocessor or digital signal processor (DSP) may be used in practice to implement some or all of the functionality of some or all of the components of the presentation device in accordance with embodiments of the present application. The application can also be implemented as a device or device program (e.g., a computer program and a computer program product) for performing some or all of the methods described herein. Such a program implementing the present application may be stored on a computer readable medium or may be in the form of one or more signals. Such signals may be downloaded from an Internet website, provided on a carrier signal, or provided in any other form.
例如,图8示出了可以实现根据本申请的演示文稿的制作的终端设备。该终端设备传统上包括处理器810和以存储器820形式的计算机程序产品或者计算机可读介质。存储器820可以是诸如闪存、EEPROM(电可擦除可编程只读存储器)、EPROM、硬盘或者ROM之类的电子存储器。存储器820具有用于执行上述方法中的任何方法步骤的程序代码831的存储空间830。例如,用于程序代码的存储空间830可以包括分别用于实现上面的方法中的各种步骤的各个程序代码831。这些程序代码可以从一个或者多个计算机程序产品中读出或者写入到这一个或者多个计算机程序产品中。这些计算机程序产品包括诸如硬盘,紧致盘(CD)、存储卡或者软盘之类的程序代码载体。这样的计算机程序产品通常为如参考图9所述的便携式或者固定存储单元。该存储单元可以具有与图8的终端设备中的存储器820类似布置的存储段、存储空间等。程序代码可以例如以适当形式进行压缩。通常,存储单元包括计算机可读代码831’,即可以由例如诸如810之类的处理器读取的代码,这些代码当由终端设备运行时,导致该终端设备执行上面所描述的方法中的各个步骤。For example, FIG. 8 illustrates a terminal device that can implement the production of a presentation according to the present application. The terminal device conventionally includes a
本文中所称的“一个实施例”、“实施例”或者“一个或者多个实施例”意味着,结合实施例描述的特定特征、结构或者特性包括在 本申请的至少一个实施例中。此外,请注意,这里“在一个实施例中”的词语例子不一定全指同一个实施例。The term "one embodiment", "an embodiment" or "one or more embodiments" is used herein to mean that the specific features, structures, or characteristics described in connection with the embodiments are included. In at least one embodiment of the present application. In addition, it is noted that the phrase "in one embodiment" is not necessarily referring to the same embodiment.
在此处所提供的说明书中,说明了大量具体细节。然而,能够理解,本申请的实施例可以在没有这些具体细节的情况下被实践。在一些实例中,并未详细示出公知的方法、结构和技术,以便不模糊对本说明书的理解。In the description provided herein, numerous specific details are set forth. However, it is understood that the embodiments of the present application can be practiced without these specific details. In some instances, well-known methods, structures, and techniques are not shown in detail so as not to obscure the understanding of the description.
应该注意的是上述实施例对本申请进行说明而不是对本申请进行限制,并且本领域技术人员在不脱离所附权利要求的范围的情况下可设计出替换实施例。在权利要求中,不应将位于括号之间的任何参考符号构造成对权利要求的限制。单词“包含”不排除存在未列在权利要求中的元件或步骤。位于元件之前的单词“一”或“一个”不排除存在多个这样的元件。本申请可以借助于包括有若干不同元件的硬件以及借助于适当编程的计算机来实现。在列举了若干装置的单元权利要求中,这些装置中的若干个可以是通过同一个硬件项来具体体现。单词第一、第二、以及第三等的使用不表示任何顺序。可将这些单词解释为名称。It should be noted that the above-described embodiments are illustrative of the present application and are not intended to limit the scope of the application, and those skilled in the art can devise alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as a limitation. The word "comprising" does not exclude the presence of the elements or steps that are not recited in the claims. The word "a" or "an" The application can be implemented by means of hardware comprising several distinct elements and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means can be embodied by the same hardware item. The use of the words first, second, and third does not indicate any order. These words can be interpreted as names.
此外,还应当注意,本说明书中使用的语言主要是为了可读性和教导的目的而选择的,而不是为了解释或者限定本申请的主题而选择的。因此,在不偏离所附权利要求书的范围和精神的情况下,对于本技术领域的普通技术人员来说许多修改和变更都是显而易见的。对于本申请的范围,对本申请所做的公开是说明性的,而非限制性的,本申请的范围由所附权利要求书限定。 In addition, it should be noted that the language used in the specification has been selected for the purpose of readability and teaching, and is not intended to be interpreted or limited. Therefore, many modifications and changes will be apparent to those skilled in the art without departing from the scope of the invention. The disclosure of the present application is intended to be illustrative, and not restrictive, and the scope of the application is defined by the appended claims.
Claims (18)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201611220468.7 | 2016-12-26 | ||
| CN201611220468.7A CN108241598A (en) | 2016-12-26 | 2016-12-26 | Method and device for making a presentation |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2018120819A1 true WO2018120819A1 (en) | 2018-07-05 |
Family
ID=62701870
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2017/094598 Ceased WO2018120819A1 (en) | 2016-12-26 | 2017-07-27 | Method and device for producing presentation |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN108241598A (en) |
| WO (1) | WO2018120819A1 (en) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112533054A (en) * | 2019-09-19 | 2021-03-19 | 腾讯科技(深圳)有限公司 | Online video playing method and device and storage medium |
| CN114398883A (en) * | 2022-01-19 | 2022-04-26 | 平安科技(深圳)有限公司 | Presentation generation method and device, computer readable storage medium and server |
| CN114501106A (en) * | 2020-08-04 | 2022-05-13 | 腾讯科技(深圳)有限公司 | Manuscript display control method and device, electronic equipment and storage medium |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN108958608B (en) * | 2018-07-10 | 2022-07-15 | 广州视源电子科技股份有限公司 | Interface element operation method and device of electronic whiteboard and interactive intelligent equipment |
| CN112115283A (en) * | 2020-08-25 | 2020-12-22 | 天津洪恩完美未来教育科技有限公司 | Method, device and equipment for processing picture book data |
| CN117278802B (en) * | 2023-11-23 | 2024-02-13 | 湖南快乐阳光互动娱乐传媒有限公司 | Video clip trace comparison method and device |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104376001A (en) * | 2013-08-13 | 2015-02-25 | 腾讯科技(深圳)有限公司 | PPT (PowerPoint) playing method and PPT playing device |
| CN104765714A (en) * | 2014-01-08 | 2015-07-08 | 中国移动通信集团浙江有限公司 | Switching method and device for electronic reading and listening |
| CN104994434A (en) * | 2015-07-06 | 2015-10-21 | 天脉聚源(北京)教育科技有限公司 | Video playing method and device |
| CN105530440A (en) * | 2014-09-29 | 2016-04-27 | 北京金山安全软件有限公司 | Video production method and device |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20050154679A1 (en) * | 2004-01-08 | 2005-07-14 | Stanley Bielak | System for inserting interactive media within a presentation |
| CN101299250A (en) * | 2007-04-30 | 2008-11-05 | 深圳华飚科技有限公司 | On-line cooperating lantern slide manufacturing service system |
| CN101344883A (en) * | 2007-07-09 | 2009-01-14 | 宇瞻科技股份有限公司 | Method for recording demonstration manuscript |
| US8381086B2 (en) * | 2007-09-18 | 2013-02-19 | Microsoft Corporation | Synchronizing slide show events with audio |
| CN102156613A (en) * | 2011-03-29 | 2011-08-17 | 汉王科技股份有限公司 | Display method and device thereof for presentations |
| JP2015056880A (en) * | 2013-09-13 | 2015-03-23 | 株式会社ネクスウェイ | Presentation providing system, method, and program |
| CN105472406B (en) * | 2015-12-04 | 2019-01-29 | 广东威创视讯科技股份有限公司 | PowerPoint display methods and system |
-
2016
- 2016-12-26 CN CN201611220468.7A patent/CN108241598A/en active Pending
-
2017
- 2017-07-27 WO PCT/CN2017/094598 patent/WO2018120819A1/en not_active Ceased
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104376001A (en) * | 2013-08-13 | 2015-02-25 | 腾讯科技(深圳)有限公司 | PPT (PowerPoint) playing method and PPT playing device |
| CN104765714A (en) * | 2014-01-08 | 2015-07-08 | 中国移动通信集团浙江有限公司 | Switching method and device for electronic reading and listening |
| CN105530440A (en) * | 2014-09-29 | 2016-04-27 | 北京金山安全软件有限公司 | Video production method and device |
| CN104994434A (en) * | 2015-07-06 | 2015-10-21 | 天脉聚源(北京)教育科技有限公司 | Video playing method and device |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112533054A (en) * | 2019-09-19 | 2021-03-19 | 腾讯科技(深圳)有限公司 | Online video playing method and device and storage medium |
| CN114501106A (en) * | 2020-08-04 | 2022-05-13 | 腾讯科技(深圳)有限公司 | Manuscript display control method and device, electronic equipment and storage medium |
| CN114398883A (en) * | 2022-01-19 | 2022-04-26 | 平安科技(深圳)有限公司 | Presentation generation method and device, computer readable storage medium and server |
| CN114398883B (en) * | 2022-01-19 | 2023-07-07 | 平安科技(深圳)有限公司 | Presentation generation method and device, computer readable storage medium and server |
Also Published As
| Publication number | Publication date |
|---|---|
| CN108241598A (en) | 2018-07-03 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2018120819A1 (en) | Method and device for producing presentation | |
| WO2018120821A1 (en) | Method and device for producing presentation | |
| JP5030617B2 (en) | Method, system, and program for RSS content management for rendering RSS content on a digital audio player (RSS content management for rendering RSS content on a digital audio player) | |
| US8937620B1 (en) | System and methods for generation and control of story animation | |
| US8966360B2 (en) | Transcript editor | |
| US8548618B1 (en) | Systems and methods for creating narration audio | |
| JP2023548008A (en) | Text-driven editor for audio and video assembly | |
| US20200058288A1 (en) | Timbre-selectable human voice playback system, playback method thereof and computer-readable recording medium | |
| CN107517323B (en) | An information sharing method, device and storage medium | |
| US20080027726A1 (en) | Text to audio mapping, and animation of the text | |
| WO2012086356A1 (en) | File format, server, view device for digital comic, digital comic generation device | |
| US20180226101A1 (en) | Methods and systems for interactive multimedia creation | |
| US20120177345A1 (en) | Automated Video Creation Techniques | |
| JPH0778074A (en) | Method and apparatus for creation of script of multimedia | |
| JP2007242012A (en) | Method, system, and program for email management for rendering email on a digital audio player (email management for rendering email on a digital audio player) | |
| CN110781328A (en) | Video generation method, system, device and storage medium based on speech recognition | |
| CN111930289B (en) | Method and system for processing pictures and texts | |
| CN108241672A (en) | A method and device for displaying presentations online | |
| WO2018120820A1 (en) | Presentation production method and apparatus | |
| CN114638232A (en) | Method and device for converting text into video, electronic equipment and storage medium | |
| Chi et al. | Synthesis-assisted video prototyping from a document | |
| KR20210050410A (en) | Method and system for suppoting content editing based on real time generation of synthesized sound for video content | |
| CN119110139A (en) | Automatic video generation method, device, equipment and storage medium | |
| CN119299800A (en) | Video generation method, device, computing device, storage medium and program product | |
| CN115695680A (en) | Video editing method, device, electronic device, and computer-readable storage medium |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17886381 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 17886381 Country of ref document: EP Kind code of ref document: A1 |