CN117494672B - Method, device and computer-readable storage medium for generating industry documents - Google Patents
Method, device and computer-readable storage medium for generating industry documents Download PDFInfo
- Publication number
- CN117494672B CN117494672B CN202311506829.4A CN202311506829A CN117494672B CN 117494672 B CN117494672 B CN 117494672B CN 202311506829 A CN202311506829 A CN 202311506829A CN 117494672 B CN117494672 B CN 117494672B
- Authority
- CN
- China
- Prior art keywords
- document
- industry
- generating
- evaluation
- content
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
- G06F40/186—Templates
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/211—Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Machine Translation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
本发明涉及文本处理技术领域,公开了行业文档的生成方法、装置及计算机可读存储介质,该方法包括:响应于用户的第一指示,获取相应的规范文档及文档生成需求;基于规范文档及文档生成需求,生成第一行业文档;响应于用户的第二指示,基于预设的行业数据库及规范文档提取评价规则;基于评价规则对第一行业文档进行分析评测,并生成优化方案;响应于用户的确认指示,基于优化方案生成第二行业文档。本发明在通过规范文档及文档生成需求生成第一行业文档之后,可以根据用户的第二指示对第一行业文档进行优化,从而得到第二行业文档,能够提高数据、资料的查找效率的同时,提高行业文档的完整性。
The present invention relates to the field of text processing technology, and discloses a method, device and computer-readable storage medium for generating industry documents, the method comprising: in response to a first instruction from a user, obtaining a corresponding specification document and a document generation requirement; generating a first industry document based on the specification document and the document generation requirement; in response to a second instruction from a user, extracting evaluation rules based on a preset industry database and specification document; analyzing and evaluating the first industry document based on the evaluation rules, and generating an optimization plan; in response to a confirmation instruction from a user, generating a second industry document based on the optimization plan. After generating the first industry document through the specification document and the document generation requirement, the present invention can optimize the first industry document according to the second instruction from the user, thereby obtaining a second industry document, which can improve the efficiency of searching for data and information while improving the integrity of industry documents.
Description
技术领域Technical Field
本发明涉及文本处理技术领域,具体涉及一种行业文档的生成方法、装置、及计算机可读存储介质。The present invention relates to the technical field of text processing, and in particular to a method and device for generating industry documents, and a computer-readable storage medium.
背景技术Background technique
行业文档(例如标书、项目验收文档等)的生成,主要依靠人工进行编写,其不但会耗费大量的时间成本和人力成本,还容易出现编写错误,且传统的人工编写行业文档往往存在样式不统一、标准不规范的问题,其也不利于行业的标准化管理。The generation of industry documents (such as bidding documents, project acceptance documents, etc.) mainly relies on manual writing, which not only consumes a lot of time and manpower costs, but is also prone to writing errors. In addition, traditional manually written industry documents often have problems such as inconsistent styles and non-standard standards, which is not conducive to standardized management of the industry.
随着计算机科学技术的不断发展,基于人工智能的智能文本输出获得了大量关注,其可以实现对文本内容的理解、分析并自动生成人类需要的文本,这对于行业领域中大量的文档处理任务具有较高的应用价值。然而,现在普遍使用的自动生成方案,基本上是描述基于光学字符识别(Optical Character Recognition,OCR)技术的证件识别;基于自然语言处理(Natural Language Processing,NLP)技术的文件解析,帮助识别串标;可能有基于数据库实现标书文档的快速填写,但实际内容仍然需要人为填写,可见,相关技术只是提升了数据、资料的查找效率,并不能实现自动生成完整的行业文档。With the continuous development of computer science and technology, intelligent text output based on artificial intelligence has received a lot of attention. It can understand and analyze text content and automatically generate texts needed by humans, which has high application value for a large number of document processing tasks in the industry. However, the commonly used automatic generation solutions now basically describe document recognition based on optical character recognition (OCR) technology; file parsing based on natural language processing (NLP) technology to help identify bid collusion; there may be database-based rapid filling of tender documents, but the actual content still needs to be filled in manually. It can be seen that the relevant technology only improves the efficiency of searching for data and information, and cannot automatically generate complete industry documents.
发明内容Summary of the invention
有鉴于此,本发明提供了一种行业文档的生成方法、装置及计算机可读存储介质,以解决相关技术中,只是提升了数据、资料的查找效率,并不能实现自动生成完整的行业文档的技术问题。In view of this, the present invention provides a method, device and computer-readable storage medium for generating industry documents to solve the technical problem in the related technology that it only improves the efficiency of searching data and information but cannot automatically generate complete industry documents.
第一方面,本发明提供了一种行业文档的生成方法,该方法包括:响应于用户的第一指示,获取相应的规范文档及文档生成需求;基于规范文档及文档生成需求,生成第一行业文档;响应于用户的第二指示,基于预设的行业数据库及规范文档提取评价规则;基于评价规则对第一行业文档进行分析评测,并生成第一优化方案;响应于用户的确认指示,基于第一优化方案生成第二行业文档。In a first aspect, the present invention provides a method for generating an industry document, the method comprising: in response to a first instruction from a user, obtaining a corresponding specification document and document generation requirements; generating a first industry document based on the specification document and the document generation requirements; in response to a second instruction from a user, extracting evaluation rules based on a preset industry database and specification documents; analyzing and evaluating the first industry document based on the evaluation rules, and generating a first optimization plan; in response to a confirmation instruction from the user, generating a second industry document based on the first optimization plan.
本实施例提供的行业文档的生成方法,在通过规范文档及文档生成需求生成第一行业文档之后,可以根据用户的第二指示对第一行业文档进行优化,从而得到第二行业文档,能够提高数据、资料的查找效率的同时,提高行业文档的完整性。The industry document generation method provided in this embodiment can optimize the first industry document according to the user's second instruction to obtain a second industry document after generating the first industry document through standard documents and document generation requirements, thereby improving the efficiency of searching for data and information while improving the integrity of industry documents.
在一个可选的实施方式中,在响应于用户的确认指示,基于第一优化方案生成行业文档之前,方法还包括:响应于用户确认的辅助评价规则,调整评价规则;基于调整后的评价规则对第一行业文档进行分析评测,并生成第二优化方案。In an optional embodiment, before generating an industry document based on the first optimization plan in response to a user's confirmation instruction, the method also includes: adjusting the evaluation rules in response to the auxiliary evaluation rules confirmed by the user; analyzing and evaluating the first industry document based on the adjusted evaluation rules, and generating a second optimization plan.
本实施例提供的行业文档的生成方法,通过用户确认辅助评价规则,并根据辅助评价规则调整评价规则,使得评价规则能够侧重于用户的需求,从而能够提高行业文档的中标率。The method for generating industry documents provided in this embodiment allows users to confirm auxiliary evaluation rules and adjust evaluation rules based on the auxiliary evaluation rules, so that the evaluation rules can focus on user needs, thereby improving the winning rate of industry documents.
在一个可选的实施方式中,基于规范文档及文档生成需求,生成第一行业文档,包括:通过自然语言处理方法解析规范文档,提取规范文档中的待填信息;基于文档生成需求及预设的行业数据库,提取行业内容生成待填信息中的内容,生成第一行业文档。In an optional embodiment, a first industry document is generated based on a specification document and document generation requirements, including: parsing the specification document through a natural language processing method to extract the information to be filled in the specification document; based on the document generation requirements and a preset industry database, extracting industry content to generate content in the information to be filled in, thereby generating a first industry document.
本实施例提供的行业文档的生成方法,通过自然语言处理方法解析规范文档,并根据文档生成需求和预设的行业数据库提取对应待填信息的数据的方式,能够智能化生成第一行业文档,从而提高第一行业文档生成效率的同时,保证第一行业文档中内容的准确率。The industry document generation method provided in this embodiment parses the standard document through a natural language processing method, and extracts data corresponding to the information to be filled in according to the document generation requirements and a preset industry database, so as to intelligently generate the first industry document, thereby improving the efficiency of the first industry document generation while ensuring the accuracy of the content in the first industry document.
在一个可选的实施方式中,通过自然语言处理方法解析规范文档,提取规范文档中的待填信息,包括:基于自然语言处理数据库对规范文档进行分词处理,提取词语信息;使用命名实体识别,从规范文档中提取文档需求信息;对文档需求信息进行聚类处理,得到文档需求合集;基于评分关键词从规范文档中提取评分内容;使用情感分析方法及语义分析方法从规范文档中提取评分要点;基于评分内容及评分要点生成评分集合;基于格式关键词从规范文档中提取格式描述语句;利用句法分析提取格式描述语句中的各个输出字段;通过语义匹配找到各个输出字段的字段说明,提取字段含义解释;基于文档需求合集、评分集合及字段含义解释生成待填信息。In an optional embodiment, a specification document is parsed by a natural language processing method to extract information to be filled in the specification document, including: performing word segmentation on the specification document based on a natural language processing database to extract word information; using named entity recognition to extract document requirement information from the specification document; clustering the document requirement information to obtain a document requirement collection; extracting scoring content from the specification document based on scoring keywords; extracting scoring points from the specification document using sentiment analysis methods and semantic analysis methods; generating a scoring set based on the scoring content and scoring points; extracting format description statements from the specification document based on format keywords; extracting each output field in the format description statement using syntactic analysis; finding the field description of each output field through semantic matching, and extracting the field meaning explanation; generating information to be filled in based on the document requirement collection, the scoring set, and the field meaning explanation.
本实施例提供的行业文档的生成方法,通过确定的文档需求合集、评分集合及字段含义解释能够提高提取规范文档中的待填信息的效率以及准确率。The method for generating industry documents provided in this embodiment can improve the efficiency and accuracy of extracting the information to be filled in the specification document by determining the document requirement collection, the scoring set and the field meaning interpretation.
在一个可选的实施方式中,基于预设的行业数据库及规范文档提取评价规则,包括:基于预设的行业数据库从规范文档中提取相应的评价参数及评价权重;基于评价参数及评价权重生成评价规则。In an optional implementation, extracting evaluation rules based on a preset industry database and specification documents includes: extracting corresponding evaluation parameters and evaluation weights from the specification documents based on the preset industry database; and generating evaluation rules based on the evaluation parameters and evaluation weights.
本实施例提供的行业文档的生成方法,通过从预设的行业数据库中提取规范文档中评价参数和评价权重的方式,能够准确地确定第一行业文档是否能够达到中标需求。The method for generating industry documents provided in this embodiment can accurately determine whether the first industry document can meet the requirements for winning the bid by extracting evaluation parameters and evaluation weights in the specification document from a preset industry database.
在一个可选的实施方式中,基于评价规则对第一行业文档进行分析评测,并生成优化方案,包括:从第一行业文档中提取评价参数对应的待评测内容;基于评价规则,对待评测内容进行评价,生成评价结果;基于评价结果分析待评测内容中需要进行调整的内容;基于需要进行调整的内容生成第一优化方案。In an optional implementation, the first industry document is analyzed and evaluated based on evaluation rules, and an optimization plan is generated, including: extracting content to be evaluated corresponding to the evaluation parameters from the first industry document; evaluating the content to be evaluated based on the evaluation rules and generating evaluation results; analyzing the content to be evaluated that needs to be adjusted based on the evaluation results; and generating a first optimization plan based on the content that needs to be adjusted.
本实施例提供的行业文档的生成方法,通过评价规则对第一行业文档中待评测内容进行评价,从而能够确定待评测内容中影响较大的内容,进而针对该内容进行调整,使得生成的第二行业文档能够更符合中标需求。The method for generating industry documents provided in this embodiment evaluates the content to be evaluated in the first industry document through evaluation rules, so as to determine the content with greater influence in the content to be evaluated, and then adjust the content so that the generated second industry document can better meet the bidding requirements.
在一个可选的实施方式中,响应于用户确认的辅助评价规则,调整评价规则,包括:基于预设的行业数据库从规范文档中提取与辅助评价规则对应的评价参数及评价权重;基于评价参数及评价权重调整评价规则。In an optional embodiment, in response to the auxiliary evaluation rules confirmed by the user, the evaluation rules are adjusted, including: extracting evaluation parameters and evaluation weights corresponding to the auxiliary evaluation rules from the specification document based on a preset industry database; and adjusting the evaluation rules based on the evaluation parameters and evaluation weights.
本实施例提供的行业文档的生成方法,通过结合用户确定的辅助评价规则,并从预设的行业数据库中提取评价参数和评价权重的方式,使得评价规则符合用户的需求,从而使得生成的第二行业文档符合用户的需求。The method for generating an industry document provided in this embodiment combines the auxiliary evaluation rules determined by the user and extracts the evaluation parameters and evaluation weights from a preset industry database, so that the evaluation rules meet the needs of the user, thereby making the generated second industry document meet the needs of the user.
在一个可选的实施方式中,基于调整后的评价规则对第一行业文档进行分析评测,并生成第二优化方案,包括:从第一行业文档中提取评价参数对应的待评测内容;基于调整后的评价规则,对待评测内容进行评价,生成评价结果;基于评价结果分析待评测内容中需要进行调整的内容;基于需要进行调整的内容生成第二优化方案。In an optional implementation, the first industry document is analyzed and evaluated based on the adjusted evaluation rules, and a second optimization plan is generated, including: extracting the content to be evaluated corresponding to the evaluation parameters from the first industry document; evaluating the content to be evaluated based on the adjusted evaluation rules and generating evaluation results; analyzing the content to be evaluated that needs to be adjusted based on the evaluation results; and generating a second optimization plan based on the content that needs to be adjusted.
本实施例提供的行业文档的生成方法,通过调整后的评价规则对第一行业文档中待评测内容进行评价,从而能够确定待评测内容中影响较大的内容,进而针对该内容进行调整,使得生成的第二行业文档能够更符合中标需求。The method for generating industry documents provided in this embodiment evaluates the content to be evaluated in the first industry document through the adjusted evaluation rules, so as to determine the content with greater influence in the content to be evaluated, and then make adjustments to the content, so that the generated second industry document can better meet the bidding requirements.
第二方面,本发明提供了一种行业文档的生成装置,该装置包括:获取模块,用于响应于用户的第一指示,获取相应的规范文档及文档生成需求;第一生成模块,用于基于规范文档及文档生成需求,生成第一行业文档;提取模块,用于响应于用户的第二指示,基于预设的行业数据库及规范文档提取评价规则;分析生成模块,用于基于评价规则对第一行业文档进行分析评测,并生成优化方案;第二生成模块,用于响应于用户的确认指示,基于优化方案生成行业文档。In a second aspect, the present invention provides an industry document generation device, which includes: an acquisition module for acquiring corresponding specification documents and document generation requirements in response to a first instruction from a user; a first generation module for generating a first industry document based on the specification documents and document generation requirements; an extraction module for extracting evaluation rules based on a preset industry database and specification documents in response to a second instruction from a user; an analysis and generation module for analyzing and evaluating the first industry document based on the evaluation rules and generating an optimization plan; and a second generation module for generating an industry document based on the optimization plan in response to a confirmation instruction from a user.
第三方面,本发明提供了一种计算机设备,包括:存储器和处理器,存储器和处理器之间互相通信连接,存储器中存储有计算机指令,处理器通过执行计算机指令,从而执行上述第一方面或其对应的任一实施方式的行业文档的生成方法。In a third aspect, the present invention provides a computer device, comprising: a memory and a processor, the memory and the processor are communicatively connected to each other, computer instructions are stored in the memory, and the processor executes the method for generating industry documents of the above-mentioned first aspect or any corresponding embodiment thereof by executing the computer instructions.
第四方面,本发明提供了一种计算机可读存储介质,该计算机可读存储介质上存储有计算机指令,计算机指令用于使计算机执行上述第一方面或其对应的任一实施方式的行业文档的生成方法。In a fourth aspect, the present invention provides a computer-readable storage medium having computer instructions stored thereon, the computer instructions being used to enable a computer to execute the method for generating industry documents of the first aspect or any corresponding embodiment thereof.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
为了更清楚地说明本发明具体实施方式或现有技术中的技术方案,下面将对具体实施方式或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施方式,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the specific implementation methods of the present invention or the technical solutions in the prior art, the drawings required for use in the specific implementation methods or the description of the prior art will be briefly introduced below. Obviously, the drawings described below are some implementation methods of the present invention. For ordinary technicians in this field, other drawings can be obtained based on these drawings without paying creative work.
图1是根据本发明实施例的行业文档的生成方法的流程示意图;FIG1 is a schematic flow chart of a method for generating an industry document according to an embodiment of the present invention;
图2是根据本发明实施例的另一行业文档的生成方法的流程示意图;FIG2 is a schematic flow chart of another method for generating industry documents according to an embodiment of the present invention;
图3是根据本发明实施例的又一行业文档的生成方法的流程示意图;FIG3 is a schematic flow chart of another method for generating industry documents according to an embodiment of the present invention;
图4是根据本发明实施例的行业文档生成方法的架构示意图;FIG4 is a schematic diagram of the architecture of a method for generating industry documents according to an embodiment of the present invention;
图5是根据本发明实施例的再一行业文档生成方法的流程示意图;FIG5 is a flow chart of another method for generating industry documents according to an embodiment of the present invention;
图6是根据本发明实施例的行业文档的生成装置的结构框图;6 is a structural block diagram of a device for generating industry documents according to an embodiment of the present invention;
图7是本发明实施例的计算机设备的硬件结构示意图。FIG. 7 is a schematic diagram of the hardware structure of a computer device according to an embodiment of the present invention.
具体实施方式Detailed ways
为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purpose, technical solution and advantages of the embodiments of the present invention clearer, the technical solution in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without creative work are within the scope of protection of the present invention.
相关技术中,随着计算机科学技术的不断发展,基于人工智能的智能文本输出获得了大量关注,其可以实现对文本内容的理解、分析并自动生成人类需要的文本,这对于行业领域中大量的文档处理任务具有较高的应用价值。In related technologies, with the continuous development of computer science and technology, intelligent text output based on artificial intelligence has received a lot of attention. It can understand and analyze text content and automatically generate texts needed by humans, which has high application value for a large number of document processing tasks in the industry.
然而,现在普遍使用的自动生成方案,基本上是描述基于OCR技术的证件识别;基于NLP技术的文件解析,帮助识别串标;可能有基于数据库实现标书文档的快速填写,但实际内容仍然需要人为填写,可见,相关技术只是提升了数据、资料的查找效率,并不能实现自动生成完整的行业文档。However, the automatic generation solutions commonly used now basically describe document recognition based on OCR technology; file parsing based on NLP technology to help identify bid collusion; there may be database-based rapid filling of bid documents, but the actual content still needs to be filled in manually. It can be seen that the relevant technology only improves the efficiency of searching for data and information, and cannot automatically generate complete industry documents.
基于此,本发明实施例提供了一种行业文档的生成方法,在通过规范文档及文档生成需求生成第一行业文档之后,可以根据用户的第二指示对第一行业文档进行优化,从而得到第二行业文档,能够提高数据、资料的查找效率的同时,提高行业文档的完整性。Based on this, an embodiment of the present invention provides a method for generating industry documents. After generating a first industry document through standard documents and document generation requirements, the first industry document can be optimized according to the user's second instructions to obtain a second industry document, which can improve the efficiency of searching for data and information while improving the integrity of industry documents.
根据本发明实施例,提供了一种行业文档的生成方法实施例,需要说明的是,在附图的流程图示出的步骤可以在诸如一组计算机可执行指令的计算机系统中执行,并且,虽然在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤。According to an embodiment of the present invention, an embodiment of a method for generating an industry document is provided. It should be noted that the steps shown in the flowchart of the accompanying drawings can be executed in a computer system such as a set of computer executable instructions, and although a logical order is shown in the flowchart, in some cases, the steps shown or described can be executed in an order different from that shown here.
在本实施例中提供了一种行业文档的生成方法,可用于计算机设备,如电脑、服务器等,图1是根据本发明实施例的行业文档的生成方法的流程图,如图1所示,该流程包括如下步骤:In this embodiment, a method for generating an industry document is provided, which can be used for computer devices, such as computers, servers, etc. FIG. 1 is a flow chart of a method for generating an industry document according to an embodiment of the present invention. As shown in FIG. 1 , the process includes the following steps:
步骤S101,响应于用户的第一指示,获取相应的规范文档及文档生成需求。Step S101, in response to a first instruction from a user, obtaining a corresponding specification document and document generation requirements.
第一指示可以为用户通过鼠标点击、键盘选择或者触屏所产生的指示。在用户针对上传的文件(如招标文件)发出第一指示时,相应地,计算机设备可以响应用户发出的第一指示,从上传的文档中获取对应的规范文档和文档生成需求。The first instruction may be an instruction generated by a user through a mouse click, keyboard selection, or touch screen. When the user issues a first instruction for an uploaded file (such as a bidding document), the computer device may respond to the first instruction issued by the user and obtain the corresponding specification document and document generation requirements from the uploaded document.
规范文档可以为对应用户上传文件(如投标文件)的内容规定的框架。具体地,规范文档中可以包括、软件著作权数量、资质证书数量、荣誉证书数量等,也可以包括软件的各功能模块、硬件、运维等,在此不做限定。The specification document can be a framework for the content of the corresponding user uploaded file (such as bidding document). Specifically, the specification document can include the number of software copyrights, the number of qualification certificates, the number of honorary certificates, etc., and can also include the various functional modules of the software, hardware, operation and maintenance, etc., which are not limited here.
文档生成需求可以为对应用户上传文件(如投标文件)的要求(如文档生成格式、文档生成内容),在此不做具体限定。例如:上传文件的要求可以为填写近三年的收入数据,则文档生成需求为将近三年的收入数据添加到规范文档。再例如:上传文件的要求可以为增加知识产权内容,文档生成需求可以为将知识产权中具体的内容添加到规范文档中。其中,计算机设备根据文档生成需求可以在知识产权下添加软件著作权数量、资质证书数量、荣誉证书数量等。The document generation requirement may be a requirement for the corresponding user to upload a file (such as a bidding document) (such as the document generation format, document generation content), which is not specifically limited here. For example: the requirement for uploading a file may be to fill in the income data of the past three years, then the document generation requirement is to add the income data of the past three years to the specification document. Another example: the requirement for uploading a file may be to add intellectual property content, and the document generation requirement may be to add specific content in the intellectual property to the specification document. Among them, computer equipment can add the number of software copyrights, the number of qualification certificates, the number of honorary certificates, etc. under intellectual property according to the document generation requirement.
步骤S102,基于规范文档及文档生成需求,生成第一行业文档。Step S102, generating a first industry document based on the specification document and document generation requirements.
第一行业文档可以为根据文档生成需求和规范文档生成的文档。其中,由上文内容可知,规范文档和文档生成需求确定之后,计算机设备可以根据通过文档生成需求将内容添加到规范文档中,生成第一行业文档。The first industry document may be a document generated according to the document generation requirement and the specification document. As can be seen from the above content, after the specification document and the document generation requirement are determined, the computer device may add content to the specification document according to the document generation requirement to generate the first industry document.
需要说明的是,第一行业文档可以由盘古NLP大模型得到。其中,构建盘古NLP大模型的方式可以为构建公司数据库,包含公司过往投标标书库、公司荣誉资质库、公司项目合同信息库、公司案例库、公司产品库等。以公司数据库中的数据为训练集构建盘古NLP大模型。然后将规范文档及文档生成需求作为输入,自动生成符合该项目需求的第一行业文档。It should be noted that the first industry document can be obtained by the Pangu NLP big model. Among them, the way to build the Pangu NLP big model is to build a company database, including the company's past bidding documents library, company honors and qualifications library, company project contract information library, company case library, company product library, etc. The Pangu NLP big model is built with the data in the company database as the training set. Then, the specification document and document generation requirements are used as input to automatically generate the first industry document that meets the project requirements.
步骤S103,响应于用户的第二指示,基于预设的行业数据库及规范文档提取评价规则。Step S103, in response to the second instruction of the user, extracting evaluation rules based on a preset industry database and specification documents.
第二指示可以为用户通过鼠标点击、键盘选择或者触屏所产生的指示。在用户发出第二指示时,相应地,计算机设备可以响应用户发出的第二指示,从预设的行业数据库中提取对应的规范文档的评价规则。其中,评价规则可以为如何对生成的第一行业文档进行评测。具体地评价过程在下文进行详细描述。The second instruction may be an instruction generated by a user through a mouse click, keyboard selection, or touch screen. When the user issues the second instruction, the computer device may respond to the second instruction issued by the user and extract the evaluation rules of the corresponding specification document from the preset industry database. The evaluation rules may be how to evaluate the generated first industry document. The specific evaluation process is described in detail below.
预设的行业数据库可以包括公司的基本信息、产品信息、本公司所属的行业常规解决方案信息等,在此不做具体限定。例如:公司A的名称、生产的产品A、公司B的名称、生成的产品B等。The preset industry database may include basic information of the company, product information, general solution information of the industry to which the company belongs, etc., which are not specifically limited here. For example: the name of company A, product A produced, the name of company B, product B produced, etc.
步骤S104,基于评价规则对第一行业文档进行分析评测,并生成第一优化方案。Step S104: Analyze and evaluate the first industry document based on the evaluation rules, and generate a first optimization solution.
计算机设备在提取评价规则之后,可以通过评价规则对第一行业文档进行分析评测,生成第一优化方案。其中,第一优化方案可以为针对第一行业文档中不符合要求的内容进行调整的优化方案。例如:以投标文件为例,第一行业文档可以为投标文件。其中,第一行业文档进行评测后的分数为80分,并不满足要求,那么需要根据第一行业文档分析出不符合要求的内容,并给出对该内容进行调整的第一优化方案。After extracting the evaluation rules, the computer device can analyze and evaluate the first industry document according to the evaluation rules to generate a first optimization plan. The first optimization plan can be an optimization plan for adjusting the content in the first industry document that does not meet the requirements. For example, taking the bidding document as an example, the first industry document can be a bidding document. The score of the first industry document after evaluation is 80 points, which does not meet the requirements. Therefore, it is necessary to analyze the content that does not meet the requirements according to the first industry document and provide a first optimization plan for adjusting the content.
步骤S105,响应于用户的确认指示,基于第一优化方案生成第二行业文档。Step S105 , in response to a confirmation instruction from the user, generating a second industry document based on the first optimization solution.
确认指示可以为用户通过鼠标点击、键盘选择或者触屏所产生的指示。在用户发出确认指示时,相应地,计算机设备可以响应用户发出的确认指示,通过第一优化方案对第一行业文档进行调整,生成第二行业文档。例如:以投标文件为例,投标文件初步得分为80分。经预设的行业数据库比对,针对以往投标策略,第一行业文档报价过高,建议进一步优化。The confirmation instruction may be an instruction generated by a user through a mouse click, keyboard selection, or touch screen. When the user issues a confirmation instruction, the computer device may respond to the confirmation instruction issued by the user, adjust the first industry document through the first optimization scheme, and generate a second industry document. For example, taking the bidding document as an example, the preliminary score of the bidding document is 80 points. After comparison with the preset industry database, the first industry document price is too high for the previous bidding strategy, and further optimization is recommended.
本实施例提供的行业文档的生成方法,在通过规范文档及文档生成需求生成第一行业文档之后,可以根据用户的第二指示对第一行业文档进行优化,从而得到第二行业文档,能够提高数据、资料的查找效率的同时,提高行业文档的完整性。The industry document generation method provided in this embodiment can optimize the first industry document according to the user's second instruction to obtain a second industry document after generating the first industry document through standard documents and document generation requirements, thereby improving the efficiency of searching for data and information while improving the integrity of industry documents.
在一个可选的实施方式中,为了使第二行业文档更满足于用户的需求,在步骤S105之前,上述方法还包括:In an optional implementation, in order to make the second industry document more satisfying to the needs of the user, before step S105, the method further includes:
步骤a1,响应于用户确认的辅助评价规则,调整评价规则。Step a1: adjusting the evaluation rules in response to the auxiliary evaluation rules confirmed by the user.
辅助评价规则可以为用户通过鼠标点击、键盘选择、语音或者触屏所输入的规则。在用户发出辅助评价规则时,相应地,计算机设备可以响应用户发出的辅助评价规则,通过辅助评价规则对评价规则进行调整。其中,辅助评价规则可以为用户个性化的建议。例如:以投标文件为例,用户想要该投标文件以更低的成本、合适的报价、最高的利润等,中标该项目,那么可以将用户的辅助评价规则发送给计算机设备,以使计算机设备对评价规则进行调整。The auxiliary evaluation rules may be rules entered by the user through mouse clicks, keyboard selections, voice or touch screen. When the user issues an auxiliary evaluation rule, the computer device may respond to the auxiliary evaluation rule issued by the user and adjust the evaluation rule through the auxiliary evaluation rule. The auxiliary evaluation rule may be a personalized suggestion for the user. For example, taking a bidding document as an example, the user wants the bidding document to win the bid for the project with a lower cost, a suitable quotation, the highest profit, etc., then the user's auxiliary evaluation rule may be sent to the computer device so that the computer device adjusts the evaluation rule.
具体地,上述步骤a1包括:Specifically, the above step a1 includes:
步骤a11,基于预设的行业数据库从规范文档中提取与辅助评价规则对应的评价参数及评价权重。Step a11, extracting evaluation parameters and evaluation weights corresponding to the auxiliary evaluation rules from the specification document based on a preset industry database.
辅助评价规则中存在用户针对的侧重点,如更低的成本、最高的利润等,在此不做具体限定,那么可以根据用户的辅助评价规则以及预设的行业数据库,从规范文档中提取与辅助评价规则对应的评价参数及评价权重。The auxiliary evaluation rules contain user-specific focuses, such as lower cost, highest profit, etc., which are not specifically limited here. Therefore, based on the user's auxiliary evaluation rules and the preset industry database, the evaluation parameters and evaluation weights corresponding to the auxiliary evaluation rules can be extracted from the specification document.
步骤a12,基于评价参数及评价权重调整评价规则。Step a12, adjusting the evaluation rules based on the evaluation parameters and evaluation weights.
计算机设备在获取到辅助评价规则对应的评价参数及评价权重后,需要根据评价参数及评价权重对评价规则进一步调整,也即修改评价规则中其他部分的评价权重以及评价参数。After obtaining the evaluation parameters and evaluation weights corresponding to the auxiliary evaluation rules, the computer device needs to further adjust the evaluation rules according to the evaluation parameters and evaluation weights, that is, to modify the evaluation weights and evaluation parameters of other parts of the evaluation rules.
步骤a2,基于调整后的评价规则对第一行业文档进行分析评测,并生成第二优化方案。Step a2: analyzing and evaluating the first industry document based on the adjusted evaluation rules, and generating a second optimization solution.
计算机设备根据调整后的评价规则重新对第一行业文档进行分析评测,也即自动识别第一行业文档和调整后的评价规则相对应的内容,从而确定第一行业文档内影响较大内容。例如:以投标文件为例,第一行业文档可以为投标文件。其中,对第一行业文档进行评测后,建议对展示法人关系图谱参数进行优化。第二优化方案可以为将知识图谱技术实现展示法人关系图谱参数的功能修改为采用数据可视化展示方式实现展示法人关系图谱参数的功能;由此,可降低技术难度,从而降低开发、运维成本,建议报价减少2万元,利润预估仍将提升10%,标书总评分将提升为85分。The computer equipment re-analyzes and evaluates the first industry document according to the adjusted evaluation rules, that is, automatically identifies the content corresponding to the first industry document and the adjusted evaluation rules, thereby determining the content with greater impact in the first industry document. For example: taking the bidding document as an example, the first industry document can be a bidding document. Among them, after evaluating the first industry document, it is recommended to optimize the parameters for displaying the legal person relationship map. The second optimization plan can be to modify the function of displaying the parameters of the legal person relationship map implemented by the knowledge graph technology to the function of displaying the parameters of the legal person relationship map using a data visualization display method; thereby, the technical difficulty can be reduced, thereby reducing the development and operation and maintenance costs. It is recommended to reduce the quotation by 20,000 yuan, and the profit estimate will still increase by 10%, and the total score of the bid will be increased to 85 points.
具体地,上述步骤a2包括:Specifically, the above step a2 includes:
步骤a21,从第一行业文档中提取评价参数对应的待评测内容。Step a21, extracting the content to be evaluated corresponding to the evaluation parameter from the first industry document.
步骤a22,基于调整后的评价规则,对待评测内容进行评价,生成评价结果。Step a22, based on the adjusted evaluation rules, evaluate the content to be evaluated and generate an evaluation result.
计算机设备可以从第一行业文档中提取待评测内容,计算机设备根据调整后的评价规则重新对待评测内容进行评价,生成评价结果。其中,由于调整后的评价规则中评价权重以及评价参数已经改变,那么生成的评价结果就会发生改变。The computer device can extract the content to be evaluated from the first industry document, and re-evaluate the content to be evaluated according to the adjusted evaluation rules to generate an evaluation result. Since the evaluation weight and evaluation parameters in the adjusted evaluation rules have changed, the generated evaluation result will change.
步骤a23,基于评价结果分析待评测内容中需要进行调整的内容。Step a23, analyzing the content to be evaluated that needs to be adjusted based on the evaluation results.
步骤a24,基于需要进行调整的内容生成第二优化方案。Step a24, generating a second optimization solution based on the content that needs to be adjusted.
根据上述评价结果可以分析出待评测内容中哪些内容不符合要求,则需要对不符合要求的内容进行调整,然后根据调整的内容生成第二优化方案。例如:以投标文件为例,第一行业文档可以为投标文件。其中,第一行业文档进行评测后,展示软件著作权数量建议优化。第二优化方案可以为将知识图谱技术实现展示软件著作权数量的功能修改为采用数据可视化展示方式实现展示软件著作权数量的功能;由此可降低技术难度,从而降低开发、运维成本,建议报价减少2万元,利润预估仍将提升20%,标书总评分将提升为90分。According to the above evaluation results, it can be analyzed which contents in the content to be evaluated do not meet the requirements, and it is necessary to adjust the contents that do not meet the requirements, and then generate a second optimization plan based on the adjusted contents. For example: taking the bidding document as an example, the first industry document can be the bidding document. Among them, after the first industry document is evaluated, it is recommended to optimize the display of the number of software copyrights. The second optimization plan can be to modify the function of displaying the number of software copyrights by using knowledge graph technology to use data visualization to display the number of software copyrights; thereby reducing the technical difficulty, thereby reducing the development and operation and maintenance costs, and it is recommended to reduce the quotation by 20,000 yuan, and the profit estimate will still increase by 20%, and the total score of the bid will be increased to 90 points.
本实施例提供的行业文档的生成方法,通过用户确认辅助评价规则,并根据辅助评价规则调整评价规则,使得评价规则能够侧重于用户的需求,从而能够提高行业文档的中标率。The method for generating industry documents provided in this embodiment allows users to confirm auxiliary evaluation rules and adjust evaluation rules based on the auxiliary evaluation rules, so that the evaluation rules can focus on user needs, thereby improving the winning rate of industry documents.
在本实施例中提供了一种行业文档的生成方法,可用于上述计算机设备,如电脑、服务器等,图2是根据本发明实施例的行业文档的生成方法的流程图,如图2所示,该流程包括如下步骤:In this embodiment, a method for generating an industry document is provided, which can be used for the above-mentioned computer equipment, such as a computer, a server, etc. FIG. 2 is a flow chart of the method for generating an industry document according to an embodiment of the present invention. As shown in FIG. 2 , the process includes the following steps:
步骤S201,响应于用户的第一指示,获取相应的规范文档及文档生成需求。详细请参见图1所示实施例的步骤S101,在此不再赘述。Step S201, in response to the first instruction of the user, obtain the corresponding specification document and document generation requirements. Please refer to step S101 of the embodiment shown in FIG1 for details, which will not be repeated here.
步骤S202,基于规范文档及文档生成需求,生成第一行业文档。Step S202: Generate a first industry document based on the specification document and document generation requirements.
具体地,上述步骤S202包括:Specifically, the above step S202 includes:
步骤S2021,通过自然语言处理方法解析规范文档,提取规范文档中的待填信息。Step S2021, parse the specification document through natural language processing method, and extract the information to be filled in the specification document.
自然语言处理方法通过机器学习进行工作。机器学习系统像其他任何形式的数据一样存储单词及其组合方式。短语、句子、有时甚至整本书的内容都被输入机器学习引擎,并在其中使用语法规则或人们的现实语言习惯,或两者兼而有之进行处理。然后,计算机使用这些数据来查找模式并推断出接下来的结果。以翻译软件为例:在法语中,“我要去公园”是“Je vais au parc”,因此机器学习预测“我要去商店”也将以“Je vais au”开头。因此,可以通过自然语言处理方法将规范文档中的待填信息提取出来。Natural language processing methods work through machine learning. Machine learning systems store words and how they are combined, just like any other form of data. Phrases, sentences, and sometimes even entire books are fed into a machine learning engine, where they are processed using grammatical rules or people's real-world language habits, or both. The computer then uses this data to find patterns and infer what will happen next. Take translation software as an example: in French, "I'm going to the park" is "Je vais au parc", so machine learning predicts that "I'm going to the store" will also start with "Je vais au". Therefore, the information to be filled in the specification document can be extracted through natural language processing methods.
具体地,上述步骤S2021包括:Specifically, the above step S2021 includes:
步骤b1,基于自然语言处理数据库对规范文档进行分词处理,提取词语信息。Step b1, perform word segmentation processing on the standard document based on the natural language processing database to extract word information.
自然语言处理数据库指用于存储和查询语言相关数据的数据库。具体地,可以包括HanLP数据库、Jieba数据库等,在此不做具体限定。其中,自然语言处理数据库可以对规范文档进行分词处理,提取词语信息,并对文本进行清理,移除无关内容,如标点、特殊符号等,在此不做具体限定。A natural language processing database refers to a database used to store and query language-related data. Specifically, it may include the HanLP database, Jieba database, etc., which are not specifically limited here. Among them, the natural language processing database can perform word segmentation on the standard document, extract word information, and clean up the text to remove irrelevant content, such as punctuation, special symbols, etc., which are not specifically limited here.
步骤b2,使用命名实体识别,从规范文档中提取文档需求信息。Step b2, using named entity recognition, extracts document requirement information from the specification document.
文档需求信息可以为含有需要、要求等关键词的句子。具体地,可以构建针对需求描述的规则模板,并将含有文档需求信息的句子使用命名实体识别。从规范文档中抽取产品、功能、对象等信息,并使用依存句法分析,并抽取述宾结构表示的需求。其中,依存句法用于描述出各个词语之间的依存关系。Document requirement information can be sentences containing keywords such as need and requirement. Specifically, a rule template for requirement description can be constructed, and sentences containing document requirement information can be recognized using named entity recognition. Information such as products, functions, and objects can be extracted from the specification document, and dependency syntax analysis can be used to extract requirements expressed by the subject-object structure. Dependency syntax is used to describe the dependency relationship between words.
步骤b3,对文档需求信息进行聚类处理,得到文档需求合集。Step b3: cluster the document demand information to obtain a document demand collection.
文档需求合集可以为含有需要、要求等关键词的句子的组合。具体地,通过分析语义相似的句子,并通过聚类处理得到文档需求合集。可选的,聚类方法可以为K-MEANS聚类算法、均值偏移聚类算法、DBSCAN聚类算法等,在此不做具体限定。The document requirement collection may be a combination of sentences containing keywords such as need and requirement. Specifically, the document requirement collection is obtained by analyzing sentences with similar semantics and performing clustering processing. Optionally, the clustering method may be a K-MEANS clustering algorithm, a mean shift clustering algorithm, a DBSCAN clustering algorithm, etc., which are not specifically limited here.
步骤b4,基于评分关键词从规范文档中提取评分内容。Step b4, extracting the scoring content from the specification document based on the scoring keywords.
评分关键词可以为含有重要、必须等评分关键词。具体地,构建针对评分关键词的规则模板,针对评分、分值、权重等词引导的从句进行句法分析,提取评分内容。The scoring keywords may include scoring keywords such as important, necessary, etc. Specifically, a rule template for scoring keywords is constructed, and syntactic analysis is performed on clauses introduced by words such as score, value, weight, etc. to extract scoring content.
步骤b5,使用情感分析方法及语义分析方法从规范文档中提取评分要点。Step b5, using sentiment analysis methods and semantic analysis methods to extract scoring points from the specification document.
情感分析方法为利用自然语言处理和文本挖掘技术,对带有情感色彩的主观性文本进行分析、处理和抽取的过程。具体地,可以为朴素贝叶斯情感分析法、深度学习LSTM、预训练的基于规则的VADER模型等,在此不做具体限定。Sentiment analysis methods are the process of analyzing, processing and extracting emotionally subjective texts using natural language processing and text mining techniques. Specifically, they can be naive Bayes sentiment analysis, deep learning LSTM, pre-trained rule-based VADER models, etc., which are not specifically limited here.
语义分析方法为运用各种机器学习算法学习与理解一段文本所表示的语义内容。Semantic analysis methods use various machine learning algorithms to learn and understand the semantic content represented by a piece of text.
计算机设备可以通过情感分析方法判断句子中蕴含的评分倾向,然后采用语义分析方法从规范文档中对评分要点进行抽取。Computer equipment can use sentiment analysis methods to determine the scoring tendency contained in a sentence, and then use semantic analysis methods to extract scoring points from the standard document.
步骤b6,基于评分内容及评分要点生成评分集合。Step b6, generating a rating set based on the rating content and rating points.
计算机设备在获取到评分内容和评分要点之后,可以将评分内容和评分要点进行合并,生成评分集合。After acquiring the scoring content and the scoring key points, the computer device may merge the scoring content and the scoring key points to generate a scoring set.
步骤b7,基于格式关键词从规范文档中提取格式描述语句。Step b7, extracting format description sentences from the specification document based on format keywords.
格式关键词可以为含有输入、打印等格式关键词,在此不做具体限定。具体地,根据格式关键词识别格式描述语句,并从规范文档中对格式描述语句进行提取。The format keywords may include format keywords such as input, print, etc., which are not specifically limited here. Specifically, the format description sentence is identified according to the format keyword, and the format description sentence is extracted from the specification document.
步骤b8,利用句法分析提取格式描述语句中的各个输出字段。Step b8, extracting each output field in the format description statement by using syntax analysis.
句法分析可以为句子成分分析法、层次分析法、变化分析法等,在此不做具体限定。其中,以句子成分分析法为例,通过句子成分分析法对格式描述语句进行分析,分析时要求一举找出整个结构的两个中心词一名词中心词和动词中心词,以作为该结构的主要成分主语和谓语,让其他成分分别依附于主语和谓语。其分析过程是:先看清整个结构的主要成分,哪个是主语,哪个是谓语;再看谓语的动词是不是及物动词,以决定谓语后面是否有连带成分宾语;最后指出附加在主、宾之前,谓语前后的所有附加成分,然后将各个输出字段进行提取。Syntactic analysis can be sentence component analysis, hierarchical analysis, variation analysis, etc., which are not specifically limited here. Among them, taking sentence component analysis as an example, the format description sentence is analyzed by sentence component analysis. During the analysis, it is required to find the two central words of the entire structure at one go, a noun central word and a verb central word, as the main components of the structure, the subject and the predicate, and let the other components be attached to the subject and the predicate respectively. The analysis process is: first see the main components of the entire structure, which is the subject and which is the predicate; then see whether the verb of the predicate is a transitive verb to determine whether there is a joint component object after the predicate; finally, point out all the additional components attached before the subject and object, and before and after the predicate, and then extract each output field.
步骤b9,通过语义匹配找到各个输出字段的字段说明,提取字段含义解释。Step b9, find the field description of each output field through semantic matching, and extract the field meaning explanation.
在确定出各个输出字段之后,通过语义匹配找到各个输出字段的字段说明,并提取字段含义解释。After each output field is determined, the field description of each output field is found through semantic matching, and the field meaning explanation is extracted.
步骤b10,基于文档需求合集、评分集合及字段含义解释生成待填信息。Step b10, generating information to be filled in based on the document requirement collection, the score collection and the field meaning explanation.
由于待填信息由文档需求合集、评分集合及字段含义组成,那么计算机设备在获取到文档需求合集、评分集合及字段含义解释后就可以生成待填信息。Since the information to be filled in is composed of a document requirement collection, a score collection and field meanings, the computer device can generate the information to be filled in after obtaining the document requirement collection, the score collection and the field meaning interpretation.
步骤S2022,基于文档生成需求及预设的行业数据库,提取行业内容生成待填信息中的内容,生成第一行业文档。Step S2022, based on the document generation requirements and the preset industry database, extract the industry content to generate the content in the information to be filled in, and generate the first industry document.
由上文内容可知,文档生成需求可以为上传文件内的要求(如文档生成格式、文档生成内容),预设的行业数据库预可以包括公司的基本信息、产品信息、本公司所属的行业常规解决方案信息等,那么根据文档生成需求和预设的行业数据库就可以将对应上传文件的行业内容提取出来,从而将行业内容作为待填信息中的内容,并根据待填信息中的内容生成第一行业文档。From the above content, it can be seen that the document generation requirements can be the requirements in the uploaded file (such as document generation format, document generation content), and the preset industry database can include the company's basic information, product information, and general solution information of the industry to which the company belongs, etc. Then, according to the document generation requirements and the preset industry database, the industry content of the corresponding uploaded file can be extracted, so that the industry content can be used as the content in the information to be filled in, and the first industry document can be generated according to the content in the information to be filled in.
步骤S203,响应于用户的第二指示,基于预设的行业数据库及规范文档提取评价规则。详细请参见图1所示实施例的步骤S103,在此不再赘述Step S203, in response to the second instruction of the user, extracting evaluation rules based on the preset industry database and standard documents. Please refer to step S103 of the embodiment shown in FIG. 1 for details, and will not be repeated here.
步骤S204,基于评价规则对第一行业文档进行分析评测,并生成第一优化方案。详细请参见图1所示实施例的步骤S104,在此不再赘述。Step S204: Analyze and evaluate the first industry document based on the evaluation rule, and generate a first optimization solution. Please refer to step S104 of the embodiment shown in FIG1 for details, which will not be described here.
步骤S205,响应于用户的确认指示,基于第一优化方案生成第二行业文档。详细请参见图1所示实施例的步骤S105,在此不再赘述。Step S205: In response to the user's confirmation instruction, generate a second industry document based on the first optimization solution. Please refer to step S105 of the embodiment shown in FIG1 for details, which will not be repeated here.
本实施例提供的行业文档的生成方法,通过自然语言处理方法解析规范文档,并根据文档生成需求和预设的行业数据库提取对应待填信息的数据的方式,能够智能化生成第一行业文档,从而提高第一行业文档生成效率的同时,保证第一行业文档中内容的准确率。The industry document generation method provided in this embodiment parses the standard document through a natural language processing method, and extracts data corresponding to the information to be filled in according to the document generation requirements and a preset industry database, so as to intelligently generate the first industry document, thereby improving the efficiency of the first industry document generation while ensuring the accuracy of the content in the first industry document.
在本实施例中提供了一种行业文档的生成方法,可用于上述的计算机设备,如电脑、服务器等,图3是根据本发明实施例的行业文档的生成方法的流程图,如图3所示,该流程包括如下步骤:In this embodiment, a method for generating an industry document is provided, which can be used for the above-mentioned computer equipment, such as a computer, a server, etc. FIG. 3 is a flow chart of the method for generating an industry document according to an embodiment of the present invention. As shown in FIG. 3 , the process includes the following steps:
步骤S301,响应于用户的第一指示,获取相应的规范文档及文档生成需求。详细请参见图2所示实施例的步骤S201,在此不再赘述。Step S301, in response to the first instruction of the user, obtain the corresponding specification document and document generation requirements. Please refer to step S201 of the embodiment shown in FIG2 for details, which will not be repeated here.
步骤S302,基于规范文档及文档生成需求,生成第一行业文档。详细请参见图2所示实施例的步骤S202,在此不再赘述。Step S302: Generate a first industry document based on the specification document and the document generation requirement. Please refer to step S202 of the embodiment shown in FIG2 for details, which will not be described in detail here.
步骤S303,响应于用户的第二指示,基于预设的行业数据库及规范文档提取评价规则。Step S303, in response to the second instruction of the user, extracting evaluation rules based on a preset industry database and specification documents.
具体地,上述步骤S303包括:Specifically, the above step S303 includes:
步骤S3031,基于预设的行业数据库从规范文档中提取相应的评价参数及评价权重。Step S3031, extracting corresponding evaluation parameters and evaluation weights from the specification document based on a preset industry database.
规范文档包括不同因素对应的评价参数以及评价权重,根据预设的行业数据库能够从规范文档内提取每一个因素对应的评价参数及评价权重。The specification document includes evaluation parameters and evaluation weights corresponding to different factors. The evaluation parameters and evaluation weights corresponding to each factor can be extracted from the specification document according to the preset industry database.
步骤S3032,基于评价参数及评价权重生成评价规则。Step S3032: Generate evaluation rules based on evaluation parameters and evaluation weights.
评分规则:F1、F2、F3、F4分别对应技术参数、商务参数、投标报价参数、优先采购政策参数;A1、A2、A3、A4分别对应技术参数、商务参数、投标报价参数权重。总系数=A1×F1+A2×F2+A3×F3+A4×F4。Scoring rules: F1, F2, F3, and F4 correspond to technical parameters, business parameters, bidding quotation parameters, and priority procurement policy parameters respectively; A1, A2, A3, and A4 correspond to the weights of technical parameters, business parameters, and bidding quotation parameters respectively. Total coefficient = A1×F1+A2×F2+A3×F3+A4×F4.
F1包含子参数:F11、F12、F1n,分别对应上述上传文件(如招标文件)中明确要求的技术要求,如行政区划代码变更模块开发与维护、全员人口数据查询接口升级维护、终端数据加解密功能等;F1 contains sub-parameters: F11, F12, and F1n, which correspond to the technical requirements clearly required in the above-mentioned uploaded documents (such as bidding documents), such as the development and maintenance of the administrative division code change module, the upgrade and maintenance of the full population data query interface, and the terminal data encryption and decryption functions;
F2包含子参数:F21、F22、F23、F24、F2n,分别对应上述上传文件(如招标文件)中明确要求的商务要求,如技术能力、服务能力、类似业绩等。其中,技术能力参数则还可细分为F211、F212、f21n等参数,分别代表软件著作权数量、资质证书数量、荣誉证书数量等;F2 contains sub-parameters: F21, F22, F23, F24, and F2n, which correspond to the business requirements clearly required in the above uploaded documents (such as bidding documents), such as technical capabilities, service capabilities, similar performance, etc. Among them, the technical capability parameter can be further subdivided into parameters such as F211, F212, and f21n, which respectively represent the number of software copyrights, the number of qualification certificates, the number of honorary certificates, etc.;
F3包含子参数:F31、F32、F33、F34、F3n,分别对应了软件各功能模块、硬件、运维等;F3 contains sub-parameters: F31, F32, F33, F34, and F3n, which correspond to software functional modules, hardware, operation and maintenance, etc.
F4包含子参数:F41、F42、F43、F44、F4n,分别对应了招标文件中明确要求的优先采购加分要求,如中小企业、两型产品等;其中,F1、F2、F3、F4系数为各子项系数之和。F4 contains sub-parameters: F41, F42, F43, F44, and F4n, which respectively correspond to the priority procurement bonus requirements clearly required in the bidding documents, such as small and medium-sized enterprises and two-type products; among them, the F1, F2, F3, and F4 coefficients are the sum of the coefficients of each sub-item.
具体地,上述步骤S3032可以包括:Specifically, the above step S3032 may include:
步骤c1,从第一行业文档中提取评价参数对应的待评测内容。Step c1, extracting the content to be evaluated corresponding to the evaluation parameter from the first industry document.
步骤c2,基于评价规则,对待评测内容进行评价,生成评价结果。Step c2: Based on the evaluation rules, the content to be evaluated is evaluated to generate an evaluation result.
计算机设备可以从第一行业文档内提取待评测内容。其中,计算机设备根据调整后的评价规则重新对待评测内容进行评价,生成评价结果。其中,由于调整后的评价规则中评价权重以及评价参数已经改变,那么生成的评价结果就可以发生改变。The computer device may extract the content to be evaluated from the first industry document. The computer device re-evaluates the content to be evaluated according to the adjusted evaluation rules to generate an evaluation result. Since the evaluation weight and evaluation parameters in the adjusted evaluation rules have changed, the generated evaluation result may change.
步骤c3,基于评价结果分析待评测内容中需要进行调整的内容。Step c3: analyzing the content to be evaluated that needs to be adjusted based on the evaluation results.
步骤c4,基于需要进行调整的内容生成第一优化方案。Step c4: generating a first optimization solution based on the content that needs to be adjusted.
根据上述评价结果可以分析出待评测内容中哪些内容不符合要求,则需要对不符合要求的内容进行调整,然后根据调整的内容生成第一优化方案。例如:以投标文件为例,第一行业文档可以为投标文件。其中,第一行业文档进行评测后,展示软件著作权数量建议优化。第一优化方案可以为将知识图谱技术实现展示软件著作权数量的功能修改为采用数据可视化展示方式实现展示软件著作权数量的功能;由此可降低技术难度,从而降低开发、运维成本,建议报价减少2万元,利润预估仍将提升20%,标书总评分将提升为90分。According to the above evaluation results, it can be analyzed which contents in the content to be evaluated do not meet the requirements, and it is necessary to adjust the contents that do not meet the requirements, and then generate the first optimization plan based on the adjusted contents. For example: taking the bidding document as an example, the first industry document can be the bidding document. Among them, after the first industry document is evaluated, it is recommended to optimize the display of the number of software copyrights. The first optimization plan can be to modify the function of displaying the number of software copyrights by using knowledge graph technology to use data visualization to display the number of software copyrights; thereby reducing the technical difficulty, thereby reducing the development and operation and maintenance costs, and it is recommended to reduce the quotation by 20,000 yuan, the profit estimate will still increase by 20%, and the total score of the bid will be increased to 90 points.
步骤S304,基于评价规则对第一行业文档进行分析评测,并生成第一优化方案。详细请参见图2所示实施例的步骤S203,在此不再赘述。Step S304: Analyze and evaluate the first industry document based on the evaluation rule, and generate a first optimization solution. Please refer to step S203 of the embodiment shown in FIG2 for details, which will not be described here.
步骤S305,响应于用户的确认指示,基于第一优化方案生成第二行业文档。详细请参见图2所示实施例的步骤S205,在此不再赘述。Step S305: In response to the user's confirmation instruction, generate a second industry document based on the first optimization solution. Please refer to step S205 of the embodiment shown in FIG2 for details, which will not be repeated here.
本实施例提供的行业文档的生成方法,通过评价规则对第一行业文档中待评测内容进行评价,从而能够确定待评测内容中影响较大的内容,进而针对该内容进行调整,使得生成的第二行业文档能够更符合中标需求。The method for generating industry documents provided in this embodiment evaluates the content to be evaluated in the first industry document through evaluation rules, so as to determine the content with greater influence in the content to be evaluated, and then adjust the content so that the generated second industry document can better meet the bidding requirements.
大模型(如上述盘古NLP大模型)采用多模型协作的方案,模拟第一行业文档以及第二行业文档的撰写流程,并根据输出内容提供自动优化功能。在一个可选的实施方式中,图4示出了一种行业文档生成方法的架构示意图、图5示出了一种行业文档生成方法的流程示意图。The large model (such as the Pangu NLP large model mentioned above) adopts a multi-model collaboration solution to simulate the writing process of the first industry document and the second industry document, and provides automatic optimization functions based on the output content. In an optional implementation, FIG4 shows an architectural diagram of an industry document generation method, and FIG5 shows a process diagram of an industry document generation method.
总体架构分为:协作层和模型层。The overall architecture is divided into: collaboration layer and model layer.
协作层的作用是协助模型层完成任务,也是模型层的底层依赖。具体地,协作层为每个大模型分配角色,具体可以为定义一组角色,例如技术人员、审核人员等,并为每个角色定义一些属性,例如角色名称、目标、约束等,在此不做具体限定。并将每个大语言模型的上下文存入数据库中,供其他大语言模型调用。The role of the collaboration layer is to assist the model layer in completing tasks, and it is also the underlying dependency of the model layer. Specifically, the collaboration layer assigns roles to each large model. Specifically, it can define a set of roles, such as technicians, auditors, etc., and define some attributes for each role, such as role name, goals, constraints, etc., which are not specifically limited here. The context of each large language model is stored in the database for other large language models to call.
模型层的作用是接收用户信息,并输出最终的行业文档。具体地,多个大模型之间通过协作层交换信息,最终实现只需要用户的需求即可完成整个行业文档的输出流程。具体地,模型层可以包括模型A和模型B。其中,模型A:技术人员角色,负责根据规范文档和文档生成需求进行第一行业文档的输出。模型B:审核人员角色,负责理解评分规则,并根据评分规则对模型A的输出进行打分。其中,低于评分标准,则生成一份修改说明书,然后模型A根据该说明书和用户修改意见,进行多次迭代,直到达到评分标准。The role of the model layer is to receive user information and output the final industry document. Specifically, multiple large models exchange information through the collaboration layer, and ultimately the entire industry document output process can be completed based on user needs. Specifically, the model layer can include model A and model B. Model A: The role of a technician, responsible for outputting the first industry document according to the specification document and document generation requirements. Model B: The role of an auditor, responsible for understanding the scoring rules and scoring the output of model A according to the scoring rules. If it is below the scoring standard, a modification instruction is generated, and then model A is iterated multiple times based on the instruction and user modification suggestions until the scoring standard is met.
结合图4所示,图4中的文件可以为投标文件,通过自然语言处理方法(也即图4中的NLP技术)对投标文件进行处理,得到投标需求,评分标准以及招标格式。其中,投标需求可以为文件生成需求、招标格式可以为规范文档、评分标准可以为评价规则。然后在公司投标数据库提取对应的数据给到协作层,然后协作层将数据给到模型A,模型A根据投标需求和招标格式生成初稿(也即第一行业文档),然后通过协作层将第一行业文档发送给模型B,模型B对第一行业文档进行评价,并给出反馈。其中,用户可以增加辅助评分规则,也即图4中我想要一个“XX”的标书,例如:我想要一个更低的成本、合适的报价、最高的利润的标书。然后通过辅助评分规则,再次给出反馈,反馈的内容可以为修改“XX”部分,例如:将知识图谱技术修改为数据可视化展示方式。As shown in FIG. 4 , the file in FIG. 4 may be a bidding document. The bidding document is processed by a natural language processing method (i.e., the NLP technology in FIG. 4 ) to obtain bidding requirements, scoring criteria, and bidding formats. Among them, the bidding requirements may be document generation requirements, the bidding format may be a specification document, and the scoring criteria may be an evaluation rule. Then, the corresponding data is extracted from the company's bidding database and given to the collaboration layer, and then the collaboration layer gives the data to model A. Model A generates a draft (i.e., the first industry document) according to the bidding requirements and the bidding format, and then the first industry document is sent to model B through the collaboration layer. Model B evaluates the first industry document and gives feedback. Among them, the user can add auxiliary scoring rules, i.e., I want a "XX" bid in FIG. 4, for example: I want a bid with lower cost, appropriate quotation, and highest profit. Then, through the auxiliary scoring rules, feedback is given again, and the content of the feedback may be to modify the "XX" part, for example: modify the knowledge graph technology to a data visualization display method.
需要说明的是,协作层可以包括对话上下文以及文档共享。其中,对话上下文可以存储于对话数据库。文档共享用于实现模型A和模型B之间的文档共享。It should be noted that the collaboration layer may include conversation context and document sharing. The conversation context may be stored in a conversation database. Document sharing is used to implement document sharing between model A and model B.
结合图5所示,图5中的投标文件可以为第一行业文档和第二行业文档,招标文件可以为上述上传至计算机服务器的文件,投标需求可以为文件生成需求,招标格式可以为规范文档。其中,用户将招标文件输入至计算机服务器内,计算机服务器通过协作层分析招标文件的投标需求和招标格式,生成第一行业文档,然后通过模型B对第一行业文档进行评价,并给出改进建议,也即第一优化方案。其中,可以增加辅助评分规则,并根据辅助评分规则对评分规则进行修改,并根据修改后的评分规则重新对第一行业文档进行评价,给出具体优化建议,也即第二优化方案。As shown in FIG. 5 , the bidding document in FIG. 5 can be a first industry document and a second industry document, the bidding document can be the above-mentioned file uploaded to the computer server, the bidding requirements can be file generation requirements, and the bidding format can be a standard document. Among them, the user inputs the bidding document into the computer server, and the computer server analyzes the bidding requirements and bidding format of the bidding document through the collaboration layer to generate the first industry document, and then evaluates the first industry document through model B and gives improvement suggestions, that is, the first optimization solution. Among them, auxiliary scoring rules can be added, and the scoring rules can be modified according to the auxiliary scoring rules, and the first industry document can be re-evaluated according to the modified scoring rules, and specific optimization suggestions can be given, that is, the second optimization solution.
在本实施例中还提供了一种行业文档生成装置,该装置用于实现上述实施例及优选实施方式,已经进行过说明的不再赘述。如以下所使用的,术语“模块”可以实现预定功能的软件和/或硬件的组合。尽管以下实施例所描述的装置较佳地以软件来实现,但是硬件,或者软件和硬件的组合的实现也是可能并被构想的。In this embodiment, an industry document generation device is also provided, which is used to implement the above-mentioned embodiments and preferred implementation modes, and the descriptions that have been made will not be repeated. As used below, the term "module" can implement a combination of software and/or hardware of a predetermined function. Although the devices described in the following embodiments are preferably implemented in software, the implementation of hardware, or a combination of software and hardware, is also possible and conceivable.
本实施例提供一种行业文档生成装置,如图6所示,包括:获取模块601,用于响应于用户的第一指示,获取相应的规范文档及文档生成需求;第一生成模块602,用于基于规范文档及文档生成需求,生成第一行业文档;提取模块603,用于响应于用户的第二指示,基于预设的行业数据库及规范文档提取评价规则;分析生成模块604,用于基于评价规则对第一行业文档进行分析评测,并生成优化方案;第二生成模块605,用于响应于用户的确认指示,基于优化方案生成第二行业文档。This embodiment provides an industry document generation device, as shown in Figure 6, including: an acquisition module 601, used to respond to a first instruction from a user to obtain a corresponding specification document and document generation requirements; a first generation module 602, used to generate a first industry document based on the specification document and the document generation requirements; an extraction module 603, used to respond to a second instruction from a user to extract evaluation rules based on a preset industry database and specification documents; an analysis generation module 604, used to analyze and evaluate the first industry document based on the evaluation rules and generate an optimization plan; a second generation module 605, used to respond to a confirmation instruction from the user to generate a second industry document based on the optimization plan.
在一个可选的实施方式中,上述装置还包括:调整模块,用于响应于用户确认的辅助评价规则,调整评价规则;评测生成模块,用于基于调整后的评价规则对第一行业文档进行分析评测,并生成第二优化方案。In an optional embodiment, the above-mentioned device also includes: an adjustment module, which is used to adjust the evaluation rules in response to the auxiliary evaluation rules confirmed by the user; an evaluation generation module, which is used to analyze and evaluate the first industry document based on the adjusted evaluation rules and generate a second optimization plan.
在一个可选的实施方式中,第一生成模块602包括:第一提取单元,用于通过自然语言处理方法解析规范文档,提取规范文档中的待填信息;第二提取单元,用于基于文档生成需求及预设的行业数据库,提取行业内容生成待填信息中的内容,生成第一行业文档。In an optional embodiment, the first generation module 602 includes: a first extraction unit, which is used to parse the specification document through a natural language processing method to extract the information to be filled in the specification document; a second extraction unit, which is used to extract industry content to generate the content in the information to be filled in based on the document generation requirements and a preset industry database, and generate a first industry document.
在一个可选的实施方式中,第一提取单元包括:第一提取子单元,用于基于自然语言处理数据库对规范文档进行分词处理,提取词语信息;第二提取子单元,用于使用命名实体识别,从规范文档中提取文档需求信息;聚类子单元,用于对文档需求信息进行聚类处理,得到文档需求合集;第三提取子单元,用于基于评分关键词从规范文档中提取评分内容;第四提取子单元,用于使用情感分析方法及语义分析方法从规范文档中提取评分要点;第一生成子单元,用于基于评分内容及评分要点生成评分集合;第五提取子单元,用于基于格式关键词从规范文档中提取格式描述语句;第六提取子单元,用于利用句法分析提取格式描述语句中的各个输出字段;第七提取子单元,用于通过语义匹配找到各个输出字段的字段说明,提取字段含义解释;第二生成子单元,用于基于文档需求合集、评分集合及字段含义解释生成待填信息。In an optional embodiment, the first extraction unit includes: a first extraction subunit, which is used to perform word segmentation processing on the standard document based on a natural language processing database to extract word information; a second extraction subunit, which is used to use named entity recognition to extract document requirement information from the standard document; a clustering subunit, which is used to cluster the document requirement information to obtain a document requirement collection; a third extraction subunit, which is used to extract scoring content from the standard document based on scoring keywords; a fourth extraction subunit, which is used to extract scoring points from the standard document using sentiment analysis methods and semantic analysis methods; a first generation subunit, which is used to generate a scoring set based on scoring content and scoring points; a fifth extraction subunit, which is used to extract format description statements from the standard document based on format keywords; a sixth extraction subunit, which is used to extract each output field in the format description statement using syntactic analysis; a seventh extraction subunit, which is used to find the field description of each output field through semantic matching and extract the field meaning explanation; and a second generation subunit, which is used to generate information to be filled in based on the document requirement collection, the scoring set and the field meaning explanation.
在一个可选的实施方式中,提取模块603包括:第三提取单元,用于基于预设的行业数据库从规范文档中提取相应的评价参数及评价权重;第一生成单元,用于基于评价参数及评价权重生成评价规则。In an optional embodiment, the extraction module 603 includes: a third extraction unit, used to extract corresponding evaluation parameters and evaluation weights from the specification document based on a preset industry database; and a first generation unit, used to generate evaluation rules based on the evaluation parameters and evaluation weights.
在一个可选的实施方式中,分析生成模块604包括:第四提取单元,用于从第一行业文档中提取评价参数对应的待评测内容;第二生成单元,用于基于评价规则,对待评测内容进行评价,生成评价结果;第一分析单元,用于基于评价结果分析待评测内容中需要进行调整的内容;第三生成单元,用于基于需要进行调整的内容生成第一优化方案。In an optional embodiment, the analysis generation module 604 includes: a fourth extraction unit, used to extract the content to be evaluated corresponding to the evaluation parameters from the first industry document; a second generation unit, used to evaluate the content to be evaluated based on the evaluation rules and generate an evaluation result; a first analysis unit, used to analyze the content to be evaluated that needs to be adjusted based on the evaluation result; and a third generation unit, used to generate a first optimization plan based on the content that needs to be adjusted.
在一个可选的实施方式中,调整模块包括:第五提取单元,用于基于预设的行业数据库从规范文档中提取与辅助评价规则对应的评价参数及评价权重;调整单元,用于基于评价参数及评价权重调整评价规则。In an optional embodiment, the adjustment module includes: a fifth extraction unit, used to extract evaluation parameters and evaluation weights corresponding to the auxiliary evaluation rules from the specification document based on a preset industry database; and an adjustment unit, used to adjust the evaluation rules based on the evaluation parameters and evaluation weights.
在一个可选的实施方式中,评测生成模块包括:第六提取单元,用于从第一行业文档中提取评价参数对应的待评测内容;评价单元,用于基于调整后的评价规则,对待评测内容进行评价,生成评价结果;第二分析单元,用于基于评价结果分析待评测内容中需要进行调整的内容;第四生成单元,用于基于需要进行调整的内容生成第二优化方案。In an optional embodiment, the evaluation generation module includes: a sixth extraction unit, used to extract the content to be evaluated corresponding to the evaluation parameters from the first industry document; an evaluation unit, used to evaluate the content to be evaluated based on the adjusted evaluation rules and generate an evaluation result; a second analysis unit, used to analyze the content to be evaluated that needs to be adjusted based on the evaluation result; and a fourth generation unit, used to generate a second optimization plan based on the content that needs to be adjusted.
上述各个模块和单元的更进一步的功能描述与上述对应实施例相同,在此不再赘述。The further functional description of each of the above modules and units is the same as that of the above corresponding embodiments and will not be repeated here.
本实施例中的行业文档的生成装置是以功能单元的形式来呈现,这里的单元是指ASIC(Application Specific Integrated Circuit,专用集成电路)电路,执行一个或多个软件或固定程序的处理器和存储器,和/或其他可以提供上述功能的器件。The industry document generation device in this embodiment is presented in the form of a functional unit, where the unit refers to an ASIC (Application Specific Integrated Circuit) circuit, a processor and memory that executes one or more software or fixed programs, and/or other devices that can provide the above functions.
本发明实施例还提供一种计算机设备,具有上述图6所示的行业文档的生成装置。An embodiment of the present invention further provides a computer device having the device for generating the industry document shown in FIG. 6 above.
请参阅图7,图7是本发明可选实施例提供的一种计算机设备的结构示意图,如图7所示,该计算机设备包括:一个或多个处理器10、存储器20,以及用于连接各部件的接口,包括高速接口和低速接口。各个部件利用不同的总线互相通信连接,并且可以被安装在公共主板上或者根据需要以其它方式安装。处理器可以对在计算机设备内执行的指令进行处理,包括存储在存储器中或者存储器上以在外部输入/输出装置(诸如,耦合至接口的显示设备)上显示GUI的图形信息的指令。在一些可选的实施方式中,若需要,可以将多个处理器和/或多条总线与多个存储器和多个存储器一起使用。同样,可以连接多个计算机设备,各个设备提供部分必要的操作(例如,作为服务器阵列、一组刀片式服务器、或者多处理器系统)。图7中以一个处理器10为例。Please refer to FIG. 7, which is a schematic diagram of the structure of a computer device provided by an optional embodiment of the present invention. As shown in FIG. 7, the computer device includes: one or more processors 10, a memory 20, and interfaces for connecting various components, including high-speed interfaces and low-speed interfaces. The various components are connected to each other using different buses for communication, and can be installed on a common motherboard or installed in other ways as needed. The processor can process instructions executed in the computer device, including instructions stored in or on the memory to display graphical information of the GUI on an external input/output device (such as a display device coupled to the interface). In some optional embodiments, if necessary, multiple processors and/or multiple buses can be used together with multiple memories and multiple memories. Similarly, multiple computer devices can be connected, and each device provides some necessary operations (for example, as a server array, a group of blade servers, or a multi-processor system). In FIG. 7, a processor 10 is taken as an example.
处理器10可以是中央处理器,网络处理器或其组合。其中,处理器10还可以进一步包括硬件芯片。上述硬件芯片可以是专用集成电路,可编程逻辑器件或其组合。上述可编程逻辑器件可以是复杂可编程逻辑器件,现场可编程逻辑门阵列,通用阵列逻辑或其任意组合。The processor 10 may be a central processing unit, a network processor or a combination thereof. The processor 10 may further include a hardware chip. The hardware chip may be a dedicated integrated circuit, a programmable logic device or a combination thereof. The programmable logic device may be a complex programmable logic device, a field programmable gate array, a general purpose array logic or any combination thereof.
其中,存储器20存储有可由至少一个处理器10执行的指令,以使至少一个处理器10执行实现上述实施例示出的方法。The memory 20 stores instructions executable by at least one processor 10, so that at least one processor 10 executes the method shown in the above embodiment.
存储器20可以包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需要的应用程序;存储数据区可存储根据计算机设备的使用所创建的数据等。此外,存储器20可以包括高速随机存取存储器,还可以包括非瞬时存储器,例如至少一个磁盘存储器件、闪存器件、或其他非瞬时固态存储器件。在一些可选的实施方式中,存储器20可选包括相对于处理器10远程设置的存储器,这些远程存储器可以通过网络连接至该计算机设备。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 20 may include a program storage area and a data storage area, wherein the program storage area may store an operating system, an application required by at least one function; the data storage area may store data created according to the use of the computer device, etc. In addition, the memory 20 may include a high-speed random access memory, and may also include a non-transient memory, such as at least one disk storage device, a flash memory device, or other non-transient solid-state storage device. In some optional embodiments, the memory 20 may optionally include a memory remotely arranged relative to the processor 10, and these remote memories may be connected to the computer device via a network. Examples of the above-mentioned network include, but are not limited to, the Internet, an intranet, a local area network, a mobile communication network, and combinations thereof.
存储器20可以包括易失性存储器,例如,随机存取存储器;存储器也可以包括非易失性存储器,例如,快闪存储器,硬盘或固态硬盘;存储器20还可以包括上述种类的存储器的组合。The memory 20 may include a volatile memory, such as a random access memory; the memory may also include a non-volatile memory, such as a flash memory, a hard disk or a solid state drive; the memory 20 may also include a combination of the above types of memory.
该计算机设备还包括通信接口30,用于该计算机设备与其他设备或通信网络通信。The computer device further comprises a communication interface 30 for the computer device to communicate with other devices or a communication network.
本发明实施例还提供了一种计算机可读存储介质,上述根据本发明实施例的方法可在硬件、固件中实现,或者被实现为可记录在存储介质,或者被实现通过网络下载的原始存储在远程存储介质或非暂时机器可读存储介质中并将被存储在本地存储介质中的计算机代码,从而在此描述的方法可被存储在使用通用计算机、专用处理器或者可编程或专用硬件的存储介质上的这样的软件处理。其中,存储介质可为磁碟、光盘、只读存储记忆体、随机存储记忆体、快闪存储器、硬盘或固态硬盘等;进一步地,存储介质还可以包括上述种类的存储器的组合。可以理解,计算机、处理器、微处理器控制器或可编程硬件包括可存储或接收软件或计算机代码的存储组件,当软件或计算机代码被计算机、处理器或硬件访问且执行时,实现上述实施例示出的方法。The embodiment of the present invention also provides a computer-readable storage medium. The method according to the embodiment of the present invention can be implemented in hardware, firmware, or can be implemented as a computer code that can be recorded in a storage medium, or can be implemented as a computer code that is originally stored in a remote storage medium or a non-temporary machine-readable storage medium and will be stored in a local storage medium through a network download, so that the method described herein can be stored in such software processing on a storage medium using a general-purpose computer, a dedicated processor, or programmable or dedicated hardware. Among them, the storage medium can be a magnetic disk, an optical disk, a read-only storage memory, a random access memory, a flash memory, a hard disk or a solid-state hard disk, etc.; further, the storage medium can also include a combination of the above types of memories. It can be understood that a computer, a processor, a microprocessor controller, or programmable hardware includes a storage component that can store or receive software or computer code. When the software or computer code is accessed and executed by a computer, a processor, or hardware, the method shown in the above embodiment is implemented.
虽然结合附图描述了本发明的实施例,但是本领域技术人员可以在不脱离本发明的精神和范围的情况下做出各种修改和变型,这样的修改和变型均落入由所附权利要求所限定的范围之内。Although the embodiments of the present invention have been described in conjunction with the accompanying drawings, those skilled in the art may make various modifications and variations without departing from the spirit and scope of the present invention, and such modifications and variations are all within the scope defined by the appended claims.
Claims (8)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202311506829.4A CN117494672B (en) | 2023-11-13 | 2023-11-13 | Method, device and computer-readable storage medium for generating industry documents |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202311506829.4A CN117494672B (en) | 2023-11-13 | 2023-11-13 | Method, device and computer-readable storage medium for generating industry documents |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN117494672A CN117494672A (en) | 2024-02-02 |
| CN117494672B true CN117494672B (en) | 2024-07-23 |
Family
ID=89670464
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202311506829.4A Active CN117494672B (en) | 2023-11-13 | 2023-11-13 | Method, device and computer-readable storage medium for generating industry documents |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN117494672B (en) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN118350374A (en) * | 2024-04-17 | 2024-07-16 | 中国邮政储蓄银行股份有限公司 | Long document evaluation and generation method and device and electronic equipment |
| CN119849446A (en) * | 2024-12-23 | 2025-04-18 | 中国工商银行股份有限公司 | Design document generation method, device, medium and electronic equipment |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116402022A (en) * | 2023-03-02 | 2023-07-07 | 中银金融科技有限公司 | Document generation method, device, electronic equipment and storage medium |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103999081A (en) * | 2011-12-12 | 2014-08-20 | 国际商业机器公司 | Generation of natural language processing model for information domain |
| CN110795923B (en) * | 2019-11-01 | 2024-03-22 | 达观数据有限公司 | Automatic generation system and generation method of technical documents based on natural language processing |
| KR20210086849A (en) * | 2019-12-31 | 2021-07-09 | 주식회사 리걸인사이트 | Method for generating document |
| CN115238710B (en) * | 2022-09-22 | 2022-11-22 | 北京弘玑信息技术有限公司 | Intelligent document generation and management method and device |
| CN115688868B (en) * | 2022-12-30 | 2023-10-20 | 荣耀终端有限公司 | Model training method and computing equipment |
| CN116469505A (en) * | 2023-04-18 | 2023-07-21 | 平安科技(深圳)有限公司 | Data processing method, device, computer equipment and readable storage medium |
| CN117008960A (en) * | 2023-07-03 | 2023-11-07 | 深圳派氪司科技有限公司 | API document generation method, device, storage medium and chip |
-
2023
- 2023-11-13 CN CN202311506829.4A patent/CN117494672B/en active Active
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116402022A (en) * | 2023-03-02 | 2023-07-07 | 中银金融科技有限公司 | Document generation method, device, electronic equipment and storage medium |
Also Published As
| Publication number | Publication date |
|---|---|
| CN117494672A (en) | 2024-02-02 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US8972408B1 (en) | Methods, systems, and articles of manufacture for addressing popular topics in a social sphere | |
| CN117494672B (en) | Method, device and computer-readable storage medium for generating industry documents | |
| US20220165257A1 (en) | Neural sentence generator for virtual assistants | |
| US20220414463A1 (en) | Automated troubleshooter | |
| CN108875059B (en) | Method and device for generating document tag, electronic equipment and storage medium | |
| CN111339284A (en) | Product intelligent matching method, device, device and readable storage medium | |
| CN112860893B (en) | Short text classification method and terminal equipment | |
| CN114971730A (en) | Method for extracting file material, device, equipment, medium and product thereof | |
| CN111723207A (en) | Intent recognition method and system | |
| CN110110083A (en) | A kind of sensibility classification method of text, device, equipment and storage medium | |
| CN118313348A (en) | Document format typesetting method, device, computer equipment, storage medium and product | |
| CN119441443A (en) | Automatic prompt construction method based on human-computer dialogue history and semantic retrieval | |
| CN118981526B (en) | Multi-mode zero-code form modeling intelligent question-answering method and related equipment thereof | |
| CN118133946B (en) | Multi-modal knowledge hierarchical recognition and controlled alignment method | |
| CN114580346A (en) | Information generation method and device combining RPA and AI, electronic equipment and storage medium | |
| CN118051621A (en) | A marketing speech generation method and system for automated enterprise publicity | |
| JP2025515551A (en) | Trademark risk management system and method | |
| Raj | Natural language processing for chatbots | |
| CN117688123A (en) | Method and device for generating document structure tree | |
| KR20240023755A (en) | Method, device and computer-readable recording medium for generating document content based on information aggregation for market matching of small business owners | |
| TWI877993B (en) | A system of trademark risk management and the method thereof | |
| TWI880593B (en) | A system of intelligent patent type recommendation and draft generation and the method thereof | |
| TWI861940B (en) | A system of automatically generating patent files by semantic analysis and the execution method thereof | |
| CN119167946B (en) | Method and terminal for embedding exclusive AI interpretation for shared documents | |
| TW202531109A (en) | A system of trademark risk management and the method thereof |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |