[go: up one dir, main page]

WO2016006002A2 - Procédé et système de gestion de processus - Google Patents

Procédé et système de gestion de processus Download PDF

Info

Publication number
WO2016006002A2
WO2016006002A2 PCT/IN2015/050063 IN2015050063W WO2016006002A2 WO 2016006002 A2 WO2016006002 A2 WO 2016006002A2 IN 2015050063 W IN2015050063 W IN 2015050063W WO 2016006002 A2 WO2016006002 A2 WO 2016006002A2
Authority
WO
WIPO (PCT)
Prior art keywords
data
document
electronic document
eat
eats
Prior art date
Application number
PCT/IN2015/050063
Other languages
English (en)
Other versions
WO2016006002A3 (fr
Inventor
Srikant KRISHNAN
S. Narayanan
Srikrishnan P
Sakshee SHARMA
Original Assignee
Dmacq Software Pvt. Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dmacq Software Pvt. Ltd filed Critical Dmacq Software Pvt. Ltd
Publication of WO2016006002A2 publication Critical patent/WO2016006002A2/fr
Publication of WO2016006002A3 publication Critical patent/WO2016006002A3/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling

Definitions

  • the present invention relates to a method and system for workflow management, more particularly the invention relates to a content data platform with user interactive features to enhance the user's ability to directly capture information into a structured form by extracting data from unstructured documents.
  • a method of extracting and arranging data in structured format from an electronic document is performed by selecting a category of the document, selecting a data content in the document, defining associated field of the data content in the document, and storing the data content in a Database system (505).
  • a method of capturing data content from an electronic document is performed by selecting an Entity Associated Tags (EAT), thus allowing a user to extract the document directly into an appropriate document processing system such as MIS (management Information System) process integrated with the system for further processing using associated EATs.
  • EAT Entity Associated Tags
  • a system for document management includes a device to obtain a document in electronic format and transmit the scanned data to a processing module comprising a memory, and a processor (502) coupled to the memory, configured to carry out the steps of: selecting a category of the document; selecting the data content in the document; defining associated field of the data content in the document; storing the data content in a Database system (505); and making available to the integrated data processing system such as MIS (Management Information System).
  • MIS Management Information System
  • Fig. 1 illustrates a flow diagram for Updating Entities in the Master data base.
  • Fig. 2 illustrates a flow diagram for Creating EAT's.
  • Fig. 3 illustrates a flow diagram for Using EAT's.
  • Fig. 4 illustrates a flow diagram for Updating Entity's Property Values Using EAT's.
  • Fig. 5 illustrates a system diagram
  • the invention described herein is directed to perform effective workflow management through content data platform with user interactive features to enhance the user's ability to directly capture information in to a structured form by extracting data and information from unstructured documents.
  • the embodiments herein provide a method and system for workflow management through content data platform with user interactive features to enhance the user's ability to directly capture information in to a structured form by extracting data and information from unstructured documents. Further the embodiments may be easily implemented in various data and information management structures.
  • the method of the invention may also be implemented as application performed by a stand alone or embedded system.
  • references in the specification to "one embodiment” or “an embodiment” means that a particular feature, structure, characteristic, or function described in connection with the embodiment is included in at least one embodiment of the invention.
  • the appearances of the phrase “in one embodiment” in various places in the specification are not necessarily all referring to the same embodiment.
  • the method and system for workflow management through content data platform with user interactive features to enhance the user's ability to directly capture data in to a structured form by extracting data from unstructured documents is in accordance to the user and /or service provider devices compatible format and to benefit across multiple platforms.
  • the present invention may minimize manual intervention and automate text extraction from unstructured documents with the ability to directly capture data into a structured form.
  • the system and the method thereof of will enhance the user's ability to manage and process data for workflow management and execution may be by inverse tagging.
  • the inverse tagging for workflow automation may minimize manual intervention to may be to a great extent and may automate document processing.
  • the system comprise of a plurality of Interactive User Interface, at least one Master Database (5051 ), at least one Tags Database (5052), at least one Document Database (5053), a plurality of Electronic document generation and extraction module (506) wherein at least one Electronic document generation and extraction module (506) comprise of at least one Optical scanner module (5061 ), a Dynamic memory module (5062), at least one coder processor module (504), and at least one processor unit(501 ) wherein the processor unit(501 ) comprise of at least one RISC type processor, configuration register memory module (503).
  • the components of the system are configured to perform the associative functionality enabling the system to execute the overall functionality, the system is configured for.
  • the system and method thereof may automatically identify a plurality of entity associated tags (EAT's) in a plurality of given document.
  • the plurality of EAT's may be constituted of a plurality of tags associated to a plurality of entity in at least one Database (505) - for example, if Customer is an entity in the Database (505), then the customer name, say, "XYZ" is a tag, and both the tag and its associated entity are called an EAT.
  • the at least one entity customer may consist of a plurality of properties such as Name, Id, Address, Agreement Number and others.
  • the plurality of entities may have a plurality of parametric variables associated with it.
  • the plurality of parametric variables associated with at least one entity may receive or may update their values through may be the linking of the plurality of EAT's to the respective plurality of parametric variable occurrence in the document. This may significantly improve the efficiency of data input since it is much faster to allocate values for different properties of an entity than it is to enter these values manually.
  • the method and the process may be described through Figures 1 - 4.
  • Figure 1 describes the process of normal usage of the main application, wherein during the normal usage of the main application, the user creates entities that are stored in the Master Database (5051 ). These entities include all business-related entities such as Customers, Invoices, Orders, and the like. Each of these entities has several properties associated with it - for example, the properties typically associated with the Customer entity are Name, Id, Address, Agreement Number and others. During normal operation of the main application several Customer entities are created each with possibly different values for the properties.
  • Entity Associated Tags may be created may be by the user using at least one user-interface for this purpose.
  • An EAT may consist of at least one tuple (tag, entity) where tag may be the handle for the EAT and the entity may be the corresponding entity in the Master Database (5051 ). All EAT's may be stored in the Tags Database (5052).
  • the present invention deploys a plurality of systems and methods thereof for EAT's identification in electronic document.
  • One of the system and the method thereof to identify EAT's in a given electronic document may make use of Optical Character Recognition (OCR) technology.
  • OCR may identify all words and their locations in the document. These words may be stored in the at least one local memory and then may be checked with the Tags Database (5052). Any matches with the Tags Database (5052) may cause that location to be highlighted.
  • the user may right-click the highlighted location wherein upon right-clicking the highlighted location the corresponding word from the local memory may be retrieved and may be stored in the selected property in the Master Database (5051 ).
  • the system and the method thereof to implement and execute the process for creation of Entity Associated Tags (EAT's), manipulate stored EAT's for addition, deletion, and update, automatically identify all EAT's in a given electronic document, directly update all properties of a plurality of entities associated with the EAT from the document without need for manual updating, automatically associate a plurality of given document with a plurality of respective tags for later retrieval.
  • EAT's Entity Associated Tags
  • the method comprising receiving, for each electronic document of multiple electronic documents from a plurality of document generation and extraction module a plurality of first data indicative of a content pattern of the electronic document.
  • the first data comprises a content pattern associated with the electronic document, wherein the content pattern data indicative of at least one type of the electronic document, a plurality of second data indicative of all words and their location in the document, and a plurality of third data indicative of a plurality of parametric variables associated with the electronic document;
  • the method comprising storing, first data indicative of a content pattern of the electronic document of multiple electronic documents, storing second data indicative of all words and their location in the electronic document of multiple electronic documents, and storing third data indicative of a plurality of parametric variables associated with the electronic document of multiple electronic documents.
  • a request for the for determining and recommending EATs for one or more of the multiple electronic documents is generated, wherein the request comprises the first data, second data and third data for each of the one or more of the multiple electronic
  • the determining and recommending potential Entity Associated Tags (EAT) matches is based at least in part on the first data, the first data i.e. a type of the electronic document. For each electronic document of the multiple electronic documents, determining and recommending potential Entity Associated Tags (EAT) matches is based on at least in part on the second data.
  • the second data content is available for identification and selection by the user through the user interface module (507).
  • the process allows the user to create and update EAT and update all properties of a plurality of entities associated with the at least one created EAT. The user is allowed to identify and select the potential entity associated tags.
  • a plurality of current values of a plurality of properties associated with the selected / determined EAT of the electronic document are provided through display and popup upon identification and selection of the potential entity associated tags by the user.
  • the processor (502) further determines an entity category associated with the data constituting the at least one electronic document, wherein the entity includes all business related entities; and a tag is determined, at least in part, based on the determined entity category associated with the data of the at least one electronic document.
  • the processor (502) determines the EATs of the electronic document based at least in part on the first data and the third data.
  • the processor (502) of a processing unit (501 ) is configured to perform the determination and recommendation of EATs.
  • the configuration register of the processing unit (501 ) provides a reduced instruction hardware configuration set to the processor (502).
  • the configuration register module (503) receives parametric variable data through user interface module (507) from the user and generates reduced instruction hardware configuration set.
  • the processor (502) of the processing unit (501 ) also integrates the document processing systems such as but not limited to MIS and provide the EATs and relative document in electronic format for further processing to the document processing systems such as but not limited to MIS.
  • the method determine, for each of multiple words available through the content of the at least one electronic document, a degree of correspondence between an EAT with the words in the at least one electronic document and their determined locations in the at least one electronic document and rank the multiple EATs based on the determined degrees of correspondence. Further to the ranking the process performs the filtering the ranked multiple EATs to remove any multiple occurrence and least or null degrees of correspondence EATs included in the plurality of EATs determined based on the second data of the at least one electronic document. The highest ranked of the filtered EATs are identified as the determined and recommended EATs.
  • the data indicative of the at least one recommended EAT determined for at least one of the multiple electronic documents is forwarded to the Database (505) for storing the electronic document and its associated entity data and tags data into respective databases.
  • the data is received in response to sending request.
  • the received data is indicative of determined and recommended EATs for each of the one or more of the multiple electronic documents.
  • the determined and recommended EATs for each of the one or more of the multiple electronic documents are available for identification and selection by the user through the user interface module (507).
  • a plurality of current values of a plurality of properties associated with the selected / determined EAT of the electronic document are provided through display and popup upon identification and selection of the potential entity associated tags by the user.
  • the data indicative of the at least one recommended EAT determined for at least one of the multiple electronic documents is forwarded to the Database (505) for storing the electronic document and its associated entity data and tags data into respective databases.
  • the method determines and recommend at least one EATs, to each of the one or more of the multiple electronic documents, and allow the user to extract the electronic document directly in to the appropriate document processing system such as MIS process integrated with the system for further processing using associated EATs.
  • MIS process integrated with the system for further processing using associated EATs.
  • the system comprise of a plurality of Interactive User Interface to facilitate user to provide interactive inputs as well as provide control signals, at least one Master Database (5051 ) to store the entity data, at least one Tags Database (5052) to store the tag data, at least one Document Database (5053) to store the received electronic document as well as the parsed and processed document, a plurality of Electronic document generation and extraction module (506) wherein the at least one Electronic document generation and extraction module (506) comprise of at least one Optical scanner module (5061 ) to optically scan the hard copy of the document to receive a input document in electronic form, a Dynamic memory module (5062) to store the electronic document dynamically, at least one coder processor module (504) to receive document in various electronic format, and at least one processor unit(501 ) to control functionality of the individual modules and execute over all functionality constituting the method to obtain information in to a structured form by extracting data and information from unstructured documents wherein the processor unit(501 ) comprise of at least
  • the present invention may overcome the challenges of the current scenario through the described system and method for workflow management through content data platform with user interactive features to enhance the user's ability to directly capture information in to a structured form by extracting data and information from unstructured documents.

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Educational Administration (AREA)
  • Game Theory and Decision Science (AREA)
  • Development Economics (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Document Processing Apparatus (AREA)

Abstract

La présente invention porte sur un procédé et un système destinés à la gestion de processus au jour le jour. Plus particulièrement, l'invention concerne la capture aisée d'informations sous une forme structurée par extraction de données provenant de documents non structurés. Le système et le procédé associé ci-décrits permettent à l'utilisateur, grâce à une plateforme de données de contenu ayant des fonctions interactives destinées à l'utilisateur, de réaliser plus facilement une capture directe de données sous une forme structurée par extraction de données dans des documents non structurés en fonction du format compatible du dispositif utilisateur et/ou du dispositif du fournisseur de services, et ce sur plusieurs plateformes, ce qui permet d'automatiser les processus ainsi que le traitement de documents, et de réduire au minimum l'intervention manuelle.
PCT/IN2015/050063 2014-07-09 2015-07-09 Procédé et système de gestion de processus WO2016006002A2 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
IN2242/MUM/2014 2014-07-09
IN2242MU2014 2014-07-09

Publications (2)

Publication Number Publication Date
WO2016006002A2 true WO2016006002A2 (fr) 2016-01-14
WO2016006002A3 WO2016006002A3 (fr) 2016-03-03

Family

ID=55065058

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IN2015/050063 WO2016006002A2 (fr) 2014-07-09 2015-07-09 Procédé et système de gestion de processus

Country Status (1)

Country Link
WO (1) WO2016006002A2 (fr)

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5664109A (en) * 1995-06-07 1997-09-02 E-Systems, Inc. Method for extracting pre-defined data items from medical service records generated by health care providers
US7444325B2 (en) * 2005-01-14 2008-10-28 Im2, Inc. Method and system for information extraction
US9171048B2 (en) * 2012-12-03 2015-10-27 Wellclub, Llc Goal-based content selection and delivery

Also Published As

Publication number Publication date
WO2016006002A3 (fr) 2016-03-03

Similar Documents

Publication Publication Date Title
US10366123B1 (en) Template-free extraction of data from documents
US9213893B2 (en) Extracting data from semi-structured electronic documents
US12306861B2 (en) Visual presentation of search results
US8140534B2 (en) System and method for sorting attachments in an integrated information management application
US6496838B1 (en) Database reconciliation method and system
US20070244921A1 (en) Method, apparatus and computer-readable medium to provide customized classification of documents in a file management system
US20140108397A1 (en) Computer-Implemented Document Manager Application Enabler System and Method
JP2023032063A (ja) 情報処理装置およびプログラム
US8862609B2 (en) Expanding high level queries
JP2015184723A (ja) 文書作成支援システム
US20220327162A1 (en) Information search system
JP6529254B2 (ja) 情報処理装置、情報処理方法、プログラムおよび記憶媒体
US20190043020A1 (en) Generating and enhancing meeting-related objects based on image data
EP3407210A1 (fr) Appareil et procédé permettant de générer une interrogation de motif à événements multiples
US11366964B2 (en) Visualization of the entities and relations in a document
CN110134920A (zh) 绘文字兼容显示方法、装置、终端及计算机可读存储介质
Ku et al. Service recommendation system for big data analysis
US20200388076A1 (en) Method and system for generating augmented reality interactive content
WO2016006002A2 (fr) Procédé et système de gestion de processus
US20160224918A1 (en) Business influenced part extraction method and business influenced part extraction device based on business variation
US9582782B2 (en) Discovering a reporting model from an existing reporting environment
EP4165564A1 (fr) Procédés et systèmes de mise en correspondance et d'optimisation de solutions de technologie avec des produits d'entreprise demandés
WO2014061303A1 (fr) Dispositif et programme de traitement d'informations
JP2007334670A (ja) 画像処理装置、方法及びプログラム
KR101809362B1 (ko) Ocr 시스템을 이용한 거래정보 관리 시스템과 이를 이용한 전산 거래정보 관리방법

Legal Events

Date Code Title Description
NENP Non-entry into the national phase in:

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15818965

Country of ref document: EP

Kind code of ref document: A2