[go: up one dir, main page]

WO1999059087A1 - Procede permettant d'ameliorer l'execution d'un code faisant appel a un langage html analyse et appareil correspondant - Google Patents

Procede permettant d'ameliorer l'execution d'un code faisant appel a un langage html analyse et appareil correspondant Download PDF

Info

Publication number
WO1999059087A1
WO1999059087A1 PCT/US1999/006676 US9906676W WO9959087A1 WO 1999059087 A1 WO1999059087 A1 WO 1999059087A1 US 9906676 W US9906676 W US 9906676W WO 9959087 A1 WO9959087 A1 WO 9959087A1
Authority
WO
WIPO (PCT)
Prior art keywords
html
records
static
dynamic
record
Prior art date
Application number
PCT/US1999/006676
Other languages
English (en)
Inventor
Michael Giroux
Original Assignee
Bull Hn Information Systems Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Bull Hn Information Systems Inc. filed Critical Bull Hn Information Systems Inc.
Priority to EP99916169A priority Critical patent/EP1125221A4/fr
Publication of WO1999059087A1 publication Critical patent/WO1999059087A1/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Definitions

  • a World Wide Web (WWW) site typically consists of a collection of HyperText Markup Language (HTML) documents.
  • HTML is a text language that provides for hyper-linked graphic display.
  • a user of a Web browser utilizing the World Wide Web (WWW) typically requests that a Web server download HTML text to his Web browser.
  • the Web browser interprets the downloaded HTML text and generates screen images from the HTML text.
  • the HTML text invariably describes hyper-linked hot spots, that cause further downloads when selected.
  • FIG. 2 is a flowchart illustrating construction of parsed HTML file by a parser interpreting an original HTML file, as shown in FIG. 1 ;
  • FIG. 1 is a block diagram of a data processing system, in accordance with the present invention.
  • Original HTML text 40 stored on disk is parsed by a parser 42 into parsed HTML (PHTML) text 44, again stored in a Non- Volatile Storage Medium.
  • the parser sequentially interprets the original HTML text 40. Sequences of static HTML commands are collected into static HTML records, and sequences of active or dynamic HTML commands are collected into dynamic HTML records. These variable length static and dynamic HTML records are then written to the parsed HTML (PHTML) text 44 file stored in a Non-Volatile Storage Medium.
  • variable length record formats Many computer architectures directly support variable length records. In these architectures, it would most often be preferable to utilize a standard variable length record format.
  • SDF System Data Format
  • VAX systems by Digital Equipment Corporation support a similar variable length record format.
  • Count key data records provide a variable record length format on International Business Machines.
  • One alternate embodiment is to store the parsed HTML (PHTML) 44 as variable length records in a database. Instead of interpreting the first couple of bytes in each record to determine whether it contains static or dynamic HTML, a flag can be included in each database record for this purpose.
  • the original HTML file 40 is opened (not shown) and a loop is entered.
  • the next HTML command is read, step 52.
  • a test is then made whether the HTML command just read is of the same type, static or dynamic, as the HTML commands being accumulated, step 54. If the type of the HTML command just read, step 52, is different from the type of HTML commands being accumulated, a record containing the HTML commands being accumulated is completed and written out to the parsed HTML file 44, step 56. A new record is then started to accumulate the type of HTML commands just read, step 58. In any case, the HTML command just read, step 52, is then added to the record under construction, step 60. A test is then made whether there is any more input, step 62.
  • FIG. 3 is a flowchart illustrating operation of a Web server 46 interpreting parsed HTML 44, as shown in FIG. 1.
  • the Web server 46 opens the parsed HTML file 44 and enters a loop. Within the loop, the next record descriptor is read, step 72. A test is then made whether there are more records in the file, step 74. This is typically a test for end-of-file. Most modern computer architectures automalically provide such an indication. If the end-of-file has not been detected, step 76, a test is made whether the record contains dynamic HTML, step 80. In the preferred embodiment, interpreting the start of the first HTML command in the next record does this. However, other types of record formats would require correspondingly different types of testing.
  • step 82 the HTML in the record is processed, step 82. This typically results in database accesses in response to the HTML commands, with the results of the database queries being transmitted to the Web browser on the client computer 48. Otherwise, when static HTML is being processed, the entire record is transmitted to the Web browser executing on the client computer 48 without interpretation by the server computer 46, step 84. In either case, the loop is then repeated, starting with reading the next record descriptor, step 72. When no more records remain to be processed, step 74, end-of-file processing is performed, step 86. A record length of zero may be transmitted at this point to indicate end-of-file to the Web browser on the client system 48.
  • FIG. 4 is a sample sequences of original HTML 40.
  • FIG. 5 is a sample sequence of parsed HTML (PHTML) 44 generated from the sample HTML 40 in FIG. 4.
  • the first section of the original HTML 40 contains static HTML commands.
  • a record length of "0288" bytes is inserted before these static HTML commands. This is followed by an active or dynamic set of HTML commands. These are preceded by a length of "0059" in the parsed HTML 44.
  • a second set of static HTML follows, proceeded by a length of "0281" in the parsed HTML 44.
  • the parsed HTML 44 is terminated by a record length of "0000".
  • FIG. 6 is a block diagram illustrating a General Purpose Computer 20.
  • the General Purpose Computer 20 has a Computer Processor 22, and Memory 24, connected by a Bus 26.
  • Memory 24 is a relatively high speed machine readable medium and includes Volatile Memories such as DRAM, and SRAM, and Non-Volatile Memories such as, ROM, FLASH, EPROM, EEPROM, and bubble memory.
  • Secondary Storage 30, F-xternal Storage 32 Also connected to the Bus are Secondary Storage 30, F-xternal Storage 32, output devices such as a monitor 34, input devices such as a keyboard (with mouse) 36, and printers 38.
  • Secondary Storage 30 includes machine-readable media such as hard disk drives, magnetic drum, and bubble memory.
  • External Storage 32 includes machine-readable media such as floppy disks, removable hard drives, magnetic tape, CD-ROM, and even other computers, possibly connected via a communications line 28.
  • Computer software such as the parser 42 and user programs can be stored in a Computer Software Storage Medium, such as memory 24, Secondary Storage 30, and External Storage 32.
  • Executable versions of computer software 33 can be read from a Non- Volatile Storage Medium such as External Storage 32, Secondary Storage 30, and Non- Volatile Memory and loaded for execution directly into Volatile Memory, executed directly out of Non-Volatile Memory, or stored on the Secondary Storage 30 prior to loading into Volatile Memory for execution.
  • a Non- Volatile Storage Medium such as External Storage 32, Secondary Storage 30, and Non- Volatile Memory

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Un programme d'interprétation (42) commence par analyser des scripts de langage HTML source (40) correspondant à des pages Web actives ou dynamiques, les convertissant en séquences statiques et dynamiques. Ces séquences statiques et dynamiques de HTML sont écrites sous forme d'enregistrements de longueur variable dans un fichier HTML analysé (44), chaque enregistrement étant entièrement constitué de commandes HTML statiques ou dynamiques. Un serveur de base de données (46), en réponse à une demande émanant de l'explorateur Web d'un ordinateur client (48), lit le HTML analysé, transmettant (47) sans interprétation des enregistrements HTML statiques audit explorateur (48) tout en interprétant les enregistrements HTML dynamiques et en exécutant des actions demandées de base de données (45) avant de transmettre (47) les résultats à l'explorateur.
PCT/US1999/006676 1998-05-13 1999-03-26 Procede permettant d'ameliorer l'execution d'un code faisant appel a un langage html analyse et appareil correspondant WO1999059087A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP99916169A EP1125221A4 (fr) 1998-05-13 1999-03-26 Procede permettant d'ameliorer l'execution d'un code faisant appel a un langage html analyse et appareil correspondant

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US7808498A 1998-05-13 1998-05-13
US09/078,084 1998-05-13

Publications (1)

Publication Number Publication Date
WO1999059087A1 true WO1999059087A1 (fr) 1999-11-18

Family

ID=22141815

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1999/006676 WO1999059087A1 (fr) 1998-05-13 1999-03-26 Procede permettant d'ameliorer l'execution d'un code faisant appel a un langage html analyse et appareil correspondant

Country Status (2)

Country Link
EP (1) EP1125221A4 (fr)
WO (1) WO1999059087A1 (fr)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2364406A (en) * 1999-11-01 2002-01-23 Sherwood Internat Group Ltd Multi-user access of a database over a network
EP1180739A1 (fr) * 2000-08-16 2002-02-20 Gavoa Ltd Remise des contenus des pages
EP1288805A1 (fr) * 2001-08-24 2003-03-05 Accenture Global Services GmbH Estimation des risques du commerce électronique
GB2381340A (en) * 2001-10-27 2003-04-30 Hewlett Packard Co Document generation in a distributed information network
WO2002063504A3 (fr) * 2000-11-02 2004-04-08 Citrix Systems Inc Procedes et appareil permettant d'augmenter le code de generation de page pour effectuer la regeneration partielle de page
US7051084B1 (en) 2000-11-02 2006-05-23 Citrix Systems, Inc. Methods and apparatus for regenerating and transmitting a partial page
US7136880B2 (en) * 2000-07-20 2006-11-14 Market Models, Inc. Method and apparatus for compiling business data
US7194743B2 (en) 2000-12-12 2007-03-20 Citrix Systems, Inc. Methods and apparatus for communicating changes between a user interface and an executing application using property paths
US7346842B1 (en) 2000-11-02 2008-03-18 Citrix Systems, Inc. Methods and apparatus for incorporating a partial page on a client
US7596593B2 (en) 1999-09-07 2009-09-29 Citrix Systems, Llc Methods and apparatus for efficiently transmitting interactive application data between a client and server using markup language
US7908269B2 (en) * 2001-04-04 2011-03-15 At&T Intellectual Property I, L.P. Method, system, and software for transmission of information
US8527495B2 (en) * 2002-02-19 2013-09-03 International Business Machines Corporation Plug-in parsers for configuring search engine crawler

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104750866B (zh) * 2015-04-17 2019-06-14 上海斐讯数据通信技术有限公司 基于嵌入式web服务器动态生成菜单栏的方法及系统

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5844392A (en) * 1992-12-02 1998-12-01 Cybernet Systems Corporation Haptic browsing

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR930004950B1 (ko) * 1989-05-15 1993-06-10 인터내셔날 비지네스 머신즈 코포레이션 컴퓨터 엔터티간의 전송을 위한 복합 데이타 구조 작성방법 및 그 장치
WO1997034240A1 (fr) * 1996-03-15 1997-09-18 University Of Massachusetts Arbre compact pour le stockage et l'extraction de documents structures hypermedia
US5715453A (en) * 1996-05-31 1998-02-03 International Business Machines Corporation Web server mechanism for processing function calls for dynamic data queries in a web page

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5844392A (en) * 1992-12-02 1998-12-01 Cybernet Systems Corporation Haptic browsing

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
DOUGLIS ET AL: "HPP: HTML macro pre-processing to support dynamic document caching", PROCEEDINGS OF THE USENIX SYMPOSIUM ON INTERNET TECHNOLOGIES AND SYSTEMS, December 1997 (1997-12-01), pages 83 - 94, XP002921595 *
See also references of EP1125221A4 *

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7596593B2 (en) 1999-09-07 2009-09-29 Citrix Systems, Llc Methods and apparatus for efficiently transmitting interactive application data between a client and server using markup language
WO2001033388A3 (fr) * 1999-11-01 2004-05-06 Sherwood Internat Group Ltd Procede et systeme d'acces a une base de donnees, pour utilisateurs multiples, via un reseau et support de stockage lisible par un ordinateur contenant un programme permettant de realiser ce procede
GB2364406A (en) * 1999-11-01 2002-01-23 Sherwood Internat Group Ltd Multi-user access of a database over a network
US7136880B2 (en) * 2000-07-20 2006-11-14 Market Models, Inc. Method and apparatus for compiling business data
EP1180739A1 (fr) * 2000-08-16 2002-02-20 Gavoa Ltd Remise des contenus des pages
US7346842B1 (en) 2000-11-02 2008-03-18 Citrix Systems, Inc. Methods and apparatus for incorporating a partial page on a client
US7051084B1 (en) 2000-11-02 2006-05-23 Citrix Systems, Inc. Methods and apparatus for regenerating and transmitting a partial page
WO2002063504A3 (fr) * 2000-11-02 2004-04-08 Citrix Systems Inc Procedes et appareil permettant d'augmenter le code de generation de page pour effectuer la regeneration partielle de page
US7194743B2 (en) 2000-12-12 2007-03-20 Citrix Systems, Inc. Methods and apparatus for communicating changes between a user interface and an executing application using property paths
US7908269B2 (en) * 2001-04-04 2011-03-15 At&T Intellectual Property I, L.P. Method, system, and software for transmission of information
US8589388B2 (en) * 2001-04-04 2013-11-19 At&T Intellectual Property I, L.P. Method, system, and software for transmission of information
EP1288805A1 (fr) * 2001-08-24 2003-03-05 Accenture Global Services GmbH Estimation des risques du commerce électronique
GB2381340A (en) * 2001-10-27 2003-04-30 Hewlett Packard Co Document generation in a distributed information network
US8527495B2 (en) * 2002-02-19 2013-09-03 International Business Machines Corporation Plug-in parsers for configuring search engine crawler

Also Published As

Publication number Publication date
EP1125221A4 (fr) 2006-04-05
EP1125221A1 (fr) 2001-08-22

Similar Documents

Publication Publication Date Title
US6560772B1 (en) Method, system, and program for accessing data in different environments
US6260044B1 (en) Information storage and retrieval system for storing and retrieving the visual form of information from an application in a database
US6662342B1 (en) Method, system, and program for providing access to objects in a document
US7770180B2 (en) Exposing embedded data in a computer-generated document
US7072983B1 (en) Scheme for systemically registering meta-data with respect to various types of data
US7620665B1 (en) Method and system for a generic metadata-based mechanism to migrate relational data between databases
US6199071B1 (en) Method and apparatus for archiving hypertext documents
CN101183355B (zh) 复制和粘贴处理方法及装置
US7953116B2 (en) Intelligent access within a document package
US20040254935A1 (en) Method and apparatus for automatic consolidation of personalized dynamic data
JP5042693B2 (ja) マークアップ言語ファイルの格納及び送信の最適化
US6421680B1 (en) Method, system and computer program product for case and character-encoding insensitive searching of international databases
US20090313536A1 (en) Dynamically Providing Relevant Browser Content
US20100094822A1 (en) System and method for determining a file save location
WO1999059087A1 (fr) Procede permettant d'ameliorer l'execution d'un code faisant appel a un langage html analyse et appareil correspondant
US7376650B1 (en) Method and system for redirecting a request using redirection patterns
US6351741B1 (en) Method of locating a file linked to a document in a relocated document directory structure
WO1998014896A1 (fr) Integrateur de ressources de donnees/traitement pour serveur web
US6289347B1 (en) Data processing system utilizing web forms
EP1672526A2 (fr) Formats de fichiers, procédés et produits de programme informatique pour représenter des documents
US8805860B2 (en) Processing encoded data elements using an index stored in a file
US7886227B2 (en) Cross-environment context-sensitive help files
US20050131859A1 (en) Method and system for standard bookmark classification of web sites
US7131065B1 (en) Dynamically altering web page content based on a page's title properties
US20050125372A1 (en) System and method for generating a unique, file system independent key from a URI (Universal Resource Identifier) for use in an index-less VoiceXML browser caching mechanism

Legal Events

Date Code Title Description
AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 1999916169

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1999916169

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 1999916169

Country of ref document: EP