Fairon, 1998 - Google Patents
GlossaNet: Parsing a web site as a corpusFairon, 1998
View PDF- Document ID
- 9940903941308025854
- Author
- Fairon C
- Publication year
- Publication venue
- Lingvisticae Investigationes
External Links
Snippet
GlossaNet is an automated system that monitors Web sites. On dates and at intervals selected by the user, GlossaNet downloads the Web site, converts it to an electronic corpus and uses the intex programs (M. Silberztein 1993) and the linguistic resources of the ladl …
- 230000014509 gene expression 0 description 7
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
- G06F17/30684—Query execution using natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/271—Syntactic parsing, e.g. based on context-free grammar [CFG], unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2795—Thesaurus; Synonyms
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2785—Semantic analysis
- G06F17/279—Discourse representation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/274—Grammatical analysis; Style critique
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US5890103A (en) | Method and apparatus for improved tokenization of natural language text | |
| US6910004B2 (en) | Method and computer system for part-of-speech tagging of incomplete sentences | |
| US6539348B1 (en) | Systems and methods for parsing a natural language sentence | |
| EP0953192B1 (en) | Natural language parser with dictionary-based part-of-speech probabilities | |
| Maynard et al. | Architectural elements of language engineering robustness | |
| US5680628A (en) | Method and apparatus for automated search and retrieval process | |
| US7689411B2 (en) | Concept matching | |
| US20030158723A1 (en) | Syntactic information tagging support system and method | |
| WO1997004405A9 (en) | Method and apparatus for automated search and retrieval processing | |
| JP2002229981A (en) | A system that generates a normalized representation of a string | |
| EP2406731A2 (en) | System and method for automatic semantic labeling of natural language texts | |
| Jabbar et al. | An analytical analysis of text stemming methodologies in information retrieval and natural language processing systems | |
| Fairon | GlossaNet: Parsing a web site as a corpus | |
| Sankaravelayuthan et al. | English to Tamil machine translation system using parallel corpus | |
| JPH07325826A (en) | Japanese language processing system | |
| KR19980078071A (en) | Scheduling from Natural Memo Sentences | |
| Ben-Hazez et al. | Designing tasks of identification of complex linguistic patterns used for text semantic filtering | |
| Kameyama | Information extraction across linguistic barriers | |
| Bernth et al. | Terminology extraction for global content management | |
| Schwall et al. | From METAL to T1: systems and components for machine translation applications | |
| WO2001024053A2 (en) | System and method for automatic context creation for electronic documents | |
| JP3358100B2 (en) | Japanese question message analysis method and device | |
| JPH0973454A (en) | Document creating apparatus and document creating method | |
| JPH06187329A (en) | Natural language processor | |
| JPH10134055A (en) | Sentence creation support device and medium storing sentence creation support program |