Gupta, 2018 - Google Patents
Optimized text data processingGupta, 2018
- Document ID
- 16060311900364626460
- Author
- Gupta S
- Publication year
- Publication venue
- 2018 International Conference on Advances in Computing, Communication Control and Networking (ICACCCN)
External Links
Snippet
As the data generated by the average user continues to rise day by day, the ability to process these large chunks of data seems to be unable to match it. This is mostly caused due to algorithmic inefficiency and time and memory constraints. While numerous algorithms …
- 238000000034 method 0 abstract description 10
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30312—Storage and indexing structures; Management thereof
- G06F17/30321—Indexing structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30613—Indexing
- G06F17/30619—Indexing indexing structures
- G06F17/30625—Trees
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30943—Information retrieval; Database structures therefor; File system structures therefor details of database functions independent of the retrieved data type
- G06F17/30964—Querying
- G06F17/30979—Query processing
- G06F17/30985—Query processing by using string matching techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30943—Information retrieval; Database structures therefor; File system structures therefor details of database functions independent of the retrieved data type
- G06F17/30946—Information retrieval; Database structures therefor; File system structures therefor details of database functions independent of the retrieved data type indexing structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30289—Database design, administration or maintenance
- G06F17/30303—Improving data quality; Data cleansing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
- G06F17/30424—Query processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30587—Details of specialised database models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30067—File systems; File servers
- G06F17/30129—Details of further file system functionalities
- G06F17/3015—Redundancy elimination performed by the file system
- G06F17/30156—De-duplication implemented within the file system, e.g. based on file segments
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/22—Arrangements for sorting or merging computer data on continuous record carriers, e.g. tape, drum, disc
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Boytsov | Indexing methods for approximate dictionary searching: Comparative analysis | |
| Lin | Binary search algorithm | |
| Adjeroh et al. | The burrows-wheeler transform: data compression, suffix arrays, and pattern matching | |
| Harman et al. | Inverted Files. | |
| CN101937448B (en) | For the string compression of the order of the maintenance based on dictionary of primary memory row memory storage | |
| CN107153647B (en) | Method, apparatus, system and computer program product for data compression | |
| US10521441B2 (en) | System and method for approximate searching very large data | |
| US20070198566A1 (en) | Method and apparatus for efficient storage of hierarchical signal names | |
| CN109299086B (en) | Optimal sort key compression and index reconstruction | |
| Jiang et al. | Pids: attribute decomposition for improved compression and query performance in columnar storage | |
| JP2018501752A (en) | Lossless data loss by deriving data from basic data elements present in content-associative sheaves | |
| González et al. | Locally compressed suffix arrays | |
| US8548979B2 (en) | Indexing for regular expressions in text-centric applications | |
| Sirén | Burrows-Wheeler transform for terabases | |
| Chien et al. | Geometric BWT: compressed text indexing via sparse suffixes and range searching | |
| Gog et al. | Large-scale pattern search using reduced-space on-disk suffix arrays | |
| Bast et al. | Efficient fuzzy search in large text collections | |
| Zhang et al. | Succinct range filters | |
| Yoon et al. | Two scalable algorithms for associative text classification | |
| JP2018524886A (en) | Perform multi-dimensional search, content associative retrieval, and keyword-based retrieval and retrieval for lossless data using basic data sheaves | |
| Wang et al. | TxtAlign: efficient near-duplicate text alignment search via bottom-k sketches for plagiarism detection | |
| Chen et al. | On the signature tree construction and analysis | |
| Chen | Signature files and signature trees | |
| Ilic et al. | Inverted index search in data mining | |
| Kärkkäinen et al. | Full-text indexes in external memory |