Test, 2019 - Google Patents
Sampling to Maintain ApproximateTest, 2019
- Document ID
- 3905249608775805859
- Author
- Test U
- Publication year
- Publication venue
- Theoretical Computer Science: 37th National Conference, NCTCS 2019, Lanzhou, China, August 2–4, 2019, Revised Selected Papers
External Links
Snippet
In data management center, sometimes it is necessary to provide a subset to show data characteristics, among which probability distribution is an important one. Sampling is a fundamental method to generate data subsets. But how to sample a minimum subset with …
- 238000005070 sampling 0 title abstract description 74
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30587—Details of specialised database models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30705—Clustering or classification
- G06F17/3071—Clustering or classification including class or cluster creation or modification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/18—Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/18—Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6279—Classification techniques relating to the number of classes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20130097125A1 (en) | Automated analysis of unstructured data | |
| Peck et al. | Genetic algorithms as global random search methods: An alternative perspective | |
| EP2625628A2 (en) | Probabilistic data mining model comparison engine | |
| Peddi | Data Pull out and facts unearthing in biological Databases | |
| Florescu et al. | Algorithmically generating new algebraic features of polynomial systems for machine learning | |
| US12175354B1 (en) | Apparatus and method for training a tunable data structure to predict internal ribosome entry site (IRES) activity | |
| Chakradeo et al. | Breast cancer recurrence prediction using machine learning | |
| Bortolussi et al. | Learning model checking and the kernel trick for signal temporal logic on stochastic processes | |
| Yang et al. | A heuristic sampling method for maintaining the probability distribution | |
| CN118363866A (en) | Method and system for generating large language model fuzzy test sample based on coverage index | |
| Hoffmann et al. | Minimising the expected posterior entropy yields optimal summary statistics | |
| Chakraborty et al. | Improving software performance by automatic test cases through genetic algorithm | |
| Massart | A non-asymptotic theory for model selection | |
| Moezi et al. | Fault isolation of analog circuit using an optimized ensemble empirical mode decomposition approach based on multi-objective optimization | |
| Test | Sampling to Maintain Approximate | |
| Hasanin et al. | Experimental studies on the impact of data sampling with severely imbalanced big data | |
| Yang et al. | Sampling to maintain approximate probability distribution under chi-square test | |
| Bakar et al. | Improvement of transformer dissolved gas analysis interpretation using j48 decision tree model | |
| Jabbari et al. | Obtaining accurate probabilistic causal inference by post-processing calibration | |
| Foss et al. | A deterministic hitting-time moment approach to seed-set expansion over a graph | |
| Vrunda et al. | Sentimental analysis of Twitter data and Comparison of covid 19 Cases trend Using Machine learning algorithms | |
| Rabcan et al. | Generation of structure function based on ambiguous and incompletely specified data using fuzzy random forest | |
| Phelps et al. | Towards understanding the bias in decision trees | |
| Hu et al. | Input selection in learning systems: a brief review of some important issues and recent developments | |
| Wang et al. | Ensemble clustering based on evidence theory |