submit a site to this category
|
|
UCI Machine LearningURL: http://www.ics.uci.edu/~mlearn/MLRepository.html ODP description: A repository of databases, domain theories and data generators that are used by the machine learning community for the empirical analysis of machine learning algorithms at the University of California at Irvine. [Free] Page title: UCI Machine Learning Repository ![]() |
|
|
Face recognition datasetURL: http://www.cs.cmu.edu/afs/cs.cmu.edu/user/avrim/www/ML94/face_homework.html ODP description: A dataset of face images for face recognition algorithms. Page title: A NEURAL NETWORK FACE RECOGNITION ASSIGNMENT ![]() |
|
|
TechTC - Technion Repository of Text Categorization DatasetsURL: http://techtc.cs.technion.ac.il ODP description: Provides a large number of diverse test collections for use in text categorization research. Page description: Text categorization test collections. ![]() |
|
|
Time Series Data LibraryURL: http://www-personal.buseco.monash.edu.au/~hyndman/TSDL/ ODP description: A collection of over 500 time series, maintained by Rob Hyndman. Time series are organized by subject. ![]() |
|
|
DELVE - Data for Evaluating Learning in Valid ExperimentsURL: http://www.cs.utoronto.ca/~delve/ ODP description: Data for Evaluating Learning Valid Experiments: A standardized environment designed to evaluate the performance of methods that learn relationships based primarily on empirical data. Delve makes it possible for users to compare their learning methods with other methods on many datasets. ![]() |
|
|
Learning Relational Concepts from Sensor Data of a Mobile RobotURL: http://www-ai.cs.uni-dortmund.de/FORSCHUNG/PROJEKTE/BLEARN2/data-sets.html ODP description: A set of data sets, where each data set is represented in first order logic. Maintained at the University of Dortmund, Germany. Page title: University of Dortmund -- Computer Science VIII ![]() |
|
|
NIST Special Database 4.URL: http://www.nist.gov/srd/nistsd4.htm ODP description: This NIST database of fingerprint images contains 2000 8- bit gray scale fingerprint image pairs. Page title: NIST Special Database 4 - NIST 8-Bit Gray Scale Images of Fingerprint Image Groups (FIGS) Page description: This NIST database of fingerprint images contains 2000 8-bit gray scale fingerprint images pairs ![]() |
|
|
National Space Science Data CenterURL: http://nssdc.gsfc.nasa.gov/ ODP description: Provides access to a wide variety of astrophysics, space physics, solar physics, lunar, and planetary data from NASA space flight missions, in addition to selected other data, models, and software. Page title: Welcome to the NSSDC! Page description: National Space Science Data Center (NSSDC) Home Page ![]() |
|
|
Bilkent University Function Approximation RepositoryURL: http://funapp.cs.bilkent.edu.tr/DataSets/ ODP description: Datasets used for the experimental analysis of function approximation techniques and for training and demonstration by machine learning and statistics community. ![]() |
|
|
The RCSB Protein Data Bank (PDB)URL: http://www.rcsb.org/pdb/ ODP description: Archive of experimentally-determined, biological macromolecule 3-D structures from the Brookhaven National Laboratory. Also includes newsletters and a description of the PDB file format. Page title: RCSB Protein Data Bank ![]() |
|
|
Penn Treebank ProjectURL: http://www.cis.upenn.edu/~treebank/ ODP description: A corpus of parsed sentences. Used by many researchers for training data-driven parsing algorithms. ![]() |
|
|
Web->KB datasetURL: http://www.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/wwkb/ ODP description: Web pages partitioned into classes, with hyperlink data. The dataset has been used for text categorization and learning to extract symbolic knowledge from the World Wide Web. Page title: World Wide Knowledge Base (Web->KB) project ![]() |
|
|
RISE: Repository of Information Sources used in information Extraction tasks.URL: http://www.isi.edu/info-agents/RISE/ ODP description: Repository of online information sources: test domains for information extraction and wrapper generation tools that learn extraction rules (extraction patterns). Page title: RISE: Repository of information sources used in information extraction tasks (learning extraction rules / extraction patterns). ![]() |
|
|
Dataset generatorURL: http://www.datgen.com/ ODP description: Datgen, formerly SCDS, is a computer program that generates data to systematically test programs that consume data. These synthetic datasets can be used to validate learning algorithms. Page title: www.datgen.com ![]() |
|
|
HS3D - Homo Sapiens Splice Sites DatasetURL: http://www.sci.unisannio.it/docenti/rampone/ ODP description: HS3D (Homo Sapiens Splice Sites Dataset) is a database of Homo Sapiens Exon, Intron and Splice regions extracted from GenBank primate sequences Rel.123. The aim of this data set is to give standardized material to train and to assess the prediction accuracy of computational approaches for gene identification and characterization. ![]() |
|
|
The StatLib Datasets ArchiveURL: http://lib.stat.cmu.edu/datasets/ ODP description: A repository of datasets used in statistics and machine learning. Page title: StatLib---Datasets Archive ![]() |
|
|
Reuters-21578 Text Categorization CorpusURL: http://www.daviddlewis.com/resources/testcollections/reuters21578/ ODP description: A classic benchmark for text categorization algorithms. Page title: Reuters-21578 Text Categorization Test Collection ![]() |
|
|
TREC DataURL: http://trec.nist.gov/data.html ODP description: Text datasets used in information retrieval and learning in text domains. Page title: Text REtrieval Conference (TREC) Data ![]() |
|
|
WordSimilarity-353 Test CollectionURL: http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/wordsim353.html ODP description: Contains 353 English word pairs along with human-assigned similarity judgements. Page title: The WordSimilarity-353 Test Collection Page description: Word similarity test collection ![]() |
|
| |