jasminecorp.net directory
Updated Blogs
More .....



 
 

| Add Search | Advertise With JCSearch | Suggest a Site

Search the Web


Search Results

Datasets
Directory > Computers > Artificial Intelligence > Machine Learning > Datasets

Web Sitesi

DNA microarray gene expression data
A collection of public gene expression data sources maintained by A. Brazma.
http://www.ebi.ac.uk/~brazma/Data-mining/microarray.html
Reviews Rating: Not yet Rated Whois Check

WIPO patents dataset for automated categorization
The first free collection of patents, in XML format, containing over 75,000 manually-classified
patent documents in English from 1998-2002.

http://www.wipo.int/ibis/datasets
Reviews Rating: Not yet Rated Whois Check

Reuters-21578 Text Categorization Corpus
A classic benchmark for text categorization algorithms.
http://www.daviddlewis.com/resources/testcollections/reuters21578/
Reviews Rating: Not yet Rated Whois Check

BankSearch Dataset
A freely downloadable, pre-classified, dataset of HTML web page documents categorized into 10
categories.

http://www.pedal.rdg.ac.uk/banksearchdataset/
Reviews Rating: Not yet Rated Whois Check

RISE: Repository of Information Sources used in information Extraction tasks.
Repository of online information sources: test domains for information extraction and wrapper
generation tools that learn extraction rules (extraction patterns).

http://www.isi.edu/info-agents/RISE/
Reviews Rating: Not yet Rated Whois Check

WordSimilarity-353 Test Collection
Contains 353 English word pairs along with human-assigned similarity judgements.
http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/wordsim353.html
Reviews Rating: Not yet Rated Whois Check

Web->KB dataset
Web pages partitioned into classes, with hyperlink data. The dataset has been used for text
categorization and learning to extract symbolic knowledge from the World Wide Web.

http://www.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/wwkb/
Reviews Rating: Not yet Rated Whois Check

The 20 Newsgroups Data Set
20 Newsgroups for text categorization. Widely used dataset.
http://www.ai.mit.edu/~jrennie/20_newsgroups/
Reviews Rating: Not yet Rated Whois Check

AdEater data
AdEater is a program that learns to remove Internet advertisements. The machine learning dataset is
available from this page.

http://www.cs.ucd.ie/staff/nick/home/research/ae/
Reviews Rating: Not yet Rated Whois Check

Learning Relational Concepts from Sensor Data of a Mobile Robot
A set of data sets, where each data set is represented in first order logic. Maintained at the
University of Dortmund, Germany.

http://www-ai.cs.uni-dortmund.de/FORSCHUNG/PROJEKTE/BLEARN2/data-sets.html
Reviews Rating: Not yet Rated Whois Check

HS3D - Homo Sapiens Splice Sites Dataset
HS3D (Homo Sapiens Splice Sites Dataset) is a database of Homo Sapiens Exon, Intron and Splice
regions extracted from GenBank primate sequences Rel.123. The aim of this data set is to give
standardized material to train and to assess the prediction accuracy of computational approaches
for gene identification and characterization.

http://www.sci.unisannio.it/docenti/rampone/
Reviews Rating: Not yet Rated Whois Check

University of Maryland, INFORUM EconData
Several hundred thousand economic time series, produced by the U.S. Government and distributed by
the government in a variety of formats and media, have been put into a standard, highly efficient,
easy-to- use form for personal computers.

http://www.inform.umd.edu/EdRes/Topic/Economics/EconData/Econdata.html
Reviews Rating: Not yet Rated Whois Check

Penn Treebank Project
A corpus of parsed sentences. Used by many researchers for training data-driven parsing algorithms.
http://www.cis.upenn.edu/~treebank/
Reviews Rating: Not yet Rated Whois Check

Time Series Data Library
A collection of over 500 time series, maintained by Rob Hyndman. Time series are organized by
subject.

http://www-personal.buseco.monash.edu.au/~hyndman/TSDL/
Reviews Rating: Not yet Rated Whois Check

Face recognition dataset
A dataset of face images for face recognition algorithms.
http://www.cs.cmu.edu/afs/cs.cmu.edu/user/avrim/www/ML94/face_homework.html
Reviews Rating: Not yet Rated Whois Check

NIST Special Database 4.
This NIST database of fingerprint images contains 2000 8- bit gray scale fingerprint image pairs.
http://www.nist.gov/srd/nistsd4.htm
Reviews Rating: Not yet Rated Whois Check

The StatLib Datasets Archive
A repository of datasets used in statistics and machine learning.
http://lib.stat.cmu.edu/datasets/
Reviews Rating: Not yet Rated Whois Check

National Space Science Data Center
Provides access to a wide variety of astrophysics, space physics, solar physics, lunar and
planetary data from NASA space flight missions, in addition to selected other data and some
models and software.

http://nssdc.gsfc.nasa.gov/
Reviews Rating: Not yet Rated Whois Check

TREC Data
Text datasets used in information retrieval and learning in text domains.
http://trec.nist.gov/data.html
Reviews Rating: Not yet Rated Whois Check

UCI Machine Learning Repository
A repository of databases, domain theories and data generators that are used by the machine
learning community for the empirical analysis of machine learning algorithms.

http://www.ics.uci.edu/~mlearn/MLRepository.html
Reviews Rating: Not yet Rated Whois Check

DELVE - Data for Evaluating Learning in Valid Experiments
Data for Evaluating Learning Valid Experiments: A standardized environment designed to evaluate the
performance of methods that learn relationships based primarily on empirical data. Delve makes it
possible for users to compare their learning methods with other methods on many datasets.

http://www.cs.utoronto.ca/~delve/
Reviews Rating: Not yet Rated Whois Check

Bilkent University Function Approximation Repository
Datasets used for the experimental analysis of function approximation techniques and for training
and demonstration by machine learning and statistics community.

http://funapp.cs.bilkent.edu.tr/DataSets/
Reviews Rating: Not yet Rated Whois Check

Dataset generator
Datgen, formerly SCDS, is a computer program that generates data to systematically test programs
that consume data. These synthetic datasets can be used to validate learning algorithms.

http://www.datgen.com/
Reviews Rating: Not yet Rated Whois Check

The RCSB Protein Data Bank (PDB)
Archive of experimentally-determined, biological macromolecule 3-D structures from the Brookhaven
National Laboratory.

http://www.rcsb.org/pdb/
Reviews Rating: Not yet Rated Whois Check

 


Jasminecorp.net directory is based on the Open Directory and is being modified by Jasminecorp.

©2004 Jasmine Computers Inc.

Click here to subscribe for Jasminecorp's product News.

Home | JCBid |Software Development | Domain Registration | Hosting | Web Designing | Buy Books | Advertise with JCSearch | Whois | IP Locator | Add Search | Shopping | Store | Free Blogs | Free GuestBook | Free E-Cards | Free Games | Free Tutorials | Set as Home | Add to Favorite | Suggest a Site | Directory | Our Portfolio | Terms of service | Free quote | Tell a Friend | Special Offer | Job Opportunities | games | Usenet Groups

Submit a Site to Jasminecorp.net Directory || Advertise with us

 

Help build the largest human-edited directory on the web.
Submit a Site - Open Directory Project - Become an Editor
Get a Domain Name:
.com .us .info
.org .in .name
.net .biz .asia