国产一级a片免费看高清,亚洲熟女中文字幕在线视频,黄三级高清在线播放,免费黄色视频在线看

打開(kāi)APP
userphoto
未登錄

開(kāi)通VIP,暢享免費(fèi)電子書(shū)等14項(xiàng)超值服

開(kāi)通VIP
數(shù)據(jù)挖掘的數(shù)據(jù)集資源

 數(shù)據(jù)挖掘的數(shù)據(jù)集資源

新一篇: 從則平,破而立

轉(zhuǎn):http://bbs.w3china.org/blog/more.asp?name=idmer&id=24017

大家做數(shù)據(jù)挖掘研究時(shí),常常為找不到合適的數(shù)據(jù)而發(fā)愁。在KDNuggets上有Datasets欄目,提供一些數(shù)據(jù)集,網(wǎng)址為:http://www.kdnuggets.com/datasets/

還有另外一個(gè)很好的資源網(wǎng)址為:http://kdd.ics.uci.edu/,里面包含的數(shù)據(jù)資源如下(按應(yīng)用領(lǐng)域劃分):

Direct Marketing
  KDD CUP 1998 Data

GIS
  Forest CoverType

Indexing
  Corel Image Features
  Pseudo Periodic Synthetic Time Series

Intrusion Detection
  KDD CUP 1999 Data

Process Control
  Synthetic Control Chart Time Series

Recommendation Systems
  Entree Chicago Recommendation Data

Robots
  Pioneer-1 Mobile Robot Data
  Robot Execution Failures

Sign Language Recognition
  Australian Sign Language Data
  High-quality Australian Sign Language Data

Text Categorization
  20 Newsgroups Data
  Reuters-21578 Text Categorization Collection
  NSF Research Awards Abstracts 199 0-2003

World Wide Web
  Microsoft Anonymous Web Data
  MSNBC Anonymous Web Data
  Syskill Webert Web Data

 轉(zhuǎn):http://blogger.org.cn/blog/more.asp?name=DMman&id=24043

------------------------------------------------------------------分割線------------------------------------------------------------------

DMman按:以下鏈接轉(zhuǎn)自互聯(lián)網(wǎng),鏈接的有效性與可用價(jià)值DMman沒(méi)有逐個(gè)進(jìn)行測(cè)試。 

1、氣候監(jiān)測(cè)數(shù)據(jù)集 http://cdiac.ornl.gov/ftp/ndp026b

2、幾個(gè)實(shí)用的測(cè)試數(shù)據(jù)集下載的網(wǎng)站

http://www.cs.toronto.edu/~roweis/data.html
http://www.cs.toronto.edu/~roweis/data.html
http://kdd.ics.uci.edu/summary.task.type.html
http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-20/www/data/
http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/wwkb/
http://www.phys.uni.torun.pl/~duch/software.html
在下面的網(wǎng)址可以找到reuters數(shù)據(jù)集http://www.research.att.com/~lewis/reuters21578.html

以下網(wǎng)址上有各種數(shù)據(jù)集:
http://kdd.ics.uci.edu/summary.data.type.html

進(jìn)行文本分類(lèi),還有一個(gè)數(shù)據(jù)集是可以用的,即rainbow的數(shù)據(jù)集
http://www-2.cs.cmu.edu/afs/cs/project/theo-11/www/naive-bayes.html

3、找了很多測(cè)試數(shù)據(jù)集,寫(xiě)論文的同志們肯定需要的,至少能用來(lái)檢驗(yàn)算法的效果
可能有一些不能訪問(wèn),但是總有能訪問(wèn)的吧:

UCI收集的機(jī)器學(xué)習(xí)數(shù)據(jù)集
ftp://pami.sjtu.edu.cn/
http://www.ics.uci.edu/~mlearn//MLRepository.htm

statlib
http://liama.ia.ac.cn/SCILAB/scilabindexgb.htm
http://lib.stat.cmu.edu/

樣本數(shù)據(jù)庫(kù)
http://kdd.ics.uci.edu/
http://www.ics.uci.edu/~mlearn/MLRepository.html

關(guān)于基金的數(shù)據(jù)挖掘的網(wǎng)站
http://www.gotofund.com/index.asp

http://lans.ece.utexas.edu/~strehl/

reuters數(shù)據(jù)集
http://www.research.att.com/~lewis/reuters21578.html

各種數(shù)據(jù)集:
http://kdd.ics.uci.edu/summary.data.type.html
http://www.mlnet.org/cgi-bin/mlnetois.pl/?File=datasets.html
http://lib.stat.cmu.edu/datasets/
http://dctc.sjtu.edu.cn/adaptive/datasets/
http://fimi.cs.helsinki.fi/data/
http://www.almaden.ibm.com/software/quest/Resources/index.shtml
http://miles.cnuce.cnr.it/~palmeri/datam/DCI/

進(jìn)行文本分類(lèi)&WEB
http://www-2.cs.cmu.edu/afs/cs/project/theo-11/www/naive-bayes.html

http://www.w3.org/TR/WD-logfile-960221.html
http://www.w3.org/Daemon/User/Config/Logging.html#AccessLog
http://www.w3.org/1998/11/05/WC-workshop/Papers/bala2.html
http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/wwkb/
http://www.web-caching.com/traces-logs.html
http://www-2.cs.cmu.edu/webkb
http://www.cs.auc.dk/research/DP/tdb/TimeCenter/TimeCenterPublications/TR-75.pdf
http://www.cs.cornell.edu/projects/kddcup/index.html


時(shí)間序列數(shù)據(jù)的網(wǎng)址
http://www.stat.wisc.edu/~reinsel/bjr-data/

apriori算法的測(cè)試數(shù)據(jù)
http://www.almaden.ibm.com/cs/quest/syndata.html

數(shù)據(jù)生成器的鏈接
http://www.cse.cuhk.edu.hk/~kdd/data_collection.html
http://www.almaden.ibm.com/cs/quest/syndata.html


關(guān)聯(lián):
http://flow.dl.sourceforge.net/sourceforge/weka/regression-datasets.jar
http://www.almaden.ibm.com/software/quest/Resources/datasets/syndata.html#assocSynData

WEKA:
http://flow.dl.sourceforge.net/sourceforge/weka/regression-datasets.jar
1。A jarfile containing 37 classification problems, originally obtained from the UCI repository
http://prdownloads.sourceforge.net/weka/datasets-UCI.jar
2。A jarfile containing 37 regression problems, obtained from various sources
http://prdownloads.sourceforge.net/weka/datasets-numeric.jar
3。A jarfile containing 30 regression datasets collected by Luis Torgo
http://prdownloads.sourceforge.net/weka/regression-datasets.jar

癌癥基因:
http://www.broad.mit.edu/cgi-bin/cancer/datasets.cgi

金融數(shù)據(jù):
http://lisp.vse.cz/pkdd99/Challenge/chall.htm

 

另一個(gè)人提供的
http://www.cs.toronto.edu/~roweis/data.html
http://kdd.ics.uci.edu/summary.task.type.html
http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-20/www/data/
http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/wwkb/
http://www.phys.uni.torun.pl/~duch/software.html
在下面的網(wǎng)址可以找到reuters數(shù)據(jù)集
http://www.research.att.com/~lewis/reuters21578.html

以下網(wǎng)址上有各種數(shù)據(jù)集:
http://kdd.ics.uci.edu/summary.data.type.html

進(jìn)行文本分類(lèi),還有一個(gè)數(shù)據(jù)集是可以用的,即rainbow的數(shù)據(jù)集
http://www-2.cs.cmu.edu/afs/cs/project/theo-11/www/naive-bayes.html


Download the Financial Data (~17.5M zipped file, ~67M unzipped data)
Download the Medical Data (~2M zipped file, ~6M unzipped data)
http://lisp.vse.cz/pkdd99/Challenge/chall.htm

kdnuggets 相關(guān)鏈接數(shù)據(jù)集(借花獻(xiàn)佛了):
http://www.kdnuggets.com/datasets/index.html

你也可以到http://blogger.org.cn/blog/more.asp?name=idmer&id=24017
察看kdnuggets 數(shù)據(jù)集資源的詳細(xì)介紹。

------------------------------------------------------------------分割線------------------------------------------------------------------

資料來(lái)源:網(wǎng)絡(luò)資料

數(shù)據(jù)挖掘相關(guān)比賽以及數(shù)據(jù)集

2005 University of California data mining contest, predicting bad accounts and their churn date using real-world CRM data, deadline June 30, 2005.

  • ILP 2005 Challenge, on the prediction of functional classes of genes.
  • KDD Cup 2005, on classifying internet user search queries, deadline July 8.
  • Data Mining Cup 2005 (Chemnitz, Germany), for students; topic: How data mining can ascertain the risk of loss of payments and reduce this risk.
  • KDD Cup 2004, focuses on data-mining for a several performance criteria using datasets from bioinformatics and quantum physics.
  • InfoVis 2004 Contest, The History of InfoVis.
  • DATA MINING CUP 2004 (Chemnitz, Germany), for students.
  • InfoVis 2003 Contest: Visualization and Pair Wise Comparison of Trees, results announced Sep 5, 2003.
  • KDD Cup 2003, focuses on problems motivated by network mining and the analysis of usage logs.
  • DATA MINING CUP 2003 (Chemnitz, Germany). The task is to identify spam emails before they reach the user′s mailbox.
  • KDD Cup 2002, focus on data mining in molecular biology.
  • Student Data Mining Cup (2002), Chemnitz University and Prudential Systems.
  • 本站僅提供存儲(chǔ)服務(wù),所有內(nèi)容均由用戶(hù)發(fā)布,如發(fā)現(xiàn)有害或侵權(quán)內(nèi)容,請(qǐng)點(diǎn)擊舉報(bào)
    打開(kāi)APP,閱讀全文并永久保存 查看更多類(lèi)似文章
    猜你喜歡
    類(lèi)似文章
    數(shù)據(jù)挖掘的數(shù)據(jù)集資源
    數(shù)據(jù)挖掘數(shù)據(jù)集下載搜集整理版
    Python 數(shù)據(jù)挖掘工具推薦
    KDD CUP 2018:中國(guó)團(tuán)隊(duì)包攬前三名,TOP1方案出爐
    數(shù)據(jù)挖掘與知識(shí)發(fā)現(xiàn)
    數(shù)據(jù)挖掘干貨分享
    更多類(lèi)似文章 >>
    生活服務(wù)
    分享 收藏 導(dǎo)長(zhǎng)圖 關(guān)注 下載文章
    綁定賬號(hào)成功
    后續(xù)可登錄賬號(hào)暢享VIP特權(quán)!
    如果VIP功能使用有故障,
    可點(diǎn)擊這里聯(lián)系客服!

    聯(lián)系客服