九色国产,午夜在线视频,新黄色网址,九九色综合,天天做夜夜做久久做狠狠,天天躁夜夜躁狠狠躁2021a,久久不卡一区二区三区

打開APP
userphoto
未登錄

開通VIP,暢享免費電子書等14項超值服

開通VIP
數(shù)據(jù)挖掘數(shù)據(jù)集下載搜集整理版

來自互聯(lián)網(wǎng):


1、氣候監(jiān)測數(shù)據(jù)集http://cdiac.ornl.gov/ftp/ndp026b 


2、幾個實用的測試數(shù)據(jù)集下載的網(wǎng)站


Data for MATLAB hackers (HandwrittenDigits、Faces、Text)


http://www.cs.toronto.edu/~roweis/data.html


3、UCI KDD Archive(各類數(shù)據(jù)集)


http://kdd.ics.uci.edu/summary.task.type.html 


http://kdd.ics.uci.edu/summary.data.type.html 


4、UCI收集的機器學(xué)習(xí)數(shù)據(jù)集


ftp://pami.sjtu.edu.cn/  


http://www.ics.uci.edu/~mlearn//MLRepository.htm 


5、樣本數(shù)據(jù)庫


http://kdd.ics.uci.edu/ 


WWW-pages were manually classified


http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-20/www/data/ 


6、CMU World Wide Knowledge Base(Web->KB) project(classified web pages、relationaldata describing pages and hyperlinks)


http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/wwkb/ 


7、人工智能機器學(xué)習(xí)


http://duch-links.wikispaces.com/ 


8、文本分類,即rainbow的數(shù)據(jù)集


http://www-2.cs.cmu.edu/afs/cs/project/theo-11/www/naive-bayes.html 


9、Statlib 數(shù)理統(tǒng)計相關(guān)程序庫


http://liama.ia.ac.cn/SCILAB/scilabindexgb.htm


http://lib.stat.cmu.edu/ 


http://lib.stat.cmu.edu/datasets/


http://lib.stat.cmu.edu/modules.php?op=modload&name=Downloads&file=index&req=viewdownload&cid=2 


10、癌癥基因:


http://www.broad.mit.edu/cgi-bin/cancer/datasets.cgi


11、金融、醫(yī)藥數(shù)據(jù):


http://lisp.vse.cz/pkdd99/Challenge/chall.htm


12、時間序列數(shù)據(jù)的網(wǎng)址


http://www.stat.wisc.edu/~reinsel/bjr-data/ 


13、kdnuggets 相關(guān)鏈接各種數(shù)據(jù)集:


http://www.kdnuggets.com/datasets/index.html 


14、德國智能分析和信息系統(tǒng)


http://www.mlnet.org/cgi-bin/mlnetois.pl/?File=datasets.html 


http://dctc.sjtu.edu.cn/adaptive/datasets/  


http://fimi.cs.helsinki.fi/data/ 


15、IBM智能信息


http://www-958.ibm.com/software/data/cognos/manyeyes/datasets


http://www.almaden.ibm.com/software/quest/Resources/index.shtml 


16、Frequent Set Counting


http://miles.cnuce.cnr.it/~palmeri/datam/DCI/datasets.php


17、評分數(shù)據(jù)集


    Movielens電影評分數(shù)據(jù)


   基本數(shù)據(jù)描述:包括以下三個數(shù)據(jù)集:

   a.943個用戶對1682個電影的10萬條評分

   b.6040個用戶對3900個電影的1百萬條評分

   c.71567個用戶對10681個電影的1千萬條評分

   http://www.grouplens.org/  


    Book-Crossing書籍評分數(shù)據(jù)


   基本數(shù)據(jù)描述:包含了278,858個用戶對271,379本書籍的1,149,780條評分。該數(shù)據(jù)集由Cai-NicolasZiegler 在2004年8-9月用4周的時間從Book-Crossing社區(qū)用網(wǎng)絡(luò)爬出。

   http://www.informatik.uni-freiburg.de/~cziegler/BX/


    Jester JokeData Set 笑話評分集合 


    來自UCBerkeley的KenGoldberg發(fā)布的一個推薦系統(tǒng)使用的數(shù)據(jù)集。包含關(guān)于100個笑話的73,496名用戶評分的410萬條連續(xù)評分。

   http://www.ieor.berkeley.edu/~goldberg/jester-data/


    Netflix數(shù)據(jù)集


   也是電影評分數(shù)據(jù)集,480,189 個用戶,17,770 部電影,100,480,507 條評分記錄。與它相比,MovieLens數(shù)據(jù)集少了 2 個數(shù)量級。它的位置相信會逐漸被 Netflix 數(shù)據(jù)所替代,這是時代進步的必然結(jié)果。

   說明:以上四個均為用戶評分數(shù)據(jù)


21、GPS軌跡數(shù)據(jù)


GeoLife GPS Trajectories

http://research.microsoft.com/en-us/downloads/b16d359d-d164-469e-9fd4-daa38f2b2e13/default.aspx  


GPS Trajectories with transportation modelabels

http://research.microsoft.com/apps/pubs/?id=141896 


Movebank 動物軌跡

http://www.movebank.org/

 

22、手機WIFI藍牙


A Community Resource for Archiving Wireless Data AtDartmouth

http://crawdad.cs.dartmouth.edu/


crowflow  手機和wifi軌跡

http://crowdflow.net/ 


23、OpenStreetMap Data


planet.openstreetmap.org 或者http://metro.teczno.com/


24、openpath上傳數(shù)據(jù)+API


https://openpaths.cc/  


25、FOURSQUARE


26、GeoTime


http://www.geotime.com/GeoTime(s)/January-2012/Cupid-Strikes-Again--Time-Series---GIS--Together-a.aspx  


27、數(shù)據(jù)堂

http://www.datatang.com/

28、http://www.kdnuggets.com/datasets/

29、http://appsrv.cse.cuhk.edu.hk/~kdd/data_collection.html






 


IBM Almaden Research Center Data MiningProjects


 


Data Sets:


·        Synthetic Data GenerationCode for Associations and Sequential Patterns

·        Synthetic Data GenerationCode for Classification

·        "Dense" Data-Sets (aprioribinary format, 3.2Mb)

·        Enron Email Data Set

Demos:


·        General Visualizations forAssociations

·        Visualization Demo: MarketBasket Analysis 


IBM Intelligent Miner:


·        IBM Intelligent Miner forData

·        Video and image clips fromIBM Data Mining T.V. Ad 


IBM Data Mining Resources:


·        Business IntelligenceSolutions   Our colleagues offering data miningconsultancy and services.

·        Data Abstraction ResearchGroup   Our colleagues in IBM Thomas J. WatsonResearch Center.   Our colleagues in France.

·        Data Mining: Extending theInformation Warehouse Framework   IBM White Paperon Data Mining.

 

在下面的網(wǎng)址可以找到reuters數(shù)據(jù)集


http://www.research.att.com/~lewis/reuters21578.html


關(guān)于基金的數(shù)據(jù)挖掘的網(wǎng)站


http://www.gotofund.com/index.asp


http://lans.ece.utexas.edu/~strehl/


 


reuters數(shù)據(jù)集


http://www.research.att.com/~lewis/reuters21578.html


http://www-2.cs.cmu.edu/webkb


http://www.cs.auc.dk/research/DP/tdb/TimeCenter/TimeCenterPublications/TR-75.pdf


 


關(guān)聯(lián):


http://flow.dl.sourceforge.net/sourceforge/weka/regression-datasets.jar


http://www.phys.uni.torun.pl/~duch/software.html


 


WEKA:


http://flow.dl.sourceforge.net/sourceforge/weka/regression-datasets.jar 


1。A jarfile containing 37 classification problems,originally obtained from the UCI repository


http://prdownloads.sourceforge.net/weka/datasets-UCI.jar  


2。A jarfile containing 37 regression problems,obtained from various sources


http://prdownloads.sourceforge.net/weka/datasets-numeric.jar 


3。A jarfile containing 30 regression datasetscollected by Luis Torgo


http://prdownloads.sourceforge.net/weka/regression-datasets.jar  


 


數(shù)據(jù)挖掘相關(guān)比賽以及數(shù)據(jù)集


u  2005 University of Californiadata mining contest, predicting bad accounts and their churn dateusing real-world CRM data, deadline June 30, 2005.


u  ILP 2005 Challenge, on theprediction of functional classes of genes.


u  KDD Cup 2005, on classifyinginternet user search queries, deadline July 8.


u  Data Mining Cup 2005 (Chemnitz,Germany), for students; topic: How data mining can ascertain therisk of loss of payments and reduce this risk.


u  KDD Cup 2004, focuses ondata-mining for a several performance criteria using datasetsfrombioinformatics and quantum physics.


u  InfoVis 2004 Contest, TheHistory of InfoVis.


u  DATA MINING CUP 2004 (Chemnitz,Germany), for students.


u  InfoVis 2003 Contest:Visualization and Pair Wise Comparison of Trees, results announcedSep 5, 2003.


 


u  KDD CUP 2003


http://www.cs.cornell.edu/projects/kddcup/index.html


u  KDD Cup 2003, focuses onproblems motivated by network mining and the analysis of usagelogs.


u  DATA MINING CUP 2003 (Chemnitz,Germany). The task is to identify spam emails before they reach theuser′s mailbox.


u  KDD Cup 2002, focus on datamining in molecular biology.


u  Student Data Mining Cup (2002),Chemnitz University and Prudential Systems.



再補充請在百度文庫搜索“數(shù)據(jù)集情況介紹”


 


 

本站僅提供存儲服務(wù),所有內(nèi)容均由用戶發(fā)布,如發(fā)現(xiàn)有害或侵權(quán)內(nèi)容,請點擊舉報。
打開APP,閱讀全文并永久保存 查看更多類似文章
猜你喜歡
類似文章
備用的數(shù)據(jù)集,目前用kdd 99
機器學(xué)習(xí)和數(shù)據(jù)科學(xué)的最佳公共數(shù)據(jù)集
力薦!50 個最實用的免費機器學(xué)習(xí)數(shù)據(jù)集
推薦一些機器學(xué)習(xí)相關(guān)的數(shù)據(jù)集
機器學(xué)習(xí)高質(zhì)量數(shù)據(jù)集大合輯
開源數(shù)據(jù)集整理
更多類似文章 >>
生活服務(wù)
熱點新聞
分享 收藏 導(dǎo)長圖 關(guān)注 下載文章
綁定賬號成功
后續(xù)可登錄賬號暢享VIP特權(quán)!
如果VIP功能使用有故障,
可點擊這里聯(lián)系客服!

聯(lián)系客服