机器学习和生物信息学实验室联盟

 找回密码
 注册

QQ登录

只需一步,快速开始

搜索
查看: 2780|回复: 0
打印 上一主题 下一主题

开源数据集infochimps

[复制链接]
跳转到指定楼层
楼主
发表于 2013-1-23 15:06:13 | 只看该作者 回帖奖励 |倒序浏览 |阅读模式
一个比较有名的貌似是UCI,这里还有一个Infochimps

Example:
Delicious bookmarks, September 2009

A record of all bookmarking activity on delicious.com for a roughly 10-day period in September 2009. The data comes from Arvind Narayanan, a post-doctoral researcher in Computer Science at Stanford University.

Format is JSON, one record per line. There are 1.25 million entries. Download size is 170 MB. Sample record:

{"updated": “Tue, 08 Sep 2009 08:45:00 +0000”, “links”: [{"href": “http://www.mcfc.co.uk/”, “type”: “text/html”, “rel”: "alternate"}], “title”: “Home – Manchester City FC”, “author”: “cainarachi”, “comments”: “http://delicious.com/url/b7cdad040b7e1d0aec0c93b1b8c4bb41”, “guidislink”: false, “title_detail”: {"base": “http://feeds.delicious.com/v2/rss/recent?min=1&count=100”, “type”: “text/plain”, “language”: null, “value”: “Home – Manchester City FC”}, “link”: “http://www.mcfc.co.uk/”, “source”: {}, “wfw_commentrss”: “http://feeds.delicious.com/v2/rs ... d0aec0c93b1b8c4bb41”, “id”: “http://delicious.com/url/b7cdad0 ... b8c4bb41#cainarachi”, “tags”: [{"term": “bonssites”, “scheme”: “http://delicious.com/cainarachi/”, “label”: null}, {"term": “html”, “scheme”: “http://delicious.com/cainarachi/”, “label”: null}, {"term": “webdesign”, “scheme”: “http://delicious.com/cainarachi/”, “label”: null}, {"term": “css”, “scheme”: “http://delicious.com/cainarachi/”, “label”: null}]}

数据集还是比较小。。
分享到:  QQ好友和群QQ好友和群 QQ空间QQ空间 腾讯微博腾讯微博 腾讯朋友腾讯朋友
收藏收藏 转播转播 分享分享
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 注册

本版积分规则

机器学习和生物信息学实验室联盟  

GMT+8, 2025-4-12 07:27 , Processed in 0.080140 second(s), 24 queries .

Powered by Discuz! X3.2

© 2001-2013 Comsenz Inc.

快速回复 返回顶部 返回列表