|
一个比较有名的貌似是UCI,这里还有一个Infochimps
Example:
Delicious bookmarks, September 2009
A record of all bookmarking activity on delicious.com for a roughly 10-day period in September 2009. The data comes from Arvind Narayanan, a post-doctoral researcher in Computer Science at Stanford University.
Format is JSON, one record per line. There are 1.25 million entries. Download size is 170 MB. Sample record:
{"updated": “Tue, 08 Sep 2009 08:45:00 +0000”, “links”: [{"href": “http://www.mcfc.co.uk/”, “type”: “text/html”, “rel”: "alternate"}], “title”: “Home – Manchester City FC”, “author”: “cainarachi”, “comments”: “http://delicious.com/url/b7cdad040b7e1d0aec0c93b1b8c4bb41”, “guidislink”: false, “title_detail”: {"base": “http://feeds.delicious.com/v2/rss/recent?min=1&count=100”, “type”: “text/plain”, “language”: null, “value”: “Home – Manchester City FC”}, “link”: “http://www.mcfc.co.uk/”, “source”: {}, “wfw_commentrss”: “http://feeds.delicious.com/v2/rs ... d0aec0c93b1b8c4bb41”, “id”: “http://delicious.com/url/b7cdad0 ... b8c4bb41#cainarachi”, “tags”: [{"term": “bonssites”, “scheme”: “http://delicious.com/cainarachi/”, “label”: null}, {"term": “html”, “scheme”: “http://delicious.com/cainarachi/”, “label”: null}, {"term": “webdesign”, “scheme”: “http://delicious.com/cainarachi/”, “label”: null}, {"term": “css”, “scheme”: “http://delicious.com/cainarachi/”, “label”: null}]}
数据集还是比较小。。 |
|