This list is not exhaustive - help expand it!

Social Tagging Systems Research Group Source Year Obtained Availability Contact References
CiteULike Oversity Ltd. Primary Daily Snapshots Via Download after Email (link) Richard Cameron  
Bibsonomy KDE Primary Periodical Snapshots every half year Available after signed license agreement Andreas Hotho [Hotho 2006]
MovieLens GroupLens Primary 2009 Via Download (link) GroupLens Info [Sen 2006]
GiveALink NaN Group Primary Current information via API Via API Filippo Menczer [Markines 2009]
ESP Game Luis von Ahn Primary 2006 Via Download (link) Luis von Ahn [VonAhn 2004]
Delicious DAI Labor Secondary 2007/2008 Via Email Request Alan Said [Wetzker 2006]
Delicious, Stumble Upon & Wikipedia NLP and Information Retrieval Group Secondary 2008/2009 Via Download (link) Arkaitz Zubiaga [Zubiaga 2009a] [Zubiaga 2009b] [Zubiaga 2009c]
Delicious, Flickr,, TAGora Secondary 2006, 2007, 2008 Via Download (link) Vittorio Loreto  
Delicious, Flickr, Diigo, Bibsonomy and others Agents and Social Computation Secondary 2009 Via Email Request Markus Strohmaier [Grahsl 2010]

In case you are aware of other available datasets, please let me know by leaving a comment on a corresponding blog post.

Page updated and maintained by Markus Strohmaier.


[Grahsl 2010] H.P. Grahsl, C. Körner, M. Strohmaier. A Collection of Tagging Datasets Containing Complete Personomies From Heterogeneous Sources. Technical Report, Knowledge Management Institute, Graz University of Technology. To be published in 2010

[Hotho 2006] A. Hotho, R. Jäschke, C. Schmitz, and G. Stumme. BibSonomy: A Social Bookmark and Publication Sharing System. In Aldo de Moor, Simon Polovina, and Harry Delugach, editors, Proceedings of the Conceptual Structures Tool Interoperability Workshop at the 14th International Conference on Conceptual Structures, Aalborg, Denmark

[Markines 2009] B. Markines and F. Menczer. A Scalable, Collaborative Similarity Measure for Social Annotation Systems. Proc. 20th ACM Conf. on Hypertext and Hypermedia (HT).

[Sen 2006] S. Sen, S. K. Lam, A. M. Rashid, D. Cosley, D. Frankowski, J. Osterhouse, F. M. Harper, and J. Riedl. tagging, communities, vocabulary, evolution. In CSCW '06: Proceedings of the 2006 20th Anniversary Conference on Computer Supported Cooperative Work, pages 181-190, New York, NY, USA, 2006. ACM.

[VonAhn 2004] L. von Ahn and L. Dabbish. Labeling Images with a Computer Game. ACM Conference on Human Factors in Computing Systems, CHI 2004. pp 319-326.

[Wetzker 2008] R. Wetzker, C. Zimmermann, and C. Bauckhage. Analyzing Social Bookmarking Systems: A Delicious cookbook. In Mining Social Data (MSoDa) Workshop Proceedings, pp. 26-30. ECAI 2008, (July 2008).

[Zubiaga 2009a] A. Zubiaga, R. Mart穩nez, and V. Fresno. Getting the Most Out of Social Annotations for Web Page Classification. Proceedings of DocEng 2009, the 9th ACM Symposium on Document Engineering, pp. 74-83, Munich, Germany. 2009.

[Zubiaga 2009b] A. Zubiaga, A. P. Garc穩a-Plaza, V. Fresno, and R. Mart穩nez. Content-based Clustering for Tag Cloud Visualization. Proceedings of ASONAM 2009, International Conference on Advances in Social Networks Analysis and Mining. 2009.

[Zubiaga 2009c] A. Zubiaga. Enhancing Navigation on Wikipedia with Social Tags. Wikimania 2009. Buenos Aires, Argentina. 2009.

Last edited on December 7, 2009 (Christian Körner, Markus Strohmaier)


A List of Social Tagging Datasets Made Available for Research的更多相关文章

  1. Install SharePoint 2013 on Windows Server 2012 without a domain

    Any setup of Team Foundation Server is not complete until you have at least tried t work with ShareP ...

  2. paper 118:计算机视觉、模式识别、机器学习常用牛人主页链接

    牛人主页(主页有很多论文代码) Serge Belongie at UC San Diego Antonio Torralba at MIT Alexei Ffros at CMU Ce Liu at ...

  3. Datasets for Data Mining and Data Science KDDCUP-2012官网 ...

  4. (转载)公开的海量数据集 Public Research-Quality Datasets

    转载自: 海量数据数据集 海量数据(又称大数据)已经成为各大互联网企业面临的最大问题,如何处理海量数据,提供更好的解决方 ...

  5. Set up VLAN (802.1q) tagging on a network interface?

    SOLUTION VERIFIED October 13 2015 KB39674 KB741413 environment Red Hat Enterprise Linux 4 Red Hat En ...

  6. 讲座:Influence maximization on big social graph

    Influence maximization on big social graph Fanju PPT链接: social influence booming of online social ne ...

  7. Spark 官方文档(5)——Spark SQL,DataFrames和Datasets 指南

    Spark版本:1.6.2 概览 Spark SQL用于处理结构化数据,与Spark RDD API不同,它提供更多关于数据结构信息和计算任务运行信息的接口,Spark SQL内部使用这些额外的信息完 ...

  8. Call for Papers IEEE/ACM International Conference on Advances in Social Network Analysis and Mining (ASONAM)

    IEEE/ACM International Conference on Advances in Social Network Analysis and Mining (ASONAM) 2014 In ...

  9. 微信小程序想要的是无法监测的流量dark social

    “微信小程序”将带来什么样的变化?就单单的是一个超级Web app?还是只是为了给大家手机节省一些空间?腾讯想要的是高达70%以上的“无法监测的巨大流量”,称之为“dark social”(暗社交). ...


  1. div+css页面右侧底部悬浮层

    效果体验: 效果图: 代码如下: <!DOCTYPE html> <html> <head> ...

  2. MYSQL的常用命令和增删改查语句和数据类型【转】

    连接命令:<a href="" class='replace_word' title="MySQL知识库 ...

  3. mysql 5.7.16 远程连接

    1登录到mysql中,为root进行远程访问的授权,所以主机都可以链接 GRANT ALL PRIVILEGES ON . TO 'root'@'%'IDENTIFIED BY 'mypassword ...

  4. HDU 5687 字典树插入查找删除

    题目: 2016百度之星资格赛C题,直接套用字典树,顺便巩固了一下自己对字典树的理解 #include< ...

  5. Android 开源库和项目

    1.手势解锁 史上最完美的 手势密码解锁 2.数据库操作 Android数据库框架itePal 轻量级数据库:a ...

  6. 淘宝(阿里百川)手机客户端开发日记第八篇 Handler的使用方法

    首先,我们先看下API文档的说明: A Handler allows you to send and process Message and Runnable objects associated w ...

  7. 在windows下使用cmd命令行对java文件进行编译和执行

    windows下利用cmd命令行可以调用jdk里的javac.exe和java.exe对java文件进行编译和执行,前提是jdk已成功安装并正确配置相关环境变量 相关配置链接:java基础学习总结—— ...

  8. 解决maven的“Dynamic Web Module 3.0 requires Java 1.6 or newer.”错误

    需要添加一个插件,在build标签中添加 <plugins> <plugin> <groupId>org.apache.maven.plugins</grou ...

  9. 2012 #5 Gold miner

    Gold miner Time Limit: 4000/2000 MS (Java/Others)    Memory Limit: 32768/32768 K (Java/Others)Total ...

  10. MySQL性能优化的21个最佳实践 和 mysql使用索引

    1. 为查询缓存优化查询 当有很多相同的查询被执行了多次的时候,这些查询结果会被放到一个缓存中,这样,后续的相同的查询就不用操作表而直接访问缓存结果了. 2. EXPLAIN   SELECT 查询 ...