Christian Körner and Markus Strohmaier, from the Graz University of Technology, in Austria, have published an article on social tagging datasets in the latest issue of the ACM SIGWEB Newsletter. Existing social tagging datasets have been listed there, including our 3 datasets: DeliciousT140, Wiki10+ and Social-ODP-2k9.
Have a look at it!
Share/Save
Posted in research, social-bookmarking, social-tagging, tagging.
Tagged with datasets, newsletter, paper, research, sigweb, social-tagging.
Wiki10+, a dataset with 20,764 annotated English Wikipedia articles is now available for download. This dataset includes that many Wikipedia articles, with their corresponding social tags retrieved from Delicious.
The dataset, including both articles’ content and social tags, can be downloaded from here.
Share/Save
Posted in research, social-bookmarking, social-tagging, tagging.
Tagged with dataset, download, english, social-tagging, wikipedia.
I have recently prepared a prototype based on Wikipedia, integrating a tagging system on it. I did it for my presentation at Wikimania 2009, entitled “Enhancing Navigation on Wikipedia with Social Tags”. This prototype allows us to evaluate whether a social tagging system would improve and enhance article navigation and search on Wikipedia. Visit the prototype.
Share/Save
Posted in research, social-tagging, tagging, web2.0.
Tagged with improvement, navigation, prototype, research, search, social-tagging, web2.0, wikimania, wikipedia.
If you are interested in my presentation about “Enhancing Navigation on Wikipedia with Social Tags”, in Wikimania 2009, you can download the slides using the following links:
English
Spanish
Share/Save
Posted in research, social-tagging, web2.0.
Tagged with categorization, folksonomy, navigation, presentation, slides, social-tagging, taxonomy, wikimania, wikipedia.
My paper “Enhancing Navigation on Wikipedia with Social Tags” has been accepted for publication and presentation at Wikimania 2009, the 5th International Conference of the Wikimedia Foundation to be held in Buenos Aires, Argentina, from August 26 to 28, 2009.
Abstract
Social tagging has become an interesting approach to improve search and navigation over the actual Web, since it aggregates the tags added by different users to the same resource in a collaborative way. This way, it results in a list of weighted tags describing its resource. Combined to a classical taxonomic classification system such as that by Wikipedia, social tags can enhance document navigation and search. On the one hand, social tags suggest alternative navigation ways, including pivot-browsing, popularity-driven navigation, and filtering. On the other hand, it provides new metadata, sometimes uncovered by the documents’ content, that can substantially improve document search. In this work, the inclusion of an interface to add user-defined tags describing Wikipedia articles is proposed, as a way to improve article navigation and retrieval.
Share/Save
Posted in research, social-bookmarking, social-tagging, tagging, web2.0.
Tagged with categorization, information-retrieval, navigation, paper, research, search, search-engine, social-bookmarking, social-tagging, web2.0, wikipedia.
Our paper “Getting the Most Out of Social Annotations for Web Page Classification” has been accepted for publication and presentation at DocEng 2009, the 9th ACM Symposium on Document Engineering to be held in Munich, Germany, from September 15 to 18, 2009.
Abstract
User-generated annotations on social bookmarking sites can provide interesting and promising metadata for web document management tasks like web page classification. These user-generated annotations include diverse types of information, such as tags and comments. Nonetheless, each kind of annotation has a different nature and popularity level. In this work, we analyze and evaluate the usefulness of each of these social annotations to classify web pages over a taxonomy like that proposed by the Open Directory Project. We compare them separately to the content-based classification, and also combine the different types of data to augment performance. Our experiments show encouraging results with the use of social annotations for this purpose, and we found that combining these metadata with web page content improves even more the classifier’s performance.
Share/Save
Posted in research, social-bookmarking, social-tagging.
Tagged with annotations, classification, odp, paper, research, social-bookmarking, social-tagging, user-generated.
I’ve recently read the book “Tagging: People-Powered Metadata for the Social Web“, by Gene Smith, an interesting overview on the art of tagging. I would recommend it to whoever interested in discovering what tags mean, and even to those experts willing to deal with tagging in depth. Next, I present a brief summary on the topics covered by this book:
- What is Tagging?: As an introduction, the book offers an interesting overview on tagging, letting you discover what it is and its advantages.
- The Value of Tagging: Why do people tag? Why does a website/intranet need a tagging system?
- Tagging System Architecture: You will learn that a tagging system involves users, resources and tags in it. Moreover, the relations between them and their features are also presented.
- Tags, Metadata, and Classification Systems Using tags as metadata, and its differences with a classical taxonomic system.
- Navigation and Visualization: Advantages of a tagging system for navigation and visualization of a website’s content, showing some new stuff like tag clouds. In this chapter, geotagging is also presented.
- Interfaces: Some tips on implementing a user-friendly interface for a tagging system. How to ease users to tag a resource, recommending or without recommending tags, how to separate tags (spaces, commas, etc.), and much more.
- Technical Design: Some technical tips, such as designing the database for a tagging system, and using the open-source tagging plug-in FreeTag to ease this work.
- Appendix A – Case Study: Social Bookmarking: A brief history and some other ideas on social bookmarking sites.
- Appendix B – Case Study: Media Sharing: Tagging for rich media, such as images and videos.
- Appendix C – Case Study: Personal Information Management: How to manage personal information with tags.
Strongly recommended!
Share/Save
Posted in social-bookmarking, social-tagging, tagging.
Tagged with book, folksonomy, reading, social-bookmarking, social-tagging, tagging.
Our paper “Clasificación de Páginas Web con Anotaciones Sociales” has been accepted for publication and presentation at SEPLN 2009, XXV edición del Congreso Anual de la Sociedad Española para el Procesamiento del Lenguaje Natural to be held in Donostia-San Sebastián, from September 8 to 10, 2009.
Abstract
Las anotaciones generadas por usuarios en sistemas de marcadores sociales pueden proveer metadatos interesantes y muy útiles para la clasificación de páginas web. Estas anotaciones incluyen diversos tipos de información, como etiquetas y comentarios. No obstante, cada tipo de anotación tiene una naturaleza y un nivel de popularidad diferente. En este trabajo, analizamos y evaluamos la utilidad de cada una de estas anotaciones sociales para clasificar páginas web sobre una taxonomía como la del Open Directory Project. Las comparamos por separado a la clasificación basada en contenido, y también las combinamos. Nuestros experimentos muestran resultados prometedores con la utilización de anotaciones sociales para este propósito. Y además indican que su combinación con el contenido textual mejora el rendimiento de la clasificación.
Share/Save
Posted in research, social-bookmarking, social-tagging.
Tagged with classification, paper, research, social-annotations, social-bookmarking, social-tagging, svm.
We have recently recorded a radio talk on social networks. I did it with my colleague Alberto P. García-Plaza, entitled “Nuevas tendencias en las tecnologías de la información: Las redes sociales”. You can download it in mp3 or ogg format (in Spanish).
Share/Save
Posted in web2.0.
Tagged with radio, social-networks.
Next Thursday, on May 7th, I will present in basque language our recent work “Etiketa-lainoen Ikuskera Hobetzeko Multzokatzea” in Informatikari Euskaldunen Bilkura ‘09 (Basque Computer Scientists’ Conference ‘09), which will be held in Donostia, Basque Country.
Abstract
Erabiltzaileek aurrez anotatutako datuak berreskuratzeko baliabide interesgarria bilakatu dira markatzaile sozialak. Mota honetako webguneek etiketarik erabilienek osatutako etiketa-lainoa erakusten dute nabigazio aukera gisa. Etiketa-laino hauek, ordea, ez dute etiketen arteko antzekotasuna ez eta edukia kontuan izaten. Lan honetan SOM mapak erabiliz etiketen arteko erlazioak definitzeko metodoa aurkezten dugu. Erlazio hauek definitzeko, web dokumentuen edukietan oinarritu gara. Emaitza bezala lortutako mapa, ondorioz, etiketa-multzo ezberdinez osatzen da, multzo horietako bakoitzean termino esanguratsuenak erakutsiz nabigazioa eta ikuskera hobetzen direlarik. Azkenik, metodologia honek izan dezakeen aplikazio erreala aztertzen dugu.
Share/Save
Posted in research.
Tagged with paper, research, social-tagging, tag-cloud.