Journal

Tags and Folksonomy - I need your help

  • Posted on: July 25, 2005
  • In: Design
  • Comments closed

I've got some projects on at the moment which are requiring me to think long and hard about tags, metadata and 'folksonomies' (of the Del.icio.us and Flickr kind). I'm getting nowhere fast in terms of user requirements for tags and was wondering if you could help me out.

For those who are unfamiliar with what I'm talking about here's a brief description from wikipedia:

Folksonomy is a neologism for a practice of collaborative categorization using freely chosen keywords. More colloquially, this refers to a group of people cooperating spontaneously to organize information into categories. In contrast to formal classification methods, this phenomenon typically only arises in non-hierarchical communities, such as public websites, as opposed to multi-level teams. Since the organizers of the information are usually its primary users, advocates of folksonomy believe it produces results that reflect more accurately the population's conceptual model of the information...

I like tags, but that's not to say eveyone does, and I can see tagging content is useful from a content management point of view, but is it really useful from a user's perspective?

From what I can see, from the likes of Flickr, is that content is organised by the user, by keywords. The information structure is then 'created' by this tagging which all very nice. What i'm really struggling with is, does the user then use this structure to navigate? and I'm coming round to the answer being 'no'. Let me explain.

User tagging content is a submission process - If the user's can be bothered or if they have to. User tagging content will of course have problems with users having to choose the same descriptive words for things (maybe the UI could benefit with contextual suggestions here from previously submitted content). So, I've got a few questions and I'm not sure where to get informed answers (what I mean by this is based on research) Here's some questions:

  1. Do users navigate by these keywords?
  2. Does folksonomy tagging facilitate task based navigation?
  3. How does the UI affect this navigation.

I guess the beauty of tagging in this way (and presenting it in a good UI, like a 'tag-cloud') is that it helps facilitate browsing rather than finding, two very different types of navigation. Anyway, I'm really rambling now.

What I'm after is usability reports and the like on metadata and user generated tagging. If you know of any can you point me the right direction? Thanks.

So, what do you think? Are user generated tags good? How can the UI be changed in user generated tags to help facilitate task, rather than browse, based navigation?

Comments

Hmm, I often use tags to navigate through posts on some very large sites, albeit, I don’t often find them any more useful than categories. Except on the sites you mentioned. The likes of flickr, delicious and wists are really made better by tagging.

SteveC's Gravatar

SteveC
Mon 25th Jul 2005
at 10:35 am

Mark - I’m a category freak (probably was a librarian in a former life), but I don’t get much use out of tags. In flickr, they’re handy, but I wouldn’t miss the tags feature.

In del.icio.us I think it’s almost too easy to create new tags for yourself and you run the risk of going overboard. You could waste an awful lot of time micro-managing your link tags!

Gerard McGarry's Gravatar

Gerard McGarry
Mon 25th Jul 2005
at 11:47 am

tags! It’s something we’ve also been wrestling with here at Glamorgan. So far we seem to be using them purely on the CMS side of things. For example, events tag cloud is an admin side view of our tagging of ‘events’. For the user side we tend to replace them with ‘expanded information’, e.g.  A page from our site, below the fold we provide a “you may also be interested in ..” section, which is tag driven.

For pure tag clouds we so far are only experimenting with it on our soon to be released blog portal, and there are certainly some reservations within the team.

Kevin Evans's Gravatar

Kevin Evans
Mon 25th Jul 2005
at 11:47 am

I think the blog post (the link from Kevin) makes a solid point about tagging. It’s a difficult concept when your tagging text.

I think in many situations using solid user research to develop a taxonomy is much more efficient than relying on user supplied meta data about the content.

But it depends of course on the nature of the content. If it’s 100% user supplied then it can have more weight...after all you have no idea what the content IS.

Ryan Nichols's Gravatar

Ryan Nichols
Mon 25th Jul 2005
at 4:46 pm

I think tags are a good idea and we’ve seen some very good examples of how they can be used well, but all the hullabaloo about them far outweighs their actual usefulness.

On Flickr, for example, the reason tags work is because there are so many people doing it that some generally-useful patterns emerge, and because the individual content units aren’t particularly important by themselves.  On a smaller site (e.g., a corporate intranet), that wouldn’t necessarily be the case, because much-needed documents might be poorly tagged by a well-meaning employee (e.g., “tpsreports” instead of “tps").

On projects like that where it’s important that individual content pieces not be lost, and where the audience and content quantity isn’t large enough for patterns to emerge despite widely-varying methodologies, a more formal approach would probably be more appropriate.

It’s hard to give more specific recommendations without knowing more about what you’re doing, but I’d strongly recommend that you check out faceted classification as an option.

James Archer's Gravatar

James Archer
Mon 25th Jul 2005
at 6:14 pm

James - You’re right about tags being dependent on a large group of people submitting, which in turn generates patterns. The projects I’m dealing with wouldn’t necessarily have this turnover of content.

I’ll give faceted classification a look.

Mark Boulton's Gravatar

Mark Boulton
Mon 25th Jul 2005
at 8:47 pm

Two things to consider.

Who is doing the tagging? If the content author does it then the folks aspect of folksonomy is mostly lost and James’ point about quantity is magnified.

Are the tags used as the only subsequent means of access (as is currently the case with delicious) or does the system support search as well (Yahoo MyWeb, RawSugar--where I work)?

Bill Lazar's Gravatar

Bill Lazar
Mon 25th Jul 2005
at 11:39 pm

I’m a PhD student currently studying this whole, messy, area of user-centred categorisation. I think James Archer summed it up well. Folksonomies work when lots of people find that tagging benefits them each individually. Then you can get interesting/useful information from the sheer volume of people tagging for their own benefit. You can then do wonderful things with statistics and probabilities and whatnot.

However, with corporate intranets and knowledge management systems, etc., you often don’t have enough users to make free-tagging useful. You could however, allow tagging from a restricted vocabulary to prevent the synonym-tagging problem (eg. what one person calls ‘tps’ another calls ‘tpsreport’). The tricky thing is getting this restricted vocabulary to match the vocabulary that the users actually use, rather than what the designer thinks the users use, or even what the managers think the users use. Some writers suggest that you allow free-tagging initially, then come back and analyse the data later, and consolidate the vocabulary into a finite list of allowable tags/categories. This may not be practical in all situations however.

J R Sinclair's Gravatar

J R Sinclair
Mon 25th Jul 2005
at 11:55 pm

Bill’s point is good: who’s doing the tagging? One thing that annoys me about folksonomy systems is when I’m not presented (as a user) with a list of tags to choose from (I must come up with my own). I think someone mentioned that above.

I’m thinking about using tags for my next revision of my portfolio. This is because many of my projects have multiple media types or roles. BUT, I’m also going to apply categories to the “entries” - so conceivably users could navigate that way, as well. There’s also the search engine to consider - different users browse differently.

I agree that there should always be multiple paths to the information.

Allan White's Gravatar

Allan White
Tue 26th Jul 2005
at 12:13 am

The librarian within me (and I am actually a librarian) wants to say that tagging will never be as useful as a controlled vocabulary. Yet, I think that user generated descriptors will evolve thru communities such as technorati and Flickr as these systems put more thought into this area. The tag-cloud is a good visualization technique. I think that there are going to be some big developments in the UI for tagging in the coming year. And I must say that I’m noticing more new traffic on my blogs coming via tags than thru search engines. There’s an interesting discussion on tagging taking place at tagsonomy.com

Jeff Barry's Gravatar

Jeff Barry
Tue 26th Jul 2005
at 4:56 am

Considering how early we are in the use of tags there’s no doubt that better UIs will be found to express them and better ways to determine what tags to show at given points in a search path. That’s actually something our company is working hard on at the moment. I think clouds are interesting but when databases get large enough to cover a meaningful portion of the web the quantity of tags may overwhelm their usefulness.

Bill Lazar's Gravatar

Bill Lazar
Tue 26th Jul 2005
at 2:51 pm

My main problem with tagging is that it ‘gives up’ the attempt to provide a universally understood glossary. Multiple different ways to categorise the same ideas occur hence someone searching on just one of them will return a subset of data that doesn’t match what they want to find but matches posts by people who describe things in the same way. An upper ontology, however innaccurate, will at least provide a consistent (if flawed) way of categorising information. Perhaps the tags can be collected and assigned to a hierarchical categorisation structure and synonym lookup information provided, but it would seem to me that getting a simple initial category hierarchy, although more difficult and controversial, would provide better information classification as time progresses (let the users have some influence on the category hierarchies / or faceted classification schemes maybe)

Tim Parkin's Gravatar

Tim Parkin
Wed 27th Jul 2005
at 9:02 am

I think you’re right Tim. My problem with user tagging is the same as yours. Whilst tagging the information is all very well, providing an interface (search, navigation etc) to access the information is always going to be difficult for the reasons you just stated.

One way to perhaps do this is to provide, in part, the tags available to the user. That would all get very difficult to manage though, especially when you’re talking of hundreds of thousands of documents.

Mark Boulton's Gravatar

Mark Boulton
Wed 27th Jul 2005
at 9:20 am

have you looked at wikipaedia tagging? Use the wikipaedia page id’s as your tags… there are very strong definitions for what each ‘term’ means. For instance if I wanted to tag my writings on the python language I would use Python_programming_language as my wikipaedia tag (so it’s definitely not pets or monty).

Tim Parkin's Gravatar

Tim Parkin
Wed 27th Jul 2005
at 9:44 am

I think that the issue of usage will come out in the wash over time as system developers learn to use the data to find connections. There will also be better ways of presenting existing tags so they can be reused rather than retyped. To some degree our system does both already though we’re not claiming anything mroe than a start in the right direction yet.

Bill Lazar's Gravatar

Bill Lazar
Wed 27th Jul 2005
at 4:52 pm

Mark B.: Any thoughts of how one might apply folksonomy approaches to Expression Engine? I’ve been checking out the Keywords plug-in, and I wonder if you could create a UI for pre-set tags (like Tim mentioned above).

I agree that letting users tag however they wish is likely to cause confusion for others searching.

Allan White's Gravatar

Allan White
Wed 27th Jul 2005
at 10:16 pm

Allan, I’m not an ExpressionEngine user but WordPress (my preference) has several tagging plugins available and you might look at those for ideascraping.

Bill Lazar's Gravatar

Bill Lazar
Wed 27th Jul 2005
at 11:22 pm

Allan - Yeah, I’ve been thinking about tagging for this site actually but more from a keywords aspect.

What I’d like to do is turn my ‘of interest’ links into related links which can be linked to from journal entries. I’m a little bit frustrated with the links turning into an archive which is totally flat in terms of categorisation.

So, the thing now is to come up with a suitable UI which shows the relationships between the journal entries and the ‘of interest’ links.

Oh, I’m using the Linkslist plugin for this which has built in support for tags and creating tag clouds. Couple that with the Keywords plugin and you have a complete system for handling it.

The pain is migrating all the data manually…

Mark Boulton's Gravatar

Mark Boulton
Thu 28th Jul 2005
at 7:55 pm

So, the thing now is to come up with a suitable UI which shows the relationships between the journal entries and the ?of interest? links.

That’s an interesting task; what might that look like? A list of tags at the end of a post ("design, typography, Macs") that link to a link list category page? Do I understand your use case properly?

I (and others) are also curious to see how you might solve this in an EE context. There’s a dearth of quality tutorials for that deep CMS, and one reason many from that community come here!

Allan White's Gravatar

Allan White
Fri 29th Jul 2005
at 4:08 pm

[J. Sinclair] However, with corporate intranets and knowledge management systems, etc., you often don?t have enough users to make free-tagging useful. You could however, allow tagging from a restricted vocabulary to prevent the synonym-tagging problem (eg. what one person calls ?tps? another calls ?tpsreport?). The tricky thing is getting this restricted vocabulary to match the vocabulary that the users actually use, rather than what the designer thinks the users use, or even what the managers think the users use. Some writers suggest that you allow free-tagging initially, then come back and analyse the data later, and consolidate the vocabulary into a finite list of allowable tags/categories. This may not be practical in all situations however.

Idea: what if you let users submit a ‘tag request’ in this scenario? If you had a list that was ‘good enough’ for most uses, then let users submit requests for the arcane ones (’tpsreport’) to a human ‘tag curator’ for consideration.

Allan White's Gravatar

Allan White
Fri 29th Jul 2005
at 4:18 pm

Hi,

Prompted by this discussion, I’m trying a slightly different approach based on a combination of hierarchy and folksonomy (or plain metadata). Here are the steps I’m building in : (quick glossary - hcat = hierarchical category, kw = folksonomy tag or keyword)

1) scan post for words that match hcats and show them with tick boxes for selection
2) let the user add their own kws and scan them for related hcats
3) let the user tick appropriate hcats
4) allow the user the option to (i) map any remaining kws to categories (synonyms) or (ii) place the kw in hierarchy (possibly moderated)
5) store the remaining kws in a ‘metadata’ field.

step 5 can be used if people later wish to add a new category and find related items (e.g. I add Cornwall as a kw and a later users decides to create a real category for it and searches in meta data for it).

This may seem long winded but it allows people to just add keywords if they like but it helps them to add (and start to create) hierarchical categories.

Any thoughts would be appreciated.

Tim Parkin's Gravatar

Tim Parkin
Mon 1st Aug 2005
at 8:52 am

Hi Tim,

That seems like an interesting solution to a difficult problem. I’m not sure how the ticking would work within a body of text. One of my major gripes with including lots of links within text is it can hinder the reader’s eye when scanning the copy.

Be nice to see it in action.

Mark Boulton's Gravatar

Mark Boulton
Sat 6th Aug 2005
at 3:45 pm

Thanks Mark,

I did intend for the ticking to be done once the article is finished so the tick boxes would appear at the bottom of the text. This could be done whilst the article was being written (but not efficiently with a large ontology).

Tim

Tim Parkin's Gravatar

Tim Parkin
Sun 7th Aug 2005
at 10:12 am

This is something I am strugling with as well, and the comments many people have made already are helpful.

It sounds like this may be a bit basic for some (it goes ‘back’ as far as hierarchies), but I found the following article a good start: http://wiki.osafoundation.org/bin/view/Journal/HierarchyVersusFacetsVersusTags

David S's Gravatar

David S
Mon 15th Aug 2005
at 5:55 pm

Commenting is not available in this section entry.

A picture of Mark BoultonI'm a graphic designer from near Cardiff in the UK. I've been a designer for over ten years now and primarily work on the web. I'm still partial to a bit of print every now and then though. I used to work for Agency.com in London as an Art Director before working as a Senior Designer for the BBC in sunny Cardiff. This was all before I took leave of my senses and formed my own design consultancy, Mark Boulton Design Ltd.

I've got a thing about grids and typography and occasionally ramble on about them to anyone who will listen.

If you're after simple, clean and effective web design; let me know.