FUTURELAB

Home   -   Services   -   About us   -   Team   -   Business and Games Blog   -   Publications

« Perception Pyramid | Main | Blogging ROI Book to be Issued by Blog Council »

The Future of: Wikipedia

by: Iqbal Mohammed

A recent post by Doc Searls narrating the near-deletion experience of his Wikipedia entry set me thinking about the debate between Wikipedia inclusionists and deletionists.

To paraphrase the debate, the inlcusionists believe that since "Wikipedia is not paper" and has no space constraints, it should contain as many articles as its contributors are willing to produce - no matter how trivial they are. Deletionists on the other hand believe that Wikipedia should follow a more stringent editorial policy and ban articles on trivial subjects - something they believe will make it a credible and trustworthy source of reference.

As the Economist article above goes on to explain, Wikipedians have created a complex quagmire of rules to judge what makes an article trivial or non-trivial. The fate of a Wikipedia article nominated for deletion rests on the application and re-application of these rules, draining deliberations and debates - and if an entry fails this torturous process, it finds itself walking the plank (as it happens, to an afterworld called Deletionpedia.)

It is a sign of Wikipedia's growing importance that a crippling bureaucracy is developing around it - apparently, entries about governance and editorial policies comprise around a quarter of its content. To most observers, this regulation and law-mongering is good news - it will make Wikipedia a bona-fide encyclopaedia, an illegitimate child finally given the legitimacy of the family name (ironically, as Britannica itself crosses over.)

But, as I have argued elsewhere, the problem lies in defining Wikipedia as an encyclopaedia - or at least in comparing it to one. In my opinion, Wikipedia was never an encyclopaedia  - it is (and should remain) a marketplace for information, where buyers and sellers meet and trade information.

Every contributor to Wikipedia brings along a bundle of information - information that either makes an entire entry of its own, or is a cog in a bigger entry. When the contributions of various contributors conflict, Wikipedia's negotiation dance kicks in - the discussion page becomes a hotbed for the deliberations and debates discussed above. Hard as it is to believe, over time this protracted negotiation does result in an unbiased and objective entry.

While this negotation works well to resolve conflicting information, I am not convinced it works as well to decide upon the triviality or non-triviality of an entry.

A long tail marketplace of the kind Wikipedia is, instead, should not decide triviality and non-triviality by itself. It should leave it to the buyers to make that decision for themselves - and strive instead to make the meta-data (the information about the information) transparent to its users.

What that would mean is to append to every page with information about its use - the number of people visiting it, for example. An entry - no matter how detailed and complete - with no visitors is trivial. Correspondingly, a stub with lots of visitors is not. That decision shouldn't at all be an administrator's to make - no matter how stringent and bureaucratic the process aiding and abetting him.

Providing visitor numbers for each entry (in one or more ways - raw numbers, comparative colour-coded gradings, percentile figures, numbers benchmarked against the most popular entry in the category, etc) will also enable users to figure out the probablity of the accuracy of an entry. The more viewers a page has, the more likely that its going to be accurate, thanks to Linus' law - "given enough eyeballs, all bugs are shallow". (I use IMDb extensively, always aware that its Bollywood information can be buggy while everything about mainstream Hollywood fare is near authoritative.)

Finally, this transparency about visitor numbers for each entry sends a valuable signal to the sellers in the marketplace. Those contributors slaving over those biographies of Pokémon characters (more than 500 detailed character biographies at last count) will probably be persuaded to abandon those entries and contribute instead to biographies of the leaders of Poland's Solidarity movement (currently, only a handful of poorly edited entries.)

Or more likely, the numbers will probably convince them of saleability of their wares.

(The likelihood of such a shift towards transparency occuring in Wikipedia's near future seem bleak though. The Economist reports that Wikipedia has not been gathering and disclosing figures about user-activity on the site for more than a year - probably because they reveal unpleasant truths.)

The real battle for Wikipedia's soul does not lie in the include/exclude skirmishes currently taking place at its frontiers. It will be fought - if ever - within the very entry that defines what Wikipedia is. And when the word encyclopaedia is dropped from its definition, Wikipedia will be free to become what it was meant to be.

Original Post: http://www.misentropy.com/2008/10/the-future-of-wikipedia.html

TrackBack

TrackBack URL for this entry:
http://www.futurelab.net/mt/mt-tb.cgi/4604

2 Comments

Richella said:

Wow green-on-green rollovers. How can you have the audacity to write about marketing and strategy when your accessibility is shot to shit. Come on, I want to take you seriously.

Nihiltres said:

A few notes:

  • Deletionpedia is not official in any way; it's merely the biggest endeavour of its kind. Someone with deleted-article access (i.e. an administrator) is running a bot to retrieve deleted articles. Many deleted articles do not go to Deletionpedia, especially those which are copyright violation, libel, et cetera. You can request copies of deleted articles from most Wikipedia administrators, who will do it happily so long as the article's content isn't problematic.

  • Wikipedia does have statistics, just they don't do it themselves, for the obvious reason that they can't spare the server resources. A third-party site tracks page-view statistics and has made them available and easily searchable. Wikipedians would love to have more statistics, and have even added a number of new statistics to the software lately: for example, Special:Statistics includes a new entry "active users" listing the number of unique users to have made at least one action in the past 30 days. I'm particularly familiar with the addition of this feature as I personally added the descriptive text about how it's determined to the interface. Regardless, statistics generally takes a back seat to the running of the rest of the site, and increasing need for servers generally results in some of the more "nice-to-have, but not essential" features getting disabled. If you want to help solve this, donate to the Wikimedia Foundation and encourage others to do the same. You can even specify that your donation go towards statistics-related improvements in hardware and software.

  • The market and trade analogies are difficult to swallow: if anything, Wikipedia is a gift economy, where everyone donates to a common pool of a non-commoditizable resource.

  • While the notability system is contentious and has never quite worked, it does solve many problems with original research, vanity articles, spam, and other issues. The parent policy from which it derives, Wikipedia:Verifiability, has surely greatly increased Wikipedia's evident reliability, and repeated discussions have upheld its usefulness. While it's surely annoying as hell, it's often the only way to justify that something that obviously doesn't belong be excluded. Other methods, such as visitor-number measurement, aren't effective in practice. For example, the visitor-number model fails to distinguish between unworthy topics and obscure topics: is "Geoffrey Chaucer" less important than "Penis" because more immature teenagers view the latter? It's easy to criticize the current system (and the critics are usually correct), but I have yet to see a viable alternative.

Leave a comment