EBlogger

October 31, 2005

“Britannica gets it”

Filed under: web2.0, britannica

CRN columnist/blogger Ed Moltzen writes that “Britannica gets it”, with respect to Web 2.0, although he’s seen very little yet.

New Britannica.com for Halloween

Filed under: britannica

There’s a new Britannica.com up today, with flashy feature articles, quizzes, quotes, a student center, NYT and BBC news headlines, and more. Here’s a screenshot:

Screen shot of Britannica.com Homepage from 31-Oct-2005

October 26, 2005

EB Editorial Board of Advisors Member Among Prospect/Foreign Policy’s Top Ten Public Intellectuals

Filed under: britannica

Indian economist Amartya Sen (also here) was named among the world’s top ten public intellectuals in a poll by The Prospect and Foreign Policy. Fellow Editorial Board of Advisors member author and activist Wole Soyinka (also here) was among the nominees.

October 24, 2005

On Vandalism

(See this post for a little bit of context.)

Vandalism is a problem, Wikipedians are quick to assert, but one that is solved by constant vigilance–Wikipedians are watching recent changes “like hawks”.

“Yes, vandalism is common on Wikipedia,” we read in the recent collaboratively edited press release, “but Wikipedia heals quickly.” After all, “IBM researchers found that most vandalism on Wikipedia was reverted in less than five minutes.”

We see this statement frequently repeated at Wikipedia and elsewhere.

Most vandalism on Wikipedia is reverted in less than five minutes. Let us assume, for the moment, that that statement is true. Does it imply that vandalism is a solved problem for wikipedia? Well, no. Suppose that 99 out of every 100 articles that get vandalized are reverted within 24 hours. Then there is more vandalism in Wikipedia today than there was yesterday. Without knowing the rate that un-corrected vandalism is added to Wikipedia, it is entirely possible that the percentage of vandalized articles is greater today than it was yesterday. The rate at which most vandalism is reverted isn’t the right question to ask, we should be concerned with whether the amount of vandalism is shrinking or growing.

But it gets worse than that. Most vandalism on Wikipedia is reverted in less than five minutes. Is that a meaningful thing to say? In order to know that most vandalism is reverted within minutes, wouldn’t we need to identify all vandalism, at least for a representative sample of Wikipedia articles? At best what we really mean is that most known vandalism is reverted in less than five minutes. Unknown vandalism is, well, unknown.

But wait–there’s more. Most vandalism on Wikipedia is reverted in less than five minutes. Did IBM researchers actually say that? Well, no. As far as I can see, the article to which everyone links seems to have only one paragraph on vandalism, which reads as follows:

“As publicly editable sites, Wikis are vulnerable to vandalism. We’ve examined many pages on Wikipedia that treat controversial topics, and have discovered that most have, in fact, been vandalized at some point in their history. But we’ve also found that vandalism is usually repaired extremely quickly–so quickly that most users will never see its effects. The pictures below tell the story.”

The “pictures below” are:

“Visualizing every saved version of the page on “abortion”, with each version getting equal space. The vertical black interruptions indicate times when a visitor has deleted most of the page.”



and

“Same page on “abortion”, but here horizontal spacing corresponds to time, so that rapid-fire changes show up almost on top of each other. Because vandalism is repaired so quickly, it does not show up in this view of the visualization”



Wait a minute. The IBM tool visualizes (a) the number of lines in the article and (b) who created those lines. It doesn’t give any insight at all into the content of those lines. It seems that they’ve defined “vandalism” as “deleting most of the page”, and that in articles they’ve examined this is usually repaired “extremely quickly”. Wikipedian’s don’t even enumerate “deleting most of the page” on their list of common types of vandalism.

Where’s the “most vandalism” part? Or even the “five minutes” part? What IBM researchers really say is that for the controversial articles they have examined, page-wipes are restored quickly.

It seems that this “IBM researchers found most vandalism on Wikipedia is reverted in less than five minutes” line is a complete myth: IBM researchers didn’t actually make that claim, it’s not a meaninful claim to make, and it doesn’t really tell us anything at all about the volume of vandalism within Wikipedia.

October 21, 2005

Encyclopedia on a stick

Filed under: for-sale, technology, mobile

One of my co-workers has been pushing a similiar idea for a while, but it looks like Brockhaus has beaten us to it: Brockhaus’ 21st edition is now available on a 1 GM USB stick. Great stuff, but are they really charging 1,500 euros for it?

[Via Gizmodo.]

October 20, 2005

Wikipedians on Quality

Filed under: web2.0, wikipedia

In a recent post to a Wikipedia mailing list, Wikipedia co-founder Jimmy Wales described Nick Carr’s post on “The amorality of Web 2.0″ (which I, along with much of the blogosphere previously linked to) as “a valid criticism” and agreed that “the two examples [Carr] puts forward are, quite frankly, a horrific embarassment” and “nearly unreadable crap”.

This sparked several uncharacteristicly self-critical responses from Wikipedians:

Although the raw numbers [of editors] are large, the number of articles is even larger, and so there are not enough editors to go around. […] Where are all the subject-matter experts?

We’d like to think that it’s inevitable we’ll asymptotically approach high quality, as Tony defended with [[Eventualism]]. But I think it’s too simplistc.

In my view, wikipedia has to undergo a paradigm change if it really wants to succeed in creating a good encyclopedia. […] We shouldn’t give up the principle of open editing but we should make clear now from the beginning that we seek good writers and knowledgeable people, not anyone. Yes, anyone can edit an article. But not anyone should edit any article.

If Robert Henry [sic] is right (and judging by a number of fine articles now laying in ruins I suspect he is), then WP, should it desire to get finer control on article quality, needs to modify its “completely open” model a little bit.

[Via Andrew Orlowski at the Register]

October 19, 2005

Britannica’s Newsletters

Filed under: britannica

Pastor Aaron cites EB’s “well done” monthly newsletters. Visit newsletters.britannica.com to see archives of these topically arranged articles, or sign up to have them delivered via email. Thanks for the link love, Aaron!

October 17, 2005

Ping-Pong Diplomacy

I have a couple of search ‘bots that track the use of “Britannica” and related keywords in the blogosphere. These frequently find “spam blogs” created by a robot to target specific ad-sense or YPN keywords. These fake blogs will crib content from other sites that seem to be related to their keywords, in hopes of drawing context-sensitive text ads that offer high rate of return. Since Britannica covers a broad set of topics, they frequently copy content from EB’s site.

Today I stumbled across one of these spam blogs that targets, of all things, the keyword “ping pong” (yes, as in “table tennis“). A search on google for “ping pong” currently shows nine “Sponsored Links”, so perhaps that is not such a funny idea after all. (I’m not going to link to it, as I don’t want to reward the behavior.)

This particular entry cribbed from an interesting article from the Britannica Student Encyclopedia on Ping-Pong diplomacy: “an episode that occurred in 1971, as the United States was just beginning to restore normal relations with the People’s Republic of China after more than 20 years. As a thaw in relations between the two countries was becoming evident, the Chinese government invited the United States table tennis team […] to visit Beijing and play in exhibition matches. […] The American team lost its exhibition matches […] but the Chinese team was invited to visit the United States. China’s government also allowed American and Canadian newspaper and television reporters into the country to cover the event. Within a year, Nixon himself visited China, and normal diplomatic relations were restored within the decade.”

October 13, 2005

More bootleg Britannica

Filed under: britannica, for-sale

Get ‘em while they’re, umm, hot.

If you are interested in the real thing, you might visit the britannica store, amazon.com or your local retailer.

October 11, 2005

Wikipedia is not Open Source

(See this post for a little bit of context.)

Wikipedia and other “open content” initiatives are often lumped together with “open source” projects.

For instance, a Google search on “wikipedia open source” currently finds over 8 million hits. The expression “open source encyclopedia” currently finds more that 12 million. Wikipedians themselves are fond of drawing a comparision to open source projects, invoking Linus’s Law (also here), citing a benevolent dictator, or comparing the project to Linux or the Apache Web Server.

While the Wikipedia is certainly “open” for editing and is made available under a license derived from one used for open source software, it is managed differently than every every open source project on the planet, at least every one I’m aware of.

In an open source software project, one is free use the software, to obtain and examine the software’s source code, to modify it locally, and with various limitations, to redistribute it in binary or source form. One is encouraged, and in some circumstances required, to make his modifications available for others to use. But there is always someone, or a team of someones, who acts as the maintainer of the software. In the case of the Linux kernel, it was for a long time a single individual, and is now that individual and team of trusted lieutenants. In the case of the Apache Web Server, it is the “Project Management Committee”, a group, in principal, of the most meritorious contributors (who approve new members by unanimous vote). While there are many contributors to each project, and many proposed contributions, there is always someone—a maintainer, a gatekeeper, an authority, an expert, that reviews and approves each contribution.

While I’ve never followed the day-to-day Linux development, I can tell you that at the Apache Software Foundation there is an extensive, formal, and documented process to ensure that every contribution is carefully reviewed. The Foundation is legally accountable for certain types of copyright and patent infringement, and prides itself on the quality of the software it produces. Reviews, and the “web-of-trust” that determines who is qualified to do such a review, are an important part of the Apache development process. Presumably it is not a coincidence that this process produces the most popular web server in the world, and one that is remarkably secure, robust and stable.

The absence of gatekeepers is not a new complaint about Wikipedia. The obvious retort, of course, is that other contributors will review changes after the fact. This is sometimes known as a “commit then review” protocol in open source circles. But open source projects only allow commit-then-review contributions from a trusted few. The Wikipedia review process, by allowing arbitrary commit-then-review contributions, assumes (a) that someone is actually reviewing the contribution, and that (b) that someone is capable of performing an informed review of that contribution. It is possible for both of these assumptions to be correct. It is worth noting, however, thus far at least, these are unproven assumptions.

The presence of errors within the Wikipedia (and let’s be honest, the presence of more errors than virtually any “traditional” encyclopedia)–despite its impressive popularity–makes one wonder just how many eyeballs are needed before all bugs become shallow.

Update [11 Oct 2005 20:03 GMT]:

Based on comments here and elsewhere, I seem to have either riled or confused some folks, so perhaps I wasn’t quite clear. Let me restate the above as follows:

1) When people (including Wikipedia contributors) talk about Wikipedia they often appeal to a comparision to open source, and ascribe aspects/virtues of open source intitiatives to Wikipedia.

2) Wikipedia is organized differently than other “open” projects, in the sense that every open source project (as opposed to open content) maintains a gatekeeper in one form or another, while Wikipedia does not.

3) As a result, some the aspects ascribed to Wikipedia via the comparision in point #1 may not apply. Since (among many differences) they follow a different review process, things that are true about Linux or httpd may not be true about Wikipedia.

In other words, the essence of Wikipedia may be different than that of open source projects. (In fact, the essence of Wikipedia is much more like that of Ward’s Wiki than many would seem to like to admit.)

Get free blog up and running in minutes with Blogsome | Theme designs available here