tag:blogger.com,1999:blog-114939542009-02-21T14:15:34.489ZCloud StreetInformation, community and the work of making sense; knowledge as an emergent property of conversation; and various other interesting things that I'm working on, have worked on or would like to work on.Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.comBlogger60125tag:blogger.com,1999:blog-11493954.post-80297354019627003862007-03-28T23:31:00.000+01:002007-03-28T23:34:56.441+01:00Strange clothes of sandThere hasn't been a lot here lately; there hasn't been a huge amount on my home weblog <a href="http://existingactually.blogspot.com/">Actually Existing</a> either, and quite a lot of what I have posted there has been tagged as work-related. So this will be the last post here; I'm merging my weblogs and taking the opportunity to leave Google and go to Wordpress. I'll see you at <a href="http://gapingsilence.wordpress.com/">The Gaping Silence</a>. (True to the name, there's nothing much there now, but I'll put some new stuff up one of these days.)<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-8029735401962700386?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com0tag:blogger.com,1999:blog-11493954.post-32137481489375993402007-02-07T13:20:00.000Z2007-02-12T11:47:20.940ZGreat big bodiesI think the thing that really irritates me about the Long Tail is just how basic the statistical techniques underlying it are. If you've got all that data, why on earth wouldn't you do something more interesting and more informative with it? It's really not hard. (In fact it's so easy that I can't help feeling the Long Tail image must have some other appeal - but more on that later.)<br /><br />As you may have noticed, this weblog hasn't been updated for a while. In fact, when I compared it with the rest of my RSS feed I found it was a bit of an outlier:<br /><br /><a href="http://www.flickr.com/photos/37723210@N00/382695131/" title="Photo Sharing"><img src="http://farm1.static.flickr.com/159/382695131_f7fdb541bd.jpg" width="500" height="354" alt="blogs2" /></a><br /><br />The Y axis is 'number of blogs': two updated today (zero days ago), 11 in the previous 10 days, 1 in the 10-day period before that, and so on until you get to the 71-80 column. Note that each column is a range of values, and that the columns are touching; technically this is a histogram rather than a bar chart.<br /><br />You can do something similar with 'posts in last 100 days':<br /><br /><a href="http://www.flickr.com/photos/37723210@N00/382695129/" title="Photo Sharing"><img src="http://farm1.static.flickr.com/157/382695129_a098940674.jpg" width="500" height="354" alt="blogs1" /></a><br /><br />This shows that the really heavy posters are in the minority in this sample; twelve out of the eighteen have 30 or fewer posts in the last 100 days.<br /><br />So it looks as if I'm reading a lot of reasonably regular but fairly light bloggers, and a few frequent fliers. If you put the two series together you can see the two groups reflected in the way the sample smears out along the X and Y axes without much in the middle:<br /><br /><a href="http://www.flickr.com/photos/37723210@N00/382695132/" title="Photo Sharing"><img src="http://farm1.static.flickr.com/188/382695132_f414e54150.jpg" width="500" height="354" alt="blogs3" /></a><br /><br />My question is this. If you can produce readable and informative charts like this quickly and easily (and I assure you that you can - we're talking an hour from start to finish, and most of that went on counting the posts), what on earth would make you prefer this:<br /><br /><a href="http://www.flickr.com/photos/37723210@N00/382695135/" title="Photo Sharing"><img src="http://farm1.static.flickr.com/161/382695135_73229d060f.jpg" width="500" height="354" alt="blogs5" /></a><br /><br />or this:<br /><br /><a href="http://www.flickr.com/photos/37723210@N00/382695134/" title="Photo Sharing"><img src="http://farm1.static.flickr.com/158/382695134_f292a36d64.jpg" width="500" height="354" alt="blogs4" /></a><br /><br />I can only think of two reasons. One is that it looks kind of like a power law distribution, and that's a cool idea. Except that it <b>isn't</b> a power law distribution, or any kind of distribution - it's a list ranked in descending order, and, er, that's it. The same criticism applies, obviously, to the classic 'power law' graphic ranking weblogs in descending order of inbound links.<br /><br /><b>DIGRESSION</b><br>You can compute a distribution of inbound links across weblogs using very much the techniques I've used here - so many weblogs with one link, so many with two and so forth. Oddly enough, what you end up with then is a curve which falls sharply then tapers off - there are far fewer weblogs with two links than with only one, but not so much of a difference between the '20 links' and '21 links' categories. However, even that isn't a power law distribution, for reasons explained <a href="http://cscs.umich.edu/~crshalizi/weblog/232.html">here</a> and <a href="http://www.cscs.umich.edu/~crshalizi/weblog/390.html">here</a> (reasons which, for the non-mathematician, can be summed up as 'a power law distribution means something specific, and this isn't it').<br /><b>END DIGRESSION</b><br /><br />The other reason - and, I suspect, the main reason - is that the Long Tail privileges ranking: the question it suggests isn't <i>how many of which are doing what?</i> but <i>who's first?</i>. A histogram might give more information, but it wouldn't tell me who's <b>up there</b> in the <b>big head</b>, or how far down the tail I am.<br /><br />People want to be on top; failing that, they want to fantasise about being on top and identify with whoever's up there now. Not everyone, but a lot of people. The popularity of the Long Tail image has a lot in common with the popularity of celebrity gossip magazines.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-3213748148937599340?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com4tag:blogger.com,1999:blog-11493954.post-83947510016471198592006-11-22T10:30:00.000Z2006-11-23T15:08:14.646ZThey don't know about usSome dystopian thoughts on data harvesting, usage tracking, recommendation engines and consumer self-expression. First, here's <a href="http://www.plasticbag.org/archives/2006/11/on_wattson_and_electr">Tom</a>, then <a href="http://www.plasticbag.org/archives/2006/11/on_wattson_and_electr/#comments">me</a>: <blockquote>"This is going to be one of the great benefits of ambient/pervasive computing or everyware - not the tracking of objects but the tracking and collating of you yourself <i>through</i> objects."<br /><br />This sentence works just as well with the word 'benefits' replaced by 'threats'. It all depends who gets to do the tracking and collating, I suppose.</blockquote><br />Now here's <a href="http://money.cnn.com/magazines/fortune/fortune_archive/2006/11/27/8394347/">Max Levchin</a>, formerly of Paypal, and his new toy Slide (via <a href="http://vanderwal.net/random/index.php">Thomas</a>):<blockquote>If Slide is at all familiar, it's as a knockoff of Flickr, the photo-sharing site. Users upload photos, which are displayed on a running ticker or Slide Show, and subscribe to one another's feeds. But photos are just a way to get Slide users communicating, establishing relationships, Levchin explains.<br /><br />The site is beginning to introduce new content into Slide Shows. It culls news feeds from around the Web and gathers real-time information from, say, eBay auctions or Match.com profiles. It drops all of this information onto user desktops and then watches to see how they react.<br /><br />Suppose, for example, there's a user named YankeeDave who sees a Treo 750 scroll by in his Slide Show. He gives it a thumbs-up and forwards it to his buddy" we'll call him Smooth-P. Slide learns from this that both YankeeDave and Smooth-P have an interest in a smartphone and begins delivering competing prices. If YankeeDave buys the item, Slide displays headlines on Treo tips or photos of a leather case. If Smooth-P gives a thumbs-down, Slide gains another valuable piece of data. (Maybe Smooth-P is a BlackBerry guy.) Slide has also established a relationship between YankeeDave and Smooth-P and can begin comparing their ratings, traffic patterns, clicks and networks.<br /><br />Based on all that information, Slide gains an understanding of people who share a taste for Treos, TAG Heuer watches and BMWs. Next, those users might see a Dyson vacuum, a pair of Forzieri wingtips or a single woman with a six-figure income living within a ten-mile radius. In fact, that's where Levchin thinks the first real opportunity lies - hooking up users with like-minded people. "I started out with this idea of finding shoes for my girlfriend and hotties on HotOrNot for me," Levchin says with a wry smile. "It's easy to shift from recommending shoes to humans."<br /><br />If this all sounds vaguely creepy, Levchin is careful to say he's rolling out features slowly and will only go as far as his users will allow. But he sees what many others claim to see: Most consumers seem perfectly willing to trade preference data for insight. "What's fueling this is the desire for self-expression," he says.</blockquote><br /><a href="http://www.roughtype.com/archives/2006/11/selfportrait_in.php">Nick</a>:<blockquote>I'm not sure that I see, in today's self-portraits on MySpace or YouTube or Flickr, or in the fetishistic collecting of virtual tokens of attention, the desire to mark one's place in a professional or social stratum. What they seem to express, more than anything, is a desire to turn oneself into a product, a commodity to be consumed. And since, as I wrote earlier, "self-commoditization is in the end indistinguishable from self-consumption," the new portraiture seems at its core narcissistic. The portraits are advertisements for a commoditized self</blockquote><br />Granny Weatherwax:<blockquote>"And sin, young man, is when you treat people as things. Including yourself. That's what sin is. ... People as things, that's where it starts."</blockquote><br />More precisely, that's where some extraordinarily unequal and dishonest social relationships can start.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-8394751001647119859?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com2tag:blogger.com,1999:blog-11493954.post-79329189826077002822006-11-13T14:34:00.000Z2006-11-16T12:23:23.141ZGot a web between his toesNow that Nick has read the <a href="http://www.roughtype.com/archives/2006/11/welcome_web_30.php">last rites for Web 2.0</a>, perhaps it's safe to return to a question that's never quite been resolved.<br /><br />To wit: what <b>is</b> Web 2.0? (We've established that it's <a href="http://phenomenologic.blogspot.com/2006/04/not-fish-at-all.html">not a snail</a>.) Over at <a href="http://what-i-wrote.blogspot.com/">What I wrote</a>, I've just put up a March 2003 article called "<a href="http://what-i-wrote.blogspot.com/2006/11/in-godzillas-footprint.html">In Godzilla's footprint</a>". In it, I asked similar questions about e-business, taking issue with the standard rhetoric of 'efficiency' and 'empowerment'. I suggested that e-business wasn't - or rather isn't - a phenomenon in its own right, but the product of three much larger trends: <b>standardisation</b>, <b>automation</b> and <b>externalisation of costs</b>. (<a href="http://what-i-wrote.blogspot.com/2006/11/in-godzillas-footprint.html">Read the whole thing.</a>)<br /><br />Assuming for the moment that I called this one correctly - and I find my arguments pretty persuasive - what of Web 2.0? More of the same, only featuring the automation of income generation (AdSense) and the externalisation of payroll costs ('citizen journalism')? Or is there more going on - and if so, what?<br /><br /><b>Update</b> 16/11<br /><br />It would be remiss of me not to give any pointers to my own thinking on Web 2.0. So I'm republishing another column at <a href="http://what-i-wrote.blogspot.com/">What I wrote</a>, this time from February of this year. Most of you will probably have seen it the first time round, when it appeared in <i>iSeries NEWS UK</i>, but I think it's worth giving it another airing. <a href="http://what-i-wrote.blogspot.com/2006/11/everything-new-is-old-again.html">Have a gander</a>.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-7932918982607700282?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com1tag:blogger.com,1999:blog-11493954.post-1162812626479447462006-11-06T11:24:00.000Z2006-11-08T10:10:44.033ZSimplify, reduce, oversimplifyAn interesting post on 'folksonomies' at <a href="http://collinvsblog.net/archives/2006/11/folksonomies.html">Collin Brooke's blog</a> prompted this comment, which I thought deserved a post of its own.<br /><br />I think <a href="http://www.peterme.com/archives/000387.html">Peter Merholz</a>'s coinage 'ethnoclassification' could be useful here. As I've argued <a href="http://phenomenologic.blogspot.com/2005/06/cloud-of-knowing.html">elsewhere</a>, I think we can see all taxonomies (and ultimately all knowledge) as the product of an extended conversation within a given community: in this respect a taxonomy is simply an accredited 'folksonomy'. <br /><br />However, I think there's a dangerous (but interesting) slippage here between what folksonomies could be and what folksonomies <b>are</b>: between the promise of the project of 'folksonomy' (F1) and what's delivered by any identifiable folksonomy (F2). (You can get into very similar arguments about Wikipedia 1 and Wikipedia 2 - sometimes with the same people.) Compared to the complexity and exhaustiveness of any functioning taxonomic scheme, I don't believe that any actually-existing 'folksonomy' is any more than an extremely sketchy work in progress.<br /><br />For this reason <a href="http://phenomenologic.blogspot.com/2005/08/so-say-i.html">(among others)</a>, I believe we need different words for the activity and the endpoint. So we could contrast classification with Peterme's 'ethnoclassification', on one hand, and note that the only real difference between the two is that the former takes place within structured and credentialled communities. On the other hand, we could contrast actual taxonomies with 'folksonomies'. The latter could have very much the same relationship with officially-credentialled taxonomies as classification does with ethnoclassification - but they aren't there yet.<br /><br />The shift from 'folksonomy' to 'ethnoclassification' has two interesting side-effects, which I suspect are both fairly unwelcome to folksonomy boosters (a group in which I don't include Thomas Vander Wal, <a href="http://phenomenologic.blogspot.com/2005/11/this-is-new-stuff.html">ironically enough</a>). On one hand, divorcing process and product reminds us that improvements to one don't necessarily translate as improvements in the other. The activity that goes into producing a 'folksonomy', as distinct from a taxonomy, may give more participants a better experience (more egalitarian, more widely distributed, more chatty, more fun) but you wouldn't necessarily expect the end product to show improvements as a result. (You'd expect it to be a bit scrappy, by and large.) On the other hand, divorcing process from technology reminds us that ethnoclassification didn't start with del.icio.us; the aggregation of informal knowledge clouds is something we've been doing for a long time, perhaps as long as we've been human.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-116281262647944746?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com0tag:blogger.com,1999:blog-11493954.post-1162297299056224242006-10-31T10:42:00.000Z2006-11-08T10:10:43.974ZA taxonomy of terrorI attended part of a very interesting <a href="http://www.socialsciences.manchester.ac.uk/politics/events/cst/default.htm">conference on terrorism</a> last week. The organisers intend to launch a network and a journal devoted to 'critical terrorism studies', a project which I strongly support. As the previous blog entry suggests, I've studied a bit of terrorism in my time - and I'm very much in favour of people being encouraged to approach the phenomenon critically, which is to say without necessarily endorsing the definitions and interpretive frameworks offered by official sources.<br /><br />However, it seems to me that the nature of the object of study still needs to be defined - and defined at once more precisely and more loosely. In other words, I don't believe there's much common ground between someone who thinks of terrorism in terms of gathering intelligence on the IRA, and someone who maintains that George W. Bush is a bigger terrorist than Osama bin Laden; I don't think it's particularly productive to try to find common ground between those two images of terrorism, or to simply allow them to coexist without defining the differences between them. On the other hand, I don't see much mileage in a 'purist' Terrorism Studies which would focus solely on groups akin to the IRA - or in an alternative purism which would concentrate on terror attacks by Western governments.<br /><br />A third approach offers to resolve the gap between these two - although I should say straight away that I don't believe it does so. This approach is that of terrorism as an object of discourse: what is under analysis is not so much an identifiable set of actions, or types of action, as the texts and utterances which purport to analyse and describe terrorism. The effect is to turn the analytical gaze back on the governmental discourse of terrorism, which in turn makes it possible to contrast the official image of the terrorist threat with data from other sources; an interesting example of this approach in practice is Richard Jackson's paper <i>Religion, Politics and Terrorism: A Critical Analysis of Narratives of “Islamic Terrorism”</i> (DOC file available from <a href="http://www.socialsciences.man.ac.uk/politics/research/research_groups/cip/cip_publications.htm">here</a>).<br /><br />I think this is a powerful and constructive approach - my own thesis (as yet unpublished) includes some quite similar work on Italian left-wing armed groups of the 1970s, whose presentation in both the mainstream and the Communist press was heavily shaped by differing ideological assumptions. But I think it should be recognised that it's an approach of a different order from the other two. To combine them would be to mix ontological and epistemological arguments - to say, in other words, <i>That's what is <b>officially labelled</b> terrorism, but this is <b>real</b> terrorism</i>. (Or: <i>That's what <b>they</b> call terrorism, but this is what <b>we</b> know to be the reality of terrorism.</i>) The problem with this is that it implies a commitment to a particular idea of <i><b>real</b> terrorism</i>, without actually suggesting a candidate. At best, this formulation frees the analyst to retain his or her prior commitments, bolstered with added ontological certitude. At worst, it suggests that <i><b>real</b> terrorism</i> is the inverse of <i><b>officially labelled</b> terrorism</i> - or at least that there is no possible overlap between <i><b>officially labelled</b> terrorism</i> and <i><b>real</b> terrorism</i>. This is surely inadequate: a critical approach should be able to do more with the official version than simply reverse it.<br /><br />I believe that the study of terrorism must include all of these elements, and recognise that they may overlap but don't coincide. In other words, it must include the following:<ol><li><b>Organised political violence by non-state actors</b>: 'terrorism' as a political intervention (call it <b>T1</b>)</li><li><b>Indiscriminate large-scale attacks on civilians</b>: terror as a tactic, in warfare or otherwise (<b>T2</b>)</li><li><b>The constructed antagonist of the War on Terror</b>: 'Terrorism' as object of discourse (<b>T3</b>)</li></ol>We can think of it as a three-circle Venn diagram, with areas of intersection between each pair of circles and a triple intersection in the middle. <br /><br /><a href="http://photos1.blogger.com/blogger/206/916/1600/venn.jpg"><img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;" src="http://photos1.blogger.com/blogger/206/916/320/venn.jpg" border="0" alt="" /></a><br /><br />What is immediately apparent about this list is how little of the field of terrorism falls into all three categories. The (white) triple intersect - mass killing of civilians by a non-state political actor, officially labelled (and denounced) as terrorism - is represented by a relatively small number of horrific events, chief among them September 11th. By contrast, much of what students of terrorism - myself included - would like to be able to look at under that name falls into only two categories, or even one. The (red) intersect of <b>T1</b> and <b>T3</b>, most obviously, is represented by those acts by armed groups which are officially denounced but don't involve mass killing of civilians: the 'execution' of Aldo Moro and the IRA's Brighton bomb, for example. The use of terror tactics by non-governmental death squads, such as the Nicaraguan Contras and the Salvadorean ORDEN militia, falls into the blue intersect of <b>T1</b> and <b>T2</b>. The use of state terror by official enemies and 'rogue states' - such as the Syrian Hama massacre or Saddam Hussein's gassing of the people of Halabja - falls into the green intersect of <b>T2</b> and <b>T3</b>. And this is without considering all those activities which fall into only one category: <b>T1</b> (magenta) alone, activities by armed groups which fall below the radar of the discourse of 'terrorism' (a large and interesting category); <b>T2</b> (cyan) alone, terror tactics used by states and not denounced as terrorism; and <b>T3</b> (yellow) alone, officially-denounced 'terrorism' which involves neither an organised armed group nor a mass attack on civilians.<br /><br />I don't, myself, see any problem with studying all three of these categories - or rather, all seven. I hope the remit of the new Critical Terrorism Studies is broad enough to encompass all of these without imposing an artificial unity on them. Paramilitary fundraising in Northern Ireland cannot be studied in the same way as the attack on Fallujah or press reporting of the 'ricin plot'; each of these deserves to be studied, however, and the different approaches appropriate to studying them can only strengthen the field.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-116229729905622424?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com0tag:blogger.com,1999:blog-11493954.post-1158583421433319622006-09-18T11:40:00.000+01:002006-11-08T10:10:36.465ZThe people with the answers<a href="http://www.roughtype.com/archives/2006/09/sanger_forks_wi.php">Nick</a>:<blockquote>Larry Sanger, the controversial online encyclopedia's cofounder and leading apostate, announced yesterday, at a conference in Berlin, that he is spearheading the launch of a competitor to Wikipedia called <a href="http://www.citizendium.org/">The Citizendium</a>. Sanger describes it as "an experimental new wiki project that combines public participation with gentle expert guidance."<br /><br />The Citizendium will begin as a "fork" of Wikipedia, taking all of Wikipedia's current articles and then editing them under a new model that differs substantially from the model used by what Sanger calls the "arguably dysfunctional" Wikipedia community. "First," says Sanger, in explaining the primary differences, "the project will invite experts to serve as editors, who will be able to make content decisions in their areas of specialization, but otherwise working shoulder-to-shoulder with ordinary authors. Second, the project will require that contributors be logged in under their own real names, and work according to a community charter. Third, the project will halt and actually reverse some of the 'feature creep' that has developed in Wikipedia."</blockquote><br />I've been thinking about Wikipedia, and about what makes a bad Wikipedia article so bad, for some time - <a href="http://phenomenologic.blogspot.com/2005/03/greetings-and-salutations-and-anomie.html">this</a> March 2005 post took off from some earlier remarks by Larry Sanger. I'm not attempting to pass judgment on Wikipedia as a whole - there are plenty of good Wikipedia articles out there, and some of them are very good indeed. But some of them are <b>bad</b>. Picking on an old favourite of mine, here's the first paragraph of the Wikipedia article on the <a href="http://en.wikipedia.org/wiki/Red_Brigades">Red Brigades</a>, with my comments.<br /><br /><i>The Red Brigades (Brigate Rosse in Italian, often abbreviated as BR) are</i><br /><br />The word is 'were'. The BR dissolved in 1981; its last successor group gave up the ghost in 1988. There's a small and highly violent group out there somewhere which calls itself "<i>Nuove Brigate Rosse</i>" - the New Red Brigades - but its continuity with the original BR is zero. This is a significant disagreement, to put it mildly.<br /><br /><i>a militant leftist group located in Italy. Formed in 1970, the Marxist Red Brigades</i><br /><br />'Marxist' is a bizarre choice of epithet. Most of the Italian radical left was Marxist, and almost all of it declined to follow the BR's lead. Come to that, the Italian Communist Party (one of the BR's staunchest enemies) was Marxist. Terry Eagleton's a Marxist; Jeremy Hardy's a Marxist; I'm a Marxist myself, pretty much. The BR had a highly unusual set of political beliefs, somewhere between Maoism, old-school Stalinism and pro-Tupamaro insurrectionism. 'Maoist' would do for a one-word summary. 'Marxist' is both over-broad and misleading.<br /><br /><i>sought to create a revolutionary state through armed struggle</i><br /><br />Well, yes. And no. I mean, I don't think it's possible to make any sense of the BR without acknowledging that, while they did have a famous slogan about <i>portare l'attacco al cuore dello stato</i> ('attacking at the heart of the state'), their anti-state actions were only a fairly small element of what they did. To begin with they were a factory-based group, who took action against foremen and personnel managers; in their later years - which were also their peak years - the BR, like other armed groups, got drawn into what was effectively a vendetta with the police, prioritising revenge attacks over any kind of 'revolutionary' programme. You could say that the BR were a revolutionary organisation & consequently had a revolutionary programme throughout, even if their actions didn't always match it - but how useful would this be?<br /><br /><i>and to separate Italy from the Western Alliance</i><br /><br />Whoa. I don't think the BR were particularly in favour of Italy's NATO membership, but the idea that this was one of their key goals is absurd. If the BR had been a catspaw for the KGB, intent on fomenting subversion so as to destabilise Italy, then this probably would have been high on their list. But they weren't, and it wasn't.<br /><br /><i>In 1978, they kidnapped and killed former Prime Minister Aldo Moro under obscure circumstances.</i><br /><br />Remarkably well-documented circumstances, I'd have said.<br /><br /><i>After 1984's scission</i><br /><br />This is just wrong - following growing and unresolvable factionalism, the BR formally dissolved in October 1981. <br /><br /><i>Red Brigades managed with difficulty to survive the official end of the Cold War in 1989</i><br /><br />This is both confused and wrong. Given that there was a split, how would the BR have survived beyond 1981 (or 1984), let alone 1989? As for the BR's successor groups, the last one to pack it in was last heard from in 1988.<br /><br /><i>even though it is now a fragile group with no original members.</i><br /><br />Or rather, even though the name is now used by a small group about which very little is know, but which is not believed to have any connection to the original group (whose members are after all knocking on a bit by now).<br /><br /><i>Throughout the 1970’s the Red Brigades were credited with 14,000 acts of violence.</i><br /><br />Good grief. Credited by whom? According to the sources I've seen, between 1970 and 1981 Italian armed struggle groups were responsible for a total of 3,258 actions, including 110 killings; the BR's share of the total came to 472 actions, including 58 killings. (Most 'actions' consisted of criminal damage and did not involve personal violence.) I'd be the first to admit that the precision of these figures is almost certainly spurious, but even if we doubled that figure of 472 we'd be an awful long way short of 14,000. <br /><br />I'm not even going to look at the body of the article.<br /><br />I think there are two main problems here; the good news is that Larry's proposals for the neo-Wikipedia (Nupedia? maybe not) would address both of them.<br /><br />Firstly, <b>first mover advantage</b>. The structure of Wikipedia creates an odd imbalance between writers and editors. Writing a new article is easy: the writer can use whatever framework he or she chooses, in terms both of categories used to structure the entry and of the overall argument of the piece. Making minor edits to an article is easy: mutter <i>1984? no way, it was 1981!</i>, log on, a bit of typing and it's done. But making major edits is hard - you can see from the comments above just how much work would be needed to make that BR article acceptable, <b>starting from what's there now</b>. It would literally be easier to write a new article. What's more, making edits stick is hard; I deleted one particularly ignorant falsehood from the BR article myself a few months ago, only to find my edit reverted the next day. (Of course, I re-reverted it. So <b>there</b>!)<br /><br />Larry's suggestion of getting experts on board is very much to the point here. Slap my face and call me a credentialled academic, but I don't believe that everyone is equally qualified to write an encyclopedia article about their favourite topic - and I do think it matters who gets the first go.<br /><br />Secondly, <b>gaming the system</b>. Wikipedia is a community as well as an encyclopedia. I'll pass over Larry's suggestion that Wikipedia is dysfunctional <b>as a community</b>, but I do think it's arguable that some behaviours which work well for Wikipedia-the-community are dysfunctional for Wikipedia-the-resource. It's been suggested, for instance, that what really makes Wikipedia special is the 'history' pages, which take the lid off the debate behind the encyclopedia and let us see knowledge in the process of formation. It follows from this that to show the world a single, 'definitive' version of an article on a subject would actually be a step backwards: <i>The discussion tab on Wikipedia is a great place to point to your favorite version ... Does the world need a Wikipedia for stick-in-the-muds?</i> <a href="http://www.straypackets.com/2006/09/17/can-the-citizendium-sanction-the-wrong-and-the-abusers/">W. A. Gerrard</a> objects:<blockquote>Of what value is publicly documenting the change history of an encyclopedia entry? How can something that purports to be authoritative allow the creation of alternative versions which readers can adopt as favorites?<br /><br />If an attempt to craft a wiki that strives for accuracy, even via a flawed model, is considered something for “stick-in-the-muds”, then it’s apparent that many of Wikipedia’s supporters value the dynamics of its community more than the credibility of the product they deliver.</blockquote><br />I think this is exactly right: the history pages are worth much more to members of the Wikipedia community than to Wikipedia users. People like to form communities and communities like to chat - and edits and votes are the currency of Wikipedia chat. And gaming the system is fun (hence the word 'game'). <a href="http://www.aaronsw.com/weblog/whowritescomments">Aaron Swartz</a> quotes comments about Wikipedia regulars who <i>delete your newly[-]create[d] article without hesitation, or revert your changes and accuse you of vandalis[m] without even checking the changes you made</i>, or who <i>"edited" thousands of articles ... [mostly] to remove material that they found unsuitable</i>. This clearly suggest the emergence of behaviours which are driven more by social expectations than by a concern for Wikipedia. The second writer quoted above continues: <i>Indeed, some of the people-history pages contained little "awards" that people gave each other -- for removing content from Wikipedia.</i><br /><br />Now, all systems can be gamed, and all communities chat. The question is whether the chatting and the gaming can be harnessed for the good of the encyclopedia - or, failing that, minimised. I'm not optimistic about the first possibility, and I suspect Larry Sanger isn't either. Larry does, however, suggest a very simple hack which would help with the second: get everyone to use their real name. This would, among other things, make it obvious when a writer had authority in a given area. I don't entirely agree with Aaron's conclusion:<blockquote>Larry Sanger famously suggested that Wikipedia must jettison its anti-elitism so that experts could feel more comfortable contributing. I think the real solution is the opposite: Wikipedians must jettison <i>their</i> elitism and welcome the newbie masses as genuine contributors to the project, as people to respect, not filter out.</blockquote><br />This is half right: Wikipedia-the-community has produced an elite of 'regulars', whose influence over Wikipedia-the-resource derives from their standing in the community rather than from any kind of claim to expertise. I agree with Aaron that this is an unhealthy situation, but I think Larry was right as well. The artificial elitism of the Wikipedia community doesn't only marginalise the 'masses' who contribute most of the original content; it also sidelines the subject-area experts who, within certain limited domains, have a genuine claim to be regarded as an elite.<br /><br />I don't know if the Citizendium is going to address these problems in practice; I don't know if the Citizendium is going anywhere full stop. But I think Larry Sanger is asking the right questions. It's increasingly clear that Wikipedia isn't just facing in two directions at once, it's actually two different things - and what's good for Wikipedia-the-community isn't necessarily good for Wikipedia-the-resource.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-115858342143331962?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com6tag:blogger.com,1999:blog-11493954.post-1157451862793225782006-09-05T11:23:00.000+01:002006-11-08T10:10:36.406ZBack in the garage<blockquote>I have begun to see what I think is a promising trend in the publishing world that may just transform the industry for good.</blockquote><br /><a href="http://many.corante.com/archives/2006/09/02/social_publishing.php">Paul Hartzog</a>'s Many-to-Many post on publishing draws some interesting conclusions from the success of Charlie Stross's <a href="http://www.accelerando.org/"><i>Accelerando</i></a> (nice one, Charlie). but makes me a bit nervous, partly because of the <b>liberal use</b> of <b>excitable bolding</b>.<br /><blockquote>What I am suggesting is happening is the reversal of traditional publishing, i.e. the transformation of the system in which authors create and distribute their work. In the old system, it is assumed that the publishing process acts as a quality control filter ... but it ends up merely being a profit-capturing filter.<br />[...]<br />Conversely, in the new system, the works are made available, and it is up to the community-at-large to pass judgement on their quality. In the emerging system, <strong>authors create and distribute their work, and readers, individually and collectively, including fans as well as editors and peers, review, comment, rank, and tag, everything</strong>.</blockquote><br />Setting aside the formatting - and the evangelistic tone, something which never fails to set my teeth on edge - this is all interesting stuff. My problem is that I'm not sure about the economics of it. It's not so much that writers won't write if they don't get paid - writers will write, full stop - as that writers won't <b>eat</b> if they don't get paid: some money has to change hands some time. If the kind of development Paul is talking about takes hold, I can imagine a range of more-or-less unintended consequences, all with different overtones but few of them, to this jaundiced eye, particularly desirable:<br /><ol><li>Mass amateurisation means that nobody pays for anything, which in turn means that nobody makes a living from writing; this is essentially the RIAA/BPI anti-filesharing nightmare scenario, transposed to literature</li><li>Mass amateurisation doesn't touch the Dan Brown/Katie Price market, but gains traction in specialist areas of literature to the point where nobody can make a living from writing unless they're writing for the mass market; this is Charlie Gillett's argument for keeping CDs expensive (and the line the BPI would use against filesharing if they had any sense)<br /></li><li>Downloads like <i>Accelerando</i> function essentially as tasters and people end up buying just as many actual books, if not more; this scenario will also be familiar from filesharing arguments, as it's the line generally used to counter the previous two<br /></li><li>Mass amateur production becomes a new sphere of economic activity, linked in with and subordinate to the major mainstream operators: this is the MySpace scenario (at least, the <a href="http://www.guardian.co.uk/uk_news/story/0,,1865120,00.html">MySpace makes money for Murdoch</a> scenario)</li><li>Mass amateur production becomes a new sphere of <b>non</b>-economic activity, with a few star authors subsidised by publishing companies for the sake of the cachet they bring: the open source scenario</li><li>Mass amateur production becomes a new sphere of economic activity, existing on the margins and in the shadows, out of the reach of the major mainstream operators: the punk scenario (or, for older readers, the hippie scenario)<br /></li></ol>We can dismiss the first, RIAA-nightmare scenario. The third ('tasters') would be bearable, although it wouldn't go halfway to justifying Paul's argument. Most of the rest look pretty ghastly to me. Perhaps Paul is thinking in terms of the last scenario or something like it - but in that case I'd have to say that his optimism is just as misplaced, for different but related reasons, as the pessimism of the first scenario (although a new wave of <a href="http://www.angelfire.com/on/clash/gi.html#Garageland">garage literature</a> would be a fine thing to see).<br /><br />The trouble with making your own history is that you don't do it in circumstances of your own choosing. The participatory buzz of Web 2.0 tends to eat away at the structural and procedural walls that stop people getting their hands on stuff - but that can just mean that only the strongest and highest walls are left standing. Besides, walls can be useful, particularly if you want to keep a roof over your head.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-115745186279322578?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com0tag:blogger.com,1999:blog-11493954.post-1157017804893712642006-08-31T09:49:00.000+01:002006-11-08T10:10:36.348ZWe're all together now, dancing in time<a href="http://www.thinkvitamin.com/features/webapps/why-i-dont-use-social-softwareprint/">Ryan Carson</a>:<br /><blockquote>I’d love to add friends to my Flickr account, add my links to del.icio.us, browse digg for the latest big stories, customise the content of my Netvibes home page and build a MySpace page. But you know what? I don’t have time and you don’t either...</blockquote><br />Read the whole thing. What's particularly interesting is a small straw poll at the end of the article, where Ryan asks people who actually work on this stuff what social software apps they use on a day-to-day basis. Six people made 30 nominations in all; Ryan had five of his own for a total of 35.<br /><br />Here are the apps which got more than one vote:<br /><br />Flickr (four votes)<br />Upcoming (two)<br />Wikipedia (two)<br /><br />And, er, that's it.<br /><br />Social software looks like very big news indeed from some perspectives, but when it's held to the standard of actually helping people get stuff done, it fades into insignificance. I think there are three reasons for this apparent contradiction. First, there's the crowd effect - and, since you need a certain number of users before network effects start taking off, any halfway-successful social software application has a crowd behind it. It can easily look as if <b>everyone</b>'s doing it, even if the relevant definition of 'everyone' looks like a pretty small group to you and me.<br /><br />Then there's the domain effect: tagging and user-rating are genuinely useful and constructive, in some not very surprising ways, within pre-defined domains. (Think of a corporate intranet app, where there is no need for anyone to specify that 'Dunstable' means one of the company's offices, 'Barrett' means the company's main competitor and 'Monkey' means the payroll system.) For anyone who is getting work done with tagging, in other words, tagging is going to look pretty good - and, thanks to the crowd effect, it's going to look like a good thing that <b>everyone</b>'s using.<br /><br />Thirdly, social software is new, different, interesting and fun, as something to play with. It's a natural for geeks with time to play with stuff and for commentators who like writing about new and interesting stuff - let alone geek commentators. The hype generates itself; it's the kind of development that's guaranteed to look bigger than it is.<br /><br />Put it all together - and introduce feedback effects, as the community of geek commentators starts to find social software apps genuinely useful within <b>its</b> specialised domain - and social software begins to look like a Tardis in reverse: much, much bigger on the outside than it is on the inside.<br /><br />That's not to say that social software isn't interesting, or that it isn't useful. But I think that in the longer term those two facets will move apart: useful and productive applications of tagging will be happening under the commentator radar, often behind organisational firewalls, while the stuff that's interesting and fun to play with will remain... interesting and fun to play with.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-115701780489371264?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com6tag:blogger.com,1999:blog-11493954.post-1154608551839907672006-08-03T12:45:00.000+01:002006-11-08T10:10:36.292ZSo much that hides<a href="http://www.agwright.com/blog/archives/001062.html">Alex</a> points to <a href="http://www.rashmisinha.com/archives/06_07/tag-findability.html">this</a> piece by Rashmi Sinha on 'Findability with tags': the vexed question of using tags to find the material that you've tagged, rather than as an elaborate way of building a mind-map. <br /><br />I should stress, parenthetically, that that last bit wasn't meant as a putdown - it actually describes my own use of <a href="http://www.simpy.com">Simpy</a>. I regularly tag pages, but almost never use tags to actually retrieve them. Sometimes - quite rarely - I do pull up all the pages I've tagged with a generic "write something about this" tag. Apart from that, I only ever ask Simpy two questions: one is "what was that page I tagged the other day?" (for which, obviously, meaningful tags aren't required); the other is "what does my tag cloud look like?".<br /><br />Now, you could say that the answer to the second question isn't strictly speaking information; it's certainly not information I <b>use</b>, unless you count the time I spend grooming the cloud by splitting, merging and deleting stray tags. I like tag clouds and don't agree with Jeffrey Zeldman's <a href="http://www.zeldman.com/daily/0405d.shtml">anathema</a>, but I do agree with Alex that they're not the last word in retrieving information from tags. Which is where Rashmi's article comes in.<br /><br />Rashmi identifies three ways of layering additional information on top of the basic item/tag pairing, all of which hinge on partitioning the tag universe in different ways. This is most obvious in the case of <b>faceted</b> tagging: here, the field of information is partitioned before any tags are applied. Rashmi cites the familiar example of wine, where a 'region' tag would carry a different kind of information from 'grape variety', 'price' or for that matter 'taste'. Similar distinctions can be made in other areas: a news story tagged 'New Labour', 'racism' and 'to blog about' is implicitly carrying information in the domains 'subject (political philosophy)', 'subject (social issue)' and 'action to take'.<br /><br />There are two related problems here. A unique tag, in this model, can only exist within one dimension: if I want separate tags for New Labour (the people) and New Labour (the philosophy), I'll either have to make an artificial distinction between the two (New_Labour vs New_Labour_philosophy) or add a dimension layer to my tags (political_party.New_Labour vs political_philosophy.New_Labour). Both solutions are pretty horrible. More broadly, you can't invoke a taxonomist's standby like the wine example without setting folksonomic backs up, and with some reason: part of the appeal of tagging is precisely that you start with a blank sheet and let the domains of knowledge emerge as they may.<br /><br /><b>Clustered</b> tagging (a new one on me) addresses both of these problems, as well as answering the much-evaded question of how those domains are supposed to emerge. A tag cluster - as seen on Flickr - consists of a group of tags which consistently appear together, suggesting an implicit 'domain'. Crucially, a single tag can occur in multiple clusters. The clusters for the Flickr 'election' tag, for example, are easy to interpret:<br /><br /><b>vote, politics, kerry, bush, voting, ballot, poster, cameraphone, democrat, president <br /><br />wahl, germany, deutschland, berlin, cdu, spd, bundestagswahl<br /><br />canada, ndp, liberal, toronto, jacklayton, federalelection</b><br /><br />and, rather anticlimactically,<br /><br /><b>england, uk</b><br /><br />Clustering, I'd argue, represents a pretty good stab at building emergent domains. The downside is that it only becomes possible when there are huge numbers of tagging operations.<br /><br />The third enhancement to tagging Rashmi describes is the use of tags as <b>pivots</b>:<blockquote>When everything (tag, username, number of people who have bookmarked an item) is a link, you can use any of those links to look around you. You can change direction at any moment.</blockquote><br />Lurking behind this, I think, is <a href="http://www.vanderwal.net/random/entrysel.php?blog=1750">Thomas</a>'s original tripartite definition of 'folksonomy':<blockquote>the three needed data points in a folksonomy tool [are]: 1) the person tagging; 2) the object being tagged as its own entity; and 3) the tag being used on that object. Flattening the three layers in a tool in any way makes that tool far less valuable for finding information. But keeping the three data elements you can use two of the elements to find a third element, which has value. If you know the object (in del.icio.us it is the web page being tagged) and the tag you can find other individuals who use the same tag on that object, which may lead (if a little more investigation) to somebody who has the same interest and vocabulary as you do. That person can become a filter for items on which they use that tag.</blockquote><br />This, I think, is pivoting in action: from the object and its tags, to the person tagging and the tags they use, to the person using particular tags and the objects they tag. (There's a more concrete description <a href="http://www.zylstra.org/blog/archives/2006/07/social_software.html">here</a>.)<br /><br />Alex suggests that using tags as pivots <i>could also be considered a subset of faceted browsing</i>. I'd go further, and suggest that facets, clusters and pivots are all subsets of a larger set of solutions, which we can call domain-based tagging. If you use facets, the domains are imposed: this approach is a good fit to relatively closed domains of knowledge and finite groups of taggers. If you've got an epistemological blank sheet and a limitless supply of taggers, you can allow the domains to emerge: this is where clusters come into their own. And if what you're primarily interested in is people - and, specifically, <b>who</b>'s saying <b>what</b> about <b>what</b> - then you don't want multiple content-based domains but only the information which derives directly from human activity: the objects and their taggers. Or rather, you want the objects and the taggers, plus the ability to pivot into a kind of multi-dimensional space: instead of tags existing within domains, each tag is a domain in its own right, and what you can find within each tag-domain is the objects and their taggers.<br /><br />What all of this suggests is that, unsurprisingly, there is no 'one size fits all' solution. I suggested <a href="http://phenomenologic.blogspot.com/2005/06/cloud-of-knowing.html">some time ago</a> that<blockquote>If 'cloudiness' is a universal condition, del.icio.us and Flickr and tag clouds and so forth don't enable us to do anything new; what they are giving us is a live demonstration of how the social mind works.</blockquote><br />All knowledge is cloudy; all knowledge is constructed through conversation; conversation is a way of dealing with cloudiness and building usable clouds; social software lets us see knowledge clouds form in real time. I think that's fine as far as it goes; what it doesn't say is that, as well as having conversations about different things, we're having different <b>kinds</b> of conversations and dealing with the cloud of knowing in different ways. Ontology is not, necessarily, overrated; neither is folksonomy.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-115460855183990767?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com0tag:blogger.com,1999:blog-11493954.post-1152096485774066252006-07-05T11:04:00.000+01:002006-11-08T10:10:36.235ZThe users geeks don't see<a href="http://www.roughtype.com/archives/2006/07/the_web_20_nich.php">Nick</a> writes, provocatively as ever, about the recent 'community-oriented' redesign of the netscape.com portal:<br /><blockquote>A few days ago, Netscape turned its traditional portal home page into a knockoff of the popular geek news site Digg. Like Digg, Netscape is now a "news aggregator" that allows users to vote on which stories they think are interesting or important. The votes determine the stories' placement on the home page. Netscape's hope, it seems, is to bring Digg's hip Web 2.0 model of social media into the mainstream. There's just one problem. Normal people seem to think the entire concept is ludicrous.</blockquote><br />Nick cites a post titled <a href="http://www.readwriteweb.com/archives/netscape_commun.php">Netscape Community Backlash</a>, from which this line leapt out at me:<br /><blockquote>while a lot of us geeks and 2.0 types are addicted to our own technology (and our own voices, to be honest), it's pretty darn obvious that A LOT of people want to stick with the status quo</blockquote><br />This reminded me of a minor revelation I had the other day, when I was looking for the Java-based OWL reasoner 'pellet'. I googled for<br />pellet owl<br />- just like that, no quotes - expecting to find a 'pellet' link at the bottom of forty or fifty hits related to, well, owls and their pellets. In fact, the top hit was "Pellet OWL Reasoner". (To be fair, if you google<br />owl pellet<br />you do get the fifty pages of owl pellets first.) <br /><br />I think it's fair to say that the pellet OWL reasoner isn't big news even in the Web-using software development community; I'd be surprised if everyone reading this post even knows what an OWL reasoner is (or has any reason to care). But there's enough activity on the Web around pellet to push it, in certain circumstances, to the top of the Google rankings (<a href="http://tinyurl.com/nvql4">see for yourself</a>).<br /><br />Hence the revelation: <i>it's still a geek Web</i>. Or rather, <b>there's</b> still a geek Web, and it's still making a lot of the running. When I first started using the Internet, about ten years ago, there was a geek Web, a hobbyist Web, an academic Web (small), a corporate Web (very small) and a commercial Web (minute) - and the geek Web was by far the most active. Since then the first four sectors have grown incrementally, but the commercial Web has exploded, along with a new sixth sector - the Web-for-everyone of AOL and MSN and MySpace and LiveJournal (and blogs), whose users vastly outnumber those of the other five. But the geek Web is still where a lot of the new interesting stuff is being created, posted, discussed and judged to be interesting and new.<br /><br />Add social software to the mix - starting, naturally, within the geek Web, as that's where it came from - and what do you get? You get a myth which diverges radically from the reality. The myth is that this is where the Web-for-everyone comes into its own, where millions of users of what was built as a broadcast Web with walled-garden interactive features start talking back to the broadcasters and breaking out of their walled gardens. The reality is that the voices of the geeks are heard even more loudly - and even more disproportionately - than before. Have a look at the 'popular' tags on del.icio.us: as I write, six of the top ten (including all of the top five) relate directly to programmers, and only to programmers. (Number eight reads: "LinuxBIOS - aims to replace the normal BIOS found on PCs, Alphas, and other machines with a Linux kernel". The unglossed reference to Alphas says it all.) Of the other four, one's a political video, two are photosets and one is a full-screen animation of a cartoon cat dancing, rendered entirely in ASCII art. (Make that <b>seven</b> of the top ten.)<br /><br />I'm not a sceptic about social software: ranking, tagging, search-term-aggregation and the other tools of what I persist in calling ethnoclassification are both new and powerful. But they're most powerful within a delimited domain: a user coming to del.icio.us for the first time should be looking for the 'faceted search' option straight away ("OK, so that's the geek cloud, how do I get it to show me the cloud for European history/ceramics/<i>Big Brother</i>?") The fact that there is no 'faceted search' option is closely related, I'd argue, to the fact that there is no discernible tag cloud for European history or ceramics or <i>Big Brother</i>: we're all in the geek Web. (Even Nick Carr.) (Photography is an interesting exception - although even there the only tags popular enough to make the del.icio.us tag cloud are 'photography', 'photo' and 'photos'. There are 40 programming-related tags, from ajax to xml.)<br /><br />Social software wasn't built for the users of the Web-for-everyone. Reaction to the Netscape redesign tells us (or reminds us) that there's no reason to assume they'll embrace it.<br /><br /><b>Update</b> Have a look at <a href="http://www.esztersblog.com/2006/06/14/what-do-college-students-do-online/">Eszter Hargittai</a>'s survey of Web usage among 1,300 American college students, conducted in February and March 2006. MySpace is huge, and Facebook's even huger, but Web 2.0 as we know it? It's not there. 1.9% use Flickr; 1.6% use Digg; 0.7% use del.icio.us. Answering a slightly different question, 1.5% have <b>ever</b> visited Boingboing, and 1% Technorati. By contrast, 62% have visited CNN.com and 21% bbc.co.uk. It's still, very largely, a broadcast Web with walled-garden interactivity. Comparing results like these with the prophecies of tagging replacing hierarchy, Long Tail production and mashups all round, I feel like invoking the story of the blind men and the elephant - except that I'm not even sure we've all got the same elephant.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-115209648577406625?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com1tag:blogger.com,1999:blog-11493954.post-1150110474029293192006-06-12T11:46:00.000+01:002006-11-08T10:10:36.176ZWe hear the sound of machinesSooner or later, the Internet will need to be saved from Google. Because Google - which appears to be an integral part of the <i>information-wants-to-be-free</i> Net dream, the search engine which gives life to the hyperlinked digital nervous system of a kind of massively-distributed Xanadu project - is nothing of the sort. Google is a private company; Google's business isn't even search. Google's business is advertising - and, whatever we think about how well search goes together with tagging and folksonomic stumbling-upon, search absolutely doesn't go with advertising. (<b>Update</b> 15th June: <a href="http://www.guardian.co.uk/uk_news/story/0,,1797445,00.html">this</a> is a timely reminder that Google is a business, and its business is advertising. Mass personalisation, online communities, interactive rating and ranking, it's all there - and it's <b>all</b> about the advertising.)<br /><br />I had thought that, in the context of plain vanilla Web search, Google actually had this cracked - that the prominence of 'sponsored links', displayed separately from search results, allowed them to deliver an unpolluted service and still make money. I hadn't reckoned with AdSense. AdSense doesn't in itself pollute Google's search results. What it does is far worse: it encourages other people to pollute the Net. Which will mean, ultimately, that Google will paint (or choke) itself into a corner - but that, if we're not careful, an awful lot of users will be stuck in that corner with them.<br /><br />For a much fuller and more cogent version of this argument, read <a href="http://www.fool.com/news/commentary/2006/commentary06060927.htm">Seth Jayson</a> (via <a href="http://publishing2.com/2006/06/10/popping-the-google-hype-bubble/">Scott</a>). One point in particular stood out: <i>Google (Nasdaq: GOOG) insiders are continuing to drop shares on the public at a rate that boggles the mind</i>. It's true. Over the last year, as far as published records show, Sun insiders have sold $50,000 worth of shares, net. In the same period, IBM insiders have sold $6,500,000; Microsoft insiders have sold $1,500,000,000; and Google insiders have sold $5,000,000,000. <a href="http://www.form4oracle.com/company?cik=0001288776&ticker=goog">See for yourself</a>. That's a lot of shares.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-115011047402929319?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com6tag:blogger.com,1999:blog-11493954.post-1149506787156553692006-06-05T11:28:00.000+01:002006-11-08T10:10:36.120ZI couldn't make it any simplerI hate to say this - I've always loathed VR boosters and been highly sceptical about the people they boost - but Jaron Lanier's a bright bloke. His essay <a href="http://www.edge.org/documents/archive/edge183.html">Digital Maoism</a> doesn't quite live up to the title, but it's well worth reading (thanks, Thomas).<br /><br />I don't think he quite gets to the heart of the current 'wisdom of the crowds' myth, though. It's not Maoism so much as Revivalism: there's a tight feedback loop between membership of the collective, collective activity and (crucially) celebration of the activity of the collective. Or: celebration of process rather than end-result - because the process incarnates the collective.<br /><br />Put it this way. Say that (for example) the Wikipedia page on the Red Brigades is wildly wrong or wildly inadequate (which is just as bad); say that the tag cloud for an authoritative Red Brigades resource is dominated by misleading tags ('kgb', 'ussr', 'mitrokhin'...). Would a wikipedian or a 'folksonomy' advocate see this situation as a major problem? Not being either I can't give an authoritative answer, but I strongly suspect the answer would be No: it's all part of the process, it's all part of the collective self-expression of wikipedians and the growth of the folksonomy, and if the subject experts don't like it they should just get their feet wet and start tagging and editing themselves. And if, in practice, the experts don't join in - perhaps, in the case of Wikipedia, because they don't have the stomach for the kind of 'editing' process which saw Jaron Lanier's own <a href="http://en.wikipedia.org/w/index.php?title=Jaron_Lanier&oldid=53931333">corrections</a> get reverted? Again, I don't know for sure, but I suspect the answer would be another shrug: the wiki's open to all - and tagspace couldn't be <b>more</b> open - so who's to blame, if you can't make your voice heard, but you? There's nothing inherently wrong with the process, except that you're not helping to improve it. There's nothing inherently wrong with the collective, except that you haven't joined it yet.<br /><br />Two quotes to clarify (hopefully) the connection between collective and process. <a href="http://www.nettakeaway.com/tp/article/175/i-continue-to-despise-tagging">Michael Wexler</a>:<blockquote>our understanding of things changes and so do the terms we use to describe them. How do I solve that in this open system? Do I have to go back and change all my tags? What about other people’s tags? Do I have to keep in mind all the variations on tags that reflect people’s different understanding of the topics?<br /><br />The social connected model implies that the connections are the important part, so that all you need is one tag, one key, to flow from place to place and discover all you need to know. But the only people who appear to have time to do that are folks like Clay Shirky. The rest of us need to have information sorted and organized since we actually have better things to do than re-digest it.<br /><...><br />What tagging does is attempt to recreate the flow of discovery. That’s fine… but what taxonomy does is recreate the structure of knowledge that you’ve already discovered. Sometimes, I like flowing around and stumbling on things. And sometimes, that’s a real pita. More often than not, the tag approach involves lots of stumbling around and sidetracks.<br /><...><br />It's like Family Feud [a.k.a. Family Fortunes - PJE]. You have to think not of what you might say to a question, you have to guess what the survey of US citizens might say in answer to a question. And that’s really a distraction if you are trying to just answer the damn question.</blockquote><br />And our man Lanier:<blockquote>there's a demonstrative ritual often presented to incoming students at business schools. In one version of the ritual, a large jar of jellybeans is placed in the front of a classroom. Each student guesses how many beans there are. While the guesses vary widely, the average is usually accurate to an uncanny degree.<br /><br />This is an example of the special kind of intelligence offered by a collective. It is that peculiar trait that has been celebrated as the "Wisdom of Crowds,"<br /><...><br />The phenomenon is real, and immensely useful. But it is not infinitely useful. The collective can be stupid, too. Witness tulip crazes and stock bubbles. Hysteria over fictitious satanic cult child abductions. Y2K mania. The reason the collective can be valuable is precisely that its peaks of intelligence and stupidity are not the same as the ones usually displayed by individuals. Both kinds of intelligence are essential.<br /><br />What makes a market work, for instance, is the marriage of collective and individual intelligence. A marketplace can't exist only on the basis of having prices determined by competition. It also needs entrepreneurs to come up with the products that are competing in the first place. In other words, clever individuals, the heroes of the marketplace, ask the questions which are answered by collective behavior. They put the jellybeans in the jar.</blockquote><br />To illustrate this, once more (just the once) with the Italian terrorists. There are tens of thousands of people, at a conservative estimate, who have read enough about the Red Brigades to write that Wikipedia entry: there are a lot of ill-informed or partially-informed or tendentious books about terrorism out there, and some of them sell by the bucketload. There are probably only a few hundred people who have read Gian Carlo Caselli and Donatella della Porta's long article "The History of the Red Brigades: Organizational structures and Strategies of Action (1970-82)" - and I doubt there are twenty who know the source materials as well as the authors do. (I'm one of the first group, obviously, but certainly not the second.) Once the work's been done anyone can discover it, but discovery isn't knowledge: the knowledge is in the words on the pages, and ultimately in the individuals who wrote them. <i>They put the jellybeans in the jar.</i><br /><br />This is why (an academic writes) the academy matters, and why academic elitism is - or at least can be - both valid and useful. Jaron:<blockquote>The balancing of influence between people and collectives is the heart of the design of democracies, scientific communities, and many other long-standing projects. There's a lot of experience out there to work with. A few of these old ideas provide interesting new ways to approach the question of how to best use the hive mind.<br /><...><br />Scientific communities ... achieve quality through a cooperative process that includes checks and balances, and ultimately rests on a foundation of goodwill and "blind" elitism — blind in the sense that ideally anyone can gain entry, but only on the basis of a meritocracy. The tenure system and many other aspects of the academy are designed to support the idea that individual scholars matter, not just the process or the collective.</blockquote><br />I'd go further, if anything. Academic conversations may present the appearance of a collective, but it's a collective where individual contributions are preserved and celebrated ("Building on Smith's celebrated critique of Jones, I would suggest that Smith's own analysis is vulnerable to the criticisms advanced by Evans in another context..."). That is, academic discourse <b>looks like a conversation</b> - which wikis certainly can do, although Wikipedia emphatically doesn't.<br /><br />The problem isn't the technology, in other words: both wikis and tagging could be ways of making conversation visible, which inevitably means visualising debate and disagreement. The problem is the drive to efface any possibility of conflict, effectively repressing the appearance of debate in the interest of presenting an evolving consensus. (Or, I could say, the problem is the tendency of people to bow and pray to the neon god they've made, but that would be a bit over the top - and besides, Simon and Garfunkel quotes are far too obvious.)<br /><br /><b>Update</b> 13th June<br /><br />I wrote (above): <i>It's not Maoism so much as Revivalism: there's a tight feedback loop between membership of the collective, collective activity and (crucially) celebration of the activity of the collective. Or: celebration of process rather than end-result - because the process incarnates the collective.</i><br /><br />Here's <a href="http://www.edge.org/discourse/digital_maoism.html#doctorow">Cory Doctorow</a>, responding to Lanier:<br /><blockquote>Wikipedia isn't great because it's like the Britannica. The Britannica is great at being authoritative, edited, expensive, and monolithic. Wikipedia is great at being free, brawling, universal, and instantaneous.<br /><...><br />If you suffice yourself with the actual Wikipedia entries, they can be a little papery, sure. But that's like reading a mailing-list by examining nothing but the headers. Wikipedia entries are nothing but the emergent effect of all the angry thrashing going on below the surface. No, if you want to really navigate the truth via Wikipedia, you have to dig into those "history" and "discuss" pages hanging off of every entry. That's where the real action is, the tidily organized palimpsest of the flamewar that lurks beneath any definition of "truth." The Britannica tells you what dead white men agreed upon, Wikipedia tells you what live Internet users are fighting over.<br /><br />The Britannica truth is an illusion, anyway. There's more than one approach to any issue, and being able to see multiple versions of them, organized with argument and counter-argument, will do a better job of equipping you to figure out which truth suits you best.</blockquote><br />Quoting myself again, <i>There's nothing inherently wrong with the process, except that you're not helping to improve it. There's nothing inherently wrong with the collective, except that you haven't joined it yet.</i><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-114950678715655369?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com1tag:blogger.com,1999:blog-11493954.post-1148552574209731582006-05-25T10:36:00.000+01:002006-11-08T10:10:36.064ZWhen there is no outsideNick Carr's hyperbolically-titled <a href="http://www.roughtype.com/archives/2006/05/the_death_of_wi.php">The Death of Wikipedia</a> has received a couple of endorsements and some fairly vigorous disagreement, unsurprisingly. I think it's as much a question of tone as anything else. When Nick reads the line<blockquote>certain pages with a history of vandalism and other problems may be semi-protected on a pre-emptive, continuous basis.</blockquote><br />it clearly sets alarm bells ringing for him, as indeed it does for me ("Ideals always expire in clotted, bureaucratic prose", Nick comments). Several of his commenters, on the other hand, sincerely fail to see what the big deal might be: it's only a handful of pages, it's only <b>semi</b>-protection, it's not that onerous, it's part of the continuing development of Wikipedia editing policies, Wikipedia never claimed to be a totally open wiki, there's no such thing as a totally open wiki anyway...<br /><br />I think the reactions are as instructive as the original post. No, what Nick's pointing to isn't really a qualitative change, let alone the death of anything. But yes, it's a genuine problem, and a genuine embarrassment to anyone who takes the Wikipedian rhetoric seriously. Wikipedia ("the free encyclopedia that anyone can edit") routinely gets hailed for its openness and its authority, only not both at the same time - indeed, maximising one can always be used to justify limits on the other. As here. But there's another level to this discussion, which is to do with Wikipedia's resolution of the openness/authority balancing-act. What happens in practice is that the contributions of active Wikipedians take precedence over both random vandals and passing experts. In effect, both openness and authority are vested in the group.<br /><br />In some areas this works well enough, but in others it's a huge problem. I use Wikipedia myself, and occasionally drop in an edit if I see something that's crying out for correction. Sometimes, though, I see a Wikipedia article that's just wrong from top to bottom - or rather, an article where verifiable facts and sustainable assertions alternate with errors and misconceptions, or are set in an overall argument which is based on bad assumptions. In short, sometimes I see a Wikipedia article which doesn't need the odd correction, it needs to be pulled and rewritten. I'm not alone in having this experience: here's <a href="http://www.plasticbag.org/archives/2005/09/links_for_20050920.shtml">Tom Coates on 'penis envy'</a> and <a href="http://www.vanderwal.net/random/entrysel.php?blog=1750">Thomas Vander Wal (!) on 'folksonomy'</a>, as well as <a href="http://phenomenologic.blogspot.com/2005/03/greetings-and-salutations-and-anomie.html">me on 'anomie'</a>.<br /><br />It's not just a problem with philosophical concepts, either - I had a similar reaction more recently to the Wikipedia page on the Red Brigades. On the basis of the reading I did for my doctorate, I could rewrite that page from start to finish, leaving in place only a few proper names and one or two of the dates. But writing this kind of thing is hard and time-consuming work - and I've got quite enough of that to do already. So it doesn't get done. <br /><br />I don't think this is an insurmountable problem. A while ago I floated a <a href="http://phenomenologic.blogspot.com/2005/09/if-i-drew-detailed-map.html">cunning plan</a> for fixing pages like this, using PledgeBank to mobilise external reserves of peer-pressure; it might work, and if only somebody else would actually get it rolling I might even sign up. But I do think it's a problem, and one that's inherent to the Wikipedia model.<br /><br />To reiterate, both openness and authority are vested in the group. Openness: sure, Wikipedia is as open to me as any other registered editor d00d, but in practice the openness of Wikipedia is graduated according to the amount of time you can afford to spend on it. As for authority, I'm not one, but (like Debord) I have read <a href="http://www.users.zetnet.co.uk/amroth/scritti/debord3.htm">several good books</a> - better books, to be blunt, than those relied on by the author[s] of the current Red Brigades article. But what would that matter unless I was prepared to defend what I wrote against bulk edits by people who disagreed - such as, for example, the author[s] of the current article? On the other hand, if I <b>was</b> prepared to stick it out through the edit wars, what would it matter whether I knew my stuff or not? This isn't just random bleating. When I first saw that Red Brigades article I couldn't resist one edit, deleting the completely spurious assertion that the group Prima Linea was a Red Brigades offshoot. When I looked at the page again the next day, my edit had been reverted.<br /><br />Ultimately Wikipedia isn't about either openness or authority: it's about the collective activity of editing Wikipedia and being a Wikipedian. From that, all else follows.<br /><br /><b>Update</b> 2/6/06 (in response to David, in comments)<br /><br />There are two obvious problems with the Wikipedia page on the Brigate Rosse, and one that's larger but more diffuse. The first problem is that it's written in the present tense; it's extremely dubious that there's any continuity between the historic Brigate Rosse and the gang who shot Biagi, let alone that they're simply, unproblematically the same group. This alone calls for a major rewrite. Secondly, the article is written very much from a police/security-service/conspiracist stance, with a focus on question like whether the BR was assisted by the Czech security services or penetrated by NATO. But this tends to reinforce an image of the BR as a weird alien force which popped up out of nowhere, rather than an extreme but consistent expression of broader social movements (all of which has been documented).<br /><br />The broader problem - which relates to both of the specific points - goes back to a problem with the amateur-encyclopedia format itself: Wikipedia implicitly asks what a given topic <b>is</b>, which prompts contributors to think of their topic as having a core, essential meaning (I wrote about this <a href="http://phenomenologic.blogspot.com/2005/03/greetings-and-salutations-and-anomie.html">last year</a>). The same problem can arise in a 'proper' encyclopedia, but there it's generally mitigated by expertise: somebody who's spent several years studying the broad Italian armed struggle scene is going to be motivated to relate the BR back to that scene, rather than presenting it as an utterly separate thing. The motivation will be still greater if the expert on the BR has also been asked to contribute articles on Prima Linea, the NAP, etc. This, again, is something that happens (and <b>works</b>, for all concerned) in the kind of restricted conversations that characterise academia, but isn't incentivised by the Wikipedia conversation - because the Wikipedia conversation doesn't go anywhere else. Doing Wikipedia is all about doing Wikipedia.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-114855257420973158?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com2tag:blogger.com,1999:blog-11493954.post-1147688813556879412006-05-15T10:58:00.000+01:002006-11-08T10:10:36.003ZWho's there?At Many-to-Many, <a href="http://many.corante.com/archives/2006/05/12/social_science_and_design_questions.php">Ross Mayfield</a> reports that Clay Shirky and danah boyd have been thinking about "the lingering questions in our field", viz. the field of social software. I was a bit surprised to see that<p><i>How can communities support veterans going off topic together and newcomers seeking topical information and connections?</i><p>still qualifies as a 'lingering question'; I distinctly remember being involved in thrashing this one out, together with Clay, the best part of <a href="http://groups.google.com/group/alt.folklore.urban/msg/988c2147e81d3492?">nine years ago</a>. But this was the one that really caught my eye, if you'll pardon the expression:<p><i>What level of visual representation of the body is necessary to trigger mirror neurons?</i><p>Uh-oh. <a href="http://www.lrb.co.uk/v28/n08/print/turk01_.html">Sherry Turkle</a> (subscription-only link):<blockquote>a woman in a nursing home outside Boston is sad. Her son has broken off his relationship with her. Her nursing home is taking part in a study I am conducting on robotics for the elderly. I am recording the woman’s reactions as she sits with the robot Paro, a seal-like creature advertised as the first ‘therapeutic robot’ for its ostensibly positive effects on the ill, the elderly and the emotionally troubled. Paro is able to make eye contact by sensing the direction a human voice is coming from; it is sensitive to touch, and has ‘states of mind’ that are affected by how it is treated – for example, it can sense whether it is being stroked gently or more aggressively. In this session with Paro, the woman, depressed because of her son’s abandonment, comes to believe that the robot is depressed as well. She turns to Paro, strokes him and says: ‘Yes, you’re sad, aren’t you. It’s tough out there. Yes, it’s hard.’ And then she pets the robot once again, attempting to provide it with comfort. And in so doing, she tries to comfort herself.<br /><br />What are we to make of this transaction? When I talk to others about it, their first associations are usually with their pets and the comfort they provide. I don’t know whether a pet could feel or smell or intuit some understanding of what it might mean to be with an old woman whose son has chosen not to see her anymore. But I do know that Paro understood nothing. The woman’s sense of being understood was based on the ability of computational objects like Paro – ‘relational artefacts’, I call them – to convince their users that they are in a relationship by pushing certain ‘Darwinian’ buttons (making eye contact, for example) that cause people to respond as though they were in relationship.</blockquote><br />Further reading: see <a href="http://headrush.typepad.com/creating_passionate_users/2006/04/angrynegative_p.html">Kathy Sierra</a> on mirror neurons and the contagion of negativity. See also <a href="http://weblog.burningbird.net/2006/04/18/human-heat-sinks/">Shelley</a>'s critique of Kathy's argument, and of attempts to enforce 'positive' feelings by manipulating mood. And see the sidebar at Many-to-Many, which currently reads as follows:<blockquote>Recent Comments<br /><br />viagra on Sanger on Seigenthaler’s criticism of Wikipedia<br /><br />hydrocodone cheap on Sanger on Seigenthaler’s criticism of Wikipedia<br /><br />viagra on Sanger on Seigenthaler’s criticism of Wikipedia<br /><br />alprazolam online on Sanger on Seigenthaler’s criticism of Wikipedia<br /><br />Timur on Sanger on Seigenthaler’s criticism of Wikipedia<br /><br />Timur on Sanger on Seigenthaler’s criticism of Wikipedia<br /><br />Recent Trackbacks<br /><br />roulette: roulette<br /><br />jouer casino: jouer casino<br /><br />casinos on line: casinos on line<br /><br />roulette en ligne: roulette en ligne<br /><br />jeux casino: jeux casino<br /><br />casinos on line: casinos on line</blockquote><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-114768881355687941?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com1tag:blogger.com,1999:blog-11493954.post-1147170408527783142006-05-09T10:46:00.000+01:002006-11-08T10:10:35.942ZSome day this will all be yours<a href="http://publishing2.com/2006/05/07/what-if-no-one-will-pay-for-content/">Scott Karp</a>:<blockquote>What if dollars have no place in the new economics of content?<br />...<br />In media 1.0, brands paid for the attention that media companies gathered by offering people news and entertainment (e.g. TV) in exchange for their attention. In media 2.0, people are more likely to give their attention in exchange for OTHER PEOPLE’S ATTENTION. This is why MySpace can’t effectively monetize its 70 million users through advertising — people use MySpace not to GIVE their attention to something that is entertaining or informative (which could thus be sold to advertisers) but rather to GET attention from other users.<br />...<br />MySpace can’t sell attention to advertisers because the site itself HAS NONE. Nobody pays attention to MySpace — users pay attention to each other, and compete for each other’s attention — it’s as if the site itself doesn’t exist.<br /><br />You see the same phenomenon in blogging — blogging is not a business in the traditional sense because most people do it for the attention, not because they believe there’s any financial reward. What if the economics of media in the 21st century begin to look like the economics of poetry in the 20th century? — Lots of people do it for their own personal gratification, but nobody makes any money from it.</blockquote><br />Pedantry first: it's inconceivable that we'll reach a point where <b>nobody</b> makes any money from the media, at least this side of the classless society. Even the hard case of blogging doesn't really stand up - I could name half a dozen bloggers who have made money or are making money from their blogs, without pausing to think.<br /><br />It's a small point, but it's symptomatic of the enthusiastic looseness of Karp's argument. So I welcomed Nicholas Carr's <a href="http://www.roughtype.com/archives/2006/05/no_direction_ho.php">counterblast</a>, which puts Karp together with some recent comments by Esther Dyson:<blockquote>"Most users are not trying to turn attention into anything else. They are seeking it for itself. For sure, the attention economy will not replace the financial economy. But it is more than just a subset of the financial economy we know and love."</blockquote><br />Here's Carr:<blockquote>I fear that to view the attention economy as "more than just a subset of the financial economy" is to misread it, to project on it a yearning for an escape (if only a temporary one) from the consumer culture. There's no such escape online. When we communicate to promote ourselves, to gain attention, all we are doing is turning ourselves into goods and our communications into advertising. We become salesmen of ourselves, hucksters of the "I." In peddling our interests, moreover, we also peddle the commodities that give those interests form: songs, videos, and other saleable products. And in tying our interests to our identities, we give marketers the information they need to control those interests and, in the end, those identities. Karp's wrong to say that MySpace is resistant to advertising. MySpace is nothing but advertising.</blockquote><br />Now, this is good, bracing stuff, but I think Carr bends the stick a bit too far the other way. I know from my own experience that there's a part of my life labelled Online Stuff, and that most of my reward for doing Online Stuff is attention from other people doing Online Stuff. Real-world payoffs - money, work or just making new real-world friends - are nice to get, but they're not what it's all about.<br /><br />The real trouble is that Karp has it backwards. Usenet - where I started doing Online Stuff, ten years ago - is a model of open-ended mutual <a href="http://en.wikipedia.org/wiki/Whuffie">whuffie</a> exchange. (A very imperfect model, given the tendency of social groups to develop boundaries and hierarchies, but at least an unmonetised one.) Systematised whuffie trading came along later. The model case here is eBay, where there's a weird disconnect between meaning and value. Positive feedback doesn't really mean that you think the other person is a "great ebayer" - it doesn't really <b>mean</b> anything, any more than "A+++++" means something distinct from "A++++" or "A++++++". What it does convey is value: it makes it that much easier for the other person to make money. It also has attention-value, making the other person feel good for no particular real-world reason, but even this is quantifiable ("48! I'm up to 48!").<br /><br />Ultimately Dyson and Carr are both right. The 'attention economy' of Online Stuff is new, absorbing and unlike anything that went before - not least because the way in which it gratifies fantasies of being truly appreciated, understood, attended to. But, to the extent that the operative model is eBay rather than Usenet, it is nothing other than <i>a subset of the financial economy</i>. Karp may be right about the specific case of MySpace, but I can't help distrusting his <a href="http://tinyurl.com/q7bxe">exuberance</a> - not least because, in my experience, the suffix '2.0' is strongly associated with a search for new ways to cash in.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-114717040852778314?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com0tag:blogger.com,1999:blog-11493954.post-1146152140276093812006-04-27T16:17:00.000+01:002006-11-08T10:10:35.886ZNot a fish at allOn the subject of broadcast vs broadband, <a href="http://www.plasticbag.org/archives/2006/04/is_the_pace_of_change_really_such_a_shock.shtml">Tom</a> writes:<blockquote>There's nothing rapid about this transition at all. It's been happening in the background for fifteen years. So let me rephrase it in ways that <i>I</i> understand. <i>Shock revelation! A new set of technologies has started to displace older technologies and will continue to do so at a fairly slow rate over the next ten to thirty years!</i><br />...<br />My sense of these media organisations that use this argument of incredibly rapid technology change is that they're screaming that they're being pursued by a snail and yet they cannot get away! <i>'The snail! The snail!'</i>, they cry. <i>'How can we possibly escape!?'</i>. The problem being that the snail's been moving closer for the last twenty years one way or another and they just weren't paying attention.</blockquote><br />In comments, <a href="http://www.potlatch.org.uk/">Will</a> writes:<blockquote>If one person is claiming that the world is moving fairly slowly, and has some sound advice on what this might look like (as you are doing here), and another person is claiming that the world is moving extraordinarily quickly, but offers some quickfire measures through which to cope with this, the sense of emergency will win purely because it is present. From here, it almost becomes *risky* not to then adopt the quickfire measures suggested by the second person. Panic becomes a safer strategy than calmness. Which explains management consultancy...</blockquote><br />and John asks:<blockquote>does web2.0 count as a snail too?</blockquote><br />But Web 2.0 is not a snail.<br /><br />Web 2.0 is the people pointing and shouting <i>'The snail! The snail!'</i><br /><br />Web 2.0 is also the people who overhear the first group and join in, shouting <i>'The whale! The whale!'</i> and pointing vaguely upwards and towards the nearest ocean.<br /><br />Web 2.0 is also the people who hear the second group and panic about the approaching whale, or is it a <b>land</b>-whale? what is a land-whale anyway? whatever it is, there's one coming and we'd all better... well, we'd better tell someone about it, anyway - I mean, there's a <b>land-whale</b> coming, how often does something like that happen?<br /><br />Web 2.0 is also the people who hear the third group and improvise a land-whale parade, with floats and dancers and drummers and at its centre a giant paper land-whale held aloft by fifteen people, because, I don't know, but everyone was talking about land-whales and it just seemed like a good idea, you know?<br /><br />And Web 2.0 is the people who come along halfway through the parade and sell the roadside spectators standing-room tickets.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-114615214027609381?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com5tag:blogger.com,1999:blog-11493954.post-1146134560209182122006-04-27T11:25:00.000+01:002006-11-08T10:10:35.827ZCloudbuilding (3)By way of background to <a href="http://phenomenologic.blogspot.com/2006/03/cloudbuilding-1.html">this</a> post - and because I think it's quite interesting in itself - here's a short paper I gave last year at <a href="http://www.asc.org.uk/Events/Sep05/Programme.htm">this</a> conference (great company, shame about the catering). It was co-written with my colleagues Judith Aldridge and Karen Clarke. I don't stand by everything in it - as I've got deeper into the project I've moved further away from Clay's scepticism and closer towards people like Carole Goble and Keith Cole - but I think it still sets out an argument worth having.<br /><br /><b><i>Mind the gap: Metadata in e-social science</i></b><br /><br /><b>1. Towards the final turtle</b><br /><br />It’s said that Bertrand Russell once gave a public lecture on astronomy. He described how the earth orbits around the sun and how the sun, in turn, orbits around the centre of our galaxy. At the end of the lecture, a little old lady at the back of the room got up and said: “What you have told us is rubbish. The world is really a flat plate supported on the back of a giant tortoise.” <br /><br />Russell smiled and replied, “What is the tortoise standing on?” <br /><br />“You’re very clever, young man, very clever,” said the old lady. “But it’s turtles all the way down.”<br /><br />The Russell story is emblematic of the logical fallacy of infinite regress: proposing an explanation which is just as much in need of explanation as the original fact being explained. The solution, for philosophers (and astronomers), is to find a foundation on which the entire argument can be built: a body of known facts, or a set of acceptable assumptions, from which the argument can follow.<br /><br />But what if infinite regress is a problem for people who want to build systems as well as arguments? What if we find we’re dealing with a tower of turtles, not when we’re working backwards to a foundation, but when we’re working forwards to a solution?<br /><blockquote>WSDL [Web Services Description Language] lets a provider describe a service in XML [Extensible Markup Language]. [...] to get a particular provider’s WSDL document, you must know where to find them. Enter another layer in the stack, Universal Description, Discovery, and Integration (UDDI), which is meant to aggregate WSDL documents. But UDDI does nothing more than register existing capabilities [...] there is no guarantee that an entity looking for a Web Service will be able to specify its needs clearly enough that its inquiry will match the descriptions in the UDDI database. Even the UDDI layer does not ensure that the two parties are in sync. Shared context has to come from somewhere, it can’t simply be defined into existence. [...] This attempt to define the problem at successively higher layers is doomed to fail because it’s turtles all the way up: there will always be another layer above whatever can be described, a layer which contains the ambiguity of two-party communication that can never be entirely defined away. No matter how carefully a language is described, the range of askable questions and offerable answers make it impossible to create an ontology that’s at once rich enough to express even a large subset of possible interests while also being restricted enough to ensure interoperability between any two arbitrary parties.<br />(<a href="http://webservices.xml.com/lpt/a/ws/2001/10/03/webservices.html">Clay Shirky</a>)</blockquote><br />Clay Shirky is a longstanding critic of the Semantic Web project, an initiative which aims to extend Web technology to encompass machine-readable semantic content. The ultimate goal is the codification of meaning, to the point where understanding can be automated. In commercial terms, this suggests software agents capable of conducting a transaction with all the flexibility of a human being. In terms of research, it offers the prospect of a search engine which understands the searches it is asked to run and is capable of pulling in further relevant material unprompted.<br /><br />This type of development is fundamental to e-social science: a set of initiatives aiming to enable social scientists to access large and widely-distributed databases using ‘grid computing’ techniques.<br /><blockquote>A Computational Grid performs the illusion of a single virtual computer, created and maintained dynamically in the absence of predetermined service agreements or centralised control. A Data Grid performs the illusion of a single virtual database. Hence, a Knowledge Grid should perform the illusion of a single virtual knowledge base to better enable computers and people to work in cooperation.<br />(<a href="http://www.esrc.ac.uk/esrccontent/DownloadDocs/Colereport.pdf">Keith Cole et al</a>)</blockquote><br />Is Shirky’s final turtle a valid critique of the visions of the Semantic Web and the Knowledge Grid? Alternatively, is the final turtle really a Babel fish — an instantaneous universal translator — and hence (excuse the mixed metaphors) a straw person: is Shirky setting the bar impossibly high, posing goals which no ‘semantic’ project could ever achieve? To answer these questions, it’s worth reviewing the promise of automated semantic processing, and setting this in the broader context of programming and rule-governed behaviour.<br /><br /><b>2. Words and rules</b><br /><br />We can identify five levels of rule-governed behaviour. In <i>rule-driven</i> behaviour, firstly, ‘everything that is not compulsory is forbidden’: the only actions which can be taken are those dictated by a rule. In practice, this means that instructions must be framed in precise and non-contradictory terms, with thresholds and limits explicitly laid down to cover all situations which can be anticipated. This is the type of behaviour represented by conventional task-oriented computer programming.<br /><br />A higher level of autonomy is given by <i>rule-bound</i> behaviour: rules must be followed, but there is some latitude in how they are applied. A set of discrete and potentially contradictory rules is applied to whatever situation is encountered. Higher-order rules or instructions are used to determine the relative priority of different rules and resolve any contradiction.<br /><br /><i>Rule-modifying</i> behaviour builds on this level of autonomy, by making it possible to ‘learn’ how and when different rules should be applied. In practice, this means that priority between different rules is decided using relative weightings rather than absolute definitions, and that these weightings can be modified over time, depending on the quality of the results obtained. Neither rule-bound nor rule-modifying behaviour poses any fundamental problems in terms of automation.<br /><br /><i>Rule-discovering</i> behaviour, in addition, allows the existing body of rules to be extended in the light of previously unknown regularities which are encountered in practice (“it turns out that many Xs are also Y; when looking for Xs, it is appropriate to extend the search to include Ys”). This level of autonomy — combining rule observance with reflexive feedback — is fairly difficult to envisage in the context of artificial intelligence, but not impossible.<br /><br />The level of autonomy assumed by human agents, however, is still higher, consisting of <i>rule-interpreting</i> behaviour. Rule-discovery allows us to develop an internalised body of rules which corresponds ever more closely to the shape of the data surrounding us. Rule-interpreting behaviour, however, enables us to continually and provisionally reshape that body of rules, highlighting or downgrading particular rules according to the demands of different situations. This is the type of behaviour which tells us whether a ban is worth challenging, whether a sales pitch is to be taken literally, whether a supplier is worth doing business with, whether a survey’s results are likely to be useful to us. This, in short, is the level of Shirky’s situational “shared context” — and of the final turtle.<br /><br />We believe that there is a genuine semantic gap between the visions of Semantic Web advocates and the most basic applications of rule-interpreting human intelligence. Situational information is always local, experiential and contingent; consequently, the data of the social sciences require interpretation as well as measurement. Any purely technical solution to the problem of matching one body of social data to another is liable to suppress or exclude much of the information which makes it valuable.<br /><br />We cannot endorse comments from e-social science advocates such as this:<blockquote>variable A and variable B might both be tagged as indicating the sex of the respondent where sex of the respondent is a well defined concept in a separate classification. If Grid-hosted datasets were to be tagged according to an agreed classification of social science concepts this would make the identification of comparable resources extremely easy.<br />(<a href="http://www.esrc.ac.uk/esrccontent/DownloadDocs/Colereport.pdf">Keith Cole et al</a>)</blockquote><br />Or this:<br /><blockquote>work has been undertaken to assert the meaning of Web resources in a common data model (RDF) using consensually agreed ontologies expressed in a common language [...] Efforts have concentrated on the languages and software infrastructure needed for the metadata and ontologies, and these technologies <b>are ready to be adopted</b>.<br />(<a href="http:// www.semanticgrid.org/docs/ECAISemanticGrid/ECAISemanticGridFinal.pdf">Carole Goble and David de Roure</a>; emphasis added)</blockquote><br />Statements like these suggest that semantics are being treated as a technical or administrative matter, rather than a problem in its own right; in short, that meaning is being treated as an add-on.<br /><br /><b>3. Google with Craig</b><br /><br />To clarify these reservations, let’s look at a ‘semantic’ success story.<br /><blockquote>The service, called “Craigslist-GoogleMaps combo site” by its creator, Paul Rademacher, marries the innovative Google Maps interface with the classifieds of Craigslist to produce what is an amazing look into the properties available for rent or purchase in a given area. [...] This is the future….this is exactly the type of thing that the Semantic Web promised<br />(<a href="http://bokardo.com/archives/holy-amazing-interface-batman/">Joshua Porter</a>)</blockquote><br />‘This’ is is an application which calculates the location of properties advertised on the ‘Craigslist’ site and then displays them on a map generated from Google Maps. In other words, it takes two sources of public-domain information and matches them up, automatically and reliably.<br /><br />That’s certainly intelligent. But it’s also highly specialised, and there are reasons to be sceptical about how far this approach can be generalised. On one hand, the geographical base of the application obviates the issue of granularity. Granularity is the question of the ‘level’ at which an observation is taken: a town, an age cohort, a household, a family, an individual? a longitudinal study, a series of observations, a single survey? These issues are less problematic in a geographical context: in geography, nobody asks what the meaning of ‘is’ is. A parliamentary constituency; a census enumeration district; a health authority area; the distribution area of a free newspaper; a parliamentary constituency (1832 boundaries) — these are different ways of defining space, but they are all reducible to a collection of identifiable physical locations. Matching one to another, as in the CONVERTGRID application (<a href="http://www.esrc.ac.uk/esrccontent/DownloadDocs/Colereport.pdf">Keith Cole et al</a>) — or mapping any one onto a uniform geographical representation — is a finite and rule-bound task. At this level, geography is a physical rather than a social science.<br /><br />The issue of trust is also potentially problematic. The Craigslist element of the Rademacher application brings the social element to bear, but does so in a way which minimises the risks of error (unintentional or intentional). There is a twofold verification mechanism at work. On one hand, advertisers — particularly content-heavy advertisers, like those who use the ‘classifieds’ and Craigslist — are motivated to provide a (reasonably) accurate description of what they are offering, and to use terms which match the terms used by would be buyers. On the other hand, offering living space over Craigslist is not like offering video games over eBay: Craigslist users are not likely to rely on the accuracy of listings, but will subject them to in-person verification. In many disciplines, there is no possibility of this kind of ‘real-world’ verification; nor is there necessarily any motivation for a writer to use researchers’ vocabularies, or conform to their standards of accuracy.<br /><br />In practice, the issues of granularity and trust both pose problems for social science researchers using multiple data sources, as concepts, classifications and units differ between datasets. This is not just an accident that could have been prevented with more careful planning; it is inherent in the nature of social science concepts, which are often inextricably contingent on social practice and cannot unproblematically be recorded as ‘facts’. The broad range covered by a concept like ‘anti-social behaviour’ means that coming up with a single definition would be highly problematic — and would ultimately be counter-productive, as in practice the concept would continue to be used to cover a broad range. On the other hand, concepts such as ‘anti-social behaviour’ cannot simply be discarded, as they are clearly produced within real — and continuing — social practices.<br /><br />The meaning of a concept like this — and consequently the meaning of a fact such as the recorded incidence of anti-social behaviour — cannot be established by rule-bound or even rule-discovering behaviour. The challenge is to record both social ‘facts’ and the circumstances of their production, tracing recorded data back to its underlying topic area; to the claims and interactions which produced the data; and to the associations and exclusions which were effectively written into it.<br /><br /><b>4. Even better than the real thing</b><br /><br />As an approach to this problem, we propose a repository of content-oriented metadata on social science datasets. The repository will encompass two distinct types of classification. Firstly, those used within the sources themselves; following Barney Glaser, we refer to these as ‘In Vivo Concepts’. Secondly, those brought to the data by researchers (including ourselves); we refer to these as ‘Organising Concepts’. The repository will include:<br /><br />• relationships between Organising Concepts<br /> ‘theft from the person’ is a type of ‘theft’<br /><br />• associations between In-Vivo Concepts and data sources<br /> the classification of ‘Mugging’ appears in ‘British Crime Survey 2003’<br /><br />• relationships between In-Vivo Concepts<br /> ‘Snatch theft’ is a subtype of the classification of ‘Mugging’<br /><br />• relationships between Organising Concepts and In-Vivo Concepts<br /> the classification of ‘Snatch theft’ corresponds to the concept of ‘theft from the person’<br /><br />The combination of these relationships will make it possible to represent, within a database structure, a statement such as<br /><br />Sources of information on <b>Theft from the person</b> include editions of the <i>British Crime Survey</i> between <i>1996</i> and <i>the present</i>; headings under which it is recorded in this source include <b>Snatch theft</b>, which is a subtype of <b>Mugging</b><br /><br />The structure of the proposed repository has three significant features. Firstly, while the relationships between concepts are hierarchical, they are also multiple. In English law, the crime of Robbery implies assault (if there is no physical contact, the crime is recorded as Theft). The In-Vivo Concept of Robbery would therefore correspond both to the Organising Concept of Theft from the person and that of Personal violence. Since different sources may share categories but classify them differently, multiple relationships between In-Vivo Concepts will also be supported. Secondly, relationships between concepts will be meaningful: it will be possible to record that two concepts are associated as synonyms or antonyms, for example, as well as recording one as a sub-type of the other. Thirdly, the repository will not be delivered as an immutable finished product, but as an open and extensible framework. We shall investigate ways to enable qualified users to modify both the developed hierarchy of Organising Concepts and the relationships between these and In-Vivo Concepts.<br /><br />In the context of the earlier discussion of semantic processing and rule-governed behaviour, this repository will demonstrate the ubiquity of rule-interpreting behaviour in the social world by exposing and ‘freezing’ the data which it produces. In other words, the repository will encode shifting patterns of correspondence, equivalence, negation and exclusion, demonstrating how the apparently rule-bound process of constructing meaning is continually determined by ‘shared context’.<br /><br />The repository will thus expose and map the ways in which social data is structured by patterns of situational information. The extensible and modifiable structure of the repository will facilitate further work along these lines: the further development of the repository will itself be an example of rule-interpreting behaviour. The repository will not — and cannot — provide a seamless technological bridge over the semantic gap; it can and will facilitate the work of bridging the gap, but without substituting for the role of applied human intelligence.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-114613456020918212?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com1tag:blogger.com,1999:blog-11493954.post-1146063193752975902006-04-26T15:48:00.000+01:002006-11-08T10:10:35.771ZSeldom a dreadIt's been quiet around here for a while, and probably will be for a while yet. For now, a small question. Is anyone reading this? More specifically, is anyone reading this in Britain? Even more specifically, is anyone reading this who is in Britain and knows about academic funding, in particular how to obtain and where from? (I've got a few ideas, but more is generally better.) Drop me a comment if so.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-114606319375297590?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com2tag:blogger.com,1999:blog-11493954.post-1144160794865631802006-04-04T15:09:00.000+01:002006-11-08T10:10:35.712ZSearching low and high<b>Update</b> 14th June: it's fixed. The search I describe below now returns 91 results on both Google and Yahoo!. (And one on <a href="http://www.majestic12.co.uk/">MJ12</a> (thanks Paulie), but it's early days.)<br /><br />Help - Google's broken.<br /><br />Google's 'exact phrase' search, to be precise. Earlier today I was looking for an English counterpart to the French phrase 'basse police' (<a href="http://existingactually.blogspot.com/2006/02/rich-mans-militia.html">elsewhere</a> I've rendered it as 'low policing', following J-P. Brodeur, but the idiomatic content of the phrase gets lost that way). If in doubt, Google - so I googled<br /><br />"basse police" definition<br /><br />secure in the knowledge that Google would find the French 'définition' as well as the unaccented English word. And it's true, I didn't need to worry about that; the word 'definition' was present and correct, with and without accent. The only trouble was, only 19 of the pages Google brought back (1-87 of about 67,000) also included the phrase 'basse police'; in particular, none of the first 66 results displayed included the phrase, although some included the word 'basse' and others the word 'police'.<br /><br />It gets worse (for Google). I tried the same query on Yahoo and got results 1-64 of about 114 (<i>about</i> 114?). Here are the first few, minus a couple of duplicates:<br /><br />qu'elle participe de la définition de ses fins et qu'elle n'est pas dénuée ... l'ordre semble d'abord relever de la basse police<br /><br />entre " haute " police et " basse " police, entre surveillance d'un territoire et surveillance ... des services secrets sont, par définition, opaques<br /><br />Ces méthodes de basse police ont déjà eu lieu à Genève avec les persécutions du Parti Communiste ... Ta définition de "stalinien" est fausse<br /><br />utiliser (pertinemment) les expressions "haute police " et " basse police ... je veux voir le mot et sa définition<br /><br />And so it goes on. You see what they've done there? Yahoo has brought back pages containing both the word 'definition' and the phrase 'basse police', and <b>only</b> those pages. Fiendish.<br /><br />To be fair to Google, this is a problem I've only noticed in the last couple of days. To revert to being hard on Google, it's a major, major, service-vitiating-if-not-actually-disabling problem, and I would like to know what on earth they were thinking of to allow it to happen. (And I'd like it fixed, obviously.)<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-114416079486563180?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com0tag:blogger.com,1999:blog-11493954.post-1143459487839869332006-03-27T11:46:00.000+01:002006-11-08T10:10:35.648ZWe are bored in the city<i>Et la piscine de la rue des Fillettes. Et le commissariat de police de la rue du Rendez-Vous. La clinique médico-chirurgicale et le bureau de placement gratuit du quai des Orfèvres. Les fleurs artificielles de la rue du Soleil. L'hôtel des Caves du Château, le bar de l'Océan et le café du Va et Vient. L'hôtel de l'Epoque.<br /><br />Et l'étrange statue du Docteur Philippe Pinel, bienfaiteur des aliénés, dans les derniers soirs de l'été. Explorer Paris.</i><br /><br />The early situationists, following <a href="http://library.nothingness.org/articles/SI/en/display/1">Chtcheglov</a>'s lead, turned urban wandering into a form of political/psychological exploration, a group encounter with the city mediated only by alcohol. At a less exalted level, I've long been fascinated by the kind of odd urban poetry evoked here, in Manchester as much as Paris, and by the changing articulation of city space: established cities are a slow-motion example of Marx's dictum about how we make our lives within conditions we have inherited. So it's easy to see how well <a href="http://socialight.com/">this</a> could work:<blockquote>Socialight lets you put virtual "sticky" notes called StickyShadows anywhere in the real world. Share pictures, notes and more using your cell phone.</blockquote>But - for all that the site says about restricting access to Groups and Contacts - it's also easy to see how very <b>badly</b> it could work.<blockquote> * I leave a note for all my friends at the mall to let them know where I'm hanging out. All my friends in the area see it.<br /> * A woman shows all her close friends the tree under which she had her first kiss.<br /> * An entire neighborhood gets together and documents all the unwanted litter they find in an effort to share ownership of a community problem.<br /> * A food-lover uses Socialight to share her thoughts on the amazing vanilla milkshakes at a new shop.<br /> * The neighborhood historian creates her own walking tour for others to follow.<br /> * A group of friends create their own scavenger hunt.<br /> * A tourist takes place-based notes about stores in a shopping district, only for himself, for a time when he returns to the same city.<br /> * A small business places StickyShadows that its customers would be interested in finding.<br /> * A band promotes an upcoming show by leaving a StickyShadow outside the venue.</blockquote>It was all going so well (although I did wonder why that entire neighbourhood couldn't just <b>pick up</b> the litter) right up to the last two. Advertising - yep, that's just what we all want more of in our urban lives. Lots of nice intrusive advertising.<br /><br /><a href="http://www.purselipsquarejaw.org/2006/03/materialising-information-enriching.php">Anne:</a><blockquote>The worst thing about taking-for-granted that our experiences with the city and each other will be "enriched" by more data, by more information, by making the invisible visible, etc., is that we never have to account for or be accountable to <i>how</i>.</blockquote>More specifically, there's a huge difference between enabling conversation and enabling people to be informed - in other words, between talking-with and being-talked-at. Social software is all about conversation - about enabling people to talk together. Moreover, any conversation is defined as much by what it shuts out as what it includes; it's hard to listen to the people you want to talk with when you're being talked at. Even setting aside the information-overload potential of all those overlapping groups (do I need to know where so-and-so had her first kiss? do I need to know <b>now</b>?), it's clear that Socialight is trying to serve two ends which are not only incompatible but opposed - and only one of which pays money. Which is probably why, even though the technology is still in beta, I already feel that using it constructively would be going against the grain.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-114345948783986933?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com0tag:blogger.com,1999:blog-11493954.post-1141404369016003032006-03-03T16:30:00.000Z2006-11-08T10:10:35.586ZCloudbuilding (2)Here's a problem I ran into, halfway through building my first ontology, and some thoughts on what the solution might be.<br /><br />Question 47 of the Mixmag survey reads:<br /><br />Have you ever had an instance[sic] where your drug use caused you to:<br />Get arrested?<br />Lose a job?<br />Fail an exam?<br />Crash a car/bike?<br />Be kicked out of a club?<br /><br />What this tells us is that one of the things the Mixmag questionnaire is 'about' - one of the <i>in vivo</i> concepts (or groups of <i>in vivo</i> concepts) that we need to record - is misadventures consequent on drug use. The question is how we define this concept logically - and this isn't just an abstract question, as the way that we define it will affect how people can access the information. There are three main possibilities.<br /><br /><b>1. Model the world</b><br />We could say that to have a job is to be a party to a contract of employment, which is a type of agreement between two parties, which is agreed on a set occasion and covers a set timespan. Hence to lose a job is to cease to be a party to a previously-agreed contract of employment; this may occur as a consequence of drug use (defined, in the Mixmag context, as the use of a psychoactive substance other than alcohol and tobacco).<br /><br />This is all highly logical and would make it explicit that the Mixmag data contains some information on terminations of contracts of employment (as well as on drug-related stuff). However, the Mixmag survey isn't actually <b>about</b> contracts of employment, and doesn't mandate the definitional assumptions I made above. So this isn't really legitimate. (It would also be incredibly laborious, particularly when we turn our attention away from the relatively succinct Mixmag survey and look at more typical social survey data: surveys of physical capacity, for example, routinely ask people whether they can (a) walk to the shops (b) walk to the Post Office (c) walk to the nearest bus stop, and so on down to (j) or (k). All, in theory, capable of being modelled logically - but perhaps only in theory.)<br /><br /><b>2. Stick to the theme</b><br />Alternatively, we could begin by taking a view as to the key concepts which a data source is about - in this case, psychoactive consumption, feelings about psychoactive consumption, consequences of psychoactive consumption, and sexual behaviour - and draw the line at anything beyond those concepts. On this assumption the fact that the survey covers misadventures consequent on drug use would be within scope, but the list of misadventures given above wouldn't be: that's part of the data that researchers will find when they look at the data source itself, not part of the conceptual 'catalogue' that we're building. The advantage of this is that it's conceptually very 'clean' and makes it that much clearer what a source is about; the disadvantage is obviously that it cuts off some ways in to the data and hides some information.<br /><br /><b>3. Include black boxes</b><br />What I've got at the moment - following the principle of using the definitions supplied by the source - is an ontology in which some concepts are defined and others are undefined (black boxes). For instance, I've got a concept of <i>Job loss</i>, but all that OWL 'knows' about it is that it's a type of <i>Misadventure</i> (which may be consequent on <i>drug use</i>) - which is in turn a type of <i>Life event</i>, (which is a type of <i>event</i> that happens to one <i>person</i>). This would allow anyone searching for events consequent on drug use to get to job loss as a type of misadventure, but wouldn't let them get to drug-related misadventure from job loss - unless they happened to enter the exact name of the 'job loss' concept. I'm coming to believe that this is unsatisfactory: we should define the model in terms of what a data source is about. This means that we've got to either take a narrow, domain-specific view or take the view that each source gives us one piece of a much larger picture - in which case we're inevitably committed to modelling the world. But the 'black box' option isn't really sustainable.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-114140436901600303?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com0tag:blogger.com,1999:blog-11493954.post-1141403381030107342006-03-03T16:09:00.000Z2006-11-08T10:10:35.528ZCloudbuilding (1)This one's about work.<br /><br />I'm currently documenting the concepts underlying the 2005 <a href="http://www.mixmag.uklinux.net/">Mixmag Drug Survey</a> using <a href="http://protege.stanford.edu/">Protege</a>. Here's why:<br /><br />The documentation of social science datasets on a conceptual level, so as to make multiple datasets comprehensible within a shared conceptual framework, is inherently problematic: the concepts on which the data of the social sciences are constructed are imprecise, contested and mutable, with key concepts defined differently by different sources. When a major survey release is published, for example, the accompanying metadata often includes not only a definition of key terms, but discussion of how and why the definitions have changed since the previous release. This information is of crucial importance to the social scientist, both as a framework for understanding statistical data and as a body of social data in its own right.<br /><br />It follows that we cannot think in terms of ironing out inconsistencies between social science datasets and resolving ambiguities. Rather, documenting the datasets must include documenting the definitions of the conceptual framework on which the datasets are built, however imprecise or <a href="http://impossiblist.blogspot.com/2006/03/work-matters.html">inappropriate</a> these concepts might appear in retrospect. This will also involve preserving - and exposing - the variations between different sources, or successive releases from a single source.<br /><br />There are currently two main approaches to conceptually-oriented data documentation. A ‘top down’ approach is exemplified by the European Language Social Sciences Thesaurus (ELSST). The <a href="http://www.madiera.net">Madiera</a> portal allows researchers to explore ELSST and access European survey data which has been linked to ELSST keywords. The limitations of the top-down approach can be gauged from ELSST’s concepts relating to drug use. <i>Drug Abuse</i>, <i>Drug Addiction</i>, <i>Illegal Drugs</i> and <i>Drug Effects</i> are all 'leaf' concepts - headings which have no subheadings under them. However, they are in different parts of the overall ELSST tree: for example, <i>Drug Abuse</i> is under <i>Social Problems->Abuse</i>, while <i>Drug Effects</i> is under <i>Biology->Pharmacology</i>. Although the hierarchy is augmented by a list of 'related' concepts, to some extent facilitating horizontal as well as vertical navigation, the hierarchy inevitably makes some types of enquiry easier than others. Anyone using the ELSST 'tree' will be visually reminded of the affinities identified by ELSST’s authors between <i>Pharmacology</i> and <i>Physiology</i>, or between <i>Drug Abuse</i> and <i>Child Abuse</i>. These problems follow from the initial design choice of a single conceptual hierarchy.<br /><br />This approach to classification has recently come under criticism. <a href="http://www.shirky.com/writings/ontology_overrated.html">Advocates</a> of 'bottom-up' approaches argue that top-down taxonomies like the Dewey Decimal System or ELSST are an artificial imposition on the world of knowledge, which is better represented as a set of individual acts of labelling or ‘tagging’. It is argued that the 'trees' of hierarchical taxonomies can be replaced with a <a href="http://www.hyperorg.com/blogger/misc/taxonomies_and_tags.html">pile of 'leaves'</a>.<br /><br />One successful 'bottom-up' approach is the framework for documenting survey data developed by the <a href="http://www.icpsr.umich.edu/DDI/">Data Documentation Initiative</a> (DDI). The DDI standard makes it possible to search on keywords associated with surveys, sections of surveys and individual questions; the short text of individual questions is also searchable. Searches of DDI metadata can also be run from the Madiera portal: a search on ‘marijuana’, for instance, brings back short text items including the following:<br /><br />CONSUMED HASHISH,MARIJUANA<br />- Health Behaviour in School-Aged Children (Switzerland, 1990) <br /><br />Smoking cannabis should be legal? Q2.31<br />- Scottish Social Attitudes Survey (Scotland, 2001)<br /><br />Q92C DRUGS EV B OFFERED - MARIJUANA<br />- Eurobarometer 37.0 (EU-wide, 1992)<br /><br />Clearly, this way in to the data makes it easy for a well-prepared researcher to track the use of particular concepts 'in the wild' (<i>in vivo</i> concepts). However, this gain comes at the cost of some information. There is wide variation both in the terminology used in the surveys and in the concepts to which they refer. In one survey smoking cannabis might be a type of petty crime; in others it might figure as a type of leisure activity or a potential health risk. These conceptual differences are reflected in the vocabulary used by data sources - and by researchers. Depending on context, three researchers using 'marijuana', 'hashish' and 'cannabis' as search terms may be asking for the same data or for three different sets of data. <br /><br />Neither the 'top-down' nor the 'bottom-up' approach articulates the conceptual assumptions which underlie the construction of a dataset - assumptions expressed both in the definition of <i>in vivo</i> concepts and in relationships between them. Rather than leaving much of this conceptual information undocumented (the DDI approach) or encoding one 'correct' set of assumptions while excluding or sidelining others (the ELSST approach), we propose to offer a coherent hierarchy of <i>in vivo</i> concepts for each individual source, based on the definitions (explicit and implicit) used in each source. Comparing the <i>in vivo</i> conceptual hierarchies used in multiple datasets will enable researchers both to see where concepts are directly comparable and to see where - and how - their definitions diverge and overlap.<br /><br />To document hierarchies of <i>in vivo</i> concepts, we shall use description logic and the Semantic Web language OWL-DL (Web Ontology Language - Description Logic). OWL-DL makes it possible to formulate a precise logical specification of concepts such as<br /><br />- use of cannabis (either marijuana or hashish) in the month prior to the survey<br />- use of either Valium or temazepam, at any time<br />- seizures of Class A drugs by HM Customs in the financial year 2004/5<br /><br />At least, that's the idea. Now wait for part 2...<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-114140338103010734?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com0tag:blogger.com,1999:blog-11493954.post-1141307914688435302006-03-02T13:07:00.000Z2006-11-08T10:10:35.464ZNor mine, nowI nearly installed <a href="http://www.hyperwords.net/">Hyperwords</a> this morning; the only reason I didn't is that I haven't moved to Firefox 1.5 yet (and don't intend to until I'm confident it won't break any of the extensions I'm already using). And, in principle, it looks great:<blockquote>With the Hyperwords Firefox Extension installed just select any text and a menu appears. You can search major search engines, look things up in reference sites, check dictionary definitions, translate, email quickly and much more.</blockquote>So why does the thought of actually using it give me the creeps? <a href="http://www.agwright.com/blog/archives/001029.html">Alex</a> is similarly ambivalent:<blockquote>In principle, it's a handy tool. But I would have to overcome a few personal adoption barriers before I started using it on a regular basis. As a consumer, I can see the appeal of opening up texts to interact with the rest of the Web; but as a writer, I instinctively bristle at the idea of giving up that kind of control. I suspect that disposition colors the way I read things on the Web; I like my documents to feel fixed, not fluid. And the Web feels squishy enough as it is. That, and somehow the premise of cracking open someone else's document with a toolbox of Web services feels like a kind of violation. This is undoubtedly my own personal neurotic hangup.</blockquote>Well, if it is, it's mine too. <a href="http://markbernstein.org/Feb0601/Genericlinksandhyperwords.html">Mark Bernstein</a> gets some of it:<blockquote>In the very early days of hypertext research, people worried a lot about hand-crafted links. "How will we ever afford to put in all those links?" We also worried about how we'd ever manage to afford to digitize stuff for the Web, not to mention paying people to create original Web pages. Overnight, we discovered that we'd got the sign wrong: people would pay for the privilege of making Web sites. The problem isn't the 'tyranny' of the links, and replacing it with the tyranny of the link server might not be a great solution.</blockquote>and<blockquote> Authors don't offer navigation options to be "useful"; thoughtful writers use links to express ideas. Argumentation seeks understanding, not merely access.</blockquote>Let's put some of that together: <i>cracking open someone else's document with a toolbox of Web services</i>; <i>the tyranny of the link server</i>; <i>thoughtful writers use links to express ideas</i>. In other words, Hyperwords doesn't extend existing hyperlink practice but undermines it. In the Hyperwords world you'll no longer read a document, you'll mine it for information - or rather, <a href="http://www.answers.com/main/ntquery?s=mine&gwp=13">mine</a> it for <a href="http://maps.google.com/maps?f=q&sll=37.0625,-95.677068&sspn=32.197599,59.941406&hl=en&q=jumping">jumping-off</a> <a href="http://www.points.com/">points</a> for <a href="http://images.google.com/images?svnum=100&hl=en&q=retriever&btnG=Search">retrieving</a> <a href="http://www.infoplease.com/">information</a> from <a href="http://www.technorati.com">authoritative</a> <a href="http://www.bible.com/">sources</a>. (Or <a href="http://images.google.com/images?svnum=100&hl=en&q=retriever&btnG=Search">retrieving</a> <a href="http://www.amazon.co.uk">whatever</a> <a href="http://www.ebay.co.uk">other</a> <a href="http://en.wikipedia.org">stuff</a> <a href="http://www.lovefilm.com">you</a> <a href="http://www.kelkoo.co.uk">may</a> <a href="http://www.abebooks.com">want</a> <a href="http://babelfish.altavista.com/babelfish/tr?urltext=to&lp=en_it">to</a> <a href="http://images.google.com/images?svnum=100&hl=en&q=retriever&btnG=Search">retrieve</a>.) <br /><br />Alex mentioned <a href="http://xanadu.com/xuTheModel/index.html">Xanadu</a>, but I don't think Hyperwords is a step in that direction. If anything, it's a step backwards. (One of Xanadu's key words is "author-based".) Hyperlinks and the Web of dialogic, socially-produced content go together just fine; as Mark says, mass amateurism is already providing an answer to the question of where all those links are going to come from. It's messy and incomplete, but it's here - and it's, well, <b>ours</b> (<i>as a writer, I instinctively bristle at the idea of giving up that kind of control</i>). You can see two visions of the Web here: the mass amateurisation of writing as against the 'consumer'-oriented, authority-led, broadcast Web. Hyperwords ostensibly enhances horizontal, transverse linkage, but its effect would be to pull the Web further towards broadcast mode - albeit an 'empowered', roll-your-own broadcast mode.<br /><br /><i>Can't keep quiet for long - I'm a human being!<br />Can't help singing this song - I'm a human being!<br />You won't listen to me,<br />I'm not an authority...</i><br />- Steve Mason, "Eclipse"<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-114130791468843530?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com3tag:blogger.com,1999:blog-11493954.post-1141132326407793002006-02-28T12:30:00.000Z2006-11-08T10:10:35.399ZAll the things I could doBut (for new readers, this is point 2; point 1 is <a href="http://phenomenologic.blogspot.com/2006/02/talk-to-my-machine.html">here</a>, and you should go and <a href="http://phenomenologic.blogspot.com/2006/02/talk-to-my-machine.html">read</a> <a href="http://phenomenologic.blogspot.com/2006/02/talk-to-my-machine.html">it</a> <a href="http://phenomenologic.blogspot.com/2006/02/talk-to-my-machine.html">immediately</a>), it's becoming clear that Web 2.0 is all about the <a href="http://phenomenologic.blogspot.com/2005/11/which-side-of-table.html">walled gardens</a>. As I wrote in that post, <i>In the context of social software, when I use a word like 'enclose' - or a word like 'monetise' - it means something quite specific and entirely negative: it's a red-flag word.</i> Which means that, oddly, when I started reading Russell Beattie's <a href="http://www.russellbeattie.com/notebook/1008838.html">WTF 2.0</a> I found a lot to agree with.<blockquote>The worst thing about all the Web 2.0 hype is the complete loss of business perspective. There’s a few companies out there that seem to get it but just about every other new website I’ve seen lately is nothing but features parading as businesses. Sure, these guys get to be entered in the “Flip It Quick Acquisition Lottery”, but beyond that, none seem to be creating anything of any real value.</blockquote>"Features masquerading as businesses", the "Flip It Quick Acquisition Lottery" - all good stuff. Except that Russell's objections aren't quite the same as mine.<blockquote>You can create a new website, fill it with all the goodness in the world, be good to your users, and be a good netizen and use every open standard there is while you’re at it, if at the end of the day your users didn’t put money into your bank account, it’s a useless waste of time for everyone involved. I mean, hey, if you want to create the next non-profit service like Wikipedia, all the more power too you. But if you want to get VC cash, an office in downtown Palo Alto, do a bunch of development, attract lots of users and pretend you’re a business? Then act like one, create something of real value and make some real money from it.</blockquote>"Real value", "real money". You don't have to be a Marxist to suspect that those aren't necessarily the same thing (although, to be honest, it does help). In the next paragraph Russell draws a hazy distinction between the two himself:<blockquote>look at the Weblog federations for example. They’re making money like people have done for a hundred years or so: hire writers, sell some ads, publish using standard technologies. Nothing too innovative, but they’re making money and I totally dig that. Then again, those writers are generating real value, IMHO, so there’s something there to make money from.</blockquote>Russell commends the Weblog federations, whoever they are (didn't they have trouble with the spice routes a while back?), for making money. He then stresses that they're also creating <i>real value</i>, which means <i>there’s something there to make money from</i> - but 'real value' is qualified rather worryingly with 'IMHO', suggesting that it may or may not <b>be</b> real. At the end of the day the money's real, though, and Russell digs that.<br /><br />Russell then reminds us that things are different in the 'mobile world'. (If your immediate reaction to this sentence was "Damn right, things are obscenely expensive in the mobile world", or words to that effect, you're ahead of me already.)<blockquote>I deal with companies every day who have no qualms about charging 25 cents to send 160 characters of data from one person to another, or who have no problems charging $3.00 for a 10kb .gif image or a bad .midi version of a popular song, or even up to $10.00 for a small Java clone of Tetris - a 20 year old game. Unlike the web world, the mobile world is accustomed to charging for every thing that has the slightest bit of value. The difference between the markets couldn’t be more drastic. I know of a mobile chat site that’s on many carrier decks that’s a great example of this. To use it, you need to sign up to a subscription for $3.00 a month, and in return you get a URL which links to a very basic WAP based chat. This would be okay in my mind if there was some sort of extra special functionality, but there’s not.</blockquote>Follow this reasoning. Money is being charged; in Russell's mind this would be okay if there was 'extra special functionality' involved; but there isn't. So, by implication, it's not okay. The money is real, the value isn't. An equally poor service which was free would be better. A better service which was free would be better still. Right? Well...<blockquote>But don’t get me wrong, it’s not that this is a bad service or a rip off - they are providing a chat app as promised and it works. It’s just the fact that this particular app could be written by any developer in the Valley in less than an hour, and yet they easily have thousands if not millions of paying subscribers world wide.</blockquote>The part about how the value isn't real and it's not okay? Forget that. The value <b>is</b> real, obviously, because <i>they are providing a chat app as promised and it works</i>. In other words, the measure of the value of a service is the fact that people are willing to pay for it. And if people aren't paying for a service that has value to them (because it does stuff that they want it to do), then that's just <b>wrong</b> and we shouldn't encourage them.<blockquote>Why will people gladly pay $3.00 for a basic mobile chat site and not pay anything for a decent web service? I think it’s mostly because of expectations, and honestly, the naivete of many of the people trying to start “businesses” on the web today.</blockquote>Really, the hype around Web 2.0 has got to stop until all concerned stop acting like a bunch of hippies and start concentrating on what really matters, which is of course money:<blockquote>I really do think there should be a litmus test for new web apps launched from now on - something very basic and if they don’t pass, they don’t qualify for any buzz or linkage. It’s a simple test: Will they take my credit card? That’s it. I don’t care if they have advertisers or sponsors or god knows what else, all I want to see is a place where I can type in my credit card for some service.</blockquote>Money: that's what Russell wants. Or rather, that's what he wants to be <b>charged</b>. After all, if you're giving it away, it can't have very much value.<br /><br />Ultimately, for Russell, there are two very simple questions which software developers need to be able to answer if they're going to have any hope of jumping the Web 2.0 train. <i>Do you want to get VC cash and an office in downtown Palo Alto</i>, or not? And if not, WTF is wrong with you?<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/11493954-114113232640779300?l=phenomenologic.blogspot.com'/></div>Philhttp://www.blogger.com/profile/07009879034507926661noreply@blogger.com0