recent posts

Weapons of Mass Disruption

New Yorker Cartoon Caption Contest Winner

If the Stones had cut "Street Fighting Man" today,...

Crack for Geeks

Fed-Ex Delivery Guy in a Surly Mood This Morning

Iraqi Elections Results

Shit List Employee of the Month: Ken Hughes

Shit!

Johnny Carson: Dead

Operation Senior Moment

archives

May 2014

May 2012

February 2012

November 2011

September 2011

August 2011

July 2011

June 2011

May 2011

March 2011

February 2011

January 2011

December 2010

November 2010

October 2010

September 2010

August 2010

July 2010

June 2010

May 2010

April 2010

March 2010

February 2010

January 2010

October 2009

September 2009

June 2009

April 2009

February 2009

January 2009

December 2008

November 2008

October 2008

September 2008

August 2008

July 2008

May 2008

March 2008

February 2008

January 2008

December 2007

November 2007

October 2007

September 2007

August 2007

July 2007

June 2007

May 2007

April 2007

March 2007

January 2007

December 2006

October 2006

September 2006

August 2006

July 2006

June 2006

May 2006

April 2006

March 2006

February 2006

January 2006

December 2005

November 2005

October 2005

September 2005

August 2005

July 2005

June 2005

May 2005

April 2005

March 2005

February 2005

January 2005

December 2004

November 2004

October 2004

September 2004

August 2004

July 2004

June 2004

May 2004

April 2004

March 2004

February 2004

January 2004

December 2003

November 2003

October 2003

September 2003

August 2003

Saturday, February 05, 2005
The Google Query Bomb
I first learned about Google a few years back in a New Yorker article. The article immediately caught my attention because at the time I was trying to figure out what the best search engine was. I was under the impression for some reason that it was Northernlights.com, but I could never make sense of their results page. The New Yorker article answered my question.

But what I found most intriguing was a statement in passing that Google saves all its queries. What value did a string of 3 or 4 words have? No one knew. So why did they do it? Probably because they could.

I don't think the significance of this immediately struck me. But I see it now as a marker of one of the major epochal faultlines in human history -- the technological problem of saving massive amounts of information has become almost trivial.

The value of those short query strings, I have come to realize, is not so trivial. For one thing, there is probably more information saved with each query than just the 3 or 4 words fed into the query field. There's also whatever information can be extracted from your browser (e.g. IP address.) There is all the other information people put out there on the web identifying themselves. And then there are the algorithms that can mine the vast oceans of data being gathered and divine all those interesting and incriminating patterns.

What I imagine will eventually happen is this: with the advancement of data-mining algorithms and other esoteric pattern-recognition techniques, Google (or some other service) eventually will be able to tie every query you ever made, however filthy or disgusting, back to you. They'll probably be able to identify every site you've ever visited. And thus it will be revealed, for instance, that on September 12, 2001, only one day after the most important tragedy in the history of the world, while a nation mourned, Tomohiro Idokoro searched for "hairy slut beaver shots" on his desktop computer.

But then we'll find out that a lot of other people did, too. So it won't turn out to be as big a deal as it seems to be right now.