<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="wordpress/2.3.3" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>The Future of the Internet—And How to Stop It</title>
	<link>http://yupnet.org/zittrain</link>
	<description>Jonathan L. Zittrain</description>
	<pubDate>Sat, 10 May 2008 14:31:10 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.3.3</generator>
	<language>en</language>
			<item>
		<title>Conclusion</title>
		<link>http://yupnet.org/zittrain/archives/21</link>
		<comments>http://yupnet.org/zittrain/archives/21#comments</comments>
		<pubDate>Tue, 18 Mar 2008 02:16:53 +0000</pubDate>
		<dc:creator>The Editors</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://yupnet.org/zittrain/archives/21</guid>
		<description><![CDATA[Nicholas Negroponte, former director of the MIT Media Lab, announced the One Laptop Per Child (OLPC) project at the beginning of 2005. The project aims to give one hundred million hardy, portable computers to children in the developing world. The laptops, called XOs, are priced around $100, and they are to be purchased by governments [...]]]></description>
			<content:encoded><![CDATA[<p>Nicholas Negroponte, former director of the MIT Media Lab, announced the One Laptop Per Child (OLPC) project at the beginning of 2005. The project aims to give one hundred million hardy, portable computers to children in the developing world. The laptops, called XOs, are priced around $100, and they are to be purchased by governments and given to children through their schools.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-1')">1</a></sup> As of this writing Brazil, Libya, Mexico, Nigeria, Peru, Rwanda, and Uruguay have committed to a pilot run that will have the XO’s assembly lines ramping up to five million machines per month and totaling approximately 20 percent of all laptop manufacturing in the world.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-2')">2</a></sup></p>
<p>The pitch to governments footing the bill emphasizes alignment with existing schoolhouse curricula and practices. A laptop can be a cost-effective way to distribute textbooks, because it can contain so much data in a small space and can be updated after it has been distributed. Says Negroponte: “The hundred-dollar laptop is an education project. It’s not a laptop project.”<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-3')">3</a></sup></p>
<p>Yet OLPC is about revolution rather than evolution, and it embodies both the promise and challenge of generativity. The project’s intellectual pedigree and structure reveal an enterprise of breathtaking theoretical and logistical ambition. The education Negroponte refers to is not the rote learning represented by the typical textbook and the three R’s that form the basis of most developing and developed country curricula. Rather, the XO is shaped to reflect the theories of fellow Media Lab visionary Seymour Papert. Alternately known as constructionism or constructivism, Papert’s vision of education downplays drills in hard facts and abstract skills in favor of a model that teaches students how to learn by asking them to undertake projects that they find relevant to their everyday lives.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-4')">4</a></sup></p>
<p>A modest incarnation of the OLPC project would distribute PCs as electronic workbooks. The PCs would run the consumer operating systems and applications prevailing in the industrialized world—the better to groom students for work in call centers and other outsourced IT-based industries. Microsoft, under competition from free operating systems, has shown a willingness to greatly reduce the prices for its products in areas where wallets are smaller, so such a strategy is not necessarily out of reach, and in any case the XO machine could run one of the more consumer-friendly versions of free Linux without much modification.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-5')">5</a></sup></p>
<p>But the XO completely redesigns today’s user interfaces from the ground up. Current PC users who encounter an XO have a lot to unlearn. For example, the arrow pointer serves a different purpose: moving the XO’s arrow toward the center of the screen indicates options that apply only to that computer; moving the pointer toward any edge indicates interaction with nearby computers or the community at large. </p>
<p>The XO envisions students who are able to hack their own machines: to reprogram them even as they are learning to read and write—and to do so largely on their own initiative. The XO dissemination plan is remarkably light on both student and teacher training. There are a handful of trainers to cover the thousands of schools that will serve as distribution points, and the training function is more to ensure installation and functioning of the servers rather than true mastery of the machines. Students are expected to rely on each other and on trial-and-error to acquire most of the skills needed to use and reprogram the machines. </p>
<p>Content also seems a calculated afterthought. The XO project wiki—haphazardly organized, as wikis tend to be—featured a “call for content” in late 2006, mere months before millions of machines were to be placed in children’s hands, for “content creators, collectors, and archivists, to suggest educational content for inclusion with the laptops, to be made available to millions of children in the developing the world, most of whom do not have access to learning materials.”<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-6')">6</a></sup> Determining exactly what would be bundled on the machines, what would repose on servers at schools, and what would be available on the XO Web site for remote access was very much a work in progress even as deployment dates neared. </p>
<p>In other words, XO has embraced the procrastination principle that is woven through generative technologies. To the chagrin and discomfort of most educational academics following the project, there is little focus on specific educational outcomes or metrics.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-7')">7</a></sup> There are no firm plans to measure usage of the laptops, or to correlate changes in test scores with their deployment and use. Instead, the idea is to create an infrastructure that is both simple and generative, stand back, and see what happens, fixing most major substantive problems only as they arise, rather than anticipating them from the start. </p>
<p>Thus as much as Negroponte insists that the project is not a technology play, the lion’s share of the effort has gone into just that, and is calculated to promote a very special agenda of experimentation. Central to the XO’s philosophy is that each machine should belong to a single child, rather than being found in a typical computer lab or children’s cyber café. That partially explains the XO’s radical design, both in physical form and in software. It features especially small keys so that adults cannot easily use it if they should steal it from a child, and it has no moving parts within. There is no hard drive to suffer from a fall; the screen is designed to be viewable in direct sunlight; and it consumes little enough power that it can be recharged with a crank or other physical motion in the absence of a source of electricity. The machines automatically form mesh networks with one another so that children can share programs and data with each other or connect to a school’s data repository in the absence of any ISPs. It is a rediscovery of the principles behind FIDOnet, the ad hoc network of bulletin boards programmed on PCs that called each other using modems before PC users could connect to the Internet. </p>
<p>One bundled application, TamTam, lets a child use the machine to generate music and drumbeats, and nearby machines can be coordinated through their mesh networks so that each one represents a different instrument in a symphony the group can compose and play. Just as some students might develop and express talents at the technical layer, reprogramming the machines, others might be inspired to develop musical talents through the rough tools of Tam- Tam at the content layer. </p>
<p>Constructionism counts on curiosity and intellectual passion of self- or informally taught individuals as its primary engine, exactly the wellspring tapped by generative systems. From XO’s founders we see an attempt to reprise the spirit that illuminated the original personal computer, Internet, and Web. They believe that it is less important to provide content than to provide a means of making it and passing it along, just as an Internet without any plan for content ended up offering far more than the proprietary walled gardens that had so carefully sponsored and groomed their offerings. There is a leap of faith that a machine given entirely to a child’s custody, twenty-four hours a day, will not promptly be lost, stolen, or broken. Instead, children are predicted to treat these boxes as dear possessions, and some among them will learn to program, designing and then sharing new applications that in turn support new kinds of content and interaction that may not have been invented in the developed world. </p>
<p>Yet the makers of the XO are aware that it is not the dawn of the networked era. We have experienced market boom and wildly successful applications, but also bust, viruses, and spam. The sheer scale and public profile of the XO project make it difficult fully to embrace an experimentalist spirit, whether at the technical, content, or social layers. The sui generis modified Linux-based operating systems within the XO machines give them an initial immunity to the worms and viruses that plague the machines of the developed world, so that should they choose to surf the world’s Web they will not be immediately overcome by the malware that otherwise requires constantly updated firewalls. They can breathe the digital air directly, without the need for the expensive antivirus “clean suits” that other PCs must have. XO’s director of security has further implemented a security architecture for the machines that keeps programs from being able to communicate with each other, in order to preemptively quarantine any attack in one part of the machine.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-8')">8</a></sup> This means that a word processor cannot talk directly to a music program, and an Internet program cannot talk to a drawing program. This protects the machine from hypothetical viruses, but it also adds a layer of inflexibility and complexity to an operating system that children are supposed to be able to understand and modify. </p>
<p>The XO thus combines its generative foundation with elements of a tethered appliance. XO staff have vowed never to accede to requests for content filtering<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-9')">9</a></sup>— yet they have built a kill switch into the machines so that stolen models can be selectively disabled,<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-10')">10</a></sup> and such a switch opens the door to later remote control. Thus, XOs are both independent as they can form mesh networks, and tethered as they can be updated, monitored, and turned off from afar, so long as they are connected to the Internet. They are generative in spirit and architecture, and they are also appliances, painstakingly designed to be reliable to and usable by someone who cannot read or write. They combine the hope of the early Internet era with the hard lessons of its second phase. They represent the confusion of the interregnum between the unbridled explosion of cheap and flexible processors, networks, and sensors, and the tightening up that comes as their true power is appreciated—and abused. </p>
<p>Perhaps the audience of schoolchildren in developing countries is remote and uninteresting enough to those who want to control or compromise today’s information technology that it will be helpfully overlooked during the critical time period in which backwater status helps to foster generative development. Just as domain names were originally first-come, first-served, and no major companies reserved their own names or foresaw a trademark problem, poor schoolchildren may not be deemed enough of an economic market to be worth vying for—either in attracting their eyeballs to show them advertising, or in preventing them from exchanging bits that could be copyrighted. There are no preexisting CD sales among them to dent. </p>
<p>XO is but the most prominent and well-funded of a series of enterprises to attempt to bridge the digital divide. Other efforts, such as the Volkscomputer in Brazil, the VillagePDA, and the Ink have fared poorly, stuck at some phase of development or production.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-11')">11</a></sup> Negroponte’s impatience with tentative initial steps, and with the kind of planning and study that firm-based ventures usually require, has worried many in the international development community. They fear that a prominent failure of the project could unduly tarnish other attempts to deploy technology in the developing world. The Indian government announced in 2006 that it would not sign up to buy any XO machines, in part due to difficulties encountered with the Simputer, a for-profit project begun in 1998 to deliver handheld technology to India’s rural population, which is made up mostly of farmers and laborers—many of whom are illiterate and speak regional dialects. In 2001, Bruce Sterling lionized the Simputer as “computing as it would have looked if Gandhi had invented it, then used Steve Jobs for his ad campaign.”<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-12')">12</a></sup> It never took off. Instead India appears to be placing its bets on the Novantium Nova or a similar device, non-generative machines fully tethered to a subscription server for both software and content.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-13')">13</a></sup></p>
<p>Will XO fail like the others? Development experts view it as skeptically as education experts do, seeing XO as yet another risky heaving of hardware at problems that are actually political, social, and economic in nature. Debates on the XO wiki wonder whether teching-up an entire generation of millions of children will be good or bad for those already online. Some worry that the already- formidable sources of Nigerian “419” spam soliciting business deals will grow and diversify. There is even musing that guerrilla fighters could use the laptops’ mesh networking capabilities to communicate secretly on the battlefield.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-14')">14</a></sup> (Depending on which side one supports in that battle, that could be good, although it is a far cry from the notion of laptops as educational gateways for children.) </p>
<p>As computer scientist Gene Spafford wrote: </p>
<blockquote><p>We can’t defend against the threats we are facing now. If these mass computer giveaways succeed, shortly we will have another billion users online who are being raised in environments of poverty, with little or no education about proper IT use, and often in countries where there is little history of tolerance (and considerable history of religious, ethnic and tribal strife). Access to eBay and YouTube isn’t going to give them clean water and freedom from disease. But it may help breed resentment and discontent where it hasn’t been before.</p></blockquote>
<blockquote><p>Gee, I can barely wait. The metaphor that comes to mind is that if we were in the ramp-up to the Black Plague in the middle ages, these groups would be trying to find ways to subsidize the purchase of pet rats.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-15')">15</a></sup></p></blockquote>
<p>Spafford appears to recognize the delicate condition of today’s Net, and he believes that a pause in expansion is needed—a bit of time to digest the problems that beset it. The easier and more risk-averse path is to distribute mobile phones and other basic Net appliances to the developing world just as those devices are becoming more central in the developed one, bridging the digital divide in one sense—providing useful technology—while leaving out the generative elements most important to the digital space’s success: the integration of people as participants in it rather than only consumers of it. </p>
<p>But a project like OLPC offers an opportunity to demonstrate fixes to the Net’s problems among audiences that have yet to encounter it. Its XO represents a new path of continued if cautious generativity as the developed world’s technology is beginning to ossify under the weight of its own success. It represents a faith not only that students can learn to reprogram their computers, but that what they direct them to do will be, on balance, good if disruptive. </p>
<p>The story of the XO is the story of the generative pattern. The pattern begins with the ambitious planting of a flag for a new enterprise in an overlooked backwater. The procrastination principle gives license for the idea’s technical and social blueprints to be incomplete. Contribution is welcome from outsiders, and if the project takes off, the results may prove completely unexpected. </p>
<p>The XO’s skeptics have much in common with generativity’s skeptics. They can convincingly put forward the very real risks attendant to a project only partially planned, without extensive layers of measurement, control, and accountability. These risks are most obviously grouped under the rubric of security, but the label is far too narrow either to capture the problem or to point us to the most promising solutions—just as the story of badware on PCs is not simply a story about security worries on the Internet, narrowly defined. </p>
<p>Rather, the limits of an open PC and Net, and the fears for the XO, are much more general case studies of what happens within systems that are built with a peculiar and extraordinary openness to contribution and innovation and that succeed because of it. They challenge us to understand and meet the problems arising from success in a way that does not neuter what made the original success possible. </p>
<p>The puzzle of PC security is fundamentally the same as the puzzle of keeping Wikipedia honest and true—and perhaps giving birth to its version 2.0 successor— now that Wikipedia has entered the prime-time glare, attracting participants who are ignorant or scornful of its ideals. It is the puzzle of empowering people to share and trade stories, photos, and recommendations without losing their identities as they become not only the creators of distributed scrutiny and judgment, but also their subjects. </p>
<p>It is the puzzle of Herdict, the application designed to run on netizens’ PCs to generate and share a collective map of vital signs, that can produce distributed judgments about good code and bad. One of the first questions asked about Herdict is whether makers of badware will simply hijack lots of PCs and compel them to report to Herdict that they are happy, when in fact they are not. One answer acknowledges the problem and then seeks, from day one, to forestall it while it is still on the drawing board, with attendant complication, investment, and elaboration. An alternative answer says: The point at which Herdict is worth the effort of bad people to game it is a milestone of success. It is a token of movement from the primordial soup that begins the generative pattern to the mainstream impact that attracts the next round of problems. </p>
<p>Imagine planning but not yet executing Wikipedia: “Won’t people come along and vandalize it?” One response to that question, and to the others like it that arise for an idea as crazy as Wikipedia, would be to abandon the idea—to transform it so much in anticipation of the problems that it is unrecognizable from its original generative blueprint. The response instead was to deem the question reasonable but premature. The generativity that makes it vulnerable also facilitates the tools and relationships through which people can meet the problems when first-round success causes them to materialize. </p>
<p>The animating spirit: “Ready, fire, aim.” This ethos is a major ingredient of Google’s secret sauce as a company, a willingness to deploy big ideas that remain labeled “beta” for months even as they become wildly popular, as Google News was. It lies behind the scanning of all the world’s books, despite the logistical burdens and legal uncertainties. To the amazement of those of us who work for universities and could not possibly persuade our general counsels to risk their clients’ endowments on such a venture, Google simply started doing it. The litigation continues as this book goes to press, and so does the scanning of the books and the indexing of their contents, available to hundreds of millions of people who would otherwise never know of them, at books.google.com. </p>
<p>How we choose to approach generative puzzles animates the struggle between the models of the Net and of television, of the insurgent and the incumbent. Traditional cyberlaw frameworks tend to see the Net as an intriguing force for chaos that might as well have popped out of nowhere. It is too easy to then shift attention to the “issues raised” by the Net, usually by those threatened by it—whether incumbent technical-layer competitors like traditional telephony providers, or content-layer firms like record companies whose business models (and, to be sure, legally protected interests) are disrupted by it. Then the name of the game is seen to be coming up with the right law or policy by a government actor to address the issues. Such approaches can lead to useful, hard-nosed insights and suggestions, but they are structured to overlook the fact that the Net is quite literally what we make it. </p>
<p>The traditional approaches lead us in the direction of intergovernmental organizations and diplomatically styled talk-shop initiatives like the World Summit on the Information Society and its successor, the Internet Governance Forum, where “stakeholders” gather to express their views about Internet governance, which is now more fashionably known as “the creation of multi-stakeholder regimes.” Such efforts import from professional diplomacy the notion of process and unanimity above all. Their solution for the difficulties of individual state enforcement on the Net is a kind of negotiated intellectual harmony among participants at a self-conscious summit—complex regimes to be mapped out in a dialogue taking place at an endlessly long table, with a role for all to play. Such dialogues end either in bland consensus pronouncements or in final documents that are agreed upon only because the range of participants has been narrowed. </p>
<p>It is no surprise that this approach rarely gets to the nuts and bolts of designing new tools or grassroots initiatives to take on the problems it identifies. The Net and its issues sail blithely on regardless of the carefully worded communiqués that emerge from a parade of meetings and consultations. Stylized gatherings of concerned stakeholders are not inherently bad—much can come of dialogue among parties whose interests interconnect. Indeed, earlier in this book I called for a latter-day Manhattan Project to take on the most pressing problems facing the generative Internet. But the types of people that such a project requires are missing from the current round of “stakeholder governance” structures. Missing are the computer scientists and geeks who would rather be coding than attending receptions in Geneva and Tunis. Without them we too easily neglect the prospect that we could code new tools and protocols to facilitate social solutions—the way that the robots.txt of Chapter Nine has so far headed off what otherwise would have been yet another cyberlaw issue. </p>
<p>To be sure, from the earliest days of the Internet the people who designed its protocols acceded to some formality and diplomacy. Recall that they published “RFCs,” requests for comments designed to write up their ideas, creating institutional structure and memory as the project became bigger than just a few researchers in a room. The author of the first one—RFC 1—recalls: “We parceled out the work and wrote the initial batch of memos. In addition to participating in the technical design, I took on the administrative function of setting up a simple scheme for numbering and distributing the notes. Mindful that our group was informal, junior and unchartered, I wanted to emphasize these notes were the beginning of a dialog and not an assertion of control.”<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-16')">16</a></sup></p>
<p>Informal, junior, and unchartered, yet collaborative and at least partially structured: this includes people who are eager to take on a parcel of work and build. It represents the ingredients found in the generative soil of Wikipedia, Pledgebank, Meetup, CouchSurfing.com, and other countless innovations that abound on the Net, themselves made possible because the Net’s soil is made of the same stuff. The way to secure it and the innovations built upon it is to empower rank-and-file users to contribute, rather than to impose security models that count on a handful of trusted people for control. We need tools that cure the autistic nature of today’s Net experience: PC users unaware of their digital environments and therefore unable to act on social cues, whether of danger or of encouragement. </p>
<p>If history is a guide, these tools can just as likely come from one or two researchers as from hackers, and the properly executed Manhattan Project to bolster the Net for another round of success will not be marked by centralization so much as by focus: the application of money and encouragement to those who step forward to help fix the most important and endemic problems that can no longer tolerate any procrastination. </p>
<p>Just as the XO’s technology platform seeks to cultivate such contributions as routine rather than as obscure or special, by placing generative technologies into as many children’s hands as possible, the educational systems in the developed world could be geared to encourage and reward such behavior, whether at the technical, content, or social layers. </p>
<p>Unfortunately, the initial reaction by many educators to digital participatory enterprises—ones that indeed may be subverted by their users—is fear. Many teachers are decrying the ways in which the Net has made it easy to plagiarize outright or to draw from dubious sources.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-17')">17</a></sup> Some schools and universities have banned the citation of Wikipedia in student papers,<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-18')">18</a></sup> while signing up for plagiarism detection services like TurnitIn.com and automatic essay-grading tools like SAGrader.com, which “uses computational intelligence strategies to grade students [<em>sic</em>] essays in seconds and respond with detailed, topic-specific feedback.”<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-19')">19</a></sup></p>
<p>Instead of being subject to technology that automates and reinforces the worst aspects of contemporary education—emphasizing regurgitation and summarization of content from an oracular source, followed by impersonal grading within a conceptual echo chamber—our children ought to be encouraged to accept the participatory invitation of the Net and that which has recursively emerged at its upper layers from its open technologies below. Wikipedia’s conceded weakness as a source is an invitation to improve it, and the act of improving it can be as meaningful to the contributor as to those who come after. Our students can be given assignments that matter—reading with a critical eye the sources that the rest of the online world consults, and improving them as best they know how, using tools of research and argument and intellectual honesty that our schools can teach. Instead of turning in a report for a single teacher to read, they can put their skills into work that everyone can read. The millions of students doing make-work around the world can come to learn instead that what they do can have consequences—and that if they do not contribute, it is not necessarily true that others will. In other words, we can use our generative technologies to teach our children to take responsibility for the way the world works rather than to be merely borne along by its currents. This will work best if our teachers are on board. Without people to whom others can apprentice, to learn technical skills and social practice, generative technologies can flounder. This is the XO’s vulnerability, too—if it fails, it may in large part be because the technology was too difficult to master and share, and its possibilities not hinted at enough to entice learners to persist in their attention to it. </p>
<p>Like the XO, generativity itself is, at its core, not a technology project. It is an education project, an exercise in intellect and community, the founding concepts of the university. Our universities are in a position to take a leadership role in the Net’s future. They were the Net’s original cradle, along with the self-taught hobbyists who advanced the PC from its earliest days. Business and commerce followed in their wake, refining and expanding the opportunities developed through largely nonmarket process and ethos. The Internet and attached generative platforms are invitations to code and to build. Universities— and not just their computer science departments—should see those invitations as central to their missions of teaching their students and bringing knowledge to the world. </p>
<p>As countries and groups in the developing world incline to brand new generative technologies, those in the developed world must fight to retain theirs. There is not a simple pendulum swinging from generative to non-generative and back again; we cannot count on the fact that screws tightened too far can become stripped. Any comprehensive redesign of the Internet at this late stage would draw the attention of regulators and other parties who will push for ways to prevent abuse before it can even happen. Instead, we must piecemeal refine and temper the PC and the Net so that they can continue to serve as engines of innovation and contribution while mitigating the most pressing concerns of those harmed by them. We must appreciate the connection between generative technology and generative content. </p>
<p>Today’s consumer information technology is careening at breakneck pace, and most see no need to begin steering it. Our technologists are complacent because the ongoing success of the generative Net has taken place without central tending—the payoffs of the procrastination principle. Rank-and-file Internet users enjoy its benefits while seeing its operation as a mystery, something they could not possibly hope to affect. They boot their PCs each day and expect them more or less to work, and they access Wikipedia and expect it more or less to be accurate. </p>
<p>But our Net technologies are experiencing the first true shock waves from their generative successes. The state of the hacking arts is advancing. Web sites can be compromised in an instant, and many visitors will then come away with an infected PC simply for having surfed there. Without a new cadre of good hackers unafraid to take ownership of the challenges posed by their malicious siblings and create the tools needed to help nonhackers keep the Net on a constructive trajectory, the most direct solutions will be lockdown that cuts short the Net experiment, deprives us of its fruits, and facilitates a form of governmental control that upends a balance between citizen and sovereign. These ripples can be followed recursively up the Net’s layers. Our generative technologies need technically skilled people of goodwill to keep them going, and the fledgling generative activities above—blogging, wikis, social networks—need artistically and intellectually skilled people of goodwill to serve as true alternatives to a centralized, industrialized information economy that asks us to identify only as consumers of meaning rather than as makers of it. Peer production alone does not guarantee collaborative meaning making. Services like Inno- Centive place five-figure bounties on difficult but modular scientific problems, and ask the public at large to offer solutions.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-20')">20</a></sup> But the solutions tendered then become the full property of the institutional bounty hunter.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-21')">21</a></sup> Amazon’s Mechanical Turk has created a marketplace for the solving of so-called human intelligence tasks on the other side of the scale: trivial, repetitive tasks like tracing lines around the faces in photographs for a firm that has some reason to need them traced.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-conclusion#note-22')">22</a></sup> If five years from now children with XOs were using them for hours each day primarily to trace lines at half a penny per trace, it could be a useful economic engine to some and a sweatshop to others—but either way it would not be an activity that is generative at the content layer. </p>
<p>The deciding factor in whether our current infrastructure can endure will be the sum of the perceptions and actions of its users. There are roles for traditional state sovereigns, pan-state organizations, and formal multistakeholder regimes to play. They can help reinforce the conditions necessary for generative blossoming, and they can also step in—with all the confusion and difficulty that notoriously attends regulation of a generative space—when mere generosity of spirit among people of goodwill cannot resolve conflict. But such generosity of spirit is a society’s powerful first line of moderation. </p>
<p>Our fortuitous starting point is a generative device in tens of millions of hands on a neutral Net. To maintain it, the users of those devices must experience the Net as something with which they identify and belong. We must use the generativity of the Net to engage a constituency that will protect and nurture it. That constituency may be drawn from the ranks of a new generation able to see that technology is not simply a video game designed by someone else, and that content is not simply what is provided through a TiVo or iPhone.</p>
]]></content:encoded>
			<wfw:commentRss>http://yupnet.org/zittrain/archives/21/feed</wfw:commentRss>
		</item>
		<item>
		<title>Chapter 9: Meeting the Risks of Generativity: Privacy 2.0</title>
		<link>http://yupnet.org/zittrain/archives/20</link>
		<comments>http://yupnet.org/zittrain/archives/20#comments</comments>
		<pubDate>Sun, 16 Mar 2008 23:18:08 +0000</pubDate>
		<dc:creator>The Editors</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://yupnet.org/zittrain/archives/20</guid>
		<description><![CDATA[So far this book has explored generative successes and the problems they cause at the technical and content layers of the Internet. This chapter takes up a case study of a problem at the social layer: privacy. Privacy showcases issues that can worry individuals who are not concerned about some of the other problems discussed [...]]]></description>
			<content:encoded><![CDATA[<p>So far this book has explored generative successes and the problems they cause at the technical and content layers of the Internet. This chapter takes up a case study of a problem at the social layer: privacy. Privacy showcases issues that can worry individuals who are not concerned about some of the other problems discussed in this book, like copyright infringement, and it demonstrates how generativity puts old problems into new and perhaps unexpected configurations, calling for creative solutions. Once again, we test the notion that solutions that might solve the generative problems at one layer—solutions that go light on law, and instead depend on the cooperative use of code to cultivate and express norms—might also work at another. </p>
<p>The heart of the next-generation privacy problem arises from the similar but uncoordinated actions of individuals that can be combined in new ways thanks to the generative Net. Indeed, the Net enables individuals in many cases to compromise privacy more thoroughly than the government and commercial institutions traditionally targeted for scrutiny and regulation. The standard approaches that have been developed to analyze and limit institutional actors do not work well for this new breed of problem, which goes far beyond the compromise of sensitive information. </p>
<p><strong>PRIVACY 1.0</strong></p>
<p>In 1973, a blue-ribbon panel reported to the U.S. Secretary of Health, Education, and Welfare (HEW) on computers and privacy. The report could have been written today: </p>
<blockquote><p>It is no wonder that people have come to distrust computer-based record-keeping operations. Even in non-governmental settings, an individual’s control over the personal information that he gives to an organization, or that an organization obtains about him, is lessening as the relationship between the giver and receiver of personal data grows more attenuated, impersonal, and diffused. There was a time when information about an individual tended to be elicited in face-to-face contacts involving personal trust and a certain symmetry, or balance, between giver and receiver. Nowadays an individual must increasingly give information about himself to large and relatively faceless institutions, for handling and use by strangers—unknown, unseen and, all too frequently, unresponsive. Sometimes the individual does not even know that an organization maintains a record about him. Often he may not see it, much less contest its accuracy, control its dissemination, or challenge its use by others.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-1')">1</a></sup></p></blockquote>
<p>The report pinpointed troubles arising not simply from powerful computing technology that could be used both for good and ill, but also from its impersonal quality: the sterile computer processed one’s warm, three-dimensional life into data handled and maintained by faraway faceless institutions, viewed at will by strangers. The worries of that era are not obsolete. We are still concerned about databases with too much information that are too readily accessed; databases with inaccurate information; and having the data from databases built for reasonable purposes diverted to less noble if not outright immoral uses.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-2')">2</a></sup></p>
<p>Government databases remain of particular concern, because of the unique strength and power of the state to amass information and use it for life-altering purposes. The day-to-day workings of the government rely on numerous databases, including those used for the calculation and provision of government benefits, decisions about law enforcement, and inclusion in various licensing regimes.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-3')">3</a></sup> Private institutional databases also continue to raise privacy issues, particularly in the realms of consumer credit reporting, health records, and financial data. </p>
<p>Due to political momentum generated by the HEW report and the growing controversy over President Richard Nixon’s use of government power to investigate political enemies, the U.S. Congress enacted comprehensive privacy legislation shortly after the report’s release. The Privacy Act of 1974 mandated a set of fair information practices, including disclosure of private information only with an individual’s consent (with exceptions for law enforcement, archiving, and routine uses), and established the right of the subject to know what was recorded about her and to offer corrections. While it was originally intended to apply to a broad range of public and private databases to parallel the HEW report, the Act was amended before passage to apply only to government agencies’ records.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-4')">4</a></sup> Congress never enacted a comparable comprehensive regulatory scheme for private databases. Instead, private databases are regulated only in narrow areas of sensitivity such as credit reports (addressed by a complex scheme passed in 1970 affecting the handful of credit reporting agencies)<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-5')">5</a></sup> and video rental data,<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-6')">6</a></sup> which has been protected since Supreme Court nominee Robert Bork’s video rental history was leaked to a newspaper during his confirmation process in 1987.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-7')">7</a></sup></p>
<p>The HEW report expresses a basic template for dealing with the informational privacy problem: first, a sensitivity is identified at some stage of the information production process—the gathering, storage, or dissemination of one’s private information—and then a legal regime is proposed to restrict these activities to legitimate ends. This template has informed analysis for the past thirty years, guiding battles over privacy both between individuals and government and between individuals and “large and faceless” corporations. Of course, a functional theory does not necessarily translate into successful practice. Pressures to gather and use personal data in commerce and law enforcement have increased, and technological tools to facilitate such data processing have matured without correspondingly aggressive privacy protections.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-8')">8</a></sup> (Consider Chapter Five’s description of the novel uses of tethered appliances to conduct surveillance.) In 1999, Scott McNealey, CEO of Sun Microsystems, was asked whether a new Sun technology to link consumer devices had any built-in privacy protection. “You have zero privacy anyway,” he replied. “Get over it.”<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-9')">9</a></sup></p>
<p>McNealey’s words raised some ire at the time; one privacy advocate called them “a declaration of war.”<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-10')">10</a></sup> McNealey has since indicated that he believes his answer was misunderstood.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-11')">11</a></sup> But the plain meaning of “getting over it” seems to have been heeded: while poll after poll indicates that the public is concerned about privacy,<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-12')">12</a></sup> the public’s actions frequently belie these claims. Apart from momentary spikes in privacy concerns that typically arise in the wake of highprofile scandals—such as Watergate or the disclosure of Judge Bork’s video rentals—we routinely part with personal information and at least passively consent to its use, whether by surfing the Internet, entering sweepstakes, or using a supermarket discount card. </p>
<p>Current scholarly work on privacy tries to reconcile people’s nonchalant behavior with their seemingly heartfelt concerns about privacy. It sometimes calls for industry self-regulation rather than direct governmental regulation as a way to vindicate privacy interests, perhaps because such regulation is seen as more efficient or just, or because direct governmental intervention is understood to be politically difficult to achieve. Privacy scholarship also looks to the latest advances in specific technologies that could further weaken day-to-day informational privacy.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-13')">13</a></sup> One example is the increasing use of radio frequency identifiers (RFIDs) in consumer items, allowing goods to be scanned and tracked at a short distance. One promise of RFID is that a shopper could wheel her shopping cart under an arch at a grocery store and obtain an immediate tally of the price of its contents; one peril is that a stranger could drive by a house with an RFID scanner and instantly inventory its contents, from diapers to bacon to flat-screen TVs, immediately discerning the sort of people who live within. </p>
<p>This work on privacy generally hews to the original analytic template of 1973: both the analysis and suggested solutions talk in terms of institutions gathering data, and of developing ways to pressure institutions to better respect their customers’ and clients’ privacy. This approach is evident in discussions about electronic commerce on the Internet. Privacy advocates and scholars have sought ways to ensure that Web sites disclose to people what they are learning about consumers as they browse and buy. The notion of “privacy policies” has arisen from this debate. Through a combination of regulatory suasion and industry best practices, such policies are now found on many Web sites, comprising little-read boilerplate answering questions about what information a Web site gathers about a user and what it does with the information. Frequently the answers are, respectively, “as much as it can” and “whatever it wants”—but, to some, this is progress. It allows scholars and companies alike to say that the user has been put on notice of privacy practices. </p>
<p>Personal information security is another area of inquiry, and there have been some valuable policy innovations in this sphere. For example, a 2003 California law requires firms that unintentionally expose their customers’ private data to others to alert the customers to the security breach.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-14')">14</a></sup> This has led to a rash of well-known banks sending bashful letters to millions of their customers, gently telling them that, say, a package containing tapes with their credit card and social security numbers has been lost en route from one processing center to another.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-15')">15</a></sup> Bank of America lost such a backup tape with 1.2 million cus- tomer records in <sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-16')">2005.16</a></sup> That same year, a MasterCard International security breach exposed information of more than 40 million credit card holders.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-17')">17</a></sup> Boston College lost 120,000 alumni records to hackers as a result of a breach.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-18')">18</a></sup> The number of incidents shows little sign of decreasing,<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-19')">19</a></sup> despite the incentives provided by the embarrassment of disclosure and the existence of obvious ways to improve security practices. For minimal cost, firms could minimize some types of privacy risks to consumers—for example, by encrypting their backup tapes before shipping them anywhere, making them worthless to anyone without a closely held digital key. </p>
<p>Addressing Web site privacy and security has led to elaborations on the traditional informational privacy framework. Some particularly fascinating issues in this framework are still unfolding: is it fair, for example, for an online retailer like Amazon to record the average number of nanoseconds each user spends contemplating an item before clicking to buy it? Such data could be used by Amazon to charge impulse buyers more, capitalizing on the likelihood that this group of consumers does not pause long enough to absorb the listed price of the item they just bought. A brief experiment by Amazon in differential pricing resulted in bad publicity and a hasty retreat as some buyers noticed that they could save as much as $10 on a DVD by deleting browser cookies that indicated to Amazon that they had visited the site before.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-20')">20</a></sup> As this example suggests, forthrightly charging one price to one person and another price to someone else can generate resistance. Offering individualized discounts, however, can amount to the same thing for the vendor while appearing much more palatable to the buyer. Who would complain about receiving a coupon for $10 off the listed price of an item, even if the coupon were not transferable to any other Amazon user? (The answer may be “someone who did not get the coupon,” but to most people the second scenario is less troubling than the one in which different prices were charged from the start.)<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-21')">21</a></sup></p>
<p>If data mining could facilitate price discrimination for Amazon or other online retailers, it could operate in the tangible world as well. As a shopper uses a loyal-customer card, certain discounts are offered at the register personalized to that customer. Soon, the price of a loaf of bread at the store becomes indeterminate: there is a sticker price, but when the shopper takes the bread up front, the store can announce a special individualized discount based on her relationship with the store. The sticker price then becomes only that, providing little indication of the price that shoppers are actually paying. Merchants can also vary service. Customer cards augmented with RFID tags can serve to identify those undesirable customers who visit a home improvement store, monopolize the attention of the attendants, and exit without having bought so much as a single nail. With these kinds of cards, the store would be able to discern the “good” (profitable) customers from the “bad” (not profitable) ones and appropriately alert the staff to flee from bad customers and approach good ones. </p>
<p><strong>PRIVACY 2.0</strong></p>
<p>While privacy issues associated with government and corporate databases remain important, they are increasingly dwarfed by threats to privacy that do not fit the standard analytical template for addressing privacy threats. These new threats fit the generative pattern also found in the technical layers for Internet and PC security, and in the content layer for ventures such as Wikipedia. The emerging threats to privacy serve as an example of generativity’s downsides on the social layer, where contributions from remote amateurs can enable vulnerability and abuse that calls for intervention. Ideally such intervention would not unduly dampen the underlying generativity. Effective solutions for the problems of Privacy 2.0 may have more in common with solutions to other generative problems than with the remedies associated with the decades-old analytic template for Privacy 1.0. </p>
<p><strong>The Era of Cheap Sensors</strong></p>
<p>We can identify three successive shifts in technology from the early 1970s: cheap processors, cheap networks, and cheap sensors.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-22')">22</a></sup> The third shift has, with the help of the first two, opened the doors to new and formidable privacy invasions. </p>
<p>The first shift was cheap processors. Moore’s Law tells us that processing power doubles every eighteen months or so.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-23')">23</a></sup> A corollary is that existing processing power gets cheaper. The cheap processors available since the 1970s have allowed Bill Gates’s vision of a “computer on every desk” to move forward. Cheap processors also underlie information appliances: thanks to Moore’s Law, there is now sophisticated microprocessor circuitry in cars, coffeemakers, and singing greeting cards. </p>
<p>Cheap networks soon followed. The pay-per-minute proprietary dial-up networks gave way to an Internet of increasing bandwidth and dropping price. The all-you-can-eat models of measurement meant that, once established, idle network connections were no cheaper than well-used ones, and a Web page in New York cost no more to access from London than one in Paris. Lacking gatekeepers, these inexpensive processors and networks have been fertile soil for whimsical invention to take place and become mainstream. This generativity has occurred in part because the ancillary costs to experiment—both for software authors and software users—have been so low. </p>
<p>The most recent technological shift has been the availability of cheap sensors. Sensors that are small, accurate, and inexpensive are now found in cameras, microphones, scanners, and global positioning systems. These characteristics have made sensors much easier to deploy—and then network—in places where previously it would have been impractical to have them. </p>
<p>The proliferation of cheap surveillance cameras has empowered the central authorities found within the traditional privacy equation. A 2002 working paper estimated that the British government had spent several hundred million dollars on closed-circuit television systems, with many networked to central law enforcement stations for monitoring.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-24')">24</a></sup> Such advances, and the analysis that follows them, fit the template of Privacy 1.0: governments have access to more information thanks to more widely deployed monitoring technologies, and rules and practices are suggested to prevent whatever our notions might be of abuse.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-25')">25</a></sup> To see how cheap processors, networks, and sensors create an entirely new form of the problem, we must look to the excitement surrounding the participatory technologies suggested by one meaning of “Web 2.0.” In academic circles, this meaning of Web 2.0 has become known as “peer production.” </p>
<p><strong>The Dynamics of Peer Production</strong></p>
<p>The aggregation of small contributions of individual work can make oncedifficult tasks seem easy. For example, Yochai Benkler has approvingly described the National Aeronautics and Space Administration’s (NASA’s) use of public volunteers, or “clickworkers.”<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-26')">26</a></sup> NASA had a tedious job involving pictures of craters from the moon and Mars. These were standard bitmap images, and they wanted the craters to be vectorized: in other words, they wanted people to draw circles around the circles they saw in the photos. Writing some custom software and deploying it online, NASA asked Internet users at large to undertake the task. Much to NASA’s pleasant surprise, the clickworkers accomplished in a week what a single graduate student would have needed a year to complete.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-27')">27</a></sup> Cheap networks and PCs, coupled with the generative ability to costlessly offer new code for others to run, meant that those who wanted to pitch in to help NASA could do so. </p>
<p>The near-costless aggregation of far-flung work can be applied in contexts other than the drawing of circles around craters—or the production of a free encyclopedia like Wikipedia. Computer scientist Luis von Ahn, after noting that over nine billion person-hours were spent playing Windows Solitaire in a single year, devised the online “ESP” game, in which two remote players are randomly paired and shown an image. They are asked to guess the word that best describes the image, and when they each guess the same word they win points.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-28')">28</a></sup> Their actions also provide input to a database that reliably labels images for use in graphical search engines—improving the ability of image search engines to identify images. In real time, then, people are building and participating in a collective, organic, worldwide computer to perform tasks that real computers cannot easily do themselves.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-29')">29</a></sup></p>
<p>These kinds of grid applications produce (or at least encourage) certain kinds of public activity by combining small, individual private actions. Benkler calls this phenomenon “coordinate coexistence producing information.”<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-30')">30</a></sup> Benkler points out that the same idea helps us find what we are looking for on the Internet, even if we do not go out of our way to play the ESP game; search engines commonly aggregate the artifacts of individual Internet activity, such as webmasters’ choices about where to link, to produce relevant search results. Search engines also track which links are most often clicked on in ordered search results in order, and then more prominently feature those links in future searches.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-31')">31</a></sup> The value of this human-derived wisdom has been noted by spammers, who create “link farms” of fake Web sites containing fragments of text drawn at random from elsewhere on the Web (“word salad”) that link back to the spammers’ sites in an attempt to boost their search engine rankings. The most useful links are ones placed on genuinely popular Web sites, though, and the piles of word salad do not qualify. </p>
<p>As a result, spammers have turned to leaving comments on popular blogs that ignore the original entry to which they are attached and instead simply provide links back to their own Web sites. In response, the authors of blogging software have incorporated so-called captcha boxes that must be navigated before anyone can leave a comment on a blog. Captchas—now used on many mainstream Web sites including Ticketmaster.com—ask users to prove that they are human by typing in, say, a distorted nonsense word displayed in a small graphic.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-32')">32</a></sup> Computers can start with a word and make a distorted image in a heartbeat, but they cannot easily reverse engineer the distorted image back to the word. This need for human intervention was intended to force spammers to abandon automated robots to place their blog comment spam. For a while they did, reportedly setting up captcha sweatshops that paid people to solve captchas from blog comment prompts all day long.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-33')">33</a></sup> (In 2003, the going rate was $2.50/hour for such work.)<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-34')">34</a></sup> But spammers have continued to explore more efficient solutions. A spammer can write a program to fill in all the information but the captcha, and when it gets to the captcha it places it in front of a real person trying to get to a piece of information—say on a page a user might get after clicking a link that says, “You’ve just won $1000! Click here!”<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-35')">35</a></sup>—or perhaps a pornographic photo.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-36')">36</a></sup> The captcha had been copied that instant from a blog where a spammer’s robot was waiting to leave a comment, and then pasted into the prompt for the human wanting to see the next page. The human’s answer to the captcha was then instantly ported back over to the blog site in order to solve the captcha and leave the spammed comment.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-37')">37</a></sup> Predictably, companies have also sprung up to meet this demand, providing custom software to thwart captchas on a contract basis of $100 to $5,000 per project.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-38')">38</a></sup> Generative indeed: the ability to remix different pieces of the Web, and to deploy new code without gatekeepers, is crucial to the spammers’ work. Other uses of captchas are more benign but equally subtle: a project called reCAPTCHA provides an open API to substitute for regular captchas where a Web site might want to test to see if it is a human visiting.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-39')">39</a></sup> reCAPTCHA creates an image that pairs a standard, automatically generated test word image with an image of a word from an old book that a computer has been unable to properly scan and translate. When the user solves the captcha by entering both words, the first word is used to validate that the user is indeed human, and the second is used to put the human’s computing power to work to identify one more word of one more book that otherwise would be unscannable. </p>
<p><center>* * *</center></p>
<p>What do captchas have to do with privacy? New generative uses of the Internet have made the solutions proposed for Privacy 1.0 largely inapplicable. Fears about “mass dataveillance”<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-40')">40</a></sup> are not misplaced, but they recognize only part of the problem, and one that represents an increasingly smaller slice of the pie. Solutions such as disclosure<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-41')">41</a></sup> or encryption<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-42')">42</a></sup> still work for Privacy 1.0, but new approaches are needed to meet the challenge of Privacy 2.0, in which sensitive data is collected and exchanged peer-to-peer in configurations as unusual as that of the spammers’ system for bypassing captchas. </p>
<p>The power of centralized databases feared in 1973 is now being replicated and amplified through generative uses of individual data and activity. For example, cheap sensors have allowed various gunshot-detecting technologies to operate through microphones in public spaces.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-43')">43</a></sup> If a shot is fired, sensors associated with the microphones triangulate the shot’s location and summon the police. To avoid false alarms, the system can be augmented with help from the public at large, minimizing the need for understaffed police to make the initial assessment about what is going on when a suspicious sound is heard. Interested citizens can review camera feeds near a reported shot and press a button if they see something strange happening on their computer monitors. Should a citizen do so, other citizens can be asked for verification. If the answer is yes, the police can be sent. </p>
<p>In November of 2006, the state of Texas spent $210,000 to set up eight webcams along the Mexico border as part of a pilot program to solicit the public’s help in reducing illegal immigration.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-44')">44</a></sup> Webcam feeds were sent to a public Web site, and people were invited to alert the police if they thought they saw suspicious activity. During the month-long trial the Web site took in just under twenty-eight million hits. No doubt many were from the curious rather than the helpful, but those wanting to volunteer came forward, too. The site registered over 220,000 users, and those users sent 13,000 e-mails to report suspicious activity. At three o’clock in the morning one woman at her PC saw someone signal a pickup truck on the webcam. She alerted police, who seized over four hundred pounds of marijuana from the truck’s occupants after a highspeed chase. In separate incidents, a stolen car was recovered, and twelve undocumented immigrants were stopped. To some—especially state officials— this was a success beyond any expectation;<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-45')">45</a></sup> to others it was a paltry result for so much investment.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-46')">46</a></sup></p>
<p>Beyond any first-order success of stopping crime, some observers welcome involvement by members of the public as a check on law enforcement surveillance.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-47')">47</a></sup> Science fiction author David Brin foresaw increased use of cameras and other sensors by the government and adopted an if-you-can’t-beat-themjoin- them approach to dealing with the privacy threat. He suggested allowing ubiquitous surveillance so long as the watchers themselves were watched: live cameras could be installed in police cars, station houses, and jails. According to Brin, everyone watching everywhere would lessen the likelihood of unobserved government abuse. What the Rodney King video did for a single incident<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-48')">48</a></sup>— one that surely would have passed without major public notice but for the amateur video capturing what looked like excessive force by arresting officers— Brin’s proposal could do for nearly all state activities. Of course, Brin’s calculus does not adequately account for the invasions of privacy that would take place whenever random members of the public could watch—and perhaps record— every interaction between citizens and authorities, especially since many of those interactions take place at sensitive moments for the citizens. And ubiquitous surveillance can lead to other problems. The Sheriff’s Office of Anderson County, Tennessee, introduced one of the first live “jailcams” in the country, covering a little area in the jail where jailors sit and keep an eye on everything— the center of the panopticon.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-49')">49</a></sup> The Anderson County webcam was very Web 2.0: the Web site included a chat room where visitors could meet other viewers, there was a guestbook to sign, and a link to syndicated advertising to help fund the webcam. However, some began using the webcam to make crank calls to jailors at key moments and even, it is claimed, to coordinate the delivery of contraband.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-50')">50</a></sup> The webcam was shut down. </p>
<p>This example suggests a critical difference between Privacy 1.0 and 2.0. If the government is controlling the observation, then the government can pull the plug on such webcams if it thinks they are not helpful, balancing whatever policy factors it chooses.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-51')">51</a></sup> Many scholars have considered the privacy problems posed by cheap sensors and networks, but they focus on the situations where the sensors serve only government or corporate masters. Daniel Solove, for instance, has written extensively on emergent privacy concerns, but he has focused on the danger of “digital dossiers” created by businesses and governments.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-52')">52</a></sup> Likewise, Jerry Kang and Dana Cuff have written about how small sensors will lead to “pervasive computing,” but they worry that the technology will be abused by coordinated entities like shopping malls, and their prescriptions thus follow the pattern established by Privacy 1.0.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-53')">53</a></sup> Their concerns are not misplaced, but they represent an increasingly smaller part of the total picture. The essence of Privacy 2.0 is that government or corporations, or other intermediaries, need not be the source of the surveillance. Peer-to-peer technologies can eliminate points of control and gatekeeping from the transfer of personal data and information just as they can for movies and music. The intellectual property conflicts raised by the generative Internet, where people can still copy large amounts of copyrighted music without fear of repercussion, are rehearsals for the problems of Privacy 2.0.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-54')">54</a></sup></p>
<p>The Rodney King beating was filmed not by a public camera, but by a private one, and its novel use in 1991 is now commonplace. Many private cameras, including camera-equipped mobile phones, fit the generative mold as devices purchased for one purpose but frequently used for another. The Rodney King video, however, required news network attention to gain salience. Videos depicting similar events today gain attention without the prior approval of an intermediary.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-55')">55</a></sup> With cheap sensors, processors, and networks, citizens can quickly distribute to anywhere in the world what they capture in their backyard. Therefore, any activity is subject to recording and broadcast. Perform a search on a video aggregation site like YouTube for “angry teacher” or “road rage” and hundreds of videos turn up. The presence of documentary evidence not only makes such incidents reviewable by the public at large, but for, say, angry teachers it also creates the possibility of getting fired or disciplined where there had not been one before. Perhaps this is good: teachers are on notice that they must account for their behavior the way that police officers must take responsibility for their own actions. </p>
<p>If so, it is not just officers and teachers: we are all on notice. The famed “Bus Uncle” of Hong Kong upbraided a fellow bus passenger who politely asked him to speak more quietly on his mobile phone.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-56')">56</a></sup> The mobile phone user learned an important lesson in etiquette when a third person captured the argument and then uploaded it to the Internet, where 1.3 million people have viewed one version of the exchange.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-57')">57</a></sup> (Others have since created derivative versions of the exchange, including karaoke and a ringtone.) Weeks after the video was posted, the Bus Uncle was beaten up in a targeted attack at the restaurant where he worked.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-58')">58</a></sup> In a similar incident, a woman’s dog defecated on the floor of a South Korean subway. She refused to clean it up, even when offered a tissue—though she cleaned the dog—and left the subway car at the next stop. The incident was captured on a mobile phone camera and posted to the Internet, where the poster issued an all points bulletin seeking information about the dog owner and her relatives, and about where she worked. She was identified by others who had previously seen her and the dog, and the resulting firestorm of criticism apparently caused her to quit her job.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-59')">59</a></sup></p>
<p>The summed outrage of many unrelated people viewing a disembodied video may be disproportionate to whatever social norm or law is violated within that video. Lives can be ruined after momentary wrongs, even if merely misdemeanors. Recall verkeersbordvrij theory from Chapter Six: it suggests that too many road signs and driving rules change people into automatons, causing them to trade in common sense and judgment for mere hewing to exactly what the rules provide, no more and no less. In the same way, too much scrutiny can also turn us into automatons. Teacher behavior in a classroom, for example, is largely a matter of standards and norms rather than rules and laws, but the presence of scrutiny, should anything unusual happen, can halt desirable pedagogical risks if there is a chance those risks could be taken out of context, misconstrued, or become the subject of pillory by those with perfect hindsight. </p>
<p>These phenomena affect students as well as teachers, regular citizens rather than just those in authority. And ridicule or mere celebrity can be as chilling as outright disapprobation. In November 2002 a Canadian teenager used his high school’s video camera to record himself swinging a golf ball retriever as though it were a light saber from <em>Star Wars</em>.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-60')">60</a></sup> By all accounts he was doing it for his own amusement. The tape was not erased, and it was found the following spring by someone else who shared it, first with friends and then with the Internet at large. Although individuals want privacy for themselves, they will line up to see the follies of others, and by 2006 the “Star Wars Kid” was estimated to be the most popular word-of-mouth video on the Internet, with over nine hundred million cumulative views.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-61')">61</a></sup> It has spawned several parodies, including ones shown on prime time television. This is a consummately generative event: a repurposing of something made for completely different reasons, taking off beyond any expectation, and triggering further works, elaborations, and commentaries— both by other amateurs and by Hollywood.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-62')">62</a></sup> It is also clearly a privacy story. The student who made the video has been reported to have been traumatized by its circulation, and in no way did he seek to capitalize on his celebrity. </p>
<p>In this hyperscrutinized reality, people may moderate themselves instead of expressing their true opinions. To be sure, people have always balanced between public and private expression. As Mark Twain observed: “We are discreet sheep; we wait to see how the drove is going, and then go with the drove. We have two opinions: one private, which we are afraid to express; and another one—the one we use—which we force ourselves to wear to please Mrs. Grundy, until habit makes us comfortable in it, and the custom of defending it presently makes us love it, adore it, and forget how pitifully we came by it. Look at it in politics.”<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-63')">63</a></sup></p>
<p>Today we are all becoming politicians. People in power, whether at parliamentary debates or press conferences, have learned to stick to carefully planned talking points, accepting the drawbacks of appearing stilted and saying little of substance in exchange for the benefits of predictability and stability.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-64')">64</a></sup> Ubiquitous sensors threaten to push everyone toward treating each public encounter as if it were a press conference, creating fewer spaces in which citizens can express their private selves. </p>
<p>Even the use of “public” and “private” to describe our selves and spaces is not subtle enough to express the kind of privacy we might want. By one definition they mean who manages the space: a federal post office is public; a home is private. A typical restaurant or inn is thus also private, yet it is also a place where the public gathers and mingles: someone there is “in public.” But while activities in private establishments open to the public are technically in the public eye,<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-65')">65</a></sup> what transpires there is usually limited to a handful of eyewitnesses— likely strangers—and the activity is ephemeral. No more, thanks to cheap sensors and cheap networks to disseminate what they glean. As our previously <em>private</em> public spaces, like classrooms and restaurants, turn into <em>public</em> public spaces, the pressure will rise for us to be on press conference behavior. </p>
<p>There are both significant costs and benefits inherent in expanding the use of our public selves into more facets of daily life. Our public face may be kinder, and the expansion may cause us to rethink our private prejudices and excesses as we publicly profess more mainstream standards and, as Twain says, “habit makes us comfortable in it.” On the other hand, as law professors Eric Posner and Cass Sunstein point out, strong normative pressure can prevent outlying behavior of any kind, and group baselines can themselves be prejudiced. Outlying behavior is the generative spark found at the social layer, the cultural innovation out of left field that can later become mainstream. Just as our information technology environment has benefited immeasurably from experimentation by a variety of people with different aims, motives, and skills, so too is our cultural environment bettered when commonly held—and therefore sometimes rarely revisited—views can be challenged.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-66')">66</a></sup></p>
<p>The framers of the U.S. Constitution embraced anonymous speech in the political sphere as a way of being able to express unpopular opinions without having to experience personal disapprobation.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-67')">67</a></sup> No defense of a similar principle was needed for keeping private conversations in public spaces from becoming public broadcasts—disapprobation that begins with small “test” groups but somehow becomes society-wide—since there were no means by which to perform that transformation. Now that the means are there, a defense is called for lest we run the risk of letting our social system become metaphorically more appliancized: open to change only by those few radicals so disconnected from existing norms as to not fear their imposition at all. </p>
<p>Privacy 2.0 is about more than those who are famous or those who become involuntary “welebrities.” For those who happen to be captured doing particularly fascinating or embarrassing things, like Star Wars Kid or an angry teacher, a utilitarian might say that nine hundred million views is first-order evidence of a public benefit far exceeding the cost to the student who made the video. It might even be pointed out that the Star Wars Kid failed to erase the tape, so he can be said to bear some responsibility for its circulation. But the next-generation privacy problem cannot be written off as affecting only a few unlucky victims. Neither can it be said to affect only genuine celebrities who must now face constant exposure not only to a handful of professional paparazzi but also to hordes of sensor-equipped amateurs. (Celebrities must now contend with the consequences of cell phone videos of their slightest aberrations—such as one in which a mildly testy exchange with a valet parker is quickly circulated and exaggerated online<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-68')">68</a></sup>—or more comprehensive peer-produced sites like Gawker Stalker,<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-69')">69</a></sup> where people send in local sightings of celebrities as they happen. Gawker strives to relay the sightings within fifteen minutes and place them on a Google map, so that if Jack Nicholson is at Starbucks, one can arrive in time to stand awkwardly near him before he finishes his latte.) </p>
<p>Cybervisionary David Weinberger’s twist on Andy Warhol’s famous quotation is the central issue for the rest of us: “On the Web, everyone will be famous to fifteen people.”<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-70')">70</a></sup> Although Weinberger made his observation in the context of online expression, explaining that microaudiences are worthy audiences, it has further application. Just as cheap networks made it possible for businesses to satisfy the “long tail,” serving the needs of obscure interests every bit as much as popular ones<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-71')">71</a></sup> (Amazon is able to stock a selection of books virtually far beyond the best sellers found in a physical bookstore), peer-produced databases can be configured to track the people who are of interest only to a few others. </p>
<p>How will the next-generation privacy problem affect average citizens? Early photo aggregation sites like Flickr were premised on a seemingly dubious assumption that turned out to be true: not only would people want an online repository for their photos, but they would often be pleased to share them with the public at large. Such sites now boast hundreds of millions of photos,<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-72')">72</a></sup> many of which are also sorted and categorized thanks to the same distributed energy that got Mars’s craters promptly mapped. Proponents of Web 2.0 sing the praises of “folksonomies” rather than taxonomies—bottom-up tagging done by strangers rather than expert-designed and -applied canonical classifications like the Dewey Decimal System or the Library of Congress schemes for sorting books.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-73')">73</a></sup> Metadata describing the contents of pictures makes images far more useful and searchable. Combining user-generated tags with automatically generated data makes pictures even more accessible. Camera makers now routinely build cameras that use global positioning systems to mark exactly where on the planet each picture it snaps was taken and, of course, to time- and datestamp them. Web sites like Riya, Polar Rose, and MyHeritage are perfecting facial recognition technologies so that once photos of a particular person are tagged a few times with his or her name, their computers can then automatically label all future photos that include the person—even if their image appears in the background. In August 2006 Google announced the acquisition of Neven Vision, a company working on photo recognition, and in May 2007 Google added a feature to its image search so that only images of people could be returned (to be sure, still short of identifying which image is which).<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-74')">74</a></sup> Massachusetts officials have used such technology to compare mug shots in “Wanted” posters to driver’s license photos, leading to arrests.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-75')">75</a></sup> Mash together these technologies and functionalities through the kind of generative mixing allowed by their open APIs and it becomes trivial to receive answers to questions like: Where was Jonathan Zittrain last year on the fourteenth of February?, or, Who could be found near the entrance to the local Planned Parenthood clinic in the past six months? The answers need not come from government or corporate cameras, which are at least partially secured against abuse through well-considered privacy policies from Privacy 1.0. Instead, the answers come from a more powerful, generative source: an army of the world’s photographers, including tourists sharing their photos online without firm (or legitimate) expectations of how they might next be used and reused. </p>
<p>As generativity would predict, those uses may be surprising or even offensive to those who create the new tools or provide the underlying data. The Christian Gallery News Service was started by antiabortion activist Neal Horsley in the mid 1990s. Part of its activities included the Nuremberg Files Web site, where the public was solicited for as much information as possible about the identities, lives, and families of physicians who performed abortions, as well as about clinic owners and workers.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-76')">76</a></sup> When a provider was killed, a line would be drawn through his or her name. (The site was rarely updated with new information, and it became entangled in a larger lawsuit lodged under the U.S. Freedom of Access to Clinic Entrances Act.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-77')">77</a></sup> The site remains accessible.) An associated venture solicits the public to take pictures of women arriving at clinics, including the cars in which they arrive (and corresponding license plates), and posts the pictures in order to deter people from nearing clinics.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-78')">78</a></sup></p>
<p>With image recognition technology mash-ups, photos taken as people enter clinics or participate in protests can be instantly cross-referenced with their names. One can easily pair this type of data with Google Maps to provide finegrained satellite imagery of the homes and neighborhoods of these individuals, similar to the “subversive books” maps created by computer consultant and tinkerer Tom Owad, tracking wish lists on Amazon.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-79')">79</a></sup></p>
<p>This intrusion can reach places that the governments of liberal democracies refuse to go. In early 2007, a federal court overseeing the settlement of a class action lawsuit over New York City police surveillance of public activities held that routine police videotaping of public events was in violation of the settlement: “The authority . . . conferred upon the NYPD ‘to visit any place and attend any event that is open to the public, on the same terms and conditions of the public generally,’ cannot be stretched to authorize police officers to videotape everyone at a public gathering just because a visiting little old lady from Dubuque . . . could do so. There is a quantum difference between a police officer and the little old lady (or other tourist or private citizen) videotaping or photographing a public event.”<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-80')">80</a></sup></p>
<p>The court expressed concern about a chilling of speech and political activities if authorities were videotaping public events. But police surveillance becomes moot when an army of little old ladies from Dubuque is naturally videotaping and sharing nearly everything—protests, scenes inside a mall (such that amateur video exists of a random shootout in a Salt Lake City, Utah, mall),<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-81')">81</a></sup> or picnics in the park. Peer-leveraging technologies are overstepping the boundaries that laws and norms have defined as public and private, even as they are also facilitating beneficial innovation. Cheap processors, networks, and sensors enable a new form of beneficial information flow as citizen reporters can provide footage and frontline analysis of newsworthy events as they happen.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-82')">82</a></sup> For example, OhmyNews is a wildly popular online newspaper in South Korea with citizen-written articles and reports. (Such writers provide editors with their names and national identity numbers so articles are not anonymous.) Similarly, those who might commit atrocities within war zones can now be surveilled and recorded by civilians so that their actions may be watched and ultimately punished, a potential sea change for the protection of human rights.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-83')">83</a></sup></p>
<p>For privacy, peer-leveraging technologies might make for a much more constrained world rather than the more chaotic one that they have wrought for intellectual property. More precisely, a world where bits can be recorded, manipulated, and transmitted without limitation means, in copyright, a free-for-all for the public and constraint upon firms (and perhaps upstream artists) with content to protect. For privacy, the public is variously creator, beneficiary, and victim of the free-for-all. The constraints—in the form of privacy invasion that Jeffrey Rosen crystallizes as an “unwanted gaze”—now come not only from the well-organized governments or firms of Privacy 1.0, but from a few people generatively drawing upon the labors of many to greatly impact rights otherwise guaranteed by a legal system. </p>
<p><strong>Privacy and Reputation</strong></p>
<p>At each layer where a generative pattern can be discerned, this book has asked whether there is a way to sift out what we might judge to be bad generative results from the good ones without unduly damaging the system’s overall generativity. This is the question raised at the technical layer for network security, at the content layer for falsehoods in Wikipedia and failures of intellectual property protection, and now at the social layer for privacy. Can we preserve generative innovations without giving up our core privacy values? Before turning to answers, it is helpful to explore a final piece of the Privacy 2.0 mosaic: the impact of emerging reputation systems. This is both because such systems can greatly impact our privacy and because this book has suggested reputational tools as a way to solve the generative sifting problem at other layers. </p>
<p>Search is central to a functioning Web,<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-84')">84</a></sup> and reputation has become central to search. If people already know exactly what they are looking for, a network needs only a way of registering and indexing specific sites. Thus, IP addresses are attached to computers, and domain names to IP addresses, so that we can ask for www.drudgereport.com and go straight to Matt Drudge’s site. But much of the time we want help in finding something without knowing the exact online destination. Search engines help us navigate the petabytes of publicly posted information online, and for them to work well they must do more than simply identify all pages containing the search terms that we specify. They must rank them in relevance. There are many ways to identify what sites are most relevant. A handful of search engines auction off the top-ranked slots in search results on given terms and determine relevance on the basis of how much the site operators would pay to put their sites in front of searchers.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-85')">85</a></sup> These search engines are not widely used. Most have instead turned to some proxy for reputation. As mentioned earlier, a site popular with others—with lots of inbound links—is considered worthier of a high rank than an unpopular one, and thus search engines can draw upon the behavior of millions of other Web sites as they sort their search results.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-86')">86</a></sup> Sites like Amazon deploy a different form of ranking, using the “mouse droppings” of customer purchasing and browsing behavior to make recommendations—so they can tell customers that “people who like the Beatles also like the Rolling Stones.” Search engines can also more explicitly invite the public to express its views on the items it ranks, so that users can decide what to view or buy on the basis of others’ opinions. Amazon users can rate and review the items for sale, and subsequent users then rate the first users’ reviews. Sites like Digg and Reddit invite users to vote for stories and articles they like, and tech news site Slashdot employs a rating system so complex that it attracts much academic attention.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-87')">87</a></sup></p>
<p>eBay uses reputation to help shoppers find trustworthy sellers. eBay users rate each others’ transactions, and this trail of ratings then informs future buyers how much to trust repeat sellers. These rating systems are crude but powerful. Malicious sellers can abandon poorly rated eBay accounts and sign up for new ones, but fresh accounts with little track record are often viewed skeptically by buyers, especially for proposed transactions involving expensive items. One study confirmed that established identities fare better than new ones, with buyers willing to pay, on average, over 8 percent more for items sold by highly regarded, established sellers.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-88')">88</a></sup> Reputation systems have many pitfalls and can be gamed, but the scholarship seems to indicate that they work reasonably well.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-89')">89</a></sup> There are many ways reputation systems might be improved, but at their core they rely on the number of people rating each other in good faith well exceeding the number of people seeking to game the system—and a way to exclude robots working for the latter. For example, eBay’s rating system has been threatened by the rise of “1-cent eBooks” with no shipping charges; sellers can create alter egos to bid on these nonitems and then have the phantom users highly rate the transaction.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-90')">90</a></sup> One such “feedback farm” earned a seller a thousand positive reviews over four days. eBay intervenes to some extent to eliminate such gaming, just as Google reserves the right to exact the “Google death penalty” by de-listing any Web site that it believes is unduly gaming its chances of a high search engine rating.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-91')">91</a></sup></p>
<p>These reputation systems now stand to expand beyond evaluating people’s behavior in discrete transactions or making recommendations on products or content, into rating people more generally. This could happen as an extension of current services—as one’s eBay rating is used to determine trustworthiness on, say, another peer-to-peer service. Or, it could come directly from social networking: Cyworld is a social networking site that has twenty million subscribers; it is one of the most popular Internet services in the world, largely thanks to interest in South Korea.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-92')">92</a></sup> The site has its own economy, with $100 million worth of “acorns,” the world’s currency, sold in 2006.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-93')">93</a></sup></p>
<p>Not only does Cyworld have a financial market, but it also has a market for reputation. Cyworld includes behavior monitoring and rating systems that make it so that users can see a constantly updated score for “sexiness,” “fame,” “friendliness,” “karma,” and “kindness.” As people interact with each other, they try to maximize the kinds of behaviors that augment their ratings in the same way that many Web sites try to figure out how best to optimize their presentation for a high Google ranking.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-94')">94</a></sup> People’s worth is defined and measured precisely, if not accurately, by the reactions of others. That trend is increasing as social networking takes off, partly due to the extension of online social networks beyond the people users already know personally as they “befriend” their friends’ friends’ friends. </p>
<p>The whole-person ratings of social networks like Cyworld will eventually be available in the real world. Similar real-world reputation systems already exist in embryonic form. Law professor Lior Strahilevitz has written a fascinating monograph on the effectiveness of “How’s My Driving” programs, where commercial vehicles are emblazoned with bumper stickers encouraging other drivers to report poor driving.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-95')">95</a></sup> He notes that such programs have resulted in significant accident reductions, and analyzes what might happen if the program were extended to all drivers. A technologically sophisticated version of the scheme dispenses with the need to note a phone number and file a report; one could instead install transponders in every vehicle and distribute TiVo-like remote controls to drivers, cyclists, and pedestrians. If someone acts politely, say by allowing you to switch lanes, you can acknowledge it with a digital thumbsup that is recorded on that driver’s record. Cutting someone off in traffic earns a thumbs-down from the victim and other witnesses. Strahilevitz is supportive of such a scheme, and he surmises it could be even more effective than eBay’s ratings for online transactions since vehicles are registered by the government, making it far more difficult escape poor ratings tied to one’s vehicle. He acknowledges some worries: people could give thumbs-down to each other for reasons unrelated to their driving—racism, for example. Perhaps a bumper sticker expressing support for Republicans would earn a thumbs-down in a blue state. Strahilevitz counters that the reputation system could be made to eliminate “outliers”—so presumably only well-ensconced racism across many drivers would end up affecting one’s ratings. According to Strahilevitz, this system of peer judgment would pass constitutional muster if challenged, even if the program is run by the state, because driving does not implicate one’s core rights. “How’s My Driving?” systems are too minor to warrant extensive judicial review. But driving is only the tip of the iceberg. </p>
<p>Imagine entering a café in Paris with one’s personal digital assistant or mobile phone, and being able to query: “Is there anyone on my buddy list within 100 yards? Are any of the ten closest friends of my ten closest friends within 100 yards?” Although this may sound fanciful, it could quickly become mainstream. With reputation systems already advising us on what to buy, why not have them also help us make the first cut on whom to meet, to date, to befriend? These are not difficult services to offer, and there are precursors today.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-96')">96</a></sup> These systems can indicate who has not offered evidence that he or she is safe to meet—as is currently solicited by some online dating sites—or it may use Amazon-style matching to tell us which of the strangers who have just entered the café is a good match for people who have the kinds of friends we do. People can rate their interactions with each other (and change their votes later, so they can show their companion a thumbs-up at the time of the meeting and tell the truth later on), and those ratings will inform future suggested acquaintances. With enough people adopting the system, the act of entering a café can be different from one person to the next: for some, the patrons may shrink away, burying their heads deeper in their books and newspapers. For others, the entire café may perk up upon entrance, not knowing who it is but having a lead that this is someone worth knowing. Those who do not participate in the scheme at all will be as suspect as brand new buyers or sellers on eBay. </p>
<p>Increasingly, difficult-to-shed indicators of our identity will be recorded and captured as we go about our daily lives and enter into routine transactions— our fingerprints may be used to log in to our computers or verify our bank accounts, our photo may be snapped and tagged many times a day, or our license plate may be tracked as people judge our driving habits. The more our identity is associated with our daily actions, the greater opportunities others will have to offer judgments about those actions. A government-run system like the one Strahilevitz recommends for assessing driving is the easy case. If the state is the record keeper, it is possible to structure the system so that citizens can know the basis of their ratings—where (if not by whom) various thumbs-down clicks came from—and the state can give a chance for drivers to offer an explanation or excuse, or to follow up. The state’s formula for meting out fines or other penalties to poor drivers would be known (“three strikes and you’re out,” for whatever other problems it has, is an eminently transparent scheme), and it could be adjusted through accountable processes, just as legislatures already determine what constitutes an illegal act, and what range of punishment it should earn. </p>
<p>Generatively grown but comprehensively popular unregulated systems are a much trickier case. The more that we rely upon the judgments offered by these private systems, the more harmful that mistakes can be.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-97')">97</a></sup> Correcting or identifying mistakes can be difficult if the systems are operated entirely by private parties and their ratings formulas are closely held trade secrets. Search engines are notoriously resistant to discussing how their rankings work, in part to avoid gaming—a form of security through obscurity.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-98')">98</a></sup> The most popular engines reserve the right to intervene in their automatic rankings processes—to administer the Google death penalty, for example—but otherwise suggest that they do not centrally adjust results. Hence a search in Google for “Jew” returns an anti- Semitic Web site as one of its top hits,<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-99')">99</a></sup> as well as a separate sponsored advertisement from Google itself explaining that its rankings are automatic.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-100')">100</a></sup> But while the observance of such policies could limit worries of bias to search algorithm design rather than to the case-by-case prejudices of search engine operators, it does not address user-specific bias that may emerge from personalized judgments. </p>
<p>Amazon’s automatic recommendations also make mistakes; for a period of time the <em>Official Lego Creator Activity Book</em> was paired with a “perfect partner” suggestion: <em>American Jihad: The Terrorists Living Among Us Today</em>. If such mismatched pairings happen when discussing people rather than products, rare mismatches could have worse effects while being less noticeable since they are not universal. The kinds of search systems that say which people are worth getting to know and which should be avoided, tailored to the users querying the system, present a set of due process problems far more complicated than a stateoperated system or, for that matter, any system operated by a single party. The generative capacity to share data and to create mash-ups means that ratings and rankings can be far more emergent—and far more inscrutable. </p>
<p><strong>SOLVING THE PROBLEMS OF PRIVACY 2.0</strong></p>
<p>Cheap sensors generatively wired to cheap networks with cheap processors are transforming the nature of privacy. How can we respond to the notion that nearly anything we do outside our homes can be monitored and shared? How do we deal with systems that offer judgments about what to read or buy, and whom to meet, when they are not channeled through a public authority or through something as suable, and therefore as accountable, as Google? </p>
<p>The central problem is that the organizations creating, maintaining, using, and disseminating records of identifiable personal data are no longer just “organizations”— they are people who take pictures and stream them online, who blog about their reactions to a lecture or a class or a meal, and who share on social sites rich descriptions of their friends and interactions. These databases are becoming as powerful as the ones large institutions populate and centrally define. Yet the sorts of administrative burdens we can reasonably place on established firms exceed those we can place on individuals—at some point, the burden of compliance becomes so great that the administrative burdens are tantamount to an outright ban. That is one reason why so few radio stations are operated by individuals: it need not be capital intensive to set up a radio broadcasting tower—a low-power neighborhood system could easily fit in someone’s attic—but the administrative burdens of complying with telecommunications law are well beyond the abilities of a regular citizen. Similarly, we could create a privacy regime so complicated as to frustrate generative developments by individual users. </p>
<p>The 1973 U.S. government report on privacy crystallized the template for Privacy 1.0, suggesting five elements of a code of fair information practice:</p>
<ul>
<li>There must be no personal data record-keeping systems whose very existence is secret. </li>
<li>There must be a way for an individual to find out what information about him is in a record and how it is used. </li>
<li>There must be a way for an individual to prevent information about him that was obtained for one purpose from being used or made available for other purposes without his consent. </li>
<li>There must be a way for an individual to correct or amend a record of identifiable information about him. </li>
<li>Any organization creating, maintaining, using, or disseminating records of identifiable personal data must assure the reliability of the data for their intended use and must take precautions to prevent misuse of the data.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-101')">101</a></sup></li>
</ul>
<p>These recommendations present a tall order for distributed, generative systems. It may seem clear that the existence of personal data record-keeping systems ought not to be kept secret, but this issue was easier to address in 1973, when such systems were typically large consumer credit databases or government dossiers about citizens, which could more readily be disclosed and advertised by the relevant parties. It is harder to apply the antisecrecy maxim to distributed personal information databases. When many of us maintain records or record fragments on one another, and through peer-produced social networking services like Facebook or MySpace share these records with thousands of others, or allow them to be indexed to create powerful mosaics of personal data, then exactly what the database <em>is</em> changes from one moment to the next—not simply in terms of its contents, but its very structure and scope. Such databases may be generally unknown while not truly “secret.”<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-102')">102</a></sup></p>
<p>Further, these databases are ours. It is one thing to ask a corporation to disclose the personal data and records it maintains; it is far more intrusive to demand such a thing of private citizens. Such disclosure may itself constitute an intrusive search upon the citizen maintaining the records. Similarly, the idea of mandating that an individual be able to find out what an information gatherer knows—much less to correct or amend the information—is categorically more difficult to implement when what is known is distributed across millions of people’s technological outposts. To be sure, we can Google ourselves, but this does not capture those databases open only to “friends of friends”—a category that may not include us but may include thousands of others. At the same time, we may have minimal recourse when the information we thought we were circulating within social networking sites merely for fun and, say, only among fellow college students, ends up leaking to the world at large.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-103')">103</a></sup></p>
<p>What to do? There is a combination of steps drawn from the solutions sketched in the previous two chapters that might ameliorate the worst of Privacy 2.0’s problems, and even provide a framework in which to implement some of the Privacy 1.0 solutions without rejecting the generative framework that gives rise to Privacy 2.0 in the first place. </p>
<p><strong>The Power of Code-Backed Norms</strong></p>
<p>The Web is disaggregated. Its pieces are bound together into a single virtual database by private search engines like Google. Google and other search engines assign digital robots to crawl the Web as if they were peripatetic Web surfers, clicking on one link after another, recording the results, and placing them into a concordance that can then be used for search.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-104')">104</a></sup></p>
<p>Early on, some wanted to be able to publish material to the Web without it appearing in search engines. In the way a conversation at a pub is a private matter unfolding in a public (but not publicly owned) space, these people wanted their sites to be private but not secret. The law offers one approach to vindicate this desire for privacy but not secrecy. It could establish a framework delineating the scope and nature of a right in one’s Web site being indexed, and providing for penalties for those who infringe that right. An approach of this sort has well-known pitfalls. For example, it would be difficult to harmonize such doctrine across various jurisdictions around the world,<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-105')">105</a></sup> and there would be technical questions as to how a Web site owner could signal his or her choice to would-be robot indexers visiting the site. </p>
<p>The Internet community, however, fixed most of the problem before it could become intractable or even noticeable to mainstream audiences. A software engineer named Martijn Koster was among those discussing the issue of robot signaling on a public mailing list in 1993 and 1994. Participants, including “a majority of robot authors and other people with an interest in robots,” converged on a standard for “robots.txt,” a file that Web site authors could create that would be inconspicuous to Web surfers but in plain sight to indexing robots.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-106')">106</a></sup> Through robots.txt, site owners can indicate preferences about what parts of the site ought to be crawled and by whom. Consensus among some influential Web programmers on a mailing list was the only blessing this standard received: “It is not an official standard backed by a standards body, or owned by any commercial organisation. It is not enforced by anybody, and there [<em>sic</em>] no guarantee that all current and future robots will use it. Consider it a common facility the majority of robot authors offer the WWW community to protect WWW server [<em>sic</em>] against unwanted accesses by their robots.”<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-107')">107</a></sup></p>
<p>Today, nearly all Web programmers know robots.txt is the way in which sites can signal their intentions to robots, and these intentions are respected by every major search engine across differing cultures and legal jurisdictions.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-108')">108</a></sup> On this potentially contentious topic—search engines might well be more valuable if they indexed everything, <em>especially</em> content marked as something to avoid— harmony was reached without any application of law. The robots.txt standard did not address the legalities of search engines and robots; it merely provided a way to defuse many conflicts before they could even begin. The apparent legal vulnerabilities of robots.txt—its lack of ownership or backing of a large private standards setting organization, and the absence of private enforcement devices— may in fact be essential to its success.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-109')">109</a></sup> Law professor Jody Freeman and others have written about the increasingly important role played by private organizations in the formation of standards across a wide range of disciplines and the ways in which some organizations incorporate governmental notions of due process in their activities.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-110')">110</a></sup> Many Internet standards have been forged much less legalistically but still cooperatively.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-111')">111</a></sup></p>
<p>The questions not preempted or settled by such cooperation tend to be clashes between firms with some income stream in dispute—and where the law has then partially weighed in. For example, eBay sued data aggregator Bidder’s Edge for using robots to scrape its site even after eBay clearly objected both in person and through robots.txt. eBay won in a case that has made it singularly into most cyberlaw casebooks and even into a few general property casebooks— a testament to how rarely such disputes enter the legal system.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-112')">112</a></sup></p>
<p>Similarly, the safe harbors of the U.S. Digital Millennium Copyright Act of 1998 give some protection to search engines that point customers to material that infringes copyright,<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-113')">113</a></sup> but they do not shield the actions required to create the search database in the first place. The act of creating a search engine, like the act of surfing itself, is something so commonplace that it would be difficult to imagine deeming it illegal—but this is not to say that search engines rest on any stronger of a legal basis than the practice of using robots.txt to determine when it is and is not appropriate to copy and archive a Web site.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-114')">114</a></sup> Only recently, with Google’s book scanning project, have copyright holders really begun to test this kind of question.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-115')">115</a></sup> That challenge has arisen over the scanning of paper books, not Web sites, as Google prepares to make them searchable in the same way Google has indexed the Web.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-116')">116</a></sup> The long-standing practice of Web site copying, guided by robots.txt, made that kind of indexing uncontroversial even as it is, in theory, legally cloudy. </p>
<p>The lasting lesson from robots.txt is that a simple, basic standard created by people of good faith can go a long way toward resolving or forestalling a problem containing strong ethical or legal dimensions. The founders of Creative Commons created an analogous set of standards to allow content creators to indicate how they would like their works to be used or reused. Creative Commons licenses purport to have the force of law behind them—one ignores them at the peril of infringing copyright—but the main force of Creative Commons as a movement has not been in the courts, but in cultural mindshare: alerting authors to basic but heretofore hidden options they have for allowing use of the photos, songs, books, or blog entries they create, and alerting those who make use of the materials to the general orientation of the author. </p>
<p>Creative Commons is robots.txt generalized. Again, the legal underpinnings of this standard are not particularly strong. For example, one Creative Commons option is “noncommercial,” which allows authors to indicate that their material can be reused without risk of infringement so long as the use is noncommercial. But the definition of noncommercial is a model of vagueness, the sort of definition that could easily launch a case like <em>eBay v. Bidder’s Edge</em>.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-117')">117</a></sup> If one aggregates others’ blogs on a page that has banner ads, is that a commercial use? There have been only a handful of cases over Creative Commons licenses, and none testing the meaning of noncommercial.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-118')">118</a></sup> Rather, people seem to know a commercial (or derivative) use when they see it: the real power of the license may have less to do with a threat of legal enforcement and more to do with the way it signals one’s intentions and asks that they be respected. Reliable empirical data is absent, but the sense among many of those using Creative Commons licenses is that their wishes have been respected.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-119')">119</a></sup></p>
<p><strong>Applying Code-Backed Norms to Privacy: Data Genealogy</strong></p>
<p>As people put data on the Internet for others to use or reuse—data that might be about other people as well as themselves—there are no tools to allow those who provide the data to express their preferences about how the data ought to be indexed or used. There is no Privacy Commons license to request basic limits on how one’s photographs ought to be reproduced from a social networking site. There ought to be. Intellectual property law professor Pamela Samuelson has proposed that in response to the technical simplicity of collecting substantial amounts of personal information in cyberspace, a person should have a protectable right to control this personal data. She notes that a property-based legal framework is more difficult to impose when one takes into account the multiple interests a person might have in her personal data, and suggests a move to a contractual approach to protecting information privacy based in part on enforcement of Web site privacy policies.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-120')">120</a></sup> Before turning to law directly, we can develop tools to register and convey authors’ privacy-related preferences unobtrusively. </p>
<p>On today’s Internet, the copying and pasting of information takes place with no sense of metadata.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-121')">121</a></sup> It is difficult enough to make sure that a Creative Commons license follows the photograph, sound, or text to which it is related as those items circulate on the Web. But there is no standard at all to pass along for a given work and who recorded it, with what devices,<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-122')">122</a></sup> and most important, what the subject is comfortable having others do with it. If there were, links could become two-way. Those who place information on the Web could more readily canvas the public uses to which that information had been put and by whom. In turn, those who wish to reuse information would have a way of getting in touch with its original source to request permission. Some Web 2.0 outposts have generated promising rudimentary methods for this. Facebook, for example, offers tools to label the photographs one submits and to indicate what groups of people can and cannot see them. Once a photo is copied beyond the Facebook environment, however, these attributes are lost.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-123')">123</a></sup></p>
<p>The Web is a complex social phenomenon with information contributed not only by institutional sources like <em>Britannica</em>, CNN, and others that place large amounts of structured information on it, but also by amateurs like Wikipedians, Flickr contributors, and bloggers. Yet a Google search intentionally smoothes over this complexity; each linked search result is placed into a standard format to give the act of searching structure and order. Search engines and other aggregators can and should do more to enrich users’ understanding of where the information they see is coming from. This approach would shadow the way that Theodor Nelson, coiner of the word “hypertext,” envisioned “transclusion”—a means not to simply copy text, but also to reference it to its original source.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-124')">124</a></sup> Nelson’s vision was drastic in its simplicity: information would repose primarily at its source, and any quotes to it would simply frame that source. If it were deleted from the original source, it would disappear from its subsequent uses. If it were changed at the source, downstream uses would change with it. This is a strong version of the genealogy idea, since the metadata about an item’s origin would actually be the item itself. It is data as service, and insofar as it leaves too much control with the data’s originator, it suffers from many of the drawbacks of software as service described in Chapter Five. For the purposes of privacy, we do not need such a radical reworking of the copy-and-paste culture of the Web. Rather, we need ways for people to signal whether they would like to remain associated with the data they place on the Web, and to be consulted about unusual uses. </p>
<p>This weaker signaling-based version of Nelson’s vision does not answer the legal question of what would happen if the originator of the data could not come to an agreement with someone who wanted to use it. But as with robots .txt and Creative Commons licenses, it could forestall many of the conflicts that will arise in the absence of any standard at all.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-125')">125</a></sup> Most importantly, it would help signal authorial intention not only to end users but also to the intermediaries whose indices provide the engines for invasions of privacy in the first place. One could indicate that photos were okay to index by tag but not by facial recognition, for example. If search engines of today are any indication, such restrictions could be respected even without a definitive answer as to the extent of their legal enforceability. Indeed, by attaching online identity—if not physical identity—to the various bits of data that are constantly mashed up as people copy and paste what they like around the Web, it becomes possible for people to get in touch with one another more readily to express thanks, suggest collaboration, or otherwise interact as people in communities do. Similarly, projects like reCAPTCHA could seek to alert people to the extra good their solving of captchas is doing—and even let them opt out of solving the second word in the image, the one that is not testing whether they are human but instead is being used to perform work for someone else. Just as <em>Moore v. Regents of the University of California</em> struggled with the issue of whether a patient whose tumor was removed should be consulted before the tumor is used for medical research,<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-126')">126</a></sup> we will face the question of when people ought to be informed when their online behaviors are used for ulterior purposes—including beneficial ones. </p>
<p>Respect for robots.txt, Creative Commons licenses, and privacy “tags,” and an opportunity to alert people and allow them to opt in to helpful ventures with their routine online behavior like captcha-solving, both requires and promotes a sense of community. Harnessing some version of Nelson’s vision is a self-reinforcing community-building exercise—bringing people closer together while engendering further respect for people’s privacy choices. It should be no surprise that people tend to act less charitably in today’s online environment than they would act in the physical world.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-127')">127</a></sup> Recall the discussion of verkeersbordvrij in Chapter Six, where the elimination of most traffic signs can counterintuitively reduce accidents. Today’s online environment is only half of the verkeersbordvrij system: there are few perceived rules, but there are also few ways to receive, and therefore respect, cues from those whose content or data someone might be using.<sup class="footnote"><a href="javascript:popUp('http://yupnet.org/zittrain/notes-chapter-9#note-128')">128</a></sup> Verkeersbordvrij depends not simply on eliminating most legal rules 