The Google Book Search Settlement: Difference between revisions

From The Internet: Issues at the Frontier (course wiki)
Jump to navigation Jump to search
m (Update some broken links using perma.cc)
 
(374 intermediate revisions by 6 users not shown)
Line 7: Line 7:
= Concrete Questions of the Week =
= Concrete Questions of the Week =


How will the Google book digitization project affect various interests, including those who were parties to the settlement between Google and the Authors' guild and those who were not?  In particular, what changes will libraries (public, private, and university), publishers, and readers face going forward, and how should they respond?
How will [http://en.wikipedia.org/wiki/Google_book_search the Google book digitization project] affect various interests, including those who were parties to the [http://www.googlebooksettlement.com/intl/en/Settlement-Agreement.pdf settlement] between Google and the Authors Guild/American Association of Publishers, and those who were not?  In particular, what changes will libraries (public, private, and university) and readers face going forward, and how should they respond?


= Precis =
= Brief Overview of the Google Book Search Settlement =


What does the [http://www.authorsguild.org/advocacy/articles/settlement-resources.html recent settlement] between Google and the Authors Guild/American Association of Publishers regarding online accessibility of digitalized books mean?  Many have hailed it for both improving access to knowledge by creating [http://www.nytimes.com/2008/11/30/opinion/30gleick.html?em "the long dreamed of universal library"] and for avoiding a judicial resolution that might have exposed antiquated aspects of US copyright lawBut is this settlement optimal for all interested groups?  The Google book digitization project will bring with it many changes, and not just for the Google corporation and the authors' representatives who signed the deal. Readers, publishers, and libraries will all be affected by this settlement, and each group will likely face a different set of benefits and problems. With a focus on publishers (both private and academic) and libraries (public, private, and academic), we aim to identify the main challenges that the Google digitization project will entail for non-parties and to suggest creative solutions for adapting to these changes.
== History of the Settlement ==
In 2004, [http://www.google.com/ Google] began to [http://www.google.com/press/pressrel/print_library.html digitally scan books] from the university library collections of Harvard, Stanford, Oxford, and the University of Michigan, as well as from the New York Public Library system.  Google provided a service, [http://books.google.com/ Google Book Search (GBS)], whereby users could search within these digitized books.  The service allowed users to search and view the entire content of books in the [http://en.wikipedia.org/wiki/Public_domain public domain], but it also permitted searching copyrighted books, providing short [http://books.google.com/googlebooks/screenshots.html#snippetview "snippets"] containing the search term and some surrounding text.


= Guests =
Crying foul, the [http://www.authorsguild.org/ Authors Guild], a writers' advocacy group, initiated a [http://www.authorsguild.org/advocacy/articles/settlement-resources.attachment/authors-guild-v-google/Authors%20Guild%20v%20Google%2009202005.pdf class action law suit] on September 20, 2005 in the U.S. District Court for the Southern District of New York, claiming [http://en.wikipedia.org/wiki/United_States_copyright_law#Infringement copyright infringement].  The [http://www.publishers.org/ Association of American Publishers (AAP)] followed with a [http://www.authorsguild.org/advocacy/articles/settlement-resources.attachment/mcgraw-hill/McGraw-Hill%20v.%20Google%2010192005.pdf suit of its own] on October 19, 2005.  Google [http://online.wsj.com/article/SB112958982689471238.html responded] that its actions were lawful under the doctrine of [http://en.wikipedia.org/wiki/Fair_use fair use.]  Rather than fight that battle, however, Google decided to settle with the Authors Guild and the AAP.
== Confirmed Guests ==
* Professor John Palfrey


== Invited Guests ==
The parties announced a [http://www.googlebooksettlement.com/intl/en/Settlement-Agreement.pdf settlement] on October 28, 2008.  [http://en.wikipedia.org/wiki/Denny_Chin Judge Denny Chin] is supervising the settlement process.  Under the [http://www.scribd.com/doc/14741799/SDNY-Order-Extending-Deadline-to-September-4 current schedule], authors have until September 4, 2009 to opt out of the settlement, and the court will hold a final hearing on October 7, 2009, after which it will accept or reject the settlement.  On April 14, 2009, Professor [http://en.wikipedia.org/wiki/Charles_Nesson Charles Nesson] filed a [http://www.scribd.com/doc/14227449/Letter-to-Request-Intervention-in-Authors-Guild-v-Google motion to intervene] in the lawsuit as counsel on behalf of two professors and the [http://openaccesstrust.org/ Open Access Trust], and on April 17, 2009, the [http://www.archive.org/index.php Internet Archive], an online library led by [http://en.wikipedia.org/wiki/Brewster_Kahle Brewster Kahle], also [http://www.opencontentalliance.org/2009/04/17/internet-archive-files-intervention-request/ sought to intervene.]  Judge Chin [http://docs.justia.com/cases/federal/district-courts/new-york/nysdce/1:2005cv08136/273913/92/ denied both of these requests.]  On May 4, 2009, a number of library associations [http://wo.ala.org/gbs/wp-content/uploads/2009/05/googlebrieffinal.pdf filed a brief] requesting that the court "vigorously" supervise implementation of the settlement, should it be approved.  Meanwhile, the Justice Department has [http://www.nytimes.com/2009/04/29/technology/internet/29google.html?_r=3&hp reportedly commenced a review] of the settlement's legality under [http://en.wikipedia.org/wiki/United_States_antitrust_law antitrust law.]
* Prue Adler (Associate Executive Director, ARL)
* Professor Robert Darnton (Head of Harvard Libraries) -- unable to attend
* Jonathan Hulbert (member of Harvard's Office of General Counsel)
* Corey Williams (Associate Director, Office of Government Relations, ALA)


== Discussion of Guests ==
== Content of the Settlement ==
* Google Book Search settlement
The [http://www.googlebooksettlement.com/intl/en/Settlement-Agreement.pdf settlement] creates an entirely new legal regime for book digitization.  Under the settlement, Google will continue to offer GBS, but it will now have to pay authors and publishers for books still under copyright from the revenue it makes from advertising and selling access to the books. To facilitate payment, the settlement creates a new entity called the  [http://en.wikipedia.org/wiki/Book_Rights_Registry Book Rights Registry (BRR)], an independent body chaired by an equal number of author and publisher representatives and charged with maintaining a database of book copyrights and implementing the terms of the settlement.  Google will be able to continue to digitize books, free from fear of litigation (at least from those copyright owners who do not opt out of the settlement and who published a book prior to January 5, 2009).  In return, 63% of revenue from advertising and book sales will be channeled to the BRR for distribution to rights owners.  The BRR also possesses other powers.  For instance, the BRR has some discretion over dividing revenues between publishers and authors, has some approval power over GBS’s security standards, will help set book prices, and can even license copyrights to third parties besides Google.  The settlement is thus non-exclusive to Google, though it contains a [http://dltj.org/article/first-formal-gbs-objections/ “most favored nation clause”] that requires the BRR to give Google at least as good terms as any other third-party for the 10 years after the settlement’s approval.
** Google - Alex MacGillivray (chief in-house counsel for IP)
** Authors
*** [http://www.bonizack.com/ Michael Boni or Joanne Zack] (primary negotiators of the settlement for authors (including the Authors' Guild))
** Publishing groups - Jeffrey Cunard (Debevoise, Berkman, one of the primary negotiators of the settlement for publishers (including the AAP))
** Libraries
*** ALA - Corey Williams (Associate Director, Office of Government Relations)
*** ARL - Prue Adler (Associate Executive Director)
*** Harvard Libraries - Robert Darnton (director), John Palfrey, Jonathan Hulbert (lawyer in Harvard's Office of the General Counsel involved in the negotiations)
** Other Commentators
*** Lessig? (he is probably more useful for a different topic)


* Amazon Kindle people
Users of GBS will continue to be able to search the contents of books, but instead of returning "snippets" for copyrighted books, search results will depend on the type of book.  The settlement places each book in one of three categories.  First are public domain books, which users will continue to be able to view in their entirety.  It is estimated that about 20% of GBS books are in the public domain.  Second are books that are in-copyright and commercially available, which means that they are available for sale new through a “customary channel of trade” (e.g. it is available new at Amazon). For these books, GBS will display only bibliographic information and "front material" (copyright page, table of contents, index, etc.).  About 10% of GBS books are estimated to be in-copyright and commercially available.  Third are books that are in-copyright but not commercially available, which account for an estimated 70% of GBS books, and thus make up the bulk of what is covered by the settlement.  For these books, users will be able to view up to 20% of the book (with some restrictions).  Rightsholders may choose to deviate from these default settings and individually set the amount of each of their books available for users to view.  Institutions and individual users will also have the option of paying for permanent online access to the full content of digitized books.  The initial price of the institutional subscription will be set with reference to the prices of products and services “comparable” to GBS and will vary based on the type of institution (e.g. whether it is a corporation, a library, or a government office) as well as how many people are members of the institution.
* People from publishing companies doing offering innovative services, products, or editing processes involving the internet. (Does anybody know of such companies?)
* Someone who has studied self publication on the internet (names?)
* Someone who has studied reading habits in conjunction with the shift away from printed media (names?)


:'''The Big Think team might be able help secure some of these folks -- hit me up at peter@bigthink.com if you'd like some assistance making contact. [[User:PeterH|PeterH]] 07:11, 25 December 2008 (UTC)
Libraries are also specifically addressed by the settlement.  Each public library will be [http://dltj.org/article/gbs-settlement-public-access/ provided a single GBS terminal] that will display the entire content of the Institutional Subscription Database (ISD), essentially a database comprised of all books that are in-copyright but not commercially available.  Academic libraries will be allowed to have multiple terminals with such access, based on the number of full-time equivalent students enrolled at their respective schools. Institutions may also purchase subscriptions to the ISD. "Fully participating libraries" are given digital copies of any book scanned from their collections, as well as digital copies of books that are in their collection but were scanned from another library, provided that a sufficient proportion of their own collection has been digitized.


= Readings =
== Implications of the Settlement ==
What does this [http://www.googlebooksettlement.com/intl/en/Settlement-Agreement.pdf settlement] mean for libraries and the reading public?  Many have hailed it both for improving access to knowledge by creating [http://www.nytimes.com/2008/11/30/opinion/30gleick.html?em "the long dreamed of universal library"] (or at least a [http://paulcourant.net/2008/10/28/the-google-settlement-from-the-universal-library-to-the-universal-bookstore/ "universal bookstore"]) and for [http://lessig.org/blog/2008/10/on_the_google_book_search_agre.html providing more access] than may have been permitted under fair use if the case had gone to trial.  But is this settlement optimal for all interested groups?  The ramifications of the settlement will affect not only those parties who participated in the negotiations (Google, authors, and publishers), but also libraries and the reading public, neither of whom had a direct voice in the settlement-drafting process.  Each of these groups will likely face a different set of benefits and problems.  With a focus on libraries (public, private, and academic), we aim to identify the main challenges that the Google digitization project will entail and develop ways of working within the bounds of the settlement to mitigate these concerns.


== Assigned Readings ==
'''Substantive concerns generally fall within one of the following categories:'''
* ''Access'': Are the permitted free uses of copyrighted books sufficient?  Is the number of public terminals provided to libraries adequate?  Are subscription costs for individuals, and especially institutions, going to be too high?  Could more be done to increase access to the wealth of knowledge in GBS?  These questions get to the heart of the debate over GBS, and are discussed at length in the assigned [http://www.nybooks.com/articles/22281 Darnton] and [http://paulcourant.net/2009/02/04/google-robert-darnton-and-the-digital-republic-of-letters/ Courant] pieces.
* ''Antitrust and Monopoly'': Does Google's de facto monopoly of digitized book search violate antitrust law?  Even if not, are there protections that could be implemented to ensure that Google does not misuse its monopoly?  And what about the BRR's monopoly control over licensing digitized works?  Will this harm authors or would-be competitors of GBS?  How does the most favored nation clause impact the ability of competitors to GBS to emerge?  The assigned [https://perma.cc/2QTZ-JE3U Grimmelmann piece] provides a good discussion of these issues.
* ''Copyright'': What are the substantive implications of the settlement for copyright law?  Was Google [http://lessig.org/blog/2008/10/on_the_google_book_search_agre.html right to settle] rather than [http://balkin.blogspot.com/2008/10/google-book-search-settlement.html pursue its fair use defense?]  Does Google's settlement make it more difficult for competitors to rely on a fair use defense?  And what consequences will the settlement have for orphan works?  Are Google and the BRR reaping profits from orphan works [http://radar.oreilly.com/2009/04/legally-speaking-the-dead-soul.html  illegally or unfairly?]  And how will the BRR's copyright database help future licensees track down copyright owners?  Will the database be openly available and readily updated?
* ''International Implications'':  What impact will the settlement have on foreign works in the U.S.?  Will [http://blog.librarylaw.com/librarylaw/2009/04/google-book-settlement-orphan-works-and-foreign-works.html foreign authors be exploited?]  Individuals using GBS outside the U.S. are [http://books.google.com/googlebooks/agreement/#2 not covered by the settlement.]  Could the settlement's coverage be expanded so that they too will have the increased access the settlement provides?  Is there a problem with the disconnect that foreign works available in the U.S. are covered by the settlement and will create revenue for Google and the BRR whereas international users will not have increased access to GBS?
* ''Privacy and Security'':  Will GBS [http://booksquare.com/google-book-search-and-reader-privacy-a-consideration-and-call-to-action/ monitor users' reading habits?]  If not, what security will it provide so that others are not able to acquire this information?  If so, what protections will Google provide readers so that the information is not used or disseminated in harmful ways?  And will Google [http://www.wired.com/epicenter/2009/05/libraries-warn-of-censorship-privacy-cost-in-googles-digital-library/ censor books in GBS?]  If so, what measures will be taken to ensure the censorship is limited to appropriate situations (e.g. preventing minors from viewing obscenity)? 
* ''User Interface'': Could the settlement provide more [http://blogs.lib.berkeley.edu/shimenawa.php/2009/01/26/a-fire-on-the-plain imaginative options for readers], such as better annotation or modification features?  Does GBS's scanning [http://www.nybooks.com/articles/21514 fragment works] in harmful ways?  And how will Google choose which books and which editions to show at the top of GBS searches?  Could the settlement have done more to resolve these issues?


* The [http://www.googlebooksettlement.com/intl/en/Settlement-Agreement.pdf settlement agreement], especially Article VII on the obligations and rights of participating libraries.
= Outline of Class Plan =
* A [http://www.arl.org/pp/ppcopyright/google/ summary] of the Google Books settlement provided by the Association of Research Libraries and the American Library Association.
* A report on the settlement's implications that will be issued by the American Library Association, Association of Research Libraries, and Association of College and Research Libraries after their closed discussion on Feb. 9.
* An [http://www.thecrimson.com/article.aspx?ref=524989 article] from the Harvard Crimson on Harvard's withdrawal from the Google Book project.
* A [http://www.authorsguild.org/advocacy/articles/settlement-resources.attachment/joint-press/Joint%20Press%20Release.pdf statement] from some public university libraries in support of the project.
* A [http://laboratorium.net/archive/2008/11/08/principles_and_recommendations_for_the_google_book#P2 blog post] outlining potential problems with the settlement and proposing some revisions to it.


== Discussion of Readings ==
We will be working from the assumption that the current proposed settlement will be approved by the court.  Within the framework that this settlement establishes, then, how can we mitigate the concerns that have been raised by libraries and the reading public?  We hope to explore the contours of these problems and produce an array of possible solutions.


* Harvard [http://www.thecrimson.com/article.aspx?ref=524989 withdrew] from the Google Book project because it found the terms of the settlement agreement unpalatable. [[User:Cooper|Cooper]] 15:51, 12 January 2009 (UTC)
'''Class time will be segmented as follows:'''
* The class will begin with a short overview of the settlement itself, provided by the student presenters responsible for this class session. 
* Then, each of our three [http://cyber.law.harvard.edu/iif/The_Google_Book_Search_Settlement#Guests guests] will be given ten minutes to present his views on the settlement and begin to answer the question posed above. 
* After our guests have introduced their perspectives on the settlement, we will follow up with questions we have prepared for them.  These questions are designed to ensure that all important aspects of the settlement and its expected effects are brought up.
* Finally, we will open the session to class discussion, guided by the Berkman Question Tool (see [http://cyber.law.harvard.edu/iif/The_Google_Book_Search_Settlement#Class_Participation Class Participation]).  The discussion will be focused on normative suggestions for how institutions and readers should respond to the settlement in the wake of its (presumed) approval by the court.


* I just remembered a potentially relevant book I read as an undergrad, called Scrolling Forward.  I'll peruse it over break looking for useful excerpts.  [[User:Gwen|Gwen]] 22:03, 18 December 2008 (UTC)  Actually, I don't think this book will be helpful.  It's a bit too philosophical and vague for our purposes; there isn't a particular chapter focused on one concrete problem. [[User:Gwen|Gwen]] 19:49, 2 January 2009 (UTC)
= Guests =


* The Association of Research Libraries and the American Library Association have released a neutral [http://www.arl.org/pp/ppcopyright/google/ summary] of the settlement that highlights the most relevant parts for libraries. [[User:Lbaker|Lbaker]] 17:51, 23 December 2008 (UTC)
Three guests will be participating in our class session.  Two of them will be joining us live, and one will join us over videoconference.  All three guests will be present for the duration of the class session.  


* [http://www.authorsguild.org/advocacy/articles/settlement-resources.attachment/joint-press/Joint%20Press%20Release.pdf Here] is an interesting statement from some of the public university libraries about their support for the agreement and what they view as its benefits.  [[User:Cooper|Cooper]] 20:13, 23 December 2008 (UTC)
* [http://www.debevoise.com/Attorneys/detail.aspx?id=541bb1af-ea41-4f18-b528-d40bdefabbfb&type=showfullbio Jeffrey P. Cunard] (via videoconference), managing partner of the Washington, D.C. office of the law firm [http://www.debevoise.com/ Debevoise & Plimpton LLP].
* [http://www.law.harvard.edu/faculty/directory/index.html?id=486 John G. Palfrey], Henry N. Ess III Professor of Law at Harvard Law School, Vice Dean of Library and Information Resources at Harvard Law School, and [http://cyber.law.harvard.edu/people/jpalfrey Faculty Co-Director] of the [http://cyber.law.harvard.edu/ Berkman Center for Internet and Society](Blog located [http://blogs.law.harvard.edu/palfrey/ here].)
* Allan A. Ryan, Director of Intellectual Property at [http://harvardbusiness.org/ Harvard Business Publishing].


* Less specifically related to libraries, but [http://laboratorium.net/archive/2008/11/08/principles_and_recommendations_for_the_google_book#P2 this blog post] outlines some potential problems with the settlement and proposes some tweaks that would better serve the public.
= Class Participation =


* On February 9, the [http://www.arl.org/ ARL], [http://www.ala.org/ala/mgrps/divs/acrl/index.cfm ACRL] and [http://www.ala.org/ ALA] are hosting a session on the Google settlement agreementIt will not be webcast or archived (or even open to the public), but its conclusions regarding the implications of the agreement will be released in a report. The report should be drafted and available in time to be read before our session.[[User:Gwen|Gwen]] 21:50, 18 January 2009 (UTC)
We will be using our old friend, [http://cyber.law.harvard.edu/questions/chooser.php the Berkman Question Tool]!  (The instance name for our class session is IIFGBS, and it is located [http://cyber.law.harvard.edu/questions/IIFGBS here].)  Before the day of our class session, we will seed the question tool with some questions that we feel are particularly relevant and topical.  We encourage everyone to take a look at these questions prior to the class and to make comments, add more questions, and vote up the questions you find most interestingWe will be using this list of questions to guide the class discussion, especially in the final portion of the class.


= Class Participation/Use of Tech =
= Readings =
 
== Assigned Readings ==


Our subject will revolve around the particular challenges that the settlement, as it stands, presents for libraries, as well as the missed opportunities for the reading public. Given the "brain trust" present in the members of this seminar, as well as the experience of the guests who will be joining us, we believe we could collectively draft a list of recommendations for incorporation into a final, court-approved settlement, similar in format to the Laboratorium post (in the list of potential readings), but focused on libraries (and the reading public). This may then be sent to the court (if this is feasible or there is some mechanism for public input in the settlement-approval process) or just nailed to the proverbial church door (blog, Berkman site, wherever).
* [https://perma.cc/CL4K-YVR2 This summary of the settlement], prepared by [http://www.policybandwidth.com/ Jonathan Band] on behalf of [http://www.arl.org/ the Association of Research Libraries] and [http://www.ala.org/ the American Library Association].


This list of recommendations will be projected, drafted, and edited dynamically throughout the session. Ideally, some technology that would allow for simultaneous editing, as opposed to the one-at-a-time restrictions of the wiki format, would be employed. There would be an opportunity for further editing of the list of recommendations after the class, for perhaps a week or two, and then deposited online and/or brought to the court's attention after a last round of "polishing" edits.
* [http://cyber.law.harvard.edu/iif/sites/iif/images/Settlement_Selections.doc These selected provisions] from the settlement agreement.


:: As for technology for simultaneous document editing, check out gobby (google it). [[User:Mchua|Mchua]] 22:55, 12 January 2009 (UTC)
* [https://perma.cc/2QTZ-JE3U This article] by [http://james.grimmelmann.net/ James Grimmelmann], outlining potential problems with and revisions to the settlement.


We will be using Gobby as a means of collaborating, as suggested by Mel (thanks Mel!).
* [http://www.nybooks.com/articles/22281 This New York Review of Books article] by [http://history.fas.harvard.edu/people/faculty/darnton.php Robert Darnton] criticizing the settlement. (Or, for those who prefer to listen rather than read, [http://www.npr.org/templates/story/story.php?storyId=100969810 hear Darnton's criticisms] on [http://www.npr.org/ National Public Radio].)
*For Windows users, download it [http://gobby.0x539.de/trac/wiki/Download here]
*For Mac users, follow the instructions [http://www.divvun.no/doc/tools/gobby.html#Mac+OS+X here]
*For Unix users, the instructions are [http://www.divvun.no/doc/tools/gobby.html#Linux here]


= General Discussion =
* [http://paulcourant.net/2009/02/04/google-robert-darnton-and-the-digital-republic-of-letters/ This letter in response to Darnton] by [http://www-personal.umich.edu/~pnc/ Paul Courant] (head of the University of Michigan's library, a participant in Google Book Search), challenging Darnton's criticism of the settlement.


The internet has completely changed the meaning of publication, and the relationship between print and digital media is continually evolving.  The advent of the personal computer and the internet have changed the way information is assembled, distributed, managed, and digested in ways at least as dramatic and consequential as the advent of the printing press. How are traditional publishers coping with these changes?  What new forms of publishing are made possible by the internet, and what challenges do they entail? --[[User:Gwen|Gwen]] 16:34, 1 December 2008 (EST)
* [http://www.nybooks.com/articles/22496 This New York Review of Books piece] by Darnton in response to Courant's challenges. (Skip the published version of Courant's letter here, which is less detailed than the one posted on his blog at the link above, and just read Darnton's response at the bottom of the page.)


:'''Might be worth coordinating with the people doing media/press day -- could make them back to back and have them relate in some way. [[User:JZ|JZ]] 04:52, 16 December 2008 (UTC)
== Optional Readings ==


Since the "Future of News" group is focusing on news providers and how they are adapting to the disruption caused by the internet (and, hopefully, harnessing the advantages the internet provides), we could extend that discussion by looking at how other groups are handling that transition (ie. the open access publishing, effects on non-news-providing publishers).  Alternatively, it would be a nice parallel if we focused on self-publishing - that is, part of the disruptive process that is causing the collapse of the traditional newspaper business model (depending on how much the other group covers this, and whether we'll have enough info on that topic to make a session out of it...I guess the question here would be how to commercialize/monetize it - or are there other ideas for good questions?).
*Experiment with the current incarnation of [http://books.google.com/ Google Book Search.]


In general, I think our first step is to decide whether we want to focus more on the open access/self-publishing aspect, or the Google Book Search/"death of paper" aspect. Which topic seems most promising/interesting/etc. to you guys? (directed at Gwen and Jon) [[User:Lbaker|Lbaker]] 17:18, 18 December 2008 (UTC)
*Skim the [https://web.archive.org/web/20091209165819/https://www.googlebooksettlement.com/intl/en/Settlement-Agreement.pdf settlement agreement], focusing on Article 4 (Economic Terms for Google Book Search) and Article 7 (Obligations and Rights of Participating Libraries).


I agree that that decision makes sense as a first step (unless we can split the session into two parts, perhaps with two different virtual visitors? but that would definitely risk sacrificing quality for quantity in trying to cover too much). Personally, I think I am a bit more interested in the Google Book Search / death of paper topic. I kind of like that that goes in a different direction from the news people, because it would add more variety to the class as a whole and carry less risk of redundancy. --[[User:Gwen|Gwen]] 22:01, 18 December 2008 (UTC)
*For the highly ambitious, [http://waltcrawford.name/ Walt Crawford] of [http://citesandinsights.info/ Cites and Insights] has compiled [http://citesandinsights.info/civ9i4.pdf this 30-page newsletter], which includes excerpts from an enormous number of blog posts, accompanied by his own commentary.  It covers the landscape up until late February of 2009.


== The Relationship Between Print and Digital Media ==
*[http://www.economist.com/science/tq/displaystory.cfm?story_id=13174399 This Economist piece] introduces [http://en.wikipedia.org/wiki/Brewster_Kahle Brewster Kahle] and his [http://www.archive.org/index.php Internet Archive] as an alternative or supplement to Google Book Search.


=== Google Book Search ===
== Further Research ==


What does the [http://www.googlebooksettlement.com/index.html recent settlement] between Google and the Authors Guild/American Association of Publishers regarding online accessibility of digitalized books mean?  Many have hailed it for both improving access to knowledge by creating [http://www.nytimes.com/2008/11/30/opinion/30gleick.html?em "the long dreamed of universal library"] and for avoiding a judicial resolution that might have exposed antiquated aspects of US copyright law.  But there may also be troubling aspects of having access to such a large and unique collection of content controlled by a single for-profit company (the agreement is non-exclusive to Google, but it may be  difficult for a legitimate competitor to emerge, given Google's sizable first mover advantage).
*The full settlement, including all of its attachments, is available [https://web.archive.org/web/20091209165819/https://www.googlebooksettlement.com/intl/en/Settlement-Agreement.pdf here].


Is this settlement optimal for all interested groups?  Presumably it is for Google and the Authors Guild/AAP, but what about externalities for non-parties, such as the reading public?  Is some sort of government intervention appropriate to ensure access to this "universal library"?  What difference does it make, if any, that this "universal library" is operated by a private company reliant on many [http://www.google.com/googlebooks/partners.html public university libraries?]
*[http://pureinformation.org/about/ Timothy Vollmer] of [http://pureinformation.org/ pureinformation.org] has compiled a comprehensive and regularly updated [https://web.archive.org/web/20090523074107/http://pureinformation.org/archives/2008/11/25/google-book-settlement-link-dump-awesomeness/ list of links] to articles and blog posts about the settlement.


We could also look at the costs/difficulties libraries (and hosts for the Research Corpus) would face under the deal, including the provided punishments, and the feasibility of implementation/problems libraries might have implementing them.  To me, one part that seems particularly problematic is how Research Corpus would implement a means to restrict researchers to "non-consumptive research". [[User:Lbaker|Lbaker]] 17:21, 18 December 2008 (UTC)
= Class Session Recap =


:'''Once again Berkman Center alums and affiliates are all over this -- Alex MacGillivray for Google and Jeff Cunard for the publishersWe could pull together a great session on the deal, although it takes some time to absorb its parameters -- before getting to a place where we can have a cutting-edge discussion about it. [[User:JZ|JZ]] 04:52, 16 December 2008 (UTC)
The class session was dominated largely by discussion among our three guests, punctuated by questions from the class.  What follows below is a thematic summary of the views that were expressed during this discussionFor a more detailed account of the class session (including diachronic notes on the guests' presentations, the live discussion, and the online discussion, as well as lists of our prepared questions), see our [http://cyber.law.harvard.edu/iif/sites/iif/images/Class_Notes_03.30.09.pdf edited class notes].


=== The Shifting Role of Publishing Companies ===
''A Private Settlement With Benefits for Everyone''


As noted above under "Self Publication," the internet makes it very easy for individuals to make their work widely availableHowever, actually garnering a sizable audience or realizing a profit from one's work remains a greater challenge; it appears to be with respect to this step that the services of traditional publishers appear to retain some value.  After all, publishing companies offer marketing channels and name recognition in addition to simply machines that print a books.  Are traditional publishing companies threatened by the new forms of publishing that the internet makes possible?  Are publishers better off battling the internet (for example, by emphasizing the superiority and reliability of their traditional services) or embracing it (for example, by offering digital and internet-based publication services)? --[[User:Gwen|Gwen]] 16:16, 1 December 2008 (EST) Should the latter services and items -- such as ebooks, audiobooks in mp3 format, and Amazon Kindle -- be replacements for or compliments to printed books?  --[[User:Gwen|Gwen]] 07:32, 2 December 2008 (EST)
Despite the many concerns (e.g. [http://www.nybooks.com/articles/22281 Darnton's]) and settlement amendment proposals (e.g. [https://perma.cc/2QTZ-JE3U Grimmelmann's]) raised in newspapers and the blogosphere, it is important to remember that this settlement is the (presumptive) result of actual litigation between specific parties. The parties have taken care to incorporate other interests into the negotiation process, but at its core the settlement is designed to satisfy the concerns of publishers and authors that led them to sue Google in the first place. Nonetheless, major libraries had a presence during the negotiations, participating both directly and through Google. The public interest was also taken into account in a variety of ways, culminating in the [http://dltj.org/article/gbs-settlement-public-access/ provision of public access terminals] to all public and academic libraries and substantial [http://findarticles.com/p/articles/mi_m0EIN/is_2008_Oct_31/ai_n30957631/?tag=content;col1 access provisions for the visually impaired]. It should also be noted that copyright holders, as members of the reading public themselves, had a stake in ensuring that the public interest was not ignored during the negotiations. All three guests agreed that the settlement represents a substantial improvement over the status quo for everyone affected. Professor Fisher, however, suggested that "an ideal dream scenario" would be a more useful basis of comparison than the status quo, and once such an ideal was described, the settlement was found to fall short of the ideal.


=== The Fate of Printed Materials ===
''Publishers and Authors:  Upholding the Copyright Regime''


Will the internet cause the use of printed materials to decline to the point that printed materials become obsolete?  Obsolescence is reality in my own experience with The ''Harvard Journal of Law and Technology'' (''JOLT''). ''JOLT'' publishes its articles online on its [http://jolt.law.harvard.edu/ website], and it also publishes shorter and more timely posts online in its companion, the [http://jolt.law.harvard.edu/digest/ JOLT Digest].  In addition to being available directly to any internet user, all ''JOLT'' articles are made available through legal research databases, including Westlaw and Lexis. Each semester, we order from our publisher (Hein) enormous boxes of the new issue in print, but we have no idea what to do with them.  Even after giving away copies to our parents, there are still stacks and stacks of unwanted and unneeded paper copies, and a lighthearted dialogue about what to do with them has steadily taken over the dry erase board in our office. These printed copies of our journal are literally useless. --[[User:Gwen|Gwen]] 16:32, 1 December 2008 (EST)
From the publishers’ perspective, the litigation was essentially concerned with ensuring that copyright remains a permissions-based regime. When Google decided to start digitizing books on a massive scale, it did so without attempting to secure permission from rights holders; this move was a bold challenge to the existing regime. Publishers were also concerned about potential repercussions from Google’s practice of providing libraries with digital copies of the books scanned from their collections. Library users have [http://fairuse.stanford.edu/Copyright_and_Fair_Use_Overview/chapter7/7-d.html special exemptions] under the Copyright Act to copy books in a library’s collection, and these exemptions were carved out during an era where photocopying a book, page by page, was the only means of producing such a copy. In today’s digital era, this exemption for library users, coupled with the ease of online library "use" resulting from mass digitization, could lead to much more widespread copying of books in library collections without any revenue being generated for the rights holders. It was these concerns that motivated publishers and authors to bring suit against Google.  


The way that readers encounter and digest information is vastly different in the context of printed materials and in the context of digital and online materials. These differences have consequences for both academic researchers and regular citizens in terms of both the kind of information an individual is exposed to and the way that the individual approaches those sources.  If a dramatic shift away from printed media is happening, what other shifts does that entail for the way that people learn, synthesize, and evaluate information? --[[User:Gwen|Gwen]] 16:45, 1 December 2008 (EST)
''Libraries: Concerned About Resource Strain and the Proprietary Database''


We talked about an [http://www.theatlantic.com/doc/200807/google interesting article] relating to the topic of how digital media and the internet are affecting the way in which people read in JZ's 1L reading group. The article relates more to how the presentation of written material on the 'net (short and skimmable, links galore, etc.) is affecting the way we process information and our ability to read "long" pieces (ie. more than a page or so) without becoming distracted. It is a bit tangential to the specific discussion of the movement of print media onto digital form (since it mostly discusses the ''differences'' between the format of media in each of the forms), but is interesting regardless. [[User:Lbaker|Lbaker]] 08:55, 2 December 2008 (EST)
Although the settlement would result in a net increase in public access to information, there are causes for concern from the perspective of libraries. The provision of a free public access terminal to every library represents a boon to small-town local libraries, but it likely would act as a strain on a larger libraries with more user demand, such as the New York Public Library system. The digital book database that Google is amassing would be an invaluable corpus on which to perform various types of research, and research libraries are rightfully concerned about restrictions on the use of this information. Potential problems also arise from the concentration of this corpus in the hands of a single (or small handful of) powerful private actor(s). The issue of concentrated private power over vast information is particularly alarming because the settlement effectively precludes any potential competitors from offering a comparable service in the near future.  


=== Distribution Channels ===
''The Settlement Mechanism as a Problem-Solving Tool''


How is the internet changing the way printed materials are distributed?  [http://www.amazon.com Amazon.com] appears to be taking over the role of brick-and-mortar bookstores by offering a cheaper and more convenient way to purchase new printed books; their "look inside" feature makes the online shopping experience even more similar to being in a live bookstoreSimilarly, [http://www.abebooks.com Abebooks.com] and similar websites have made it possible for individuals to locate and purchase used, out-of-print, and rare books from one another without requiring the research services of specialized booksellers.  Even if hard copy printed materials remain in demand, might bookstores become obsolete? --[[User:Gwen|Gwen]] 19:41, 4 December 2008 (EST)
Additionally, considerations of institutional competence were raised.  The effects of the settlement on copyright doctrine -- and in particular, the [http://www.nytimes.com/2009/04/04/technology/internet/04books.html fate of orphan works] -- are uncertain but have the potential to be far-reachingAre broad legal issues that so fundamentally affect the public interest best decided through private litigation, or should Congress step in? Is it wise to establish such a rigid, comprehensive system for the digital book realm before we, as a society, have a better idea of how we want to structure our digital world?


== The Publication Process ==
''Concrete Normative Suggestions''


=== Open Access Publishing ===
* Professor Nesson rejected our premise that the settlement will be accepted in its current form.  He views the threats posed by this settlement to digital freedom as urgently severe and alarming; thus, he advocates active intervention in the settlement process itself.  Indeed, following our class, Professor Nesson [http://www.scribd.com/doc/14227449/Letter-to-Request-Intervention-in-Authors-Guild-v-Google moved to intervene] as counsel on behalf of professors [http://www.lewishyde.com/index.html Lewis Hyde] and [http://www.eecs.harvard.edu/~lewis/ Harry Lewis] and the [http://openaccesstrust.org/ Open Access Trust], although the request was [http://docs.justia.com/cases/federal/district-courts/new-york/nysdce/1:2005cv08136/273913/92/ denied].
* Professor Zittrain sees the "golden copies" of digitized books that Google provides to participating libraries as holding great potential as instruments of change.  He suggests that libraries cooperate in a large-scale creative use of these copies.
* Professor Fisher emphasized the importance of understanding the problem on the big-picture level of legal theory and social goals before electing a particular course of action by a particular institution.
* Professor Palfrey suggests three generalized improvements to the settlement that would begin to address many of the concerns that have been raised:
** Ensure the possibility of a meaningful competitive landscape, such that second-comers are not barred from success.
** Establish a means by which the public can have a meaningful level of control over the workings of the Book Rights Registry.
** Create a system of periodic review for the settlement terms. This system would not need to involve periodic wholesale review of the entire settlement by the courts; it could instead merely involve libraries negotiating sunset provisions for individual works with publishers, authors, and other rights holders.


Addressing whether there actually seems to be a movement toward this model, and away from traditional science/tech publishing.  What effects movement toward this model might have on quality, oversight, etc. of published articles.  Also, discussion of business models/funding, problems with open access models, etc.  And any copyright issues (to tie things back to law).
= Teacher's Guide =


This can relate both to open access of full articles (as with [http://www.plos.org/ PLoS]) or single experiments/results (including [http://sciencecommons.org/ Science Commons] and like projects to both make the data available, and, perhaps more importantly, the technologies to make it available in usable form)
This section is intended to provide guidance to anyone interested in teaching about the Google Book Search settlement in the future.  Anyone who has taught on this topic is encouraged to use this wiki page to share additional wisdom gained from the experience.  The development of our plan for this class session (including our selection of the Google Books Settlement as a focus within the broader topic of the internet and publication, as well as our decisions about guests, readings, and technology use) is documented on [http://cyber.law.harvard.edu/iif/Talk:The_Google_Book_Search_Settlement the discussion page].


Would "open review" (instead of "peer review") work? Are there any models around? What about a Slashdot-style system of moderation and meta-moderation?
== Evaluation of the Class ==


Yes, there is at least one example that I can think of.  Lawrence Lessig published the first edition of his book Code in 1999.  It came out in paper and ink.  Several years later, in order to "translate" (his word) the book into a second edition, Lessig persuaded the publisher (Basic Books) to allow him to post the entire text of the first edition of the book on a wiki hosted by Jotspot.  (The Wiki text was licensed under a Creative Commons  Attribution-ShareAlike 2.5 License.)  Lessig explains, "a team of 'chapter captains' helped facilitate a conversation about the text.  There were some edits to the text itself, and many more valuable comments and criticisms.  I then took that text as of the end of 2005 and added my own edits to produce this book." (Preface to ''Code version 2.0'', x.)  ''Code version 2.0'' is the result of this collaborative editing process.  It is available for purchase in paper and ink, for free as a [http://pdf.codev2.cc/Lessig-Codev2.pdf PDF download], and also on a [http://www.socialtext.net/codev2/index.cgi wiki] hosted by Socialtext. --[[User:Gwen|Gwen]] 15:45, 1 December 2008 (EST)
''Topic Breadth and Time Constraints''


[http://openaccess.eprints.org/ Stevan Harnad] put a suitable question on the [http://www.eprints.org/openaccess/ ePrints] site:
The Google Book Settlement is a large, diverse, and contentious topic. Even with our efforts to limit the scope of our discussion by (a) assuming the settlement would be approved and (b) focusing primarily on concerns raised by libraries and the reading public, two hours was enough time only to scratch the surface. One way to help alleviate this difficulty would be to focus the topic even more narrowly (for instance, by choosing to discuss only one or two of the [http://cyber.law.harvard.edu/iif/The_Google_Book_Search_Settlement#Implications_of_the_Settlement major substantive concerns] we have identified), albeit at the cost of precluding discussion on subjects that students might find more interesting.  The most significant factor contributing to the palpable time constraint, however, is the sheer complexity of the settlement itself. At over 140 pages plus appendices, the settlement contains a lot of material that must be digested before potential concerns may be formulated and discussed. [http://www.arl.org/bm~doc/google-settlement-13nov08.pdf Band's summary] does a good job of explaining the most relevant portions of the settlement, and [https://perma.cc/2QTZ-JE3U Grimmelmann's article] explains major concerns in the context of the settlement, but neither piece addresses all the relevant details. As a result, significant time was spent with Mr. Cunard answering specific questions on the terms of the settlement before the class could proceed to a more high-level discussion about concerns and potential solutions. Additionally, while each of our guests brought his own unique and insightful perspective to the settlement, we found two hours to be insufficient to appropriately accommodate three guests.


:"Why did 34,000 researchers sign a threat in 2000 to boycott their journals unless those journals agreed to provide open access to their articles - when the researchers themselves could provide open access (OA) to their own articles by self-archiving them on their own institutional websites?"
''Reading Assignments''


More specifically, we can look at the pros/cons of open access journals (or open access controlled/granted via publishing companies) vs. self-archiving (ie. open access by academics themselves) if this is still a hot/open debateAlso, as the first commenter pointed out on [http://pubfrontier.com/2007/12/11/putting-science-into-science-publishing/ this] blog post, "open access doesn’t mean easy access." So perhaps we could also address the question of how to make open access publications as accessible as non-OA forms.
The amount of reading we assigned seemed proportional to a two-hour class session, and for the most part, the content of the readings was relevant to our discussion.  We sought to provide a descriptive introduction of what the settlement encompasses and what issues it raises, as well as an introduction to the normative implications of the settlement.  For descriptive pieces, we chose not to assign the entire settlement given its length and technical legal language (which would have been particularly difficult for the non-law students in our class).  Instead, we assigned [http://www.arl.org/bm~doc/google-settlement-13nov08.pdf Band's piece] to provide a readable summary of the settlement, some [http://cyber.law.harvard.edu/iif/sites/iif/images/Settlement_Selections.doc selected examples] of provisions in the settlement to give a taste of the actual settlement, and [https://perma.cc/2QTZ-JE3U Grimmelmann's primer] on the chief issues raised by the settlementAs to normative pieces, we assigned what we felt were the most provocative (and the most discussed) pieces on the settlement: the [http://www.nybooks.com/articles/22281 Darnton] and [http://paulcourant.net/2009/02/04/google-robert-darnton-and-the-digital-republic-of-letters/ Courant] articles, plus [http://www.nybooks.com/articles/22496 Darnton's response] to Courant. We felt the readings all served their purpose well during the class (the students seemed relatively well informed of the descriptive aspects of the settlement and its normative implications), with the sole exception being the actual settlement provisions that we assigned but whose language we did not examine closely during classWe felt that it would be useful to have students--especially law students--glimpse what the settlement's actual language looked like, but given our time constraints, we were unable to delve into the particular language and whether it would implement the settlement in the manner that the various commentators expect it to.


A further thought - does ownership/copyright of published articles pass to the journal in most cases? (I think this was the case for the Journal of Pharmaceutical and Biomedical Analysis, the only journal I've had direct experience with).  If so, does that give the journal the power to determine where and to what degree the article could be published?  That is, is this a (potential or real) barrier to self-archiving, and, if so, what can be done about it?  Academics likely could not change this part of the publishing agreement unless they reach critical mass, especially for the better-known journals.
''Discussion Framework and Presence of Guests''


ePrints is apparently the leading software for academic self-archiving (according to the [http://en.wikipedia.org/wiki/Eprints Wikipedia] page), and Stevan Harnad (''long'' interview [http://poynder.blogspot.com/2007/07/oa-interviews-stevan-harnad.html here]) is apparently one of the academics who has been leading the charge in open access to (scientific) academic journals/publications.
A substantial portion of the class discussion focused on the question of whether the pre-settlement status quo was the appropriate comparison against which the settlement should be judged. One specific alternative “dream scenario” was suggested as a potential foil to the settlement, but it seems that more could have been done with this approach. A more lively discussion of alternatives to the settlement might have emerged had the guests only been present for the first hour of the class. Planning for a portion of class discussion to take place outside the presence of the guests would have prevented monopolization of the conversation by the guests, thereby allowing for more student input and a greater diversity of ideas and proposals. In particular, by removing the “settlement expert” after a specified period of time, the amount of time spent asking detailed descriptive questions about the settlement could have been reduced, allowing more time for an open-ended normative discussion of concerns with the settlement and potential solutions. Of course, if fundamental questions regarding the mechanics of the settlement remained or arose after the guest's departure, an informed discussion of concerns and solutions would become more difficult.  In order for a closed class discussion to be productive, those leading the discussion would need to have a firm grasp of the major points of the settlement and be able to act as a fall-back experts on descriptive matters once the guests have left.


[http://en.wikipedia.org/wiki/Peter_Suber Peter Suber] writes what is apparently "the most authoritative [http://www.earlham.edu/~peters/fos/fosblog.html blog]...on open access".
''Use of Technology''
[[User:Lbaker|Lbaker]] 18:24, 18 December 2008 (UTC)


:'''Harvard has been taking a leadership role on open access. We could definitely do a session on this -- Stuart Shieber, Terry Fisher, and John Palfrey would be natural guests or people to talk to to narrow down the questions. [[User:JZ|JZ]] 04:52, 16 December 2008 (UTC)
This class was served well by a minimal use of technology. The Berkman Question Tool was useful in providing a means for backchannel discussion while not being too distracting. It also provided fall-back questions to use if there was a lull in the discussion, with student voting (presumably) signaling which questions and topics the class found most interesting.  Videoconferencing was an effective way of communicating with a remote speaker. However, the camera placement was not ideal, as it gave the remote guest a view of students' backs.


=== Collaborative and Customized Textbooks ===
== Suggestions for Future Iterations ==


Maybe also Harvard's new open access policy for academic work?
''Topic Management''
(note that the Harvard Free Culture group is working on the matter - see [http://wiki.freeculture.org/Open_University_Campaign The Wheeler Declaration])


JZ described an innovative publication option with which Foundation Press seems willing to experimentessentially, individual chapters are available independently from one another, giving instructors the freedom to custom build a text book that contains exactly their desired materials (no more, and no less), in the desired sequenceAssuming this model is technologically, legally, and financially feasible, what benefits and drawbacks does it entail?  Possible risks might include a lack of completeness and/or organization in the materials ultimately acquired by students as well as the possibility that pedagogical emphasis is dictated by sociologically driven group trends rather than deliberately thoughtful decision making. --[[User:Gwen|Gwen]] 15:57, 1 December 2008 (EST)
Even though our topic was narrowed to addressing the concerns of libraries and the reading public under the assumption that the proposed settlement agreement will take effect, we were able only to scratch the surface of this topic in the two hours allotted.  Structuring the class around a more focused topic (perhaps looking at only one or two of the [http://cyber.law.harvard.edu/iif/The_Google_Book_Search_Settlement#Implications_of_the_Settlement major substantive concerns] we identified) would likely allow for a deeper and more substantial discussion. As the topic becomes more focused, however, it may be more difficult to ensure that all students are interested and feel competent to contribute to the discussionA class specifically addressing antitrust concerns, for example, would likely have alienated many of the non-law students in the class.


Stumbled across a fledgling project that seems to be similar (in some respects) to this issue [http://w.cali.org/eLangdell here].  Doesn't look like it's actually operational at the moment, though. [[User:Lbaker|Lbaker]] 20:28, 15 December 2008 (UTC)
''Readings Management''


=== Self Publication ===
Future iterations of this class would do well to use the readings we assigned, as they are comprehensive, yet readable.  Furthermore, they are currently the pieces most mentioned by those discussing the settlement, so knowledge of them is crucial for being able to participate in the discussion.  The one exception is the actual settlement provisions that we assigned. Careful thought should be put into assigning portions of the settlement agreement itself as reading.  Any such settlement excerpts should be selected deliberately with an eye to how they will be used in class.  Parsing the language of settlement terms might alienate any non-law students in a class, so the composition of student backgrounds should be taken into account in deciding whether to assign reading from the settlement agreement.  Although a glimpse of what the actual settlement looks like can be educational, it could probably be done without.  One further caution is that the settlement is part of a constantly changing landscape (see our  [http://cyber.law.harvard.edu/iif/The_Google_Book_Search_Settlement#History_of_the_Settlement summary of the settlement's current status]), so future classes on the subject will need to make sure they are up to date on any recent developments -- for instance, whether the settlement has been modified, accepted, or rejected.  If a major change does occur, then substitute readings may be needed.


One of the biggest and most obvious changes wrought by the advent of the internet and PCs the ability of individuals to self-publish; it is now cheap, quick, and easy to reach a mass audience with one's own text, images, and sounds.  The rise of blogging, Youtube, and other developments have further increased the ease of self-publication.  I know that several scholars have studied the rise and impact of self publication opportunities, but I'm not sure what conclusions they've drawn or which of them might be interesting to bring in as a guest.  Suggestions? --[[User:Gwen|Gwen]] 16:09, 1 December 2008 (EST)
''Guest Management''


:'''Interested to hear what people might find. :) [[User:JZ|JZ]] 04:52, 16 December 2008 (UTC)
Potential ideas of ways to help foster a lively class discussion that is neither dominated by the guests nor stuck on purely descriptive questions include the following:
*''Fewer guests.'' While each guest had a unique perspective on the settlement, more guests means less time for class discussion and student input.  This trade-off is especially problematic given the complexity of the settlement.
*''Pre-recorded material.'' In order to better control timing and ensure that only the most relevant and interesting guest statements are presented to the class, interviews with the guests and/or prepared statements by them could be recorded and edited.  This edited footage would then be shown to the class instead of, or in conjunction with, live guests.  Recorded interviews would also enable the presentation of guests who are not available on the day of the class session; for example, our class might have benefited from a recorded interview with [http://history.fas.harvard.edu/people/faculty/darnton.php Robert Darnton], who was enthusiastic about participating in the discussion but was unable to join us in real time because of a schedule conflict.
*''Time limits on guests.''  Both to mitigate time constraints and also to lead students away from descriptive questions and toward more normative discussion, the amount of time for which the guests are available could be limited.  One possible arrangement would be to have the guests available for introductions and questions during the first hour, and then to have student discussion proceed during the second hour without the guests present.


''Technology Management''


:'''An embarrassment of riches -- great ideas hereNow to home in on one! [[User:JZ|JZ]] 04:52, 16 December 2008 (UTC)
Technology might be used more effectively in the following ways:
*''Interest polling for planning purposes.'' The Berkman Question tool, or even an online poll, might have been used earlier in the planning process in order to determine which of the major issues the class found most interestingThis technique could have alleviated some of the problems mentioned under "topic management" above, by allowing for a more narrowly focused class while ensuring that at least a plurality of students would be interested in the discussion.
*''Videoconferencing details.''  Appropriate camera placement should be considered in advance, in order to avoid awkward arrangements such as we experienced.  Videoconferencing also offers a convenient way of limiting the period of time during which the guests are available, whereas ushering out live guests would be more difficult.

Latest revision as of 13:39, 25 January 2024

Topic Owners: Gwen, Lee, Jon

Topic Date: March 30

back to syllabus

Concrete Questions of the Week

How will the Google book digitization project affect various interests, including those who were parties to the settlement between Google and the Authors Guild/American Association of Publishers, and those who were not? In particular, what changes will libraries (public, private, and university) and readers face going forward, and how should they respond?

Brief Overview of the Google Book Search Settlement

History of the Settlement

In 2004, Google began to digitally scan books from the university library collections of Harvard, Stanford, Oxford, and the University of Michigan, as well as from the New York Public Library system. Google provided a service, Google Book Search (GBS), whereby users could search within these digitized books. The service allowed users to search and view the entire content of books in the public domain, but it also permitted searching copyrighted books, providing short "snippets" containing the search term and some surrounding text.

Crying foul, the Authors Guild, a writers' advocacy group, initiated a class action law suit on September 20, 2005 in the U.S. District Court for the Southern District of New York, claiming copyright infringement. The Association of American Publishers (AAP) followed with a suit of its own on October 19, 2005. Google responded that its actions were lawful under the doctrine of fair use. Rather than fight that battle, however, Google decided to settle with the Authors Guild and the AAP.

The parties announced a settlement on October 28, 2008. Judge Denny Chin is supervising the settlement process. Under the current schedule, authors have until September 4, 2009 to opt out of the settlement, and the court will hold a final hearing on October 7, 2009, after which it will accept or reject the settlement. On April 14, 2009, Professor Charles Nesson filed a motion to intervene in the lawsuit as counsel on behalf of two professors and the Open Access Trust, and on April 17, 2009, the Internet Archive, an online library led by Brewster Kahle, also sought to intervene. Judge Chin denied both of these requests. On May 4, 2009, a number of library associations filed a brief requesting that the court "vigorously" supervise implementation of the settlement, should it be approved. Meanwhile, the Justice Department has reportedly commenced a review of the settlement's legality under antitrust law.

Content of the Settlement

The settlement creates an entirely new legal regime for book digitization. Under the settlement, Google will continue to offer GBS, but it will now have to pay authors and publishers for books still under copyright from the revenue it makes from advertising and selling access to the books. To facilitate payment, the settlement creates a new entity called the Book Rights Registry (BRR), an independent body chaired by an equal number of author and publisher representatives and charged with maintaining a database of book copyrights and implementing the terms of the settlement. Google will be able to continue to digitize books, free from fear of litigation (at least from those copyright owners who do not opt out of the settlement and who published a book prior to January 5, 2009). In return, 63% of revenue from advertising and book sales will be channeled to the BRR for distribution to rights owners. The BRR also possesses other powers. For instance, the BRR has some discretion over dividing revenues between publishers and authors, has some approval power over GBS’s security standards, will help set book prices, and can even license copyrights to third parties besides Google. The settlement is thus non-exclusive to Google, though it contains a “most favored nation clause” that requires the BRR to give Google at least as good terms as any other third-party for the 10 years after the settlement’s approval.

Users of GBS will continue to be able to search the contents of books, but instead of returning "snippets" for copyrighted books, search results will depend on the type of book. The settlement places each book in one of three categories. First are public domain books, which users will continue to be able to view in their entirety. It is estimated that about 20% of GBS books are in the public domain. Second are books that are in-copyright and commercially available, which means that they are available for sale new through a “customary channel of trade” (e.g. it is available new at Amazon). For these books, GBS will display only bibliographic information and "front material" (copyright page, table of contents, index, etc.). About 10% of GBS books are estimated to be in-copyright and commercially available. Third are books that are in-copyright but not commercially available, which account for an estimated 70% of GBS books, and thus make up the bulk of what is covered by the settlement. For these books, users will be able to view up to 20% of the book (with some restrictions). Rightsholders may choose to deviate from these default settings and individually set the amount of each of their books available for users to view. Institutions and individual users will also have the option of paying for permanent online access to the full content of digitized books. The initial price of the institutional subscription will be set with reference to the prices of products and services “comparable” to GBS and will vary based on the type of institution (e.g. whether it is a corporation, a library, or a government office) as well as how many people are members of the institution.

Libraries are also specifically addressed by the settlement. Each public library will be provided a single GBS terminal that will display the entire content of the Institutional Subscription Database (ISD), essentially a database comprised of all books that are in-copyright but not commercially available. Academic libraries will be allowed to have multiple terminals with such access, based on the number of full-time equivalent students enrolled at their respective schools. Institutions may also purchase subscriptions to the ISD. "Fully participating libraries" are given digital copies of any book scanned from their collections, as well as digital copies of books that are in their collection but were scanned from another library, provided that a sufficient proportion of their own collection has been digitized.

Implications of the Settlement

What does this settlement mean for libraries and the reading public? Many have hailed it both for improving access to knowledge by creating "the long dreamed of universal library" (or at least a "universal bookstore") and for providing more access than may have been permitted under fair use if the case had gone to trial. But is this settlement optimal for all interested groups? The ramifications of the settlement will affect not only those parties who participated in the negotiations (Google, authors, and publishers), but also libraries and the reading public, neither of whom had a direct voice in the settlement-drafting process. Each of these groups will likely face a different set of benefits and problems. With a focus on libraries (public, private, and academic), we aim to identify the main challenges that the Google digitization project will entail and develop ways of working within the bounds of the settlement to mitigate these concerns.

Substantive concerns generally fall within one of the following categories:

  • Access: Are the permitted free uses of copyrighted books sufficient? Is the number of public terminals provided to libraries adequate? Are subscription costs for individuals, and especially institutions, going to be too high? Could more be done to increase access to the wealth of knowledge in GBS? These questions get to the heart of the debate over GBS, and are discussed at length in the assigned Darnton and Courant pieces.
  • Antitrust and Monopoly: Does Google's de facto monopoly of digitized book search violate antitrust law? Even if not, are there protections that could be implemented to ensure that Google does not misuse its monopoly? And what about the BRR's monopoly control over licensing digitized works? Will this harm authors or would-be competitors of GBS? How does the most favored nation clause impact the ability of competitors to GBS to emerge? The assigned Grimmelmann piece provides a good discussion of these issues.
  • Copyright: What are the substantive implications of the settlement for copyright law? Was Google right to settle rather than pursue its fair use defense? Does Google's settlement make it more difficult for competitors to rely on a fair use defense? And what consequences will the settlement have for orphan works? Are Google and the BRR reaping profits from orphan works illegally or unfairly? And how will the BRR's copyright database help future licensees track down copyright owners? Will the database be openly available and readily updated?
  • International Implications: What impact will the settlement have on foreign works in the U.S.? Will foreign authors be exploited? Individuals using GBS outside the U.S. are not covered by the settlement. Could the settlement's coverage be expanded so that they too will have the increased access the settlement provides? Is there a problem with the disconnect that foreign works available in the U.S. are covered by the settlement and will create revenue for Google and the BRR whereas international users will not have increased access to GBS?
  • Privacy and Security: Will GBS monitor users' reading habits? If not, what security will it provide so that others are not able to acquire this information? If so, what protections will Google provide readers so that the information is not used or disseminated in harmful ways? And will Google censor books in GBS? If so, what measures will be taken to ensure the censorship is limited to appropriate situations (e.g. preventing minors from viewing obscenity)?
  • User Interface: Could the settlement provide more imaginative options for readers, such as better annotation or modification features? Does GBS's scanning fragment works in harmful ways? And how will Google choose which books and which editions to show at the top of GBS searches? Could the settlement have done more to resolve these issues?

Outline of Class Plan

We will be working from the assumption that the current proposed settlement will be approved by the court. Within the framework that this settlement establishes, then, how can we mitigate the concerns that have been raised by libraries and the reading public? We hope to explore the contours of these problems and produce an array of possible solutions.

Class time will be segmented as follows:

  • The class will begin with a short overview of the settlement itself, provided by the student presenters responsible for this class session.
  • Then, each of our three guests will be given ten minutes to present his views on the settlement and begin to answer the question posed above.
  • After our guests have introduced their perspectives on the settlement, we will follow up with questions we have prepared for them. These questions are designed to ensure that all important aspects of the settlement and its expected effects are brought up.
  • Finally, we will open the session to class discussion, guided by the Berkman Question Tool (see Class Participation). The discussion will be focused on normative suggestions for how institutions and readers should respond to the settlement in the wake of its (presumed) approval by the court.

Guests

Three guests will be participating in our class session. Two of them will be joining us live, and one will join us over videoconference. All three guests will be present for the duration of the class session.

Class Participation

We will be using our old friend, the Berkman Question Tool! (The instance name for our class session is IIFGBS, and it is located here.) Before the day of our class session, we will seed the question tool with some questions that we feel are particularly relevant and topical. We encourage everyone to take a look at these questions prior to the class and to make comments, add more questions, and vote up the questions you find most interesting. We will be using this list of questions to guide the class discussion, especially in the final portion of the class.

Readings

Assigned Readings

  • This New York Review of Books piece by Darnton in response to Courant's challenges. (Skip the published version of Courant's letter here, which is less detailed than the one posted on his blog at the link above, and just read Darnton's response at the bottom of the page.)

Optional Readings

  • Skim the settlement agreement, focusing on Article 4 (Economic Terms for Google Book Search) and Article 7 (Obligations and Rights of Participating Libraries).

Further Research

  • The full settlement, including all of its attachments, is available here.

Class Session Recap

The class session was dominated largely by discussion among our three guests, punctuated by questions from the class. What follows below is a thematic summary of the views that were expressed during this discussion. For a more detailed account of the class session (including diachronic notes on the guests' presentations, the live discussion, and the online discussion, as well as lists of our prepared questions), see our edited class notes.

A Private Settlement With Benefits for Everyone

Despite the many concerns (e.g. Darnton's) and settlement amendment proposals (e.g. Grimmelmann's) raised in newspapers and the blogosphere, it is important to remember that this settlement is the (presumptive) result of actual litigation between specific parties. The parties have taken care to incorporate other interests into the negotiation process, but at its core the settlement is designed to satisfy the concerns of publishers and authors that led them to sue Google in the first place. Nonetheless, major libraries had a presence during the negotiations, participating both directly and through Google. The public interest was also taken into account in a variety of ways, culminating in the provision of public access terminals to all public and academic libraries and substantial access provisions for the visually impaired. It should also be noted that copyright holders, as members of the reading public themselves, had a stake in ensuring that the public interest was not ignored during the negotiations. All three guests agreed that the settlement represents a substantial improvement over the status quo for everyone affected. Professor Fisher, however, suggested that "an ideal dream scenario" would be a more useful basis of comparison than the status quo, and once such an ideal was described, the settlement was found to fall short of the ideal.

Publishers and Authors: Upholding the Copyright Regime

From the publishers’ perspective, the litigation was essentially concerned with ensuring that copyright remains a permissions-based regime. When Google decided to start digitizing books on a massive scale, it did so without attempting to secure permission from rights holders; this move was a bold challenge to the existing regime. Publishers were also concerned about potential repercussions from Google’s practice of providing libraries with digital copies of the books scanned from their collections. Library users have special exemptions under the Copyright Act to copy books in a library’s collection, and these exemptions were carved out during an era where photocopying a book, page by page, was the only means of producing such a copy. In today’s digital era, this exemption for library users, coupled with the ease of online library "use" resulting from mass digitization, could lead to much more widespread copying of books in library collections without any revenue being generated for the rights holders. It was these concerns that motivated publishers and authors to bring suit against Google.

Libraries: Concerned About Resource Strain and the Proprietary Database

Although the settlement would result in a net increase in public access to information, there are causes for concern from the perspective of libraries. The provision of a free public access terminal to every library represents a boon to small-town local libraries, but it likely would act as a strain on a larger libraries with more user demand, such as the New York Public Library system. The digital book database that Google is amassing would be an invaluable corpus on which to perform various types of research, and research libraries are rightfully concerned about restrictions on the use of this information. Potential problems also arise from the concentration of this corpus in the hands of a single (or small handful of) powerful private actor(s). The issue of concentrated private power over vast information is particularly alarming because the settlement effectively precludes any potential competitors from offering a comparable service in the near future.

The Settlement Mechanism as a Problem-Solving Tool

Additionally, considerations of institutional competence were raised. The effects of the settlement on copyright doctrine -- and in particular, the fate of orphan works -- are uncertain but have the potential to be far-reaching. Are broad legal issues that so fundamentally affect the public interest best decided through private litigation, or should Congress step in? Is it wise to establish such a rigid, comprehensive system for the digital book realm before we, as a society, have a better idea of how we want to structure our digital world?

Concrete Normative Suggestions

  • Professor Nesson rejected our premise that the settlement will be accepted in its current form. He views the threats posed by this settlement to digital freedom as urgently severe and alarming; thus, he advocates active intervention in the settlement process itself. Indeed, following our class, Professor Nesson moved to intervene as counsel on behalf of professors Lewis Hyde and Harry Lewis and the Open Access Trust, although the request was denied.
  • Professor Zittrain sees the "golden copies" of digitized books that Google provides to participating libraries as holding great potential as instruments of change. He suggests that libraries cooperate in a large-scale creative use of these copies.
  • Professor Fisher emphasized the importance of understanding the problem on the big-picture level of legal theory and social goals before electing a particular course of action by a particular institution.
  • Professor Palfrey suggests three generalized improvements to the settlement that would begin to address many of the concerns that have been raised:
    • Ensure the possibility of a meaningful competitive landscape, such that second-comers are not barred from success.
    • Establish a means by which the public can have a meaningful level of control over the workings of the Book Rights Registry.
    • Create a system of periodic review for the settlement terms. This system would not need to involve periodic wholesale review of the entire settlement by the courts; it could instead merely involve libraries negotiating sunset provisions for individual works with publishers, authors, and other rights holders.

Teacher's Guide

This section is intended to provide guidance to anyone interested in teaching about the Google Book Search settlement in the future. Anyone who has taught on this topic is encouraged to use this wiki page to share additional wisdom gained from the experience. The development of our plan for this class session (including our selection of the Google Books Settlement as a focus within the broader topic of the internet and publication, as well as our decisions about guests, readings, and technology use) is documented on the discussion page.

Evaluation of the Class

Topic Breadth and Time Constraints

The Google Book Settlement is a large, diverse, and contentious topic. Even with our efforts to limit the scope of our discussion by (a) assuming the settlement would be approved and (b) focusing primarily on concerns raised by libraries and the reading public, two hours was enough time only to scratch the surface. One way to help alleviate this difficulty would be to focus the topic even more narrowly (for instance, by choosing to discuss only one or two of the major substantive concerns we have identified), albeit at the cost of precluding discussion on subjects that students might find more interesting. The most significant factor contributing to the palpable time constraint, however, is the sheer complexity of the settlement itself. At over 140 pages plus appendices, the settlement contains a lot of material that must be digested before potential concerns may be formulated and discussed. Band's summary does a good job of explaining the most relevant portions of the settlement, and Grimmelmann's article explains major concerns in the context of the settlement, but neither piece addresses all the relevant details. As a result, significant time was spent with Mr. Cunard answering specific questions on the terms of the settlement before the class could proceed to a more high-level discussion about concerns and potential solutions. Additionally, while each of our guests brought his own unique and insightful perspective to the settlement, we found two hours to be insufficient to appropriately accommodate three guests.

Reading Assignments

The amount of reading we assigned seemed proportional to a two-hour class session, and for the most part, the content of the readings was relevant to our discussion. We sought to provide a descriptive introduction of what the settlement encompasses and what issues it raises, as well as an introduction to the normative implications of the settlement. For descriptive pieces, we chose not to assign the entire settlement given its length and technical legal language (which would have been particularly difficult for the non-law students in our class). Instead, we assigned Band's piece to provide a readable summary of the settlement, some selected examples of provisions in the settlement to give a taste of the actual settlement, and Grimmelmann's primer on the chief issues raised by the settlement. As to normative pieces, we assigned what we felt were the most provocative (and the most discussed) pieces on the settlement: the Darnton and Courant articles, plus Darnton's response to Courant. We felt the readings all served their purpose well during the class (the students seemed relatively well informed of the descriptive aspects of the settlement and its normative implications), with the sole exception being the actual settlement provisions that we assigned but whose language we did not examine closely during class. We felt that it would be useful to have students--especially law students--glimpse what the settlement's actual language looked like, but given our time constraints, we were unable to delve into the particular language and whether it would implement the settlement in the manner that the various commentators expect it to.

Discussion Framework and Presence of Guests

A substantial portion of the class discussion focused on the question of whether the pre-settlement status quo was the appropriate comparison against which the settlement should be judged. One specific alternative “dream scenario” was suggested as a potential foil to the settlement, but it seems that more could have been done with this approach. A more lively discussion of alternatives to the settlement might have emerged had the guests only been present for the first hour of the class. Planning for a portion of class discussion to take place outside the presence of the guests would have prevented monopolization of the conversation by the guests, thereby allowing for more student input and a greater diversity of ideas and proposals. In particular, by removing the “settlement expert” after a specified period of time, the amount of time spent asking detailed descriptive questions about the settlement could have been reduced, allowing more time for an open-ended normative discussion of concerns with the settlement and potential solutions. Of course, if fundamental questions regarding the mechanics of the settlement remained or arose after the guest's departure, an informed discussion of concerns and solutions would become more difficult. In order for a closed class discussion to be productive, those leading the discussion would need to have a firm grasp of the major points of the settlement and be able to act as a fall-back experts on descriptive matters once the guests have left.

Use of Technology

This class was served well by a minimal use of technology. The Berkman Question Tool was useful in providing a means for backchannel discussion while not being too distracting. It also provided fall-back questions to use if there was a lull in the discussion, with student voting (presumably) signaling which questions and topics the class found most interesting. Videoconferencing was an effective way of communicating with a remote speaker. However, the camera placement was not ideal, as it gave the remote guest a view of students' backs.

Suggestions for Future Iterations

Topic Management

Even though our topic was narrowed to addressing the concerns of libraries and the reading public under the assumption that the proposed settlement agreement will take effect, we were able only to scratch the surface of this topic in the two hours allotted. Structuring the class around a more focused topic (perhaps looking at only one or two of the major substantive concerns we identified) would likely allow for a deeper and more substantial discussion. As the topic becomes more focused, however, it may be more difficult to ensure that all students are interested and feel competent to contribute to the discussion. A class specifically addressing antitrust concerns, for example, would likely have alienated many of the non-law students in the class.

Readings Management

Future iterations of this class would do well to use the readings we assigned, as they are comprehensive, yet readable. Furthermore, they are currently the pieces most mentioned by those discussing the settlement, so knowledge of them is crucial for being able to participate in the discussion. The one exception is the actual settlement provisions that we assigned. Careful thought should be put into assigning portions of the settlement agreement itself as reading. Any such settlement excerpts should be selected deliberately with an eye to how they will be used in class. Parsing the language of settlement terms might alienate any non-law students in a class, so the composition of student backgrounds should be taken into account in deciding whether to assign reading from the settlement agreement. Although a glimpse of what the actual settlement looks like can be educational, it could probably be done without. One further caution is that the settlement is part of a constantly changing landscape (see our summary of the settlement's current status), so future classes on the subject will need to make sure they are up to date on any recent developments -- for instance, whether the settlement has been modified, accepted, or rejected. If a major change does occur, then substitute readings may be needed.

Guest Management

Potential ideas of ways to help foster a lively class discussion that is neither dominated by the guests nor stuck on purely descriptive questions include the following:

  • Fewer guests. While each guest had a unique perspective on the settlement, more guests means less time for class discussion and student input. This trade-off is especially problematic given the complexity of the settlement.
  • Pre-recorded material. In order to better control timing and ensure that only the most relevant and interesting guest statements are presented to the class, interviews with the guests and/or prepared statements by them could be recorded and edited. This edited footage would then be shown to the class instead of, or in conjunction with, live guests. Recorded interviews would also enable the presentation of guests who are not available on the day of the class session; for example, our class might have benefited from a recorded interview with Robert Darnton, who was enthusiastic about participating in the discussion but was unable to join us in real time because of a schedule conflict.
  • Time limits on guests. Both to mitigate time constraints and also to lead students away from descriptive questions and toward more normative discussion, the amount of time for which the guests are available could be limited. One possible arrangement would be to have the guests available for introductions and questions during the first hour, and then to have student discussion proceed during the second hour without the guests present.

Technology Management

Technology might be used more effectively in the following ways:

  • Interest polling for planning purposes. The Berkman Question tool, or even an online poll, might have been used earlier in the planning process in order to determine which of the major issues the class found most interesting. This technique could have alleviated some of the problems mentioned under "topic management" above, by allowing for a more narrowly focused class while ensuring that at least a plurality of students would be interested in the discussion.
  • Videoconferencing details. Appropriate camera placement should be considered in advance, in order to avoid awkward arrangements such as we experienced. Videoconferencing also offers a convenient way of limiting the period of time during which the guests are available, whereas ushering out live guests would be more difficult.