Since its inception, the Google Book Search initiative has caused tremendous controversy within the publishing industry. While the recent settlement between Google and the publishers may have allayed some of the key concerns, urgent questions remain on how the industry can effectively capitalize on the agreement. This closing keynote session was an interview between MarkLogic CEO Dave Kellogg and Daniel J. Clancy, PhD, engineering director for Google Book Search Project.
Clancy talked about the three types of books they included in the service, books that are in print and protected by copyright, out of print and protected by copyright, or it’s a book no longer covered by copyright.
The Google Book Search project includes 10 millions scanned books, averaging 330 pages per book. The index is about half English language books.
Interestingly, Spanish language books are accessed three times more often than English language books.
Publishers don’t always disagree with the approach Google is taking. In fact, some like it a lot because their inclusion in the Book Search project increases sales for legacy books. Google provides links to where users can purchase the books included in the index.
According to the settlement, there are a variety of ways that rights holders (folks who have copyright on books scanned and included in the index) can choose to participate. Two such options are to remain in the index and get some cut of the sales (yes, Google sells access to book content) or they can ask to be removed altogether, and Google will honor their request.
When asked by an audience member how Google is handling multiple editions of the same book, Clancy said they treat the books like a library would. So, yes, they scan and index multiple editions and versions in different languages (translations).
While book publisher rights are often the focus of discussions about digital books, an audience member asked what would happen to passing books on to used book stores?
“Certain aspects of the used book market are based on scarcity,” said Clancy. “Scarcity goes away in a digital space.”
On the subject of the perception there’s a competition beteween book archivists and Google Book Search, Clancy said, “What happens in 20 or 50 years from now when Google isn’t here?” It’s important that multiple parties are scanning and making books available 100 years from now.”
It was a very interesting conversation that provided much food for thought.
[Note: Clancy was not saying Google will not be here in 20-50 years, but that the world is a very uncertain place and you never know what the future will hold.]
To learn more about the Google Book Search class action lawsuit (and to ensure I didn’t miscommunicate the details), visit the Settlement Agreement website.
Learn more about the opportunites that result from the setlement in this white paper from MarkLogic.
Discover which libraries are helping Google scan books.


