Groklaw - Interview with Margaret Boribon of Copiepresse About Google.be, by Sean Daly

	When you want to know more...

Home
Archives
Site Map
Search
About Groklaw
Awards
Legal Research
Timelines

ApplevSamsung
ApplevSamsung p.2
ArchiveExplorer
Autozone
Bilski
Cases
Cast: Lawyers
Comes v. MS
Contracts/Documents
Courts
DRM
Gordon v MS
GPL
Grokdoc
HTML How To
IPI v RH
IV v. Google
Legal Docs
Lodsys
MS Litigations
MSvB&N
News Picks
Novell v. MS
Novell-MS Deal
ODF/OOXML
OOXML Appeals
OraclevGoogle
Patents
ProjectMonterey
Psystar
Quote Database
Red Hat v SCO
Salus Book
SCEA v Hotz
SCO Appeals
SCO Bankruptcy
SCO Financials
SCO Overview
SCO v IBM
SCO v Novell
SCO:Soup2Nuts
SCOsource
Sean Daly
Software Patents
Switch to Linux
Transcripts
Unix Books

Gear

Groklaw Gear

You won't find me on Facebook

Donate

No Legal Advice

The information on Groklaw is not intended to constitute legal advice. While Mark is a lawyer and he has asked other lawyers and law students to contribute articles, all of these articles are offered to help educate, not to provide specific legal advice. They are not your lawyers.

Here's Groklaw's comments policy.

What's New

STORIES
No new stories

COMMENTS last 48 hrs
No new comments

Sponsors

Hosting:

On servers donated to ibiblio by AMD.

Webmaster

Interview with Margaret Boribon of Copiepresse About Google.be, by Sean Daly - Updated

Wednesday, October 11 2006 @ 10:28 AM EDT

Groklaw's Sean Daly had an opportunity to interview Margaret Boribon of Copiepresse in Belgium about the recent litigation against Google.be. We present the interview in both English and French. It was conducted in French, and the translation was sent to Mme. Boribon, for her approval. Here is the Ogg audio, if you prefer to listen.

As you are aware, recently Copiepresse in Belgium sued Google for copyright infringement on behalf of the authors it represents regarding Google News using headlines and a sentence or so of their material, and on September 5th in an action in which Google had not yet appeared for reasons that are not yet proven to my satisfaction, since we have not yet heard from Google on this point, the court issued an order finding Google guilty. Here's the order [PDF], in French first and then in English. If you don't like PDFs, you can find it here in English text and another more comprehensive English version is here. Now Belgian photographers have joined the class action. And the original editors are not yet satisfied. One thing that is bothering them is cache. Apparently they do not accept robot.txt files as a solution.

[Update: 4:30 AM EDT Thurs. - Now Copiepresse is reported to be threatening MSN. The article, in French, says Microsoft is being more cooperative than Google. Also, Pressbanking, which markets Belgian articles, has asked to join Copiepresse in its action. They say they've been harmed by cache, which freely makes available materials they sell. The article has one funny bit. Even if you don't read French, you'll catch it. Look for the name of the multimedia group that has joined Copiepress:

La Sofam (droits d'auteur des photographes), la SAJ (droits d'auteur de nombreux journalistes) et la Scam (droits d'auteur du multimédia) avaient déjà rejoint Copiepresse. Un jugement sur le fond est attendu le 24 novembre. Une information du journal « L'Echo ».

Not everything translates well. Finally, here's a US case where Google's cache was found to be legal, because it was deemed fair use, and because there is an easy opt-out mechanism the plaintiff chose not to use, among other reasons. Here's the ruling [PDF], in which the judge wrote that the plaintiff “attempted to manufacture a claim for copyright infringement against Google in hopes of making money from Google’s standard [caching] practice”.]

To understand the issues, you may find it helpful to read this article, in which The World Association of Newspapers discusses its unhappiness with search engines. The organization announced in January it would launch an offensive against search engines, mentioning Google by name and indicating an interest in receiving money for Google News' using their articles.

Google did not appear in the action prior to the order issuing, but now has decided to fight first in a preliminary appeal, which was unsuccessful. There will be a full hearing on November 24.

The court relied upon an expert, Luc Golvers, who is president of CLUSIB, Club de la Sécurité informatique belge. The court ruled, based on his expert report:

Considering that his research has led him to prove that, while an article is still online on the site of the Belgian publisher, Google redirects directly, via the underlying hyperlinks, to the page where the article can be found, but as soon as the article can no longer be seen on the site of the Belgian newspaper publisher, it is possible to obtain the contents of it via the “Cached” hyperlink which then goes back to the contents of the article that Google has registered in the “cached” memory of the gigantic data base which Google keeps within its enormous number of servers;...
Considering, finally, that it is deducted from the expert’s report that:
- the way in which the Google News presently operates cause the publishers of the daily press to lose control of their web sites and their contents (of the tests conducted by the expert which show the effects of the withdrawal of an article, pages 42 to 67 of the report);

You can see a picture of Golvers in an article highlighting him on the subject of security on Microsoft's website in Belgium, coincidentally enough.

A Brief Robots.txt Tutorial

Here are the instructions for being removed from Google's search engine. Please notice that you have the following choices:

Remove your entire website
Remove part of your website
Remove snippets
Remove cached pages
Remove an outdated link
Remove an image from Google Image Search
Remove a blog from Blog Search
Remove a RSS or Atom feed (i.e., block Feedfetcher)
Remove transcoded pages

So, if a site doesn't wish its material to be found in search engines' cache, here's all it has to do. Place the following in the header of the HTML of the page or pages it doesn't want cached:

META NAME="ROBOTS" CONTENT="NOARCHIVE"

The next time Google crawls the site, it will honor that instruction. (You need to put the words inside of left and right arrows, but if I do that here, you won't be able to see the words, which is the purpose of the arrows. It's instructions for robots, not for people to read.) If you can't wait for the next time Google stops by, there's an automatic URL removal system.

If you wish not to appear in Google at all, here's all you have to do --create a file that says the following:

User-agent: * Disallow: /

Put it on your root server in one place. You don't even have to put it on every page. In fact that would confuse the bots. It looks for a robots.txt file on every site. Groklaw's, for example, would be found at http://www.groklaw.net/robots.txt. Here are simple instructions.

You can tell the search engine, all search engines, it can't index your content. You can even tell only Google, while letting in MSN or Yahoo or all other search engines. Instead of the User-agent: * say User-agent: Googlebot, if that is your wish.

Because Google's crawler checks for robot.txt first, before crawling a site, that, to me, looks like it is seeking permission. Because if the robot.txt file tells it not to scoop up content, or to extract only certain content, it will obey. So sites need not lose control of their content. If one's goal is money, then robot.txt does not solve one's problem. And it's important to point out that this isn't Google's personal invention or private system. It's a web standard, a specification that webmasters have followed since 1994.

Regarding the issue of jurisdiction, while in the interview she asserts world jurisdiction for Belgian courts, I think that is unlikely to be upheld. Ask yourself this: if any country's courts can assert jurisdiction over Google because it can be accessed over the Internet, and content must be removed that is accessible not only on the regional version of Google but on all of Google, that would mean that China could assert world jurisdiction, and so could Iran and Africa and South America, anywhere. How could any business manage to exist with everyone able to tell it how to run its business and how would it comply with conflicting laws in various places? Jurisdiction is a concept that stands for an orderliness in the law, that you ought to be able to predict where you can be sued, so you can plan your activities with clarity, and that there should be no undue encroachment. Here's an article that explains it. Anyway, France already tried that with Yahoo, and what it got was limited to the French site. Any other resolution threatens not only Google but the Internet itself. Granted that the Internet raised new issues regarding jurisdiction, but the fundamental idea of jurisdiction is fairness. That's why if I do something in New York City that would be against the law in Belgium but not here, they can't normally arrest me, unless there is a treaty in effect that says it can. How would you like it if they could? Would you like to be subject to all the laws of the entire world simultaneously?

However, there are other issues, outside of the litigation, that Mme. Boribon raises that are not as easily answered. I think you will find the interview of interest and I thank her very much for the opportunity to hear her side of the debate. Google was offered the opportunity to comment but declined to do so.

*********************************

September 28th, 2006, 12:00PM Interview
Copiepresse offices, Brussels, Belgium
Interviewer: Sean Daly