Distributed Proofreaders

progetto di revisione web-based del progetto Gutenberg

Distributed Proofreaders, solitamente abbreviato in DP o PGDP, è un progetto con lo scopo di sviluppare eBook per il Project Gutenberg, permettendo a più persone di lavorare insieme alla correzione degli errori commessi dalle scansioni OCR. Si può tradurre in italiano come correttori distribuiti.

Storia

Distributed Proofreaders è stato fondato da Charles Frank nel 2000 come sito indipendente per aiutare il Project Gutenberg. DP è divenuto sito ufficiale di Project Gutenberg nel 2002.

L'8 novembre 2002 Distributed Proofreaders è stato pubblicizzato su Slashdot,[1][2] e più di 4mila nuovi membri si sono iscritti in un solo giorno, aumentando significativamente il numero di correttori e sviluppatori software; ciò portò a un miglioramento qualitativo e quantitativo degli e-book. Nell'ottobre 2004 Distributed Proofreaders ha raggiunto i 5.000 e-text inviati al Project Gutenberg, e nel marzo 2007 ha raddoppiato il numero. I 15.000 e-book sono stati raggiunti nel maggio 2009. Quasi la metà delle opere attualmente presenti su Project Gutenberg sono state create dai volontari di DP.

On 31 July, 2006, the Distributed Proofreaders Foundation was formed to provide Distributed Proofreaders with its own legal entity and not-for-profit status. IRS approval of section 501(c)(3) status was granted retroactive to 7 April, 2006.

Proofreading process

Public ___domain works, typically books with expired copyright, are scanned by volunteers or culled from digitalization projects, and the images are run through optical character recognition (OCR) software. Since OCR software is far from perfect, often a large number of errors appear in the resulting text. To correct them, pages are made available to volunteers via the Internet; the original page image and the recognized text appear side by side. This process thereby distributes the time-consuming error-correction process, akin to distributed computing.

Each page is proofread and formatted many times, and then a post-processor combines the pages and prepares the text for uploading to Project Gutenberg.

Besides custom software created to support the project, DP also runs a forum and a wiki for project coordinators and participants.

DP Europe

In January 2004, Distributed Proofreaders Europe started, hosted by Project Rastko. This site has the ability to process text in Unicode UTF-8 encoding. Books proofread are centered mainly on European culture, with a large proportion of non-English texts including Hebrew, Arabic, Urdu and many others. Template:As of, DP Europe had produced over 480 e-texts.

The original DP is sometimes referred to as "DP International" by members of DP Europe. However, DP servers are located in the United States, and therefore works must be cleared by Project Gutenberg as being in the public ___domain according to U.S. copyright law before they can be proofread and eventually published at DP.

DP Canada

On 1 December 2007, Distributed Proofreaders Canada launched to support the production of e-books for Project Gutenberg Canada and take advantage of shorter Canadian copyright terms. Although it was established by members of the original Distributed Proofreaders site, it is a separate entity. All of its projects are posted to Project Gutenberg Canada, which launched on Canada Day 2007.

In addition to preserving Canadiana, DP Canada is notable because it is the first major effort to take advantage of Canada's copyright laws which may allow more works to be preserved. Like copyright law in many other countries, Canada has a "life plus 50" copyright term. This means that works by authors who died more than fifty years ago may be preserved in Canada, whereas in other parts of the world those works may not be distributed because they are still copyright.

Notable authors whose works may be preserved in Canada but not other parts of the world include A. A. Milne, Walter de la Mare, Sheila Kaye-Smith and Amy Carmichael.

Numero di e-book

Numero e-book Data e-text Link al Project Gutenberg
1,000 19 febbraio 2002
2,000th 3 Sep 2003 Hamlet -- the 'Bad Quarto', William Shakespeare etext 9077
3,000th 14 Jan 2004 The Anatomy of Melancholy, Robert Burton etext 10800
4,000th 6 Apr 2004 Aventures du Capitaine Hatteras, Jules Verne etext 11927
5,000th 24 Aug 2004 A short biographical dictionary of English literature, John William Cousin etext 13240
6,000th 2 Feb 2005 The Journal of Sir Walter Scott, Sir Walter Scott etext 14860
7,000th 23 Jun 2005
8,000th 8 Feb 2006 The Suppression of the African slave-trade to the United States of America, 1638-1870, W. E. B. Dubois etext 19179
9,000th 8 Sep 2006
10,000th 9 Mar 2007
11,000th 12 Sep 2007 Northern Nut Growers Association Thirty-Fourth Annual Report 1943, Northern Nut Growers Association etext 22587
12,000th 26 Jan 2008 Zur Psychopathologie des Alltagslebens, Sigmund Freud etext 24429
13,000th 24 Jun 2008 A world of girls, L. T. Meade etext 25870
14,000th 1 Dec 2008 The Art of Stage Dancing, Ned Wayburn etext 27367
15,000th 12 May 2009 Philosophical Transactions of the Royal Society - Vol 1 - 1666, Various. Henry Oldenburg (editor) etext 28758


10,000th E-book

On 9 March 2007, Distributed Proofreaders announced completing more than 10,000 titles. In celebration, a block of 15 titles was published:

See also

References