Information About Robot Access to our Pages

The Project Gutenberg website is intended for human users only. Any perceived use of automated tools to access the Project Gutenberg website will result in a temporary or permanent block of your IP address. The only exceptions to this rule are below.

Contents

How to Get All Ebook Files
How to Get Certain Ebook Files
How to Mirror Project Gutenberg
How to Get Catalog Data

How to Get all Ebook Files

The best way to have a local up-to-date copy of all files is to setup a private mirror: See: the Mirroring How-To

How to Get Certain Ebook files

wget is free software and available for Linux, Windows, and Mac OS X at www.gnu.org/software/wget/

This is an example of how to get files using wget:

wget -w 2 -m -H "http://www.gutenberg.org/robot/harvest?filetypes[]=html"

Replace html with the file type you are interested in.

html
txt
epub.images
epub.noimages
kindle.images
kindle.noimages
mp3

If you want only files in a given language say:

wget -w 2 -m -H "http://www.gutenberg.org/robot/harvest?filetypes[]=html&langs[]=de"

Replace ‘de’ with the ISO language code you are interested in. Tip: you can learn the language code of any language in the Project Gutenberg catalog by looking at the status window of your browser while moving your cursor over the language at this page

How to Mirror Project Gutenberg

If you want to setup a mirror of Project Gutenberg, read the Mirroring How-To

How to Get Catalog Data

You can extract the whole Project Gutenberg catalog data from the Project Gutenberg catalog in machine-readable format. The catalog data are granted to the public domain.

Files change frequently

Project Gutenberg publishes hundreds of new eBooks every month, and we also fix hundreds of books when issues like typos are reported. In addition, the ‘generated’ formats (under the ‘cache’ top-level directory) are re-built monthly and sometimes have changes due to improvements in the software that builds them.

When you save a copy of a Project Gutenberg eBook, consider periodically checking for any changes.