British National Corpus


A 100 million word snapshot of British English, both spoken and written, at the end of the 20th century, containing over 4,000 text extracts selected to represent the full variety of the language.

The corpus is distributed in compressed form as a tar archive, in TEI format with an additional special-purpose index for use with the SARA retrieval program. It can be used under Unix (networked) or Windows (standalone). The DTD is TEI-conformant, with some modifications (see for details).



– Lou Burnard


British National Corpus
Oxford University Computing Services
13 Banbury Road
Oxford OX2 6NN
Tel. +44 (0)1865 273 221
Fax +44 (0)1865 273 275

Copyright TEI Consortium. Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution 3.0 Unported license and a BSD 2-Clause license.
Last recorded change to this page: 2007-09-20  •  For corrections or updates, contact webmaster AT