TEI: British National Corpus

For inclusion in the TEI Application Page

Information provided by Lou Burnard.
English (including Old/Middle English)Language Corpora20 September 2007Chris Ruotolo Updated links; converted to TEI P5 21 January 2002

Stuart BrownUpdate as per info from Lou.

18 December 2001

Stuart BrownMinor edit; URLs checked and OK.

16 June 2000

Frances CondronMinor changes made to layout.

12 August 1996

WPAmended source description and added source attribution to text.

25 June 1996

WPCreated file

  • Host: Oxford University Computing Services
  • URL:


A 100 million word snapshot of British English, both spoken and written, at the end of the 20th century, containing over 4,000 text extracts selected to represent the full variety of the language.

The corpus is distributed in compressed form as a tar archive, in TEI format with an additional special-purpose index for use with the SARA retrieval program. It can be used under Unix (networked) or Windows (standalone). The DTD is TEI-conformant, with some modifications (see for details).


  • Extensive documentation on website.
  • Many published articles; see .
  • Detailed users manual distributed with corpus in TEI, PDF, or HTML; browsable at .


– Lou Burnard


Oxford University Computing Services13 Banbury RoadOxford OX2 6NN UKTel. +44 (0)1865 273 221Fax +44 (0)1865 273 275Enquiries: natcorp@oucs.ox.ac.ukErrors: bugs@natcorp.ox.ac.uk