Corpus linguistics is a rapidly growing methodology that uses the statistical analysis of large collections of written or spoken data (corpora) to investigate linguistic phenomena.