  |
LDC - Linguistic Data Consortium - http://ldc.upenn.edu/
The Linguistic Data Consortium (LDC) creates, collects and distributes speech and text databases, annotated corpora, treebanks, lexicons and other linguistic resources for research, education and development. |
  |
American National Corpus - http://americannationalcorpus.org/
Information about this freely available database of American English. |
  |
Centre for English Corpus Linguistics - http://juppiter.fltr.ucl.ac.be/FLTR/GERM/ETAN/CECL/cecl.html
At the Catholic University of Leuven, this institute focuses on cross-linguistic corpora and learner corpora. Research, events, staff, publications. |
  |
ELRA catalog of language resources - http://catalog.elra.info/
Various language resources and evaluation packages in the field of Human Language Technology (HLT) are available at ELRA (European Language Resources Association). Distribution is taken care of by ELRA's operational body: ELDA. |
  |
International Journal of Corpus Linguistics - http://www.benjamins.com/cgi-bin/t_seriesview.cgi?series=IJCL
A journal published twice a year, presenting articles from linguists, lexicographers and language engineers. Contents, abstracts, submission information. |
  |
National Corpus of Polish - http://nkjp.pl/
The National Corpus of Polish is a publicly available, large, balanced and linguistically annotated corpus of polish. |
  |
SIGANN: ACL Special Interest Group for Annotation - http://www.cs.vassar.edu/sigann/
A subgroup of the Association for Computational Linguistics (ACL), this group is concerned with all aspects of linguistic annotation of language resources (linguistic corpora), especially the advancement of interoperability. Sponsors the annual Linguistic Annotation Workshop (LAW). |
  |
Hungarian National Corpus - http://corpus.nytud.hu/mnsz/index_eng.html
More than 150 million Hungarian words, a model of Hungarian language of the 1990s. Free and extensive query system. [Hungarian, English] |
  |
Free online parallel corpus - http://korpus.hiztegia.org
This website allows you to search online for words in Basque, Polish, English, French or Spanish, and displays results in all these languages, aligned by paragraph. |
  |
Corpus Encoding Standard - http://www.cs.vassar.edu/CES/
Application of SGML to corpus encoding. Covers the standard and projects currently using it. |