European
Data Catalogues
Dataset

CKAN

Sub menu


Brown Corpus in RDF/NIF

Dataset Profile

Odm ID
fa524a0a-8e2a-4c6a-b1ec-20ba694d5057
Title
Brown Corpus in RDF/NIF
Notes
RDF version of the Brown Corpus (W. N. Francis, H. Kucera; Brown University; 1979). 1,014,312 words in 500 documents, taken from newspapers texts on diverse topics, non-fiction and fiction books as well as government documents.

Original corpus contains manually annotated sentence and token boundaries as well as word class annotations(such as POS, inflectional morphemes, such as noun plural, verb tense and adjective comparison and special tags for foreign words and proper nouns).

Converted corpus contains complete texts reconstructed from TEI/XML version of the Brown corpus. Word classes where linked via OLiA to ontological categories for aggregated querying.
Author
Martin Brümmer
Author Email
Catalogue Url
Dataset Url
Metadata Updated
2015-09-14 18:13:41
Tags
Date Released
Date Updated
Update Frequency
Organisation
AKSW
Country
State
Platform
ckan
Language
en
Version
(not set)