Dataset | OpenDataMonitor

CKAN

Odm ID	fa524a0a-8e2a-4c6a-b1ec-20ba694d5057
Title	Brown Corpus in RDF/NIF
Notes	RDF version of the Brown Corpus (W. N. Francis, H. Kucera; Brown University; 1979). 1,014,312 words in 500 documents, taken from newspapers texts on diverse topics, non-fiction and fiction books as well as government documents. Original corpus contains manually annotated sentence and token boundaries as well as word class annotations(such as POS, inflectional morphemes, such as noun plural, verb tense and adjective comparison and special tags for foreign words and proper nouns). Converted corpus contains complete texts reconstructed from TEI/XML version of the Brown corpus. Word classes where linked via OLiA to ontological categories for aggregated querying.
Author	Martin Brümmer
Author Email	bruemmer@informatik.uni-leipzig.de
Catalogue Url	http://datahub.io/
Dataset Url	http://thedatahub.org/dataset/brown-corpus-in-rdf-nif
Metadata Updated	2015-09-14 18:13:41
Tags
Date Released
Date Updated
Update Frequency
Organisation	AKSW
Country
State
Platform	ckan
Language	en
Version	(not set)