Repository logo
 

SPECTRa-T / TheOREM Test Corpus


No Thumbnail Available

Type

Dataset

Change log

Authors

Day, Nick 
Townsend, Joseph A 

Description

These theses were used as test documents in the JISC sponsored SPECTRa-T and TheOREM projects, the former looking at text mining from thesis documents, the latter researching techniques for describing the structure of theses in the OAI ORE standard.

Version

Software / Usage instructions

Zipped archive containing Microsoft OOXML files (docx), SciXML files compatible with OSCAR, and extracted binary plugin files

Keywords

theses, jisc, nlp, semantic web

Publisher

University of Cambridge
Sponsorship
JISC SPECTRa-T, JISC TheOREM