Cambridge Law Corpus
Repository URI
Repository DOI
Change log
Authors
Description
We introduce the Cambridge Law Corpus (CLC), a corpus for legal AI research. It consists of over 250,000 court cases from the UK. Most cases are from the 21st century, but the corpus includes cases as old as the 16th century. Together with the corpus, we provide annotations on case outcomes for 638 cases, done by legal experts.
This dataset consists of 15 selected cases from the CLC. The selected cases are publicly available under the Open Justice License.
Version
Software / Usage instructions
We have focused on well-known cases across various courts and legal issues. Due to ethical considerations, access to the full CLC dataset will be restricted for research use only. Researchers will be able to apply for the full CLC via the detailed instructions on our project website: https://www.cst.cam.ac.uk/research/srg/projects/law.
Keywords
artificial intelligence, dataset, law
Publisher
Rights
Open Justice License
Sponsorship
ESRC (ES/T006315/1)
UKRI-JST