Repository logo
 

Cambridge Law Corpus


No Thumbnail Available

Type

Dataset

Change log

Authors

Östling, Andreas 
Sargeant, Holli 
Xie, Huiyuan 
Bull, Ludwig 
Alexander, Terenin 

Description

We introduce the Cambridge Law Corpus (CLC), a corpus for legal AI research. It consists of over 250,000 court cases from the UK. Most cases are from the 21st century, but the corpus includes cases as old as the 16th century. Together with the corpus, we provide annotations on case outcomes for 638 cases, done by legal experts.

This dataset consists of 15 selected cases from the CLC. The selected cases are publicly available under the Open Justice License.

Version

Software / Usage instructions

We have focused on well-known cases across various courts and legal issues. Due to ethical considerations, access to the full CLC dataset will be restricted for research use only. Researchers will be able to apply for the full CLC via the detailed instructions on our project website: https://www.cst.cam.ac.uk/research/srg/projects/law.

Keywords

artificial intelligence, dataset, law

Publisher

Rights

Open Justice License
Sponsorship
ESRC (ES/T006315/1)
UKRI-JST