Data Collections

Overview about existing data collections and copora of legal language worldwide in various sizes, languages, and text types.
»Read more

Computer Assisted Legal Linguistics Laboratory (CAL²Lab)

A language lab about German legal language: The platform offers access to statistical data and evaluations of the 199.514 most frequent lemmas. The calculation is based on the Corpus of German Law with 379.802 texts and approx. 1.2 billion tokens. Both platform and data are in German only.
»Open Cal²Lab in a new window