Data and Tools

Over the past decades, special data (text corpora) as well as tools and platforms have been developed worldwide for research in legal linguistics. SOULL documents these data sources and platforms for international networking and usability.

Data Collections

Overview about existing data collections and copora of legal language worldwide in various sizes, languages, and text types.
»Read more

Computer Assisted Legal Linguistics Laboratory (CAL²Lab)

A language lab about German legal language: The platform offers access to statistical data and evaluations of the 199.514 most frequent lemmas. The calculation is based on the Corpus of German Law with 379.802 texts and approx. 1.2 billion tokens. Both platform and data are in German only.
»Open Cal²Lab in a new window