Data and Tools
Over the past decades, special data (text corpora) as well as tools and platforms have been developed worldwide for research in legal linguistics. SOULL documents these data sources and platforms for international networking and usability.
Overview about existing data collections and copora of legal language worldwide in various sizes, languages, and text types.
Computer Assisted Legal Linguistics Laboratory (CAL²Lab)
A language lab about German legal language: The platform offers access to statistical data and evaluations of the 199.514 most frequent lemmas. The calculation is based on the Corpus of German Law with 379.802 texts and approx. 1.2 billion tokens. Both platform and data are in German only.
»Open Cal²Lab in a new window