Using Data Packages to Ship Annotated Corpora of Parliamentary Protocols: The GermaParl R Package
Using Data Packages to Ship Annotated Corpora of Parliamentary Protocols: The GermaParl R Package
0.25
0.5
0.75
1.25
1.5
1.75
2
This paper suggests to disseminate linguistically and indexed versions of corpora of parliamentary debates as R data packages. The GermaParl Corpus of Parliamentary Protocols serves as an example to illustrate the advantages this approach may have. Data packages offer established routines to version and document data, and to ensure reproducibility. They may include further annotation layers, and functionality to exploit these additional annotations.