UKParl: A Semantified and Topically Organized Corpus of Political Speeches
UKParl: A Semantified and Topically Organized Corpus of Political Speeches
0.25
0.5
0.75
1.25
1.5
1.75
2
We present a dataset created from the Hansard House of Commons archived debates of the UK parliament (2013-2016). The resource includes fine-grained topic annotations at the document level and is enriched with additional semantic information such as the one provided by entity links. We assess the quality and usefulness of this corpus with two benchmarks on topic classification and ranking.