Menu

A Sentiment-labelled Corpus of Hansard Parliamentary Debate Speeches

calendar icon May 30, 2018 789 views
split view icon
video icon
presentation icon
video with chapters icon
video thumbnail
Pause
Mute
speed icon
speed icon
0.25
0.5
0.75
1
1.25
1.5
1.75
2

Hansard transcripts provide access to the opinions of MPs on many important issues, but are rather difficult for people to effectively process. Existing corpora for sentiment analysis in Hansard debates rely on speakers' votes as sentiment polarity labels, but these votes are known to be constrained by speakers' party affiliations. Over two rounds of manual annotation, we develop an annotation scheme and create a novel corpus designed for use in the evaluation of automatic sentiment analysis systems using both automatically and manually applied speech sentiment polarity class labels. Following observations of the effects on speech sentiment of differing sentiment polarities in debate motions (proposals), we also apply sentiment labels to the debate motions. We find that humans are able to reach high agreement in identifying sentiment polarity in these debates, and also that manually applied and automatically retrieved class labels differ somewhat, suggesting that speech content does not always reflect the voting behaviour of Members of the Parliament.

RELATED CATEGORIES

MORE VIDEOS FROM THE SAME CATEGORIES

Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International license.