Menu

On the Stratification of Multi-Label Data

calendar icon Oct 3, 2011 4763 views
split view icon
video icon
presentation icon
video with chapters icon
video thumbnail
Pause
Mute
speed icon
speed icon
0.25
0.5
0.75
1
1.25
1.5
1.75
2

Strati ed sampling is a sampling method that takes into account the existence of disjoint groups within a population and produces samples where the proportion of these groups is maintained. In single-label classi cation tasks, groups are di erentiated based on the value of the target variable. In multi-label learning tasks, however, where there are multiple target variables, it is not clear how strati ed sampling could/should be performed. This paper investigates strati cation in the multi-label data context. It considers two strati cation methods for multi-label data and empirically compares them along with random sampling on a number of datasets and based on a number of evaluation criteria. The results reveal some interesting conclusions with respect to the utility of each method for particular types of multi-label datasets.

RELATED CATEGORIES

MORE VIDEOS FROM THE EVENT

MORE VIDEOS FROM THE SAME CATEGORIES

Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International license.