Description

The characters a and b are class labels for the classification task. One is signal, one is background, but which is which should not matter here. The task is to decide which feature set is `best' (e.g. requires least training data for given performance). Note that units are poorly chosen so that most of these features have silly sizes (like 10^32) which should be removed by (at the very least) a linear transformation.

Feature set 1

aSet1.txt.gz
bSet1.txt.gz

Feature set 2

aSet2.txt.gz
bSet2.txt.gz

Feature set 3

aSet3.txt.gz
bSet3.txt.gz