Ad verba per numeros
Tuesday, April 21, 2009, 06:02 PM
The WordSimilarity-353 Test Collection contains two sets of English word pairs along with human-assigned similarity judgements. The collection can be used to train and/or test computer algorithms implementing semantic similarity measures (i.e., algorithms that numerically estimate similarity of natural language words).
Eneko Agirre et al. proposed to split the WordSimilarity-353 collection into two datasets, one focused on measuring similarity, and the other one on relatedness.