A Survey of Binary Similarity and Distance Measures
Seung-Seok Choi, Sung-Hyuk Cha, Charles C. Tappert
The binary feature vector is one of the most common
representations of patterns and measuring similarity and
distance measures play a critical role in many problems
such as clustering, classification, etc. Ever since Jaccard
proposed a similarity measure to classify ecological
species in 1901, numerous binary similarity and distance
measures have been proposed in various fields. Applying
appropriate measures results in more accurate data
analysis. Notwithstanding, few comprehensive surveys
on binary measures have been conducted. Hence we
collected 76 binary similarity and distance measures used
over the last century and reveal their correlations through
the hierarchical clustering technique. Full Text
|