The Metric-Frequency Measure of Similarity for Ill-Structured Data Sets, with an Application to Family Therapy.

C.M.D. Sales, P.P. Wakker*

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review


Similarity measures have been studied extensively in many domains, but usually with well-structured data sets. In many psychological applications, however, such data sets are not available. It often cannot even be predicted how many items will be observed, or what exactly they will entail. This paper introduces a similarity measure, called the metric-frequency (mf) measure, that can be applied to such data sets. If it is not known beforehand how many items will be observed, then the number of items actually observed in itself carries information. A typical feature of the mf is that it incorporates such information. The primary purpose of our measure is that it should be pragmatic, widely applicable, and tractable, even if data are complex. The mf generalizes tversky's set-theoretic measure of similarity to cases where items may be present or absent and at the same time can be numerical as with shepard's metric measure, but need not be so. As an illustration, we apply the mf to family therapy where it cannot be predicted what issues the clients will raise in therapy sessions. The mf is flexible enough to be applicable to idiographic data.
Original languageEnglish
Pages (from-to)663-682
Number of pages20
JournalBritish Journal of Mathematical & Statistical Psychology
Publication statusPublished - 1 Jan 2009

Cite this