Ranking Accuracy for Logistic-GEE Models

Nasser Davarzani*, Ralf Peeters, Evgueni Smirnov, Joël Karel, Hans-peter Brunner-la Rocca

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingChapterAcademic

38 Downloads (Pure)

Abstract

The logistic generalized estimating equations (logistic-gee) models have been extensively used for analyzing clustered binary data. However, assessing the goodness-of-fit and predictability of these models is problematic due to the fact that no likelihood is available and the observations can be correlated within a cluster. In this paper we propose a new measure for estimating the generalization performance of the logistic gee models, namely ranking accuracy for models based on clustered data (ramcd). We define ramcd as the probability that a randomly selected positive observation is ranked higher than randomly selected negative observation from another cluster. We propose a computationally efficient algorithm for ramcd. The algorithm can be applied for two cases: (1) when we estimate ramcd as a goodness-of-fit criterion and (2) when we estimate ramcd as a predictability criterion. This is experimentally shown on clustered data from a simulation study and a biomarkers’ study.
Original languageEnglish
Title of host publicationIDA 2016: Advances in Intelligent Data Analysis XV
EditorsH Boström, A Knobbe, C Soares, P Papapetrou
PublisherSpringer International Publishing AG
Chapter2
Pages14-25
Number of pages12
ISBN (Electronic)978-3-319-46349-0
ISBN (Print)978-3-319-46348-3
DOIs
Publication statusPublished - 21 Sept 2016
Event15th International Symposium on Intelligent Data Analysis (IDA): IDA 2016 - Stockholm, Sweden
Duration: 13 Oct 201615 Oct 2016

Publication series

SeriesLecture Notes in Computer Science
Volume9897
ISSN0302-9743

Symposium

Symposium15th International Symposium on Intelligent Data Analysis (IDA)
Country/TerritorySweden
CityStockholm
Period13/10/1615/10/16

Keywords

  • Clustered data
  • Generalized Estimating Equation
  • Goodness-of-fit
  • Predictability
  • Ranking accuracy
  • OF-FIT TESTS
  • LONGITUDINAL DATA-ANALYSIS
  • CONGESTIVE-HEART-FAILURE
  • MANAGEMENT

Cite this