Mathematics and Statistics Faculty Publications and Presentations

A Constrained Maximum Likelihood Approach to Developing Well-Calibrated Models for Predicting Binary Outcomes.

Yaqi Cao, Minzu University of China
Weidong Ma, University of Pennsylvania
Ge Zhao, Portland State UniversityFollow
Anne Marie McCarthy, University of Pennsylvania
Jinbo Chen, University of Pennsylvania

Published In

Lifetime Data Analysis

Document Type

Article

Publication Date

5-8-2024

Subjects

Mathematical models, Computational modeling, Parallel processing -- Data models

Abstract

The added value of candidate predictors for risk modeling is routinely evaluated by comparing the performance of models with or without including candidate predictors. Such comparison is most meaningful when the estimated risk by the two models are both unbiased in the target population. Very often data for candidate predictors are sourced from nonrepresentative convenience samples. Updating the base model using the study data without acknowledging the discrepancy between the underlying distribution of the study data and that in the target population can lead to biased risk estimates and therefore an unfair evaluation of candidate predictors. To address this issue assuming access to a well-calibrated base model, we propose a semiparametric method for model fitting that enforces good calibration. The central idea is to calibrate the fitted model against the base model by enforcing suitable constraints in maximizing the likelihood function. This approach enables unbiased assessment of model improvement offered by candidate predictors without requiring a representative sample from the target population, thus overcoming a significant practical challenge. We study theoretical properties for model parameter estimates, and demonstrate improvement in model calibration via extensive simulation studies. Finally, we apply the proposed method to data extracted from Penn Medicine Biobank to inform the added value of breast density for breast cancer risk assessment in the Caucasian woman population.

Rights

This work is licensed under a Creative Commons Attribution 4.0 International License.

Locate the Document

https://doi.org/10.1007/s10985-024-09628-9

DOI

10.1007/s10985-024-09628-9

Persistent Identifier

https://archives.pdx.edu/ds/psu/41793

Citation Details

Cao, Y., Ma, W., Zhao, G., McCarthy, A. M., & Chen, J. (2024). A constrained maximum likelihood approach to developing well-calibrated models for predicting binary outcomes. Lifetime Data Analysis.

Download

Included in

Physical Sciences and Mathematics Commons

COinS

Mathematics and Statistics Faculty Publications and Presentations

A Constrained Maximum Likelihood Approach to Developing Well-Calibrated Models for Predicting Binary Outcomes.

Published In

Document Type

Publication Date

Subjects

Abstract

Rights

Locate the Document

DOI

Persistent Identifier

Citation Details

Included in

Find

Connect

Mathematics and Statistics Faculty Publications and Presentations

A Constrained Maximum Likelihood Approach to Developing Well-Calibrated Models for Predicting Binary Outcomes.

Authors

Published In

Document Type

Publication Date

Subjects

Abstract

Rights

Locate the Document

DOI

Persistent Identifier

Citation Details

Included in

Share

Find

Connect