著者
Masahiko Gosho Tomohiro Ohigashi Kengo Nagashima Yuri Ito Kazushi Maruo
出版者
Japan Epidemiological Association
雑誌
Journal of Epidemiology (ISSN:09175040)
巻号頁・発行日
pp.JE20210089, (Released:2021-09-25)
参考文献数
46
被引用文献数
7

Background: Logistic regression models are widely used to evaluate the association between a binary outcome and a set of covariates. However, when there are few study participants at the outcome and covariate levels, the models lead to bias of the odds ratio (OR) estimated using the maximum likelihood (ML) method. This bias is known as sparse data bias, and the estimated OR can yield impossibly large values because of data sparsity. However, this bias has been ignored in most epidemiological studies.Methods: We review several methods for reducing sparse data bias in logistic regression. The primary aim is to evaluate the Bayesian methods in comparison with the classical methods, such as the ML, Firth’s, and exact methods using a simulation study. We also apply these methods to a real data set.Results: Our simulation results indicate that the bias of the OR from the ML, Firth’s, and exact methods is considerable. Furthermore, the Bayesian methods with hyper-g prior modeling of the prior covariance matrix for regression coefficients reduced the bias under the null hypothesis, whereas the Bayesian methods with log F-type priors reduced the bias under the alternative hypothesis.Conclusion: The Bayesian methods using log F-type priors and hyper-g prior are superior to the ML, Firth’s, and exact methods when fitting logistic models to sparse data sets. The choice of a preferable method depends on the null and alternative hypothesis. Sensitivity analysis is important to understand the robustness of the results in sparse data analysis.

言及状況

外部データベース (DOI)

Twitter (22 users, 26 posts, 95 favorites)

#J_Epidemi Most viewed on J-Stage (July 2023): Bias in Odds Ratios From Logistic Regression Methods With Sparse Data Sets Masahiko Gosho et al. https://t.co/1QWIzX7TYv @J_Epidemi https://t.co/yym1hiEVZo
#J_Epidemi Most viewed on J-Stage (June 2023): Bias in Odds Ratios From Logistic Regression Methods With Sparse Data Sets Masahiko Gosho et al. https://t.co/1QWIzX7TYv @J_Epidemi https://t.co/4xbhSVQgGA
#J_Epidemi 2023 June Issue: Bias in Odds Ratios From Logistic Regression Methods With Sparse Data Sets Masahiko Gosho et al. https://t.co/1QWIzX7TYv @J_Epidemi https://t.co/zK2qtUQNu0
The latest paper on sparse-data bias by Japanese colleagues: https://t.co/kp7vc7JrXL

収集済み URL リスト