학술논문

A semi‐parametric Bayesian model for semi‐continuous longitudinal data
Document Type
article
Source
Statistics in Medicine. 41(13)
Subject
Alcoholism
Alcohol Use and Health
Pediatric
Substance Misuse
Good Health and Well Being
Algorithms
Bayes Theorem
Humans
Markov Chains
Models
Statistical
Monte Carlo Method
B-spline
Bayesian
longitudinal
Markov chain Monte Carlo
semi-continuous
semi-parametric
Statistics
Public Health and Health Services
Statistics & Probability
Language
Abstract
Semi-continuous data present challenges in both model fitting and interpretation. Parametric distributions may be inappropriate for extreme long right tails of the data. Mean effects of covariates, susceptible to extreme values, may fail to capture relevant information for most of the sample. We propose a two-component semi-parametric Bayesian mixture model, with the discrete component captured by a probability mass (typically at zero) and the continuous component of the density modeled by a mixture of B-spline densities that can be flexibly fit to any data distribution. The model includes random effects of subjects to allow for application to longitudinal data. We specify prior distributions on parameters and perform model inference using a Markov chain Monte Carlo (MCMC) Gibbs-sampling algorithm programmed in R. Statistical inference can be made for multiple quantiles of the covariate effects simultaneously providing a comprehensive view. Various MCMC sampling techniques are used to facilitate convergence. We demonstrate the performance and the interpretability of the model via simulations and analyses on the National Consortium on Alcohol and Neurodevelopment in Adolescence study (NCANDA) data on alcohol binge drinking.