https://www.banglajol.info/index.php/JStR/issue/feed Journal of Statistical Research 2023-07-09T11:29:48+00:00 Dr. Mohammad Shafiqur Rahman shafiq@isrt.ac.bd Open Journal Systems <p>Published by the <strong>Institute of Statistical Research and Training (ISRT)</strong>, University of Dhaka, Bangladesh<strong>. </strong>Full-text articles available.</p> <p><a href="http://creativecommons.org/licenses/by-nc/4.0/" rel="license"><img class="alignright" src="https://i.creativecommons.org/l/by-nc-nd/4.0/88x31.png" alt="image" width="88" height="31" /></a><br />Articles in <strong>Journal of Statistical Research </strong>(<strong>JSR</strong>) are licensed under a Creative Commons CC BY-NC-ND License Attribution-NonCommercial-NoDerivatives 4.0 International (<strong>CC-BY-NC-ND 4.0</strong>). This license permits <strong>Share</strong> —copy and redistribute the material in any medium or format.</p> https://www.banglajol.info/index.php/JStR/article/view/67463 Optimal allocation schemes in mixed ANCOVA models for longitudinal data 2023-07-05T10:23:30+00:00 Xiaojian Xu xxu@brocku.ca Sanjoy K Sinha sinha@math.carleton.ca <div class="page" title="Page 1"> <div class="layoutArea"> <div class="column"> <p>We discuss the construction of optimal allocation schemes for the linear mixed model with clustered outcomes or repeated measurements often encountered in longitudinal studies. We consider both treatment and covariate effects in the mixed model, where latent pro- cesses are used to describe random cluster or subject effects. A goal of optimal design schemes is to determine proportions of sample units allocated to each treatment for a given total sample size. We develop the optimal designs in a general setting using both D- and A- optimal design criteria. Specifically, we propose a two-stage design approach to deal with unknown parameters in the linear mixed model, where the variances of the random effects across the treatment groups are considered different. We study the empirical properties of the proposed designs using Monte Carlo simulations. An application is also provided using actual clinical data from a longitudinal study.</p> <p>Journal of Statistical Research, Vol 56, No 2, p101-114</p> </div> </div> </div> 2023-07-09T00:00:00+00:00 Copyright (c) 2022 Journal of Statistical Research https://www.banglajol.info/index.php/JStR/article/view/67466 Debias random forest regression predictors 2023-07-05T10:44:01+00:00 Lihua Chen chen3lx@jmu.edu Prabhashi Withana Gamage withanpw@jmu.edu John Ryan john.p.ryan@wisc.edu <div class="page" title="Page 1"> <div class="layoutArea"> <div class="column"> <p>The random forest can reduce the variance of regression predictors through bagging while leaving the bias mostly unchanged. In general, the bias is not negligible and consequently bias correction is necessary. The default bias correction method implemented in the R package randomForest often works poorly. Several approaches have been developed which in general outperform the R default. However, little work has been done to com- prehensively evaluate the performance of these methods and thus guide users to select an appropriate method for bias correction. This paper fills this gap by providing an informa- tive ranking of these bias correction methods based on an extensive numerical study. We further offered practical suggestions on the application of the winner of these methods and suggested a visualization technique to help users decide when bias correction is needed.</p> <p>Journal of Statistical Research 2022, Vol. 56, No. 2, pp. 115-131</p> </div> </div> </div> 2023-07-09T00:00:00+00:00 Copyright (c) 2022 Journal of Statistical Research https://www.banglajol.info/index.php/JStR/article/view/67467 Properties of inverse probability of adherence weighted estimator of the per-protocol effect for sustained treatment strategies under different data-generating mechanisms and adherence patterns 2023-07-05T12:52:34+00:00 Lucy Mosquera bellemarelucy@gmail.com Mohammad Ehsanul Karim ehsan.karim@ubc.ca Md Belal Hossain belal.hossain@ubc.ca <div class="page" title="Page 1"> <div class="layoutArea"> <div class="column"> <p>Inverse Probability (of Adherence) Weighted per-protocol (IPW-PP) estimators are get- ting popular in addressing medication non-adherence while analyzing pragmatic trial data. However, their finite sample properties under different data generating mechanisms (DGMs) have not been investigated comprehensively. In the current work, we investigated the finite sample performances of such estimators in the context of a pragmatic random- ized controlled trial. We compared the performances of IPW-PP estimators with commonly used naive and baseline-adjusted per-protocol estimators, under different DGMs emulating pragmatic trials, comparing two sustained treatment strategies, possibly with a non-null effect. DGMs include (i) different roles of a baseline variable; whether future time-varying prognostic factors are impacted by past adherence; and whether the baseline variable is measured, (ii) whether adherence patterns observed in two arms are differential, and when we have access to measurements of adherence and confounders that are recorded infre- quently (sparsely). When baseline confounders are adjusted, we generally obtain unbiased estimates, but if some necessary variables are not measured, the IPW-PP estimator may still be preferable. High non-adherence patterns might negatively impact IPW-PP effect estimators, particularly when DGMs include confounding that may be influenced by previ- ous adherence history. We used the above estimators to analyze a case study from the Lipid Research Clinics Coronary Primary Prevention Trial data in the presence of non-adherence.</p> <p>Journal of Statistical Research 2022, Vol. 56, No. 2, pp.134-154</p> </div> </div> </div> 2023-07-09T00:00:00+00:00 Copyright (c) 2022 Journal of Statistical Research https://www.banglajol.info/index.php/JStR/article/view/67468 Approximate methods for analyzing semiparametric longitudinal models with nonignorable missing responses 2023-07-05T13:12:54+00:00 Najla Aloraini arienie@qu.edu.sa Sanjoy Sinha sinha@math.carleton.ca <p>We often encounter missing data in longitudinal studies. When the missingness in longitudinal data is nonignorable, it is necessary to incorporate the missing data mechanism into the observed data likelihood function for a valid statistical inference. In this article, we propose and explore a novel semiparametric approach to estimating the regression parameters and variance components using a partially linear mixed model with nonignorable and nonmonotone missing responses. The finite sample properties of the proposed method are studied using Monte Carlo simulations, where our method is found to be very effective in capturing any curvilinear pattern in the mean response. The method is also illustrated using some actual longitudinal data obtained from a public health survey, referred to as the Health and Retirement Study (HRS).</p> <p>Journal of Statistical Research 2022, Vol. 56, No. 2, pp. 155-183</p> 2023-07-09T00:00:00+00:00 Copyright (c) 2022 Journal of Statistical Research https://www.banglajol.info/index.php/JStR/article/view/67469 Penalized logistic normal multinomial factor analyzers for high dimensional compositional data 2023-07-05T13:37:43+00:00 Wangshu Tu wangshu.tu@carleton.ca Sanjeena Subedi sanjeena.dang@carleton.ca <div class="page" title="Page 1"> <div class="layoutArea"> <div class="column"> <p>Model-based clustering utilizes a finite mixture model to identify underlying patterns or clusters across samples. A finite mixture model is a convex combination of two or more distributions, where appropriate distributions are chosen depending on the type of the data. Recently, there has been a great interest in clustering human microbiome data. Microbiome data are compositional (yielding relative abundance) and are high-dimensional. Previously, a family of logistic normal multinomial factor analyzers (LNM-FA) for model-based clus- tering of high-dimensional microbiome data was proposed via a factor analyzer structure. This reduced the number of parameters and computation overhead compared to a traditional mixtures of logistic normal multinomial models. Here, we propose a penalized LNM-FA (PLNM-FA) model by utilizing lasso regularization to each entry of the loading matrix. This introduces further parsimony compared to LNM-FA and also estimates the number of latent factors simultaneously. Parameter estimation is done using a variational variant of the alternating expectation conditional maximization algorithm to maximize the penalized maximum likelihood. The performance of proposed algorithm is evaluated using simula- tion studies and real data.</p> <p>Journal of Statistical Research 2022, Vol. 56, No. 2, pp.185-216&nbsp;</p> </div> </div> </div> 2023-07-09T00:00:00+00:00 Copyright (c) 2022 Journal of Statistical Research