Only 3 days left until the start of the Statistics Globe online workshop, Missing Data Imputation in R.
Kicking off on February 20, this workshop includes eight weekly live sessions, beginning with the basics of handling missing data and advancing to sophisticated imputation techniques in R.
The workshop is limited to 15 participants, so enroll now to secure your spot.
Learn more and sign up here: https://statisticsglobe.com/online-workshop-missing-data-imputation-r
New year, new learning format! I'm thrilled to announce the very first interactive online workshop ever conducted at Statistics Globe!
The Topic: Missing Data Imputation in R
Click here for more info about the workshop: https://statisticsglobe.com/online-workshop-missing-data-imputation-r
Pipeline release! nf-core/phaseimpute v1.0.0 - 1.0.0 - Black Labrador!
Please see the changelog: https://github.com/nf-core/phaseimpute/releases/tag/1.0.0
And it’s finally here {resurface} my #rstats package for imputing missing genotype allele frequencies, such as those from pooled samples or populations. Stems from some scripts I wrote nearly 10 years ago, which I can now finally say is open. I hope to progressively add more imputation options.
#imputation #genomics #genotyping #bioinformatics
https://github.com/lpembleton/resurface
1/I’ve been developing some genotype allele frequency (AF) #imputation #rstat scripts. While my algorithm returns a low raw bias (RB) of 0.05 to 0.1, simply imputing with the meanAF gives slightly worse but largely similar RB values. I’m wanting to quantifying accuracy better because despite the similar RB values, I find meanAF imputation inferior.
#genomics
'Nonparametric Copula Models for Multivariate, Mixed, and Missing Data', by Joseph Feldman, Daniel R. Kowal.
http://jmlr.org/papers/v25/23-0495.html
#copula #imputation #missingness
'Semi-supervised Inference for Block-wise Missing Data without Imputation', by Shanshan Song, Yuanyuan Lin, Yong Zhou.
http://jmlr.org/papers/v25/21-1504.html
#imputation #supervised #neuroimaging
... The student contacted Heshmati and eventually obtained spreadsheets of the data he had used in the paper. Heshmati acknowledged that, although he and his coauthor had not mentioned this fact in the paper, the data had gaps. He revealed in an email that these gaps had been filled by using Excel’s autofill function”....
"Heshmati told the student he had used Excel’s autofill function to mend the data. ... where there were no observations to use for the autofill operation, the professor had taken the values from an adjacent country in the spreadsheet."
#economics #imputation #heshmati #retraction #retractionwatch #Excel #AutoFill #datainvention #fudge #fake
"Journal of Cleaner Production":
https://www.sciencedirect.com/science/article/pii/S0959652623022503
"#Green #innovations and #patents in #OECD countries"
What model #statistics should one report after using multiple #imputation and #multilevel regressions, and how are they obtained? I'm using the #mice package in #rstats, and #lme4 on each imputed dataset. When pooling results, summary() yields what I need for each model term, but nothing for the whole model. If I didn't impute but deleted listwise, I would normally report AIC, BIC, Loglik. These are all in the mipo object, for each result for each imputed dataset, but they're not pooled. I'm sure I'm missing something here. Does anyone know an example article where such results are presented neatly?
'Surrogate Assisted Semi-supervised Inference for High Dimensional Risk Prediction', by Jue Hou, Zijian Guo, Tianxi Cai.
http://jmlr.org/papers/v24/21-1075.html
#imputation #predictors #supervised
RIFLE: Imputation and Robust Inference from Low Order Marginals
Numerical Data Imputation for Multimodal Data Sets: A Probabilistic Nearest-Neighbor Kernel Densi...
Florian Lalande, Kenji Doya
Action editor: Pierre Alquier.
New #ReproducibilityCertification:
Numerical Data Imputation for Multimodal Data Sets: A Probabilistic Nearest-Neighbor Kernel Densi...
Florian Lalande, Kenji Doya
Numerical Data Imputation for Multimodal Data Sets: A Probabilistic Nearest-Neighbor Kernel Density Approach
'Distributed Nonparametric Regression Imputation for Missing Response Problems with Large-scale Data', by Ruoyu Wang, Miaomiao Su, Qihua Wang.
http://jmlr.org/papers/v24/21-0673.html
#imputation #nonparametric #semiparametric
A Modulation Layer to Increase Neural Network Robustness Against Data Quality Issues
Mohamed Abdelhack, Jiaming Zhang, Sandhya Tripathi et al.