Search In this Thesis
   Search In this Thesis  
العنوان
Handling mixed missing data /
الناشر
Mai Ahmed Mohsen Moustafa ,
المؤلف
Mai Ahmed Mohsen Moustafa
هيئة الاعداد
باحث / Mai Ahmed Mohsen Moustafa
مشرف / Amany Mousa Mohamed
مشرف / Yasmin Mohamed Ibrahim
مناقش / Amany Mousa Mohamed
تاريخ النشر
2018
عدد الصفحات
154 Leaves :
اللغة
الإنجليزية
الدرجة
ماجستير
التخصص
الإحصاء والاحتمالات
تاريخ الإجازة
22/5/2019
مكان الإجازة
اتحاد مكتبات الجامعات المصرية - Statistics and Econometrics
الفهرس
Only 14 pages are availabe for public view

from 173

from 173

Abstract

Incomplete data is often an unavoidable problem faced by most applied researchers as survey results often include some non-response. Various techniques have been developed for dealing with missing values in data sets with homogeneous attributes (their independent attributes are all either continuous or discrete). However, these imputation algorithms cannot be directly applied to many real data sets, as survey data sets in general often consist of large numbers of variables which have mixed data types i.e. different measurement scales. Specific methods and modification in existing methods are found for dealing with such kind of data. This thesis reviews some methods for such kind of data and applies six imputation methods out of them. Assessing the performance of the six imputation methods which are MICE, MICE-CART, MICE-RF, MissForest, MissRanger and KNN is performed using 3 real datasets at 5 different missing rates. Complete datasets have been used and variables were artificially made 2missing at random3and results were assessed using different criteria. Across the imputed datasets MissForest and MissRanger tend to have the best results while MICE-RF and KNN tend to have the worst results