TY - JOUR
T1 - Probabilistic recovery of incomplete sensed data in IoT
AU - Fekade, Berihun
AU - Maksymyuk, Taras
AU - Kyryk, Maryan
AU - Jo, Minho
PY - 2018/8
Y1 - 2018/8
N2 - Reliable data delivery in the Internet of Things (IoT) is very important in order to provide IoT-based services with the required quality. However, IoT data delivery may not be successful for different reasons, such as connection errors, external attacks, or sensing errors. This results in data incompleteness, which decreases the performance of IoT applications. In particular, the recovery of missing data among the massive sensed data of the IoT is so important that it should be solved. In this paper, we propose a probabilistic method to recover missing (incomplete) data from IoT sensors by utilizing data from related sensors. The main idea of the proposed method is to perform probabilistic matrix factorization (PMF) within the preliminary assigned group of sensors. Unlike previous PMF approaches, the proposed model measures the similarity in data among neighboring sensors and splits them into different clusters with a K-means algorithm. Simulation results show that the proposed PMF model with clustering outperforms support vector machine (SVM) and deep neural network (DNN) algorithms in terms of accuracy and root mean square error. By using normalized datasets, PMF shows faster execution time than SVM, and almost the same execution time as the DNN method. This proposed incomplete data-recovery approach is a promising alternative to traditional DNN and SVM methods for IoT telemetry applications.
AB - Reliable data delivery in the Internet of Things (IoT) is very important in order to provide IoT-based services with the required quality. However, IoT data delivery may not be successful for different reasons, such as connection errors, external attacks, or sensing errors. This results in data incompleteness, which decreases the performance of IoT applications. In particular, the recovery of missing data among the massive sensed data of the IoT is so important that it should be solved. In this paper, we propose a probabilistic method to recover missing (incomplete) data from IoT sensors by utilizing data from related sensors. The main idea of the proposed method is to perform probabilistic matrix factorization (PMF) within the preliminary assigned group of sensors. Unlike previous PMF approaches, the proposed model measures the similarity in data among neighboring sensors and splits them into different clusters with a K-means algorithm. Simulation results show that the proposed PMF model with clustering outperforms support vector machine (SVM) and deep neural network (DNN) algorithms in terms of accuracy and root mean square error. By using normalized datasets, PMF shows faster execution time than SVM, and almost the same execution time as the DNN method. This proposed incomplete data-recovery approach is a promising alternative to traditional DNN and SVM methods for IoT telemetry applications.
KW - Clustering algorithms
KW - Internet of Things
KW - Internet of Things (IoT)
KW - massive sensed data.
KW - Probabilistic logic
KW - probabilistic matrix factorization
KW - recovery of missing sensor data
KW - Sensor systems
KW - Support vector machines
KW - Wireless sensor networks
UR - http://www.scopus.com/inward/record.url?scp=85028467073&partnerID=8YFLogxK
U2 - 10.1109/JIOT.2017.2730360
DO - 10.1109/JIOT.2017.2730360
M3 - Article
AN - SCOPUS:85028467073
SN - 2327-4662
VL - 5
SP - 2282
EP - 2292
JO - IEEE Internet of Things Journal
JF - IEEE Internet of Things Journal
IS - 4
ER -