Improving IoT data quality in mobile crowd sensing: A cross validation approach

Tie Luo, Jianwei Huang, Salil S. Kanhere, Jie Zhang, Sajal K. Das

Research output: Contribution to journalArticlepeer-review

85 Scopus citations

Abstract

Data quality, or sometimes referred to as data credibility, is a critical issue in mobile crowd sensing (MCS) and more generally Internet of Things (IoT). While candidate solutions, such as incentive mechanisms and data mining have been well explored in the literature, the power of crowds has been largely overlooked or under-exploited. In this paper, we propose a cross validation approach which seeks a validating crowd to ratify the contributing crowd in terms of the sensor data contributed by the latter, and uses the validation result to reshape data into a more credible posterior belief of the ground truth. This approach consists of a framework and a mechanism, where the framework outlines a four-step procedure and the mechanism implements it with specific technical components, including a weighted random oversampling (WRoS) technique and a privacy-aware trust-oriented probabilistic push (PATOP2) algorithm. Unlike most prior work, our proposed approach augments rather than redesigning existing MCS systems, and requires minimal effort from the crowd, making it conducive to practical adoption. We evaluate our proposed mechanism using a real-world MCS IoT dataset and demonstrate remarkable (up to 475%) improvement of data quality. In particular, it offers a unified solution to reconciling two disparate needs: reinforcing obscure (weakly recognizable) ground truths and discovering hidden (unrecognized) ground truths.

Original languageEnglish
Article number8666717
Pages (from-to)5651-5664
Number of pages14
JournalIEEE Internet of Things Journal
Volume6
Issue number3
DOIs
StatePublished - Jun 2019

Bibliographical note

Publisher Copyright:
© 2014 IEEE.

Funding

Manuscript received November 3, 2018; revised March 4, 2019; accepted March 8, 2019. Date of publication March 13, 2019; date of current version June 19, 2019. This work was supported in part by the Hong Kong General Research Fund under Grant CUHK1421906, in part by the Presidential Fund from the Chinese University of Hong Kong, Shenzhen, and in part by the National Science Foundation under Grant CNS-1818942, Grant CCF-1725755, Grant CNS-1545050, and Grant CCF-1533918. (Corresponding author: Tie Luo.) T. Luo is with the Institute for Infocomm Research, A*STAR, Singapore (e-mail: [email protected]).

FundersFunder number
Hong Kong General Research FundCUHK1421906
U.S. Department of Energy Chinese Academy of Sciences Guangzhou Municipal Science and Technology Project Oak Ridge National Laboratory Extreme Science and Engineering Discovery Environment National Science Foundation National Energy Research Scientific Computing Center National Natural Science Foundation of ChinaCCF-1533918, CCF-1725755, CNS-1545050, CNS-1818942
U.S. Department of Energy Chinese Academy of Sciences Guangzhou Municipal Science and Technology Project Oak Ridge National Laboratory Extreme Science and Engineering Discovery Environment National Science Foundation National Energy Research Scientific Computing Center National Natural Science Foundation of China
Shenzhen Graduate School, Peking University
Chinese University of Hong Kong

    Keywords

    • Chance-constrained programming
    • Crowdsourcing
    • Data quality
    • Exploration-exploitation tradeoff
    • Internet of Things (IoT)
    • Kullback-Leibler divergence
    • Privacy
    • Trust

    ASJC Scopus subject areas

    • Signal Processing
    • Information Systems
    • Hardware and Architecture
    • Computer Science Applications
    • Computer Networks and Communications

    Fingerprint

    Dive into the research topics of 'Improving IoT data quality in mobile crowd sensing: A cross validation approach'. Together they form a unique fingerprint.

    Cite this