Data smashing

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

From automatic speech recognition to discovering unusual stars, underlying almost all automated discovery tasks is the ability to compare and contrast data streams with each other, to identify connections and spot outliers. Despite the prevalence of data, however, automated methods are not keeping pace. A key bottleneck is that most data comparison algorithms today rely on a human expert to specify what "features" of the data are relevant for comparison. Here, we propose a new principle for estimating the similarity between the sources of arbitrary data streams, using neither domain knowledge nor learning. We demonstrate the application of this principle to the analysis of data from a number of real-world challenge problems, including the disambiguation of electro-encephalograph patterns pertaining to epileptic seizures, detection of anomalous cardiac activity from heart sound recordings, and classification of astronomical objects from raw photometry. In all these cases and without access to any domain knowledge, we demonstrate performance on par with the accuracy achieved by specialized algorithms and heuristics devised by domain experts. We suggest that data smashing principles may open the door to understanding increasingly complex observations, especially when experts do not know what to look for.

Original languageEnglish
Title of host publicationDiscovery Informatics - Papers Presented at the 28th AAAI Conference on Artificial Intelligence, Technical Report
PublisherAI Access Foundation
Pages7-14
Number of pages8
ISBN (Electronic)9781577356660
StatePublished - 2014
Event28th AAAI Conference on Artificial Intelligence, AAAI 2014 - Quebec City, Canada
Duration: Jul 28 2014 → …

Publication series

NameAAAI Workshop - Technical Report
VolumeWS-14-05

Conference

Conference28th AAAI Conference on Artificial Intelligence, AAAI 2014
Country/TerritoryCanada
CityQuebec City
Period7/28/14 → …

Bibliographical note

Publisher Copyright:
© 2014, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'Data smashing'. Together they form a unique fingerprint.

Cite this