Subjectively interesting subgroup discovery on real-valued targets

Jefrey Lijffijt, Bo Kang, W. Duivesteijn, Kai Puolamäki, Emilia Oikarinen, Tijl de Bie

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

1 Citation (Scopus)

Abstract

Deriving insights from high-dimensional data is one of the core problems in data mining. The difficulty mainly stems from the large number of variable combinations to potentially consider. Hence, an obvious question is whether we can automate the search for interesting patterns. Here, we consider the setting where a user wants to learn as efficiently as possible about real-valued attributes. We introduce a method to find subgroups in the data that are maximally informative (in the Information Theoretic sense) with respect to one or more real-valued target attributes. The succinct subgroup descriptions are in terms of arbitrarily-typed description attributes. The approach is based on the Subjective Interestingness framework FORSIED to use prior knowledge when mining most informative patterns.
Original languageEnglish
Title of host publicationProceedings of the 34th IEEE International Conference on Data Engineering (ICDE 2018)
Place of PublicationPiscataway
PublisherInstitute of Electrical and Electronics Engineers
Pages1352-1355
ISBN (Electronic)978-1-5386-5520-7
ISBN (Print)978-1-5386-5521-4
DOIs
Publication statusPublished - 2018
Event2018 IEEE 34th International Conference on Data Engineering (ICDE) - Paris, France
Duration: 16 Apr 201819 Apr 2018

Conference

Conference2018 IEEE 34th International Conference on Data Engineering (ICDE)
CountryFrance
CityParis
Period16/04/1819/04/18

Fingerprint

Data mining

Cite this

Lijffijt, J., Kang, B., Duivesteijn, W., Puolamäki, K., Oikarinen, E., & de Bie, T. (2018). Subjectively interesting subgroup discovery on real-valued targets. In Proceedings of the 34th IEEE International Conference on Data Engineering (ICDE 2018) (pp. 1352-1355). [8509369] Piscataway: Institute of Electrical and Electronics Engineers. https://doi.org/10.1109/ICDE.2018.00148
Lijffijt, Jefrey ; Kang, Bo ; Duivesteijn, W. ; Puolamäki, Kai ; Oikarinen, Emilia ; de Bie, Tijl. / Subjectively interesting subgroup discovery on real-valued targets. Proceedings of the 34th IEEE International Conference on Data Engineering (ICDE 2018). Piscataway : Institute of Electrical and Electronics Engineers, 2018. pp. 1352-1355
@inproceedings{c6f7d0cb2e7840b4a83d55f23d0d269b,
title = "Subjectively interesting subgroup discovery on real-valued targets",
abstract = "Deriving insights from high-dimensional data is one of the core problems in data mining. The difficulty mainly stems from the large number of variable combinations to potentially consider. Hence, an obvious question is whether we can automate the search for interesting patterns. Here, we consider the setting where a user wants to learn as efficiently as possible about real-valued attributes. We introduce a method to find subgroups in the data that are maximally informative (in the Information Theoretic sense) with respect to one or more real-valued target attributes. The succinct subgroup descriptions are in terms of arbitrarily-typed description attributes. The approach is based on the Subjective Interestingness framework FORSIED to use prior knowledge when mining most informative patterns.",
author = "Jefrey Lijffijt and Bo Kang and W. Duivesteijn and Kai Puolam{\"a}ki and Emilia Oikarinen and {de Bie}, Tijl",
year = "2018",
doi = "10.1109/ICDE.2018.00148",
language = "English",
isbn = "978-1-5386-5521-4",
pages = "1352--1355",
booktitle = "Proceedings of the 34th IEEE International Conference on Data Engineering (ICDE 2018)",
publisher = "Institute of Electrical and Electronics Engineers",
address = "United States",

}

Lijffijt, J, Kang, B, Duivesteijn, W, Puolamäki, K, Oikarinen, E & de Bie, T 2018, Subjectively interesting subgroup discovery on real-valued targets. in Proceedings of the 34th IEEE International Conference on Data Engineering (ICDE 2018)., 8509369, Institute of Electrical and Electronics Engineers, Piscataway, pp. 1352-1355, 2018 IEEE 34th International Conference on Data Engineering (ICDE), Paris, France, 16/04/18. https://doi.org/10.1109/ICDE.2018.00148

Subjectively interesting subgroup discovery on real-valued targets. / Lijffijt, Jefrey; Kang, Bo; Duivesteijn, W.; Puolamäki, Kai; Oikarinen, Emilia; de Bie, Tijl.

Proceedings of the 34th IEEE International Conference on Data Engineering (ICDE 2018). Piscataway : Institute of Electrical and Electronics Engineers, 2018. p. 1352-1355 8509369.

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - Subjectively interesting subgroup discovery on real-valued targets

AU - Lijffijt, Jefrey

AU - Kang, Bo

AU - Duivesteijn, W.

AU - Puolamäki, Kai

AU - Oikarinen, Emilia

AU - de Bie, Tijl

PY - 2018

Y1 - 2018

N2 - Deriving insights from high-dimensional data is one of the core problems in data mining. The difficulty mainly stems from the large number of variable combinations to potentially consider. Hence, an obvious question is whether we can automate the search for interesting patterns. Here, we consider the setting where a user wants to learn as efficiently as possible about real-valued attributes. We introduce a method to find subgroups in the data that are maximally informative (in the Information Theoretic sense) with respect to one or more real-valued target attributes. The succinct subgroup descriptions are in terms of arbitrarily-typed description attributes. The approach is based on the Subjective Interestingness framework FORSIED to use prior knowledge when mining most informative patterns.

AB - Deriving insights from high-dimensional data is one of the core problems in data mining. The difficulty mainly stems from the large number of variable combinations to potentially consider. Hence, an obvious question is whether we can automate the search for interesting patterns. Here, we consider the setting where a user wants to learn as efficiently as possible about real-valued attributes. We introduce a method to find subgroups in the data that are maximally informative (in the Information Theoretic sense) with respect to one or more real-valued target attributes. The succinct subgroup descriptions are in terms of arbitrarily-typed description attributes. The approach is based on the Subjective Interestingness framework FORSIED to use prior knowledge when mining most informative patterns.

U2 - 10.1109/ICDE.2018.00148

DO - 10.1109/ICDE.2018.00148

M3 - Conference contribution

SN - 978-1-5386-5521-4

SP - 1352

EP - 1355

BT - Proceedings of the 34th IEEE International Conference on Data Engineering (ICDE 2018)

PB - Institute of Electrical and Electronics Engineers

CY - Piscataway

ER -

Lijffijt J, Kang B, Duivesteijn W, Puolamäki K, Oikarinen E, de Bie T. Subjectively interesting subgroup discovery on real-valued targets. In Proceedings of the 34th IEEE International Conference on Data Engineering (ICDE 2018). Piscataway: Institute of Electrical and Electronics Engineers. 2018. p. 1352-1355. 8509369 https://doi.org/10.1109/ICDE.2018.00148