Green data science: using big data in an "environmentally friendly" manner

W.M.P. Van Der Aalst

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

15 Citations (Scopus)

Abstract

The widespread use of "Big Data" is heavily impacting organizations and individuals for which these data are collected. Sophisticated data science techniques aim to extract as much value from data as possible. Powerful mixtures of Big Data and analytics are rapidly changing the way we do business, socialize, conduct research, and govern society. Big Data is considered as the "new oil" and data science aims to transform this into new forms of "energy": insights, diagnostics, predictions, and automated decisions. However, the process of transforming "new oil" (data) into "new energy" (analytics) may negatively impact citizens, patients, customers, and employees. Systematic discrimination based on data, invasions of privacy, non-transparent life-changing decisions, and inaccurate conclusions illustrate that data science techniques may lead to new forms of "pollution". We use the term "Green Data Science" for technological solutions that enable individuals, organizations and society to reap the benefits from the widespread availability of data while ensuring fairness, confidentiality, accuracy, and transparency. To illustrate the scientific challenges related to "Green Data Science", we focus on process mining as a concrete example. Recent breakthroughs in process mining resulted in powerful techniques to discover the real processes, to detect deviations from normative process models, and to analyze bottlenecks and waste. Therefore, this paper poses the question: How to benefit from process mining while avoiding "pollutions" related to unfairness, undesired disclosures, inaccuracies, and non-transparency?

Original languageEnglish
Title of host publicationICEIS 2016 - Proceedings of the 18th International Conference on Enterprise Information Systems, ICEIS 2016, 25-28 April 2016, Rome, Italy
Place of Publications.l.
PublisherSciTePress Digital Library
Pages9-21
Number of pages13
ISBN (Electronic)9789897581878
Publication statusPublished - 2016
Event18th International Conference on Enterprise Information Systems (ICEIS 2016) - Rome, Italy
Duration: 25 Apr 201628 Apr 2016
Conference number: 18
http://www.iceis.org/?y=2016

Conference

Conference18th International Conference on Enterprise Information Systems (ICEIS 2016)
Abbreviated titleICEIS 2016
Country/TerritoryItaly
CityRome
Period25/04/1628/04/16
Internet address

Keywords

  • Accuracy
  • Big Data
  • Confidentiality
  • Data science
  • Fairness
  • Process mining
  • Transparency

Fingerprint

Dive into the research topics of 'Green data science: using big data in an "environmentally friendly" manner'. Together they form a unique fingerprint.

Cite this