TY - GEN
T1 - Predicting “What is interesting” by mining interactive-data-analysis session logs
AU - Milo, Tova
AU - Ozeri, Chai
AU - Somech, Amit
N1 - Publisher Copyright: © 2019 Copyright held by the owner/author(s).
PY - 2019
Y1 - 2019
N2 - Assessing the interestingness of data analysis actions has been the subject of extensive previous work, and a multitude of interestingness measures have been devised, each capturing a different facet of the broad concept. While such measures are a core component in many analysis platforms (e.g., for ranking association rules, recommending visualizations, and query formulation), choosing the most adequate measure for a specific analysis task or an application domain is known to be a difficult task. In this work we focus on the choice of interestingness measures particularly for Interactive Data Analysis (IDA), where users examine datasets by performing sessions of analysis actions. Our goal is to determine the most suitable interestingness measure that adequately captures the user’s current interest at each step of an interactive analysis session. We propose a novel solution that is based on the mining of IDA session logs. First, we perform an offline analysis of the logs, and identify unique characteristics of interestingness in IDA sessions. We then define a classification problem and build a predictive model that can select the best measure for a given a state of a user session. Our experimental evaluation, performed over real-life session logs, demonstrates the sensibility and adequacy of our approach.
AB - Assessing the interestingness of data analysis actions has been the subject of extensive previous work, and a multitude of interestingness measures have been devised, each capturing a different facet of the broad concept. While such measures are a core component in many analysis platforms (e.g., for ranking association rules, recommending visualizations, and query formulation), choosing the most adequate measure for a specific analysis task or an application domain is known to be a difficult task. In this work we focus on the choice of interestingness measures particularly for Interactive Data Analysis (IDA), where users examine datasets by performing sessions of analysis actions. Our goal is to determine the most suitable interestingness measure that adequately captures the user’s current interest at each step of an interactive analysis session. We propose a novel solution that is based on the mining of IDA session logs. First, we perform an offline analysis of the logs, and identify unique characteristics of interestingness in IDA sessions. We then define a classification problem and build a predictive model that can select the best measure for a given a state of a user session. Our experimental evaluation, performed over real-life session logs, demonstrates the sensibility and adequacy of our approach.
UR - http://www.scopus.com/inward/record.url?scp=85064952174&partnerID=8YFLogxK
U2 - 10.5441/002/edbt.2019.42
DO - 10.5441/002/edbt.2019.42
M3 - منشور من مؤتمر
T3 - Advances in Database Technology - EDBT
SP - 456
EP - 467
BT - Advances in Database Technology - EDBT 2019
A2 - Galhardas, Helena
A2 - Kaoudi, Zoi
A2 - Reinwald, Berthold
A2 - Herschel, Melanie
A2 - Binnig, Carsten
A2 - Fundulaki, Irini
T2 - 22nd International Conference on Extending Database Technology, EDBT 2019
Y2 - 26 March 2019 through 29 March 2019
ER -