TY - GEN
T1 - Instant Exceptional Model Mining using weighted controlled pattern sampling
AU - Moens, Sandy
AU - Boley, Mario
N1 - Publisher Copyright: © Springer International Publishing Switzerland 2014
PY - 2014
Y1 - 2014
N2 - When plugged into instant interactive data analytics processes, pattern mining algorithms are required to produce small collections of high quality patterns in short amounts of time. In the case of Exceptional Model Mining (EMM), even heuristic approaches like beam search can fail to deliver this requirement, because in EMM each search step requires a relatively expensive model induction. In this work, we extend previous work on high performance controlled pattern sampling by introducing extra weighting functionality, to give more importance to certain data records in a dataset. We use the extended framework to quickly obtain patterns that are likely to show highly deviating models. Additionally, we combine this randomized approach with a heuristic pruning procedure that optimizes the pattern quality further. Experiments show that in contrast to traditional beam search, this combined method is able to find higher quality patterns using short time budgets.
AB - When plugged into instant interactive data analytics processes, pattern mining algorithms are required to produce small collections of high quality patterns in short amounts of time. In the case of Exceptional Model Mining (EMM), even heuristic approaches like beam search can fail to deliver this requirement, because in EMM each search step requires a relatively expensive model induction. In this work, we extend previous work on high performance controlled pattern sampling by introducing extra weighting functionality, to give more importance to certain data records in a dataset. We use the extended framework to quickly obtain patterns that are likely to show highly deviating models. Additionally, we combine this randomized approach with a heuristic pruning procedure that optimizes the pattern quality further. Experiments show that in contrast to traditional beam search, this combined method is able to find higher quality patterns using short time budgets.
KW - Controlled pattern sampling
KW - Exceptional Model Mining
KW - Subgroup discovery
UR - http://www.scopus.com/inward/record.url?scp=84910002424&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-12571-8_18
DO - 10.1007/978-3-319-12571-8_18
M3 - Conference contribution
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 203
EP - 214
BT - Advances in Intelligent DataAnalysis XIII - 13th International Symposium, IDA 2014, Proceedings
A2 - Blockeel, Hendrik
A2 - van Leeuwen, Matthijs
A2 - Vinciotti, Veronica
PB - Springer Verlag
T2 - PAKDD 2006 International Workshop on Knowledge Discovery in Life Science Literature, KDLL 2006
Y2 - 9 April 2006 through 9 April 2006
ER -