TY - GEN
T1 - Diverse user selection for opinion procurement
AU - Amsterdamer, Yael
AU - Goldreich, Oded
N1 - Publisher Copyright: © 2020 Copyright held by the owner/author(s).
PY - 2020
Y1 - 2020
N2 - Many applications maintain a repository of user profiles with semantically rich information on each user. Such repositories have a potential of allowing active opinion procurement: reaching out to users to ask for their opinions on different topics. An important desideratum of the procurement process is that it targets a diverse set of users. To realize this potential, we present Podium: a first framework, to our knowledge, that supports the selection of diverse representatives in presence of high-dimensional, semantically rich user profiles. We demonstrate that data dimensionality is a challenge for both defining and achieving diversification. We address these challenges by proposing a lightweight, flexible notion of diversity that in turn allows explanations and customization of diversification results. We show that the problem of finding an optimally diverse user subset is intractable, and provide a greedy algorithm that computes an approximate solution. We have implemented our solution in a system prototype and tested it on real-world crowdsourcing platform data. Our experimental results show that Podium is effective in selecting users with diverse properties, and in turn that the opinions of these users are diverse according to multiple metrics.
AB - Many applications maintain a repository of user profiles with semantically rich information on each user. Such repositories have a potential of allowing active opinion procurement: reaching out to users to ask for their opinions on different topics. An important desideratum of the procurement process is that it targets a diverse set of users. To realize this potential, we present Podium: a first framework, to our knowledge, that supports the selection of diverse representatives in presence of high-dimensional, semantically rich user profiles. We demonstrate that data dimensionality is a challenge for both defining and achieving diversification. We address these challenges by proposing a lightweight, flexible notion of diversity that in turn allows explanations and customization of diversification results. We show that the problem of finding an optimally diverse user subset is intractable, and provide a greedy algorithm that computes an approximate solution. We have implemented our solution in a system prototype and tested it on real-world crowdsourcing platform data. Our experimental results show that Podium is effective in selecting users with diverse properties, and in turn that the opinions of these users are diverse according to multiple metrics.
UR - http://www.scopus.com/inward/record.url?scp=85084183041&partnerID=8YFLogxK
U2 - https://doi.org/10.5441/002/edbt.2020.60
DO - https://doi.org/10.5441/002/edbt.2020.60
M3 - منشور من مؤتمر
T3 - Advances in Database Technology - EDBT
SP - 486
EP - 497
BT - Advances in Database Technology - EDBT 2020
A2 - Bonifati, Angela
A2 - Zhou, Yongluan
A2 - Vaz Salles, Marcos Antonio
A2 - Bohm, Alexander
A2 - Olteanu, Dan
A2 - Fletcher, George
A2 - Khan, Arijit
A2 - Yang, Bin
T2 - 23rd International Conference on Extending Database Technology, EDBT 2020
Y2 - 30 March 2020 through 2 April 2020
ER -