TY - GEN
T1 - Predict demographic information using Word2vec on spatial trajectories
AU - Solomon, Adir
AU - Bar, Ariel
AU - Yanai, Chen
AU - Shapira, Bracha
AU - Rokach, Lior
N1 - Publisher Copyright: © 2018 Association for Computing Machinery.
PY - 2018/7/3
Y1 - 2018/7/3
N2 - Inferring socio-demographic attributes of users is an important and challenging task that could help with personalization, recommendation,advertising,etc.Sensor data collected from mobile devices can be utilized for inferring such attributes. Previous works have focused on combining different typesofsensors,such as applications, accelerometer, GPS, battery,and many others,to achieve this task. In this study, we were able to infer attributes,such as gender, age, marital status, and whether the user has children, using solely the GPS sensor. We suggest a novel inference technique, which learns an embeddingrepresentation of preprocessedspatial GPS trajectoriesusing an adaption of the Word2vec approach. Based on the embedding representation, we later train multiple classification models to achieve the inference goals.Our empirical results indicate that the suggested embedding approach outperformsaclassification approach which does not takeinto consideration the embedding patterns.Experiments on real datasets collected from Android devices show that the proposed method achieves over 80% accuracy for variousdemographic prediction tasks.
AB - Inferring socio-demographic attributes of users is an important and challenging task that could help with personalization, recommendation,advertising,etc.Sensor data collected from mobile devices can be utilized for inferring such attributes. Previous works have focused on combining different typesofsensors,such as applications, accelerometer, GPS, battery,and many others,to achieve this task. In this study, we were able to infer attributes,such as gender, age, marital status, and whether the user has children, using solely the GPS sensor. We suggest a novel inference technique, which learns an embeddingrepresentation of preprocessedspatial GPS trajectoriesusing an adaption of the Word2vec approach. Based on the embedding representation, we later train multiple classification models to achieve the inference goals.Our empirical results indicate that the suggested embedding approach outperformsaclassification approach which does not takeinto consideration the embedding patterns.Experiments on real datasets collected from Android devices show that the proposed method achieves over 80% accuracy for variousdemographic prediction tasks.
KW - DeepLearning
KW - Embedding
KW - Trajectories
KW - Word2vec
UR - http://www.scopus.com/inward/record.url?scp=85051720969&partnerID=8YFLogxK
U2 - https://doi.org/10.1145/3209219.3209224
DO - https://doi.org/10.1145/3209219.3209224
M3 - Conference contribution
T3 - UMAP 2018 - Proceedings of the 26th Conference on User Modeling, Adaptation and Personalization
SP - 331
EP - 339
BT - UMAP 2018 - Proceedings of the 26th Conference on User Modeling, Adaptation and Personalization
T2 - 26th ACM International Conference on User Modeling, Adaptation and Personalization, UMAP 2018
Y2 - 8 July 2018 through 11 July 2018
ER -