A Dataset of 10,000 Situations for Research in Computational Social Sciences Psychology and the Humanities

Yair Neuman, Yochai Cohen

Research output: Contribution to journalArticlepeer-review

Abstract

It has been realized that situational dimensions, as represented by human beings, are crucial for understanding human behavior. The Riverside Situational Q (RSQ) is a tool that measures the psychological properties of situations. However, the RSQ-4 includes only 90 items and may have limited use for researchers interested in measuring situational dimensions using a computational approach. Here we present a corpus of 10,000 artificially generated situations corresponding mostly with the RSQ-4. The dataset was generated using GPT, the state-of-the-art large language model. The dataset validity is established through inter-judge reliability, and four experiments on large datasets support its quality. The dataset and the code used for generating 100 situational dimensions may be useful for researchers interested in measuring situational dimensions in textual data.

Original languageAmerican English
Article number505
JournalScientific data
Volume10
Issue number1
DOIs
StatePublished - 1 Dec 2023

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Education
  • Library and Information Sciences
  • Statistics and Probability
  • Computer Science Applications
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'A Dataset of 10,000 Situations for Research in Computational Social Sciences Psychology and the Humanities'. Together they form a unique fingerprint.

Cite this