Skip to main navigation Skip to search Skip to main content

A generic unsupervised method for decomposing multi-author documents

Research output: Contribution to journalArticlepeer-review

Abstract

Given an unsegmented multi-author text, we wish to automatically separate out distinct authorial threads. We present a novel, entirely unsupervised, method that achieves strong results on multiple testbeds, including those for which authorial threads are topically identical. Unlike previous work, our method requires no specialized linguistic tools and can be easily applied to any text.

Original languageEnglish
Pages (from-to)2256-2264
Number of pages9
JournalJournal of the American Society for Information Science and Technology
Volume64
Issue number11
DOIs
StatePublished - Nov 2013

Keywords

  • machine learning
  • natural language processing
  • text mining

ASJC Scopus subject areas

  • Software
  • Information Systems
  • Human-Computer Interaction
  • Computer Networks and Communications
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'A generic unsupervised method for decomposing multi-author documents'. Together they form a unique fingerprint.

Cite this