Abstract
Transcriptions of phone calls are of significant value across diverse fields, such as sales, customer service, healthcare, and law enforcement. Nevertheless, the analysis of these recorded conversations can be an arduous and time-intensive process, especially when dealing with long and multifaceted dialogues. In this work, we propose a novel method, which we name SegLLM, for efficient and accurate call segmentation and topic extraction. SegLLM is composed of offline and online phases. The offline phase is applied once to a given list of topics and involves generating a distribution of synthetic sentences for each topic using a large language model (LLM). The online phase is applied to every call separately and scores the similarity between the transcripted conversation and the topic anchors found in the offline phase. The proposed paradigm provides an accurate and efficient method for call segmentation and topic extraction that does not require labeled data, thus making it a versatile approach applicable to various domains.
Original language | English |
---|---|
Title of host publication | ICASSP |
Subtitle of host publication | 2024 IEEE International Conference on Acoustics, Speech and Signal Processing |
Pages | 11361-11365 |
Number of pages | 5 |
DOIs | |
State | Published - 2024 |
Event | 49th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Seoul, Korea, Republic of Duration: 14 Apr 2024 → 19 Apr 2024 |
Conference
Conference | 49th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 |
---|---|
Country/Territory | Korea, Republic of |
City | Seoul |
Period | 14/04/24 → 19/04/24 |
Keywords
- Call Segmentation
- Data mining
- LLM
- Law enforcement
- Medical services
- Oral communication
- Signal processing
- Speech processing
- Tagging
- Transformers