Light syntax parsing and fuzzy systems for rhetorical structure theory text segmentation

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present fuzzy boundaries, a method of segmenting text whereby we consider a population of boundaries to be in a fuzzy-state. Furthermore, we aim to use this method to present a multifaceted segmentation approach that is applicable across various domains that require the use of text segmentation: Text summerisation, Rhetorical Stricture Theory, sentence-based segmentation, paragraph-based segmentation and search-algorithms that require topic-based querying. The work first outlines the subject domain together with previous work surrounding the topic of text segmentation. We move to discussing the rationale, exploring our motivations for such a method. The model and its composition is then described next along side the direction taken in the implementation of the model. We move to the results and performance of our method concluding finally with a discussion on the benefits and justifications for our propositions.

Original languageEnglish
Title of host publication2021 International Conference on INnovations in Intelligent SysTems and Applications, INISTA 2021 - Proceedings
EditorsZeynep Hilal Kilimci, Tulay Yildirim, Vincenzo Piuri, Ireneusz Czarnowski, David Camacho, Yannis Manolopoulos, Serdar Solak
PublisherInstitute of Electrical and Electronics Engineers
Number of pages6
ISBN (Electronic)9781665436038
ISBN (Print)9781665411622
DOIs
Publication statusPublished - 30 Sep 2021
Event2021 International Conference on Innovations in Intelligent SysTems and Applications - Kocaeli, Turkey
Duration: 25 Aug 202127 Aug 2021

Conference

Conference2021 International Conference on Innovations in Intelligent SysTems and Applications
Abbreviated titleINISTA 2021
Country/TerritoryTurkey
CityKocaeli
Period25/08/2127/08/21

Keywords

  • fuzzy systems
  • rhetorical structure theory
  • text segmentation

Fingerprint

Dive into the research topics of 'Light syntax parsing and fuzzy systems for rhetorical structure theory text segmentation'. Together they form a unique fingerprint.

Cite this