Polarity analysis of texts using discourse structure

B.M.W.T. Heerschop, F. Goossen, A.C. Hogenboom, F. Frasincar, U. Kaymak, F.M.G. Jong, de

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

92 Citaten (Scopus)

Samenvatting

Sentiment analysis has applications in many areas and the exploration of its potential has only just begun. We propose Pathos, a framework which performs document sentiment analysis (partly) based on a document’s discourse structure. We hypothesize that by splitting a text into important and less important text spans, and by subsequently making use of this information by weighting the sentiment conveyed by distinct text spans in accordance with their importance, we can improve the performance of a sentiment classifier. A document’s discourse structure is obtained by applying Rhetorical Structure Theory on sentence level. When controlling for each considered method’s structural bias towards positive classifications, weights optimized by a genetic algorithm yield an improvement in sentiment classification accuracy and macro-level F1 score on documents of 4.5% and 4.7%, respectively, in comparison to a baseline not taking into account discourse structure.
Originele taal-2Engels
TitelProceedings of the 20th ACM Conference on Information and Knowledge Management (CIKM 2011), 24-28 October 2011, Glasgow, UK
Plaats van productieNew York
UitgeverijAssociation for Computing Machinery, Inc
Pagina's1061-1070
ISBN van geprinte versie978-1-4503-0717-8
DOI's
StatusGepubliceerd - 2011
Evenementconference; 20th ACM Conference on Information and Knowledge Management (CIKM 2011); 2011-10-24; 2011-10-28 -
Duur: 24 okt. 201128 okt. 2011

Congres

Congresconference; 20th ACM Conference on Information and Knowledge Management (CIKM 2011); 2011-10-24; 2011-10-28
Periode24/10/1128/10/11
Ander20th ACM Conference on Information and Knowledge Management (CIKM 2011)

Vingerafdruk

Duik in de onderzoeksthema's van 'Polarity analysis of texts using discourse structure'. Samen vormen ze een unieke vingerafdruk.

Citeer dit