Technische Universität Darmstadt (Technical University of Darmstadt)
Universität Hamburg (University of Hamburg)
Technische Universität Hamburg
Universität Hamburg (University of Hamburg)
Technische Universität Hamburg
Sentiment analysis for literary texts Sentiment analysis has been used for fictional texts for about 15 years now, with a focus on plots and the development of characters.1Despite some interesting findings, sentiment analysis methods seem not to be directly deployable for literary text analysis yet. In addition to general problems (creation of suitable dictionaries, recognition of negations and other dependency-related changes in meaning), the high complexity of literary language and text structure as well as the considerably higher number of potential opinion targets or holders of sentiment (cf. Liu 2012: 18) are problematic.In comparison to the still challenging high complexity of language (i.e., metaphors, irony, and other forms of improper speech), tackling text structure and sentiment holders seems more straightforward. A feasible use of sentiment analysis rooted in literary studies should therefore combine (i) simple, but established procedures for sentiment analysis with (ii) more elaborate understandings of texts and their entities.We tried to implement this by developing the approach presented in this contribution. With two adjustments we made sentiment analysis more adequate for literary text analysis:Segmentation: Since text segmentation should correspond to hermeneutic text approaches, we only analyze segments that are relevant for character analysis (for now: character speech) and combine them with thematic aspects.3Sentiment sensitivity: Due to the problems discussed, we conduct a comparatively simple sentiment analysis ignoring the positivity or negativity of sentiment values. We thus determine what we regard as sentiment sensitivity. Thereby, we avoid both attributing false sentiment values to negations or improper speech and positive and negative values cancelling each other out.ImplementationFor our two experiments we automatically extracted direct speech and manually annotated speakers for six exemplary texts from corpora of the hermA project4. We also created the semantic fields 'state and society' and 'illness'.5 Then a sentiment value was determined for each speech segment consisting of the sum of all values provided by the SentiWS sentiment lexicon (Remus et al. 2010), normalized by their number. Finally, speech segments including words from the pertaining semantic fields were identified. Figure 1: example of sentiment analysis in character speech based on semantic fields (Zeh: Corpus Delicti, crosses indicating semantic field segments)Thus, we analyzed when, to what extent and with which sentiment values character speak as well as their relation to certain topics. The results concerning our research interests are sketched briefly below.Example 1: Gender and IllnessIn our first experiment with novels from around 1900, general sentiment sensitivity resulted higher for male than for female characters (cf. Table 1 and Figure 2). Interestingly, this difference almost levels out in segments containing the semantic field 'illness'. Therefore, while showing general differences between female and male speech around 1900, our sentiment approach proves useful for looking closer at the relation between gender and certain topics as well. Table 1: Overview of annotations and sentiment scores of Aus guter Familie, Das kleine Glück und Ellernklipp Figure 2: sentiment sensitivity of female and male characters from Aus guter Familie, Das kleine Glück und Ellernklipp (general and semantic field (sf)-related values) Example 2: Dystopias as GenreIn the second experiment we examined whether in contemporary dystopian novels the presumably crucial semantic field of state and society is sentiment sensitive. The values in Table 2 show that character speech about state and society in dystopias tends to be emotionalized. Thus, topic-related sentiment sensitivity seems to be an interesting feature for the automatic detection of dystopia as a genre.Table 2: sentiment sensitivity of character speech (with and without affiliation to the semantic field of ‘state and society’) in dystopian novelsIn summary, our concept of sentiment sensibility, as calculated by sentiment analysis, seems to be an appropriate way of dealing with the ambiguity of literary texts and the interests of literary scholarship.
If this content appears in violation of your intellectual property rights, or you see errors or omissions, please reach out to Scott B. Weingart to discuss removing or amending the materials.
In review
Hosted at Carleton University, Université d'Ottawa (University of Ottawa)
Ottawa, Ontario, Canada
July 20, 2020 - July 25, 2020
475 works by 1078 authors indexed
Conference cancelled due to coronavirus. Online conference held at https://hcommons.org/groups/dh2020/. Data for this conference were initially prepared and cleaned by May Ning.
Conference website: https://dh2020.adho.org/
References: https://dh2020.adho.org/abstracts/
Series: ADHO (15)
Organizers: ADHO