The article presents a multidimensional model of Sebastian Unger’s idiostyle based on corpus analysis and natural language processing (NLP) methods. The study is based on a structured approach to the analysis of authorial style, comprising text certification, thematic modeling, and stylometric evaluation. A subcorpus of Unger’s texts (SPU) was created and subjected to automated processing using such methods as word vectorization (Word2Vec, TF-IDF), topic modeling (LDA, BERT), syntactic and morphological analysis, and emotional modeling (Sentiment Analysis). The results of the analysis show the presence of clear stylistic markers in Unger’s work, including metaphorical structures, fragmentary composition, dominance of expressive vocabulary, and specific syntactic models. It is found that the author’s poetry tends to the categories of “nature”, “myth”, “philosophy”, which is confirmed by thematic clustering and analysis of key concepts. The proposed methodology of corpus research allows automating the identification of the author’s style, providing a quantitative assessment of his linguistic features and opening up new perspectives for digital stylometry and authorial attribution.
Keywords
IdiostyleCorpus AnalysisNLPTopic ModelingAutomated Text AnalysisEmotional ModelingStylometrySebastian Unger.
References
J. Devlin, M. Chang, K. Lee, and K. Toutanova, "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding," in Proceedings of NAACL-HLT, pp. 4171-4186, 2019.
D. Jurafsky and J. H. Martin, Speech and Language Processing, 3rd ed. Pearson, 2021.
KAS Literatur, "Sebastian Unger," [Online]. Available: https://www.kaschlit.de/autorinnen/sebastian-unger.
I. Khomytska, V. Teslyuk, N. Kryvinska, and I. Bazylevych, "Software-Based Approach Towards Automated Authorship Acknowledgement - Chi-Square Test on One Consonant Group," Electronics, vol. 4, no. 7, p. 1138, Jul. 2020, [Online]. Available: https://doi.org/10.3390/electronics9071138.
M. Koppel, J. Schler, and Sh. Argamon, "Authorship attribution in the wild," Language Resources and Evaluation, vol. 45, no. 1, pp. 46-52, 2011, [Online]. Available: https://doi.org/10.1007/s10579-009-9111-2.
D. Madigan, A. Genkin, D. D. Lewis, Sh. Argamon, D. Fradkin, and Ye. Li, "Author Identification on the Large Scale," in AIP Conference Proceedings, vol. 803, pp. 509-5013, 2005, [Online]. Available: https://doi.org/10.1063/1.2149832.
E. Stamatatos, "Authorship attribution using text distortion," in Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, vol. 1, pp. 1138-1149, 2017.
S. Unger, "Borametz – Das pflanzliche Lamm," Lyrikline, [Online]. Available: https://www.lyrikline.org/de/gedichte/borametz-das-pflanzliche-lamm-10652.
S. Unger, Das Pferd als sein eigener Reiter: Essays zum Ende der Natur. Berlin: Matthes & Seitz Berlin, 2024.
S. Unger, "Die Tiere wissen noch nicht Bescheid," Open Mike, Apr. 18, 2018, [Online]. Available: https://www.openmikederblog.de/2018/04/18/new-readings-sebastian-unger-die-tiere-wissen-noch-nicht-bescheid/.
S. Unger, Über die Dächer abwärts. Berlin: Matthes & Seitz Berlin, 2024.