ΣΝΕΛ: Ένας νέος γλωσσικός πόρος για τη μελέτη της λογοτεχνίας στα ελληνικά
Abstract
This paper presents the principles and procedures involved in creating a new linguistic resource for Greek, the Corpus of Modern Greek Literature (CMGL), designed to support the systematic diachronic study of twentieth-century Greek literature. We first outline the conceptual framework behind the development of CMGL, placing it in relation to comparable resources in other languages, for which a brief overview is provided. We then describe the process of compiling the corpus, with particular emphasis on the Logios platform, developed specifically for the digitization of polytonic texts (Perifanos & Goutsos 2025). At its current stage, CMGL contains 133 works of modern Greek literature, in the polytonic or monotonic spelling system, published between 1927 and 1999, amounting to approximately 5.5 million words. The target size is 146 literary works. The corpus includes novels, short-story collections, poetry collections and theatrical plays. The article concludes with a presentation of preliminary findings that highlight the analytical possibilities offered by corpus-based stylistic methods. Specifically, we present frequency lists derived from CMGL, including lexical bundles, as well as measurements of lexical density, average sentence length and readability scores for the texts included in the corpus. In addition, we provide charts illustrating the diachronic development of grammatical and lexical forms, which point to research directions that can be further expanded.
Article Details
- How to Cite
-
Goutsos, D., Νίκα Χ., Περήφανος Κ., & Φραγκάκη Γ. (2026). ΣΝΕΛ: Ένας νέος γλωσσικός πόρος για τη μελέτη της λογοτεχνίας στα ελληνικά. Comparison, (34), 60–86. Retrieved from https://ejournals.epublishing.ekt.gr/index.php/sygkrisi/article/view/43858
- Section
- Articles

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution Non-Commercial License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g. post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (preferably in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).