The Four Elements

These visualizations show the results of an analysis of the presence of the four elements in António Ramos Rosa’s poetry. The values are expressed as absolute frequency (integers) and relative frequency (percentages). The quantitative analysis was carried out after removing stop words and lemmatizing the corpus. Removing stop words leaves articles, conjunctions and other irrelevant words out of the analysis. The lemmatization allows, for example, the 1540 occurrences of the term “água” [“water”] to be counted in both singular and plural. Consisting of 79 books, the corpus includes 391,890 words, reduced to 181,291 after removing the stop words. The analyses were developed in R language within the RStudio environment, and the visualizations were produced using RAWGraphs. The code is available for inspection and reuse (↓ script R).

Terms of the Four Elements

What presence do the four elements have in António Ramos Rosa’s poetry?

data


What is the distribution of the four elements by book?

data


Which terms of the four elements appear in most books?

data


Terms associated with the Four Elements

What is the expression of the terms related to the four elements?

data


What presence do the four elements have if we consider an expanded set of related terms?

data


What is the distribution of the related terms by book?

data


Which terms associated with the four elements appear in most books?

data