Introduction
To begin with, the assignment for this project was to conduct a textual analysis relating to a research question of our choice. In order to perform the textual analysis, we were asked to use the computer programming language Rstudio, to run our corpora and train our models for extensive analysis. Within Rstudio, we utilized the word2vec package which operates as a word embedding tool that computes the semantic linguistic relationships between words by giving words a “number”. Through the different tools in word2vec, such as clustering, analogies, closest to, and plots (to name a few), we were able to train our models and perform different queries to get at our proposed research questions.
Research Question?
Initially, it was difficult for me to construct a research question. After performing several queries into RStudio I decided I wanted to explore the effect of magic on different power dynamics in children’s literature. More specifically, my research question was: How do varying associations of magic (positive and negative) affect the agency of power dynamics displayed in children’s literature? Through external research, I also chose to explore the effect of magic in children’s literature and how different magical elements enhance children’s perception of power and reality.
Data Prepping
Since I knew I wanted to focus on children’s literature, I knew I needed to find a corpus with a public domain of popular works of literature. Immediately, I went to Project Gutenberg, an online corpus with over 57,000 ebooks of several works of literature. While I was looking for children’s literature, I wanted to find pieces that I have read in my childhood and other popular works that are still relevant to today. Ultimately, I constructed a corpus composed of 22, 19th-21st-century children’s literature texts to research for this assignment.
List of the ebooks that I used for my corpus
Metadata
In order to prepare my data for Rstudio, I began by copying and pasting the texts into Oxygen XML Editor in order to remove the metadata from the Project Gutenberg website. After removing the metadata from the texts, I also removed the asterisks, illustration descriptions, chapter numbers, and page numbers. I chose to remove this data in order for my data to focus solely on the texts, and the semantics of the actual story rather than the other common terms found in every form of literature. After stripping the metadata from the texts, I compiled all of the .txt files into a folder and compressed the file into a .zip file in order to upload unto Rstudio to begin to train my model.
Data Collection
I wanted to have a broad perspective of the semantics in children’s literature, so I began my data collection by running the clustering tool in Rstudio. Under the parameters of 10 random clusters, with 10-word associations, I came across this initial cluster:
I was particularly interested in the sixth group because I thought that it related closely to my research question regarding power dynamics, which happened to be in relation to class. I decided to look at words closest to “servant”, to see what relationships would appear.
It was intriguing to see that the word closest to “servant” was “faithful”. I continued to look at the terms and I noticed that “king’s” was one of them, which for me referred to a sense of ownership and power. From here I queried more terms relating to gender: women, men, queens (which may be found in Appendix A below). In one of my searches regarding women and magic, I came across Glinda, the good witch from The Wonderful Wizard of Oz. After realizing that Glinda is a form of “positive magic” in the book, I wanted to see how men and women are portrayed in relation to magic in children’s literature. I decided to perform these queries:
I thought it was interesting that words closest to “men + magic” include words that related to animals, such as beasts, lions, and snakes. On the other hand, “women + magic” included words such as fortunes, tales, witches, and riches. Coincidently, the term “possessed” appeared for both queries, so I decided to search for the closest terms to “possessed”.
Two terms that took me by surprise were “attractive” and “virtue”. The other words such as “mirror, fortune, riches, and information” were all relative to “women +magic”. While the terms “creatures, persons, gained, and haunted” related closely to the terms from “men + magic”. In order to understand how power was relevant to the different perceptions of magic that are gendered in children’s literature, I decided to query the terms “virtue + attractive”.
After performing this query, I noticed that virtuous and attractive components of children’s literature are semantically close to areas of importance and honor, but also the term “possessed” appeared again, emphasizing the relationship to magical beings (both men and women) in the stories.
Data Analysis
Following my several searches, queries, and word clouds, I was able to discern from my data that magic in children’s literature is not always in positive and negative forms or good and dark, instead, it may be attached to gender roles. Men in children’s literature may have magic that relates to more animal-like qualities, while women are more closely related to witches, fortunes, and tales. I was also able to conclude from my data that magic is generally a source of power in works of children’s literature, and the characters that “possess” different forms of magic may be considered honorable, influential, and of importance to the rest of the book. Similarly, in the journal article Where Theories of Mind Meet Magic, the authors point out that “children as young as 3 understand that one cannot physically transform an object ‘just by thinking about it’ and it takes actual physical contact to achieve such results” (Woolley et al. 572). From this reading, as well as the results I received from my queries using word2vec, I am able to conclude that children are able to comprehend from children’s literature that characters of importance have to possess certain powers in order to be an influential figure and perform magic. Of course, through more research and a different approach to queries, there may be alternate agencies of power in children’s literature rather than just possessing a form of magic and virtuous qualities. Therefore, I leave this project to be continued for others to not only question the use of magic as an agency of power in children’s literature but to also forward my results and explore a different realm of ideas.
Works Cited
Woolley, Jacqueline D., et al. “Where Theories of Mind Meet Magic: The Development of Children’s
Beliefs about Wishing.” Child Development, vol. 70, no. 3, 1999, pp. 571–587. JSTOR
Hart, Michael S. Gutenberg. 1 Dec. 1971, https://www.gutenberg.org/. Accessed 12 Nov. 2018.
Appendix A (other queries)
Appendix B (word clouds – word associations in popular books)