<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article article-type="research-article" xml:lang="EN" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">EMLC</journal-id>
<journal-title-group>
<journal-title>Early Modern Low Countries</journal-title>
</journal-title-group>
<issn pub-type="epub">2543-1587</issn>
<publisher>
<publisher-name>Stichting EMLC, supported by Utrecht University Library Open Access Journals</publisher-name>
<publisher-loc>The Netherlands</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">emlc.19653</article-id>
<article-id pub-id-type="doi">10.51750/emlc.19653</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Performing Gender Through Dialogue: A Computational Approach to Male and Female Speech in Dutch Drama (1600-1800)</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>van der Deijl</surname>
<given-names>Lucas</given-names>
</name>
<bio><p><bold>Lucas van der Deijl</bold> is assistant professor in early modern Dutch literature at the University of Groningen. His research focuses on the translation and adaptation of theatre and philosophy in the vernacular culture of the Dutch Republic, integrating computational analysis with methods from cultural history, translation studies, and literary history.</p></bio>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Lassche</surname>
<given-names>Alie</given-names>
</name>
<bio><p><bold>Alie Lassche</bold> is a postdoctoral researcher at the Center for Humanities Computing at Aarhus University. Her research employs computational and statistical methods to analyse historical texts, with a particular interest in how the middle classes engaged with and shaped the flow of information from the early modern period through the nineteenth century.</p></bio>
</contrib>
</contrib-group>
<pub-date pub-type="epub">
<month>11</month>
<year>2025</year>
</pub-date>
<volume>9</volume>
<issue>2</issue>
<fpage>321</fpage>
<lpage>348</lpage>
<permissions>
<copyright-statement>Copyright: The Author(s).</copyright-statement>
<copyright-year>2025</copyright-year>
<license xlink:href="https://creativecommons.org/licenses/by-nc/4.0/" license-type="open-access">
<license-p>This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.</license-p>
</license>
</permissions>
<self-uri xlink:href="https://www.emlc-journal.org/articles/10.51750/emlc.19653"/>
<abstract>
<p>This article presents a computational approach to the relationship between gender and character speech in early modern Dutch drama. It evaluates the possibility of automatically classifying gender based on the speeches of 1141 characters and character groups from 98 early modern Dutch plays (1613-1786). The experiment combines three approaches to gender classification: lexical, semantic, and stylistic. The results show that each approach fails to reliably capture distinctions between male and female speech in early modern Dutch drama, in contrast to similar studies of gender distinctions in other literary corpora. The inability to measure a gender binary in Dutch dramatic discourse indicates that gender generally was not performed through the vocabularies of Dutch male and female characters. The absence of clear gender distinctions in character speech is read as a product of the persistent tradition of cross-dressing in the Dutch Republic, creating fictional realities in which gender became fluid and complex.</p>
</abstract>
<kwd-group>
<title>Keywords</title>
<kwd>gender performance</kwd>
<kwd>Dutch drama</kwd>
<kwd>computational analysis</kwd>
<kwd>cross-dressing</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<p>Until the second half of the seventeenth century, all-male casts remained the norm in the theatres of England, Germany, and the Dutch Republic. Male actors played both male and female roles, in male or female attire, or a combination of both. As a result, the gender identities on stage were not expressed as stable categories determined by the bodies of the actors who impersonated them. Instead, a character&#x2019;s gender had to be <italic>performed</italic>.<xref ref-type="fn" rid="fn1"><sup>1</sup></xref> In theatres where female roles were played by boys and men, representations of masculinity and femininity thus depended on dramatic actions, such as tone of voice, emotionality, costumes, acting styles, and mise-en-sc&#x00E8;ne. These theatrical techniques defined the performance of gender both in the reality of the theatre and the reality of the dramatic story world, where popular narrative devices such as disguise, delusion, and travesty further complicated the representation of gender roles in early modern drama.</p>
<p>Theatre historians have extensively documented how this widespread habit of cross-dressing &#x2013; by both actors and characters &#x2013; affected the performance and representation of gender on the early modern stage. Their work has repeatedly shown the complexity of gender relationships in a domain that was dominated by men, even though the notion of the &#x2018;all-male stage&#x2019; has been challenged convincingly.<xref ref-type="fn" rid="fn2"><sup>2</sup></xref> They have, for example, studied the theatrical problems that arise when boys impersonate girls, women, or adult men. Gina Bloom examines the English discourses about the use of voice on stage and the inevitable issue faced by boys going through puberty when they have to sing or recite in large, noisy theatre halls: a cracking voice.<xref ref-type="fn" rid="fn3"><sup>3</sup></xref> Louis Grijp did similar research on the Dutch situation, reconstructing the ages and roles of child actors on the stage of the Amsterdam city theatre between 1638 and 1659.<xref ref-type="fn" rid="fn4"><sup>4</sup></xref> A specialist in song culture, Grijp was particularly interested in the musical aspects of Dutch drama and tried to determine how boys and men sang when playing female parts. From a different perspective on cross-dressing, Vergeer and Van der Haven study the political satire in the Dutch morality play <italic>Tieranny van Eigenbaat</italic> (1679), reading the homoerotic desires and cross-dressing by the allegorical character Egoism (which represented Stadtholder William <sc>iii</sc>) as an allusion to Jacobite propaganda about William <sc>iii</sc>&#x2019;s supposed homosexuality.<xref ref-type="fn" rid="fn5"><sup>5</sup></xref> Studies like these help us to reconstruct the various material, physiological, and political effects of cross-dressing in early modern drama.</p>
<p>In this article we build upon the existing body of drama studies dedicated to early modern gender performance. Instead of tone of voice, costumes, or narrative devices, we focus on another important ingredient of a character&#x2019;s gender identity that is often overlooked: their language. Amanda Pipkin, Martine van Elk, Olga van Marion, Johanna Ferket, and other theatre historians have shown how gender roles on the early modern Dutch stage could question the gender norms that were predominant both in and outside the theatre.<xref ref-type="fn" rid="fn6"><sup>6</sup></xref> Despite the relatively minor roles of women on stage in the Dutch Republic, ideals attached to femininity, gendered norms surrounding sexual behaviour, and the male dominance in theatrical discourse were challenged through <italic>performances</italic> of female identities &#x2013; by either male or female actors. Since theatrical performances of femininity and masculinity primarily relied on what characters said, we test how character speech contributed to the construction of gender identities and gender distinctions in a theatrical culture in which both gender norms and social constraints for female actors were changing, while cross-dressing remained the norm even after female actors entered the stage. If the early modern Dutch theatre could simultaneously create and criticize constructions of &#x2018;maleness&#x2019; and &#x2018;femaleness&#x2019;, then we would expect distinctions between those categories to become visible in the lexical preferences, discursive features, and styles of male and female character speech. Formalising and measuring those distinctions on a comparative scale would help us to assess the function of the theatre as a space for fictional realities in which alternative gender roles and norms could emerge.</p>
<p>We used computational methods to question the relationship between gender and character speech based on 1141 characters from 98 early modern Dutch plays first printed and/or staged between 1613 and 1786. We combine and evaluate three textual approaches to gender classification: a lexical approach using word-document matrices, a semantic approach using large language models, and a stylistic approach using computational stylometry.<xref ref-type="fn" rid="fn7"><sup>7</sup></xref> Our findings show that each approach can only reliably capture distinctions between male and female speech from early modern Dutch drama in particular circumstances. We interpret this inability as an indication that &#x2013; other than in the case of &#x2018;stereotypical&#x2019; registers concerning for example love, family, warfare, and politics &#x2013; gender generally was not marked explicitly in the words and/or styles of male and female characters from Dutch plays. We argue that the unmarked nature of gender in the speech of characters is explained by the persistent tradition of cross-dressing in the Dutch Republic, and should be read in the context of the Republic&#x2019;s culture, where discussions in many public spaces were generally not segregated into gender groups.</p>
<sec id="s1">
<title>Gender Performance on the Dutch Stage</title>
<p>Compared to the French, Italian, and Spanish theatre traditions, which had normalised female actors in the sixteenth and early seventeenth century, the Dutch were relatively late in allowing women on the stage. Ariana van den Bergh (1626/1628-1661) famously broke the taboo in 1655, when she played her namesake Ariane in <italic>Onvergelijkelijke Ariane</italic> (1644), a play by Jan Jacobsz Schipper. She became a celebrity immediately after her first performance in the Schouwburg, paving the way for other Dutch actresses such as Susanna van Lee, Elisabeth de Baer, Alida Molensteen, Emerentia Veltzen, Joanna Vissers, Anna de Prendre, Maria Besems, Cornelia de Vlieger, Van den Bergh&#x2019;s daughter Maria Nozeman, and Van Lee&#x2019;s daughter Adriana Eeckhout.<xref ref-type="fn" rid="fn8"><sup>8</sup></xref> Many actresses had marital or family ties to professional male actors.<xref ref-type="fn" rid="fn9"><sup>9</sup></xref> Women had occasionally performed on Dutch stages before 1655, usually as singers or in travelling theatre companies, but Van den Bergh&#x2019;s debut in the spotlights of the prestigious theatre at the Keizersgracht marked a clear break in Dutch theatre history.</p>
<p>However, Ariana&#x2019;s stage debut did not mean that female characters were no longer embodied by men after 1655. The stage continued to be dominated by male bodies, either in male or female dress. Reading the <italic>Personageboek</italic>, a unique source which documented the distribution of roles among actors in the theatre season of 1658-1659, Louis Grijp concluded that in this year the main female roles were usually played by the professional actresses Van den Bergh, Van Lee, and Kalbergen. Other female characters, however, were mostly played by men. Genre seemed to have played a role, since male impersonators of female characters were more common in farces than in tragedies, possibly because of the more sexual nature of those roles.<xref ref-type="fn" rid="fn10"><sup>10</sup></xref> Furthermore, cross-dressing was not limited to male actors: in some cases, Van Lee and Van den Bergh played minor male roles.<xref ref-type="fn" rid="fn11"><sup>11</sup></xref> There is also evidence that female roles could be played by either a male or a female actor depending on the staging, which again confirms the flexible approach to gender roles during the early years of Dutch female acting. Even in singing roles, there was not a necessary connection between the gender of the characters and the gender of the actors. In the seven stagings of Vondel&#x2019;s <italic>Gysbreght van Aemstel</italic> (1637) in the 1658-1659 season for example, the dramatic <italic>Rey van Klaerissen</italic>, voiced by a group of nuns whose lament of the biblical Massacre of the Innocents allude to their own cruel rape and murder reported in the fifth act, was sung by male singers.<xref ref-type="fn" rid="fn12"><sup>12</sup></xref></p>
<p>The introduction of female actors not only affected the theatre&#x2019;s stage, but also its ticket office. In the archives of the <italic>Schouwburg</italic> in Amsterdam, digitised in the ONSTAGE project, Olga van Marion and Frans Blom found evidence of a steep increase in the theatre&#x2019;s annual income after Ariana van den Bergh&#x2019;s debut on stage in 1655.<xref ref-type="fn" rid="fn13"><sup>13</sup></xref> The opportunity to see women performing apparently drew big crowds to the Keizersgracht. By counting the female roles in several plays, Van Marion concludes that their share increased after female actors entered the stage, possibly as a result of the financial incentive to create more space for women in the story worlds of plays.<xref ref-type="fn" rid="fn14"><sup>14</sup></xref> She also tested (but could not confirm) the hypothesis that female playwrights were more inclined to attribute more lines to female characters than their male colleagues.<xref ref-type="fn" rid="fn15"><sup>15</sup></xref> In this article, we will complement her findings with new statistics on the visibility and characteristics of female voices on stage.</p>
<p>Besides female actors, female authors also played a role in the changing performances of female identities. A few female playwrights, including Katharyne Lescailje, Catharina Questiers, and Catharina Verwers Dusart, claimed their position in the male-dominated domain of the theatre by writing plays that were staged in the Schouwburg. In her readings of the first Dutch plays authored by women, Amanda Pipkin recognises a tendency among female playwrights to rewrite the moral codes concerning sexual misbehaviour. Through their dramatic and poetic criticism of gender relationships, authors such as Verwers and Questiers rehabilitated women and rejected &#x2018;the most injurious stereotype that women were the source of all sexual immorality&#x2019;.<xref ref-type="fn" rid="fn16"><sup>16</sup></xref> Martine van Elk has shown how Verwers and Questiers in their plays negotiated early modern ideals about public and private femininity, in dialogue with the sources of their adaptations, which originated in the work of Jacob Cats and Spanish authors Antonio Enr&#x00ED;quez G&#x00F3;mez, F&#x00E9;lix Lope de Vega, and Miguel de Cervantes.<xref ref-type="fn" rid="fn17"><sup>17</sup></xref> Johanna Ferket contributed to this debate by analysing the female roles in (male-authored) Dutch farces, arguing that their female characters often engage in different discourses than those of their male counterparts. Even in farces, female characters were not limited to sexual stereotypes but often mimicked female roles and female positions in everyday life and everyday discussions, for example allowing &#x2018;women to speak out on stage about their roles as mothers&#x2019;.<xref ref-type="fn" rid="fn18"><sup>18</sup></xref></p>
<p>These readings highlight the way both female and male playwrights used the stage as a space where gender relationships and early modern ideals of femininity and girlhood could be questioned and reimagined.<xref ref-type="fn" rid="fn19"><sup>19</sup></xref> Meanwhile, the increasing presence and participation of women before and behind the scenes seems to have affected the role of Dutch female characters and the representation of gender relationships. But how exactly? We lack an empirical understanding of the historical consequences of this increased visibility and participation of women in the theatre. Does a more equal (or less unequal) gender balance lead to more pronounced distinctions between gender roles, or, conversely, to more fluid boundaries between male and female identities? After the gradual normalisation of female actors on the Dutch stage in 1655, gender performativity possibly became more rather than less fluid. While female characters were always men in female dress before the 1650s &#x2013; which may have guaranteed a certain stability and conventionality in the performance of gender &#x2013; in the second half of the seventeenth century male and female bodies could be impersonated by men or women. There must have been stagings where female characters played by men interacted with female characters played by women, alongside male characters embodied by men, and female actors in male dress. The common dramatical use of disguises and false identities thus acquired a new layer of complexity, especially in the many comedies and other plays that made gender switching a driving force of their plot. A well-documented example is the role of Rozaura in the Dutch adaptation of Pedro Calder&#x00F3;n de la Barca&#x2019;s <italic>La vida es sue&#x00F1;o</italic> (1636), titled <italic>Sigismundus, prin&#x00E7;e van Poolen</italic> (1654). In later stagings of that play, Ariana van den Bergh first played a male traveller in drag, then switched to a court lady, and finally switched again to a (male) soldier.<xref ref-type="fn" rid="fn20"><sup>20</sup></xref></p>
<p>This flexibility in the performance of gender roles leads us to formulate two, contradicting hypotheses: it made gender roles on the Dutch stage either more stereotypical (to compensate for their lack of clarity), or less pronounced (because apparently gender was not so strictly defined). As a historical and comparative question, this research problem requires a large sample that is illustrative, if not representative, of dramatic productions in the studied period. Our selection of ninety-eight plays is too limited for that purpose, but it offers a solid starting point for further analysis from both a historical and a transnational perspective. We consider our analysis of the relationship between gender and character speech in early modern Dutch drama as a first step towards a more comprehensive history of changing gender roles on both Dutch and European stages.</p>
</sec>
<sec id="s2">
<title>Corpus and Data</title>
<p>The corpus was extracted from DutchDraCor, a collection of <sc>tei</sc>-encoded editions of early modern Dutch drama that contained ninety-eight plays at the moment of extraction (April 2024).<xref ref-type="fn" rid="fn21"><sup>21</sup></xref> All selected plays were first printed between 1613 and 1786. DutchDraCor is one of the subcorpora integrated into the Drama Corpora Project (DraCor), an infrastructure and &#x2018;programmable corpus&#x2019; that facilitates (cross-lingual and longitudinal) computational analysis of theatre corpora in several European languages.<xref ref-type="fn" rid="fn22"><sup>22</sup></xref> Each play included in DraCor has been encoded in all elements from the hierarchical dramatic structure: acts, scenes, speeches, and lines. All speeches are marked by a character <sc>id</sc> and all characters are labelled with a gender code: female, male, unknown, or other.</p>
<p>Our selection includes the largest sample of fully encoded Dutch plays currently available, even though it contains only a fraction of the total production of Dutch drama from the seventeenth and eighteenth centuries. There is no complete overview of all Dutch plays printed in this period, but there must have been several hundred. Mieke Smits-Veldt has identified more than 180 discrete tragic plays published in Amsterdam or by Amsterdammers between 1600 and 1650.<xref ref-type="fn" rid="fn23"><sup>23</sup></xref> Her estimation was based on Hubert Meeus&#x2019;s <italic>Repertorium van het ernstige drama in de Nederlanden 1600-1650</italic>, which documented no fewer than three hundred plays from the Low Countries in this period.<xref ref-type="fn" rid="fn24"><sup>24</sup></xref> The online database of the Amsterdam City Theatre from the aforementioned ONSTAGE project records 999 distinct Dutch plays first printed in the period studied in this article (1613-1786).<xref ref-type="fn" rid="fn25"><sup>25</sup></xref> We consider a sample of ninety-eight plays (ca. 10 percent of the relevant plays included in the ONSTAGE database) sufficient to develop an informed hypothesis about the relationship between gender and speech in Dutch drama in the seventeenth and eighteenth century. In the future, our analysis will have to be replicated and complemented from larger samples.</p>
<p>The corpus contains plays from six different genres: sixty-three tragedies, seventeen comedies, eight farces, five pastoral plays, three tragicomedies, and two morality plays. All genre labels were derived from the titlepages or other paratexts from the original editions. Eighteen plays (18.4 percent) were written by female authors: seven by Katharyne Lescailje, seven by Lucretia Wilhelmina van Merken, three by Catharina Questiers, and one by Catharina Verwers. The plays are not equally distributed over the studied period: the current state of DutchDraCor has a bias towards the seventeenth century (<xref ref-type="fig" rid="fg001">fig. 1</xref>).</p>
<fig id="fg001" position="float">
<label>Fig. 1</label>
<caption><p>Number of Dutch plays per decade, 1600-1790.</p>
<p>Source: DutchDraCor, version April 2024.</p></caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="figures/emlc.19653_fig1.jpg"/>
</fig>
<p>We extracted all speeches from all plays and grouped them by character and by gender, excluding characters with gender code &#x2018;unknown&#x2019; or &#x2018;other&#x2019;. This resulted in a dataset of 37,976 speeches by 1141 characters or single-gender character groups (such as <italic>reien</italic>, choruses). A large majority (70.3 percent) of the characters is male, which also means that male speeches outnumber female speeches by a factor of 2.2 (<xref ref-type="table" rid="tb001">tab. 1</xref>).</p>
<table-wrap id="tb001">
<label>Tab. 1</label>
<caption><p>Volume of character speech by gender.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Gender</th>
<th align="left" valign="top">Characters</th>
<th align="left" valign="top">Speech turns</th>
<th align="left" valign="top">Lines</th>
<th align="left" valign="top">Tokens</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Male</td>
<td align="left" valign="top">802</td>
<td align="left" valign="top">26,167</td>
<td align="left" valign="top">112,419</td>
<td align="left" valign="top">945,949</td>
</tr>
<tr>
<td align="left" valign="top">Female</td>
<td align="left" valign="top">339</td>
<td align="left" valign="top">11,809</td>
<td align="left" valign="top">54,840</td>
<td align="left" valign="top">448,503</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Source: DutchDraCor, version April 2024.</p>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="s3">
<title>Method</title>
<p>In theory, character speech can be gendered in various ways: male characters might use different words than female characters; they could act in different social situations and therefore speak about different topics and in different discourses; or they could speak in different styles. Although these textual features are likely to overlap to a certain degree &#x2013; a different style usually leads to different lexical preferences &#x2013; we differentiate between the lexical, semantic, and stylistic level of character speech in order to capture the complexity of potential gender differences in dramatic language. Each level requires its own computational approach, focusing on relative frequencies of content words (vocabulary), semantic connotations of words and sentences (semantics), and patterns of the most frequent (function) words (style).</p>
<p>In both the lexical and semantic approach, we approach the question whether there is a relationship between the gender of a character and their speech as a classification task. We use machine-learning techniques to predict or determine the gender of a character in a text based solely on the language used within that text. This is a popular method in the field of quantitative text analysis and has been applied to several contexts. Examples are gender identification of the authors of blog posts, tweets, e-mails, and forum messages.<xref ref-type="fn" rid="fn26"><sup>26</sup></xref> In the context of literary studies, the study of Hota et al. on gendered speech in Shakespeare theatre texts is noteworthy.<xref ref-type="fn" rid="fn27"><sup>27</sup></xref> More recent studies in which a similar task is executed include the work of Sven Vitse, who analysed gendered character descriptions in a corpus of modern Dutch novels.<xref ref-type="fn" rid="fn28"><sup>28</sup></xref></p>
</sec>
<sec id="s4">
<title>The Lexical Approach</title>
<p>The central question is thus whether we can predict the gender of a character based on their speech text. Our first attempt to answer that question focused on lexical preferences by characters, or simply: their vocabulary. We trained and evaluated a Term Frequency-Inverse Document Frequency (<sc>tf</sc>-<sc>idf</sc>) classification model using logistic regression. With <sc>tf</sc>-<sc>idf</sc>, the importance of a word is determined by weighing its occurrence in the document and computing how often the same word occurs in other documents. A word frequency matrix is built of all words in the corpus, in this case of all the words in all speech texts. Prior to that, some preprocessing steps are taken to prepare the data.</p>
<p>The procedure is as follows. First, we group the speeches of every character. Next, we make chunks of equal size from these speech texts, experimenting with different chunk sizes. This means that if the speeches of character X have a total length of ten thousand words, they are chunked in twenty fragments of five hundred words, or in twenty-five fragments of four hundred words. With this step we prevent bias being introduced to the model by the varying lengths of speech texts. Punctuation is removed from the speech texts, after which the fragments are lemmatised (converting all inflected words to their lemma or dictionary form) to decrease the number of unique words in the corpus and thus reduce the size of the <sc>tf</sc>-<sc>idf</sc> matrix.</p>
<p>A balanced dataset is created, meaning there are as many speech texts by male speakers as female speakers. The size of the dataset depends on the chunk size &#x2013; larger chunks result in a smaller dataset. We create random samples for both genders, using 80 percent for training and 20 percent for testing. The TfIdfVectorizer from the Python package scikit-learn is used to transform the text data, and a LogisticRegression model is trained on the training set across fifty iterations. The trained model is then used to make predictions on the test set.</p>
</sec>
<sec id="s5">
<title>The Semantic Approach</title>
<p>The semantic approach builds on the lexical approach, with the important difference that this time texts are represented using a transformer model, also known as a <sc>bert</sc> model. This is a type of machine-learning model that processes text and generates representations, called embeddings, which capture the meaning and structure of documents. These embeddings encode both the semantic content &#x2013; what the text is about &#x2013; and stylistic features, such as tone or writing style, allowing the model to understand and compare documents based on deeper linguistic patterns. This makes transformer models highly effective for tasks like clustering or categorising texts. There are many different transformer models, each trained on specific types of data and optimised for various tasks. The variation in training data means these models can capture different nuances, ranging from everyday language to technical or historical styles, depending on the specific context they were designed for.</p>
<p>To decide which model suits our task and corpus best, we perform a clustering task in which we test five potentially suitable models, including one historical Dutch language model, two contemporary Dutch language models, and two multilingual models.<xref ref-type="fn" rid="fn29"><sup>29</sup></xref> The clustering task includes the following steps: we restrict our corpus to the forty characters with the most speech text, and chunk their speech text into fragments of similar length. We experiment with chunk sizes of two hundred, three hundred, and four hundred tokens (word instances). We create embeddings and apply K-means clustering to see with which embeddings the text fragments of each character cluster together the best. The performance is measured using the V-score, a clustering evaluation metric that measures the harmonic mean of homogeneity and completeness. We work from the assumption that the model with the highest performance is the most suitable to encode the semantic and stylistic features of early modern Dutch theatre text. The results in <xref ref-type="table" rid="tb002">tab. 2</xref> show that the <italic>m-e5-large</italic> model consistently performs either best or second best across all experiments. However, when the chunk size is reduced to two hundred tokens the <italic>GysBERT-v2</italic> model outperforms the <italic>m-e5-large</italic> model. In order to compare the performance of both models, we continue our analysis with <italic>m-e5-large</italic> and <italic>GysBERT-v2.</italic></p>
<table-wrap id="tb002">
<label>Tab. 2</label>
<caption><p>V-scores in clustering tasks with different models, using the forty characters with the most speech text.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Model</th>
<th align="left" valign="top">Chunk size (words)</th>
<th align="left" valign="top">Sample size</th>
<th align="left" valign="top">V-score</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="3" align="left" valign="top">GysBERT-v2</td>
<td align="left" valign="top">200</td>
<td align="left" valign="top">482</td>
<td align="left" valign="top"><bold>0.7499</bold></td>
</tr>
<tr>
<td align="left" valign="top">300</td>
<td align="left" valign="top">647</td>
<td align="left" valign="top">0.7810</td>
</tr>
<tr>
<td align="left" valign="top">400</td>
<td align="left" valign="top">981</td>
<td align="left" valign="top"><underline>0.7863</underline></td>
</tr>
<tr>
<td rowspan="3" align="left" valign="top">robbert-2023-large</td>
<td align="left" valign="top">200</td>
<td align="left" valign="top">482</td>
<td align="left" valign="top">0.7084</td>
</tr>
<tr>
<td align="left" valign="top">300</td>
<td align="left" valign="top">647</td>
<td align="left" valign="top">0.7452</td>
</tr>
<tr>
<td align="left" valign="top">400</td>
<td align="left" valign="top">981</td>
<td align="left" valign="top">0.7300</td>
</tr>
<tr>
<td rowspan="3" align="left" valign="top">bert-base-dutch-cased</td>
<td align="left" valign="top">200</td>
<td align="left" valign="top">482</td>
<td align="left" valign="top">0.7054</td>
</tr>
<tr>
<td align="left" valign="top">300</td>
<td align="left" valign="top">647</td>
<td align="left" valign="top"><underline>0.7824</underline></td>
</tr>
<tr>
<td align="left" valign="top">400</td>
<td align="left" valign="top">981</td>
<td align="left" valign="top">0.7816</td>
</tr>
<tr>
<td rowspan="3" align="left" valign="top">xlm-roberta-large</td>
<td align="left" valign="top">200</td>
<td align="left" valign="top">482</td>
<td align="left" valign="top">0.6795</td>
</tr>
<tr>
<td align="left" valign="top">300</td>
<td align="left" valign="top">647</td>
<td align="left" valign="top">0.7292</td>
</tr>
<tr>
<td align="left" valign="top">400</td>
<td align="left" valign="top">981</td>
<td align="left" valign="top">0.7210</td>
</tr>
<tr>
<td rowspan="3" align="left" valign="top">m-e5-large</td>
<td align="left" valign="top">200</td>
<td align="left" valign="top">482</td>
<td align="left" valign="top"><underline>0.7493</underline></td>
</tr>
<tr>
<td align="left" valign="top">300</td>
<td align="left" valign="top">647</td>
<td align="left" valign="top"><bold>0.8138</bold></td>
</tr>
<tr>
<td align="left" valign="top">400</td>
<td align="left" valign="top">981</td>
<td align="left" valign="top"><bold>0.8081</bold></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Highest V-scores per chunk size are in bold, second highest performances are underlined.</p>
<p>Source: DutchDraCor, version April 2024.</p>
</table-wrap-foot>
</table-wrap>
<p>Preparing data for fine-tuning a <sc>bert</sc> model requires pre-processing steps that differ slightly from the pipeline described in the lexical approach. Because <sc>bert</sc> takes context into account, we do not remove punctuation, nor do we perform lemmatisation. The texts are tokenised and encoded using the pre-trained <sc>bert</sc> model to generate contextual embeddings. These embeddings are then used as input features for a Logistic Regression model, which is trained on the training set across fifty iterations. As in the lexical approach, the trained classifier is used to make predictions on the test set, enabling a direct comparison of model performance across representations.</p>
</sec>
<sec id="s6">
<title>The Stylistic Approach</title>
<p>Besides the lexical and semantic dimensions of character speech, we tested whether a character&#x2019;s gender was marked by the stylistic features of their lines. In the field of literary studies, there are myriad definitions of what literary scholars tend to call &#x2018;style&#x2019;. We specifically follow the conceptualisation used in the field of computational stylometry, which views literary style not only in qualitative terms, but also from a formal and quantitative perspective.<xref ref-type="fn" rid="fn30"><sup>30</sup></xref> Computational literary scholars have developed and tested several ways to operationalise their focus on quantifiable features of style, but in general they measure style as patterns of word frequencies in a given text that are characteristic for an author or character. They continue to engage in vibrant arguments about the distance metrics and parameters that best fit a given question, as they tend to agree that there is no one-size-fits-all approach to stylometric problems in general.<xref ref-type="fn" rid="fn31"><sup>31</sup></xref> We decided to use one of the oldest and most generic distance metrics, Burrows&#x2019;s Delta, because it often functions as a benchmark and its general applicability across different genres and use cases has been demonstrated sufficiently.<xref ref-type="fn" rid="fn32"><sup>32</sup></xref> Burrows&#x2019;s Delta was designed to normalise the weight of individual words regardless of their frequency, which enables a balanced stylometric analysis of the vocabulary despite the unequal distribution of words in any text. Stylometric distance was computed based on a small selection of the vocabulary: the hundred most frequent words (<sc>mfw</sc>) of the combined vocabulary of all plays selected for analysis. We decided to use a relatively small section of the <sc>mfw</sc> list compared to other applications of Burrows&#x2019;s Delta, because plays are relatively short texts compared to novels, with an average length of 12,273 tokens (character speech only), and because they are segmented in (even shorter) selections of male and female character speech.<xref ref-type="fn" rid="fn33"><sup>33</sup></xref> We used the R package &#x2018;Stylo&#x2019; developed by Eder et al. to compute the stylometric distance between all male character speech and all female character speech in a given play from a given oeuvre by four playwrights: Jan Harmensz. Krul, Joost van den Vondel, Katharyne Lescailje, and Lucretia Wilhelmina van Merken.<xref ref-type="fn" rid="fn34"><sup>34</sup></xref> We grouped plays by author to minimise the effect of the author&#x2019;s stylistic fingerprint.</p>
</sec>
<sec id="s7">
<title>Results <sc>i</sc>: Visibility of Male and Female Characters</title>
<p>A first step to evaluate the performance of gender is to compare the visibility and stage presence of male and female characters. We first measured the visibility of male and female characters by computing the percentage of male and female characters and the relative number of lines per gender group on the level of the play. Secondly, we computed visibility on the character level, based on the relative number of lines per character and the relative number of scenes in which each character speaks. All indicators can be normalised on a scale from 0.0 to 1.0, which enables comparisons between plays and characters from different plays. The high diversity and breadth of difference between main and side characters means the average values are of little use. Instead, we visualise the distribution of the data on the play level (<xref ref-type="fig" rid="fg002">fig. 2A</xref>, <xref ref-type="fig" rid="fg002">2B</xref>) and the character level (<xref ref-type="fig" rid="fg003">fig. 3A</xref>, <xref ref-type="fig" rid="fg003">3B</xref>) as violin plots. The horizontal bars in the belly of the violins represent the average proportion of male and female characters and lines in all plays.</p>
<fig id="fg002" position="float">
<label>Fig. 2</label>
<caption><p>(A) Proportions of male and female characters in each play, by gender group. (B) Proportions of male and female lines in each play, by gender group.</p>
<p>Source: DutchDraCor, version April 2024.</p></caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="figures/emlc.19653_fig2.jpg"/>
</fig>
<fig id="fg003" position="float">
<label>Fig. 3</label>
<caption><p>(a) Proportions of all speeches per play spoken by each character, separated by gender. (b) Proportions of all scenes per play in which each character appears, separated by gender.</p>
<p>Source: DutchDraCor, version April 2024.</p></caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="figures/emlc.19653_fig3.jpg"/></fig>
<p><xref ref-type="fig" rid="fg002">Fig. 2A</xref>, <xref ref-type="fig" rid="fg002">2B</xref> specify the dominance of male characters in the corpus. The uneven distributions visualised by the violin plots indicate that in most plays, male characters outnumber the female. In thirty-nine plays (40 percent), less than 25 percent of all characters are female, whereas there are no plays with less than 25 percent male characters. This imbalance in stage presence between the gender groups correlates with an imbalance in the discourse on stage: in eighty-one plays (82.7 percent), more than 50 percent of all lines are uttered by male characters.</p>
<p>A closer look at the visibility of the gender groups on the character level (<xref ref-type="fig" rid="fg003">fig. 3A</xref>, <xref ref-type="fig" rid="fg003">3B</xref>) reveals that female characters are not necessarily likely to be less prominent or less visible on stage than male characters. The relatively similar violin plots indicate that an average female character speaks more lines and appears in more scenes than an average male character, despite the lower stage presence of female characters overall. In other words: female characters are a minority on the early modern Dutch stage, but when they do appear, they generally tend to play larger roles than most of their male counterparts.</p>
</sec>
<sec id="s8">
<title>Results <sc>ii</sc>: Narrative Position of Male and Female Characters</title>
<p>Besides visibility, gender differences can be measured in terms of the narrative position of male and female characters. Following previous applications of network theory to narratological analyses of drama, we operationalised narrative position as the centrality in the social network of the play.<xref ref-type="fn" rid="fn35"><sup>35</sup></xref> We constructed relationships (edges) between two characters (nodes) when they appear in the same scene and if they both speak at least one line during that scene. The higher the number of scenes in which they both speak, the stronger (the weight of) their relationship. We can construct interaction networks of single plays if we accumulate all those interactions for all possible character pairs. Characters who play a key role in the story world &#x2013; i.e., who interact with many other characters &#x2013; end up in the heart of those networks. Based on these interaction networks, we can compute the normalised centrality of each character as the number of edges per character compared to all characters in the play. There are more advanced metrics for centrality such as betweenness or eigenvector centrality to operationalise an individual&#x2019;s &#x2018;power&#x2019; or &#x2018;influence&#x2019; on the network, but relative degree centrality suffices as a straightforward indication of a character&#x2019;s position in the social world of the play.<xref ref-type="fn" rid="fn36"><sup>36</sup></xref></p>
<p>A comparison between the centrality scores of male and female characters shows that the narrative position of the two gender groups is quite similar. <xref ref-type="fig" rid="fg004">Fig. 4</xref> visualises the distribution of degree centrality scores, which is similar in each gender group. The average centrality of female characters (0.60) is slightly higher than male characters (0.54), but given the large variance of centrality values in both gender groups (indicated by the oblong-like shape of both violins), the difference proves marginal. This means that on average, female characters are not more or less central than their male counterparts in the social worlds of early modern Dutch plays.</p>
<fig id="fg004" position="float">
<label>Fig. 4</label>
<caption><p>Relative degree centrality scores of each character, separated by gender.</p>
<p>Source: DutchDraCor, version April 2024.</p></caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="figures/emlc.19653_fig4.jpg"/></fig>
<p>However, as a result of the overall imbalance between male and female characters in most plays, the range of interactions is not equal between the gender groups. The polyphony that is so fundamental to theatre is in many cases limited to dialogues between men: of all the edges constructed in the social worlds of the plays, 1937 represented relationships between two male characters; 1551 of all the edges concerned a relationship between a male and a female character. Only 358 edges were female to female. Whereas these numbers should not surprise us, given the high number of male characters in the corpus, it is good to reflect on the consequences of these numbers for the question at stake in this article. When male characters dominate the interactions and therefore the discourse on stage, it is possible that dramatic discourse in general has a tendency towards &#x2018;masculine&#x2019; (socio-)linguistic norms. In the next sections, we apply various forms of computational text analysis to evaluate the existence and qualities of this supposed binary opposition between &#x2018;masculine&#x2019; versus &#x2018;feminine&#x2019; language in character speech.</p>
</sec>
<sec id="s9">
<title>Results <sc>iii</sc>: The Lexical Approach</title>
<p>In our first experiment, the lexical approach, we tested whether characters tend to have gendered lexical preferences. Such preferences could be revealed by socio-linguistic vocabularies that are characteristic of their gender group. To measure the existence of these preferences, we ran several experiments with different settings, which are listed in <xref ref-type="table" rid="tb003">tab. 3</xref>. In each experiment, character speech was fragmented into several &#x2018;chunks&#x2019; of text. The column <italic>chunk size (words)</italic> refers to the length of the speech fragments that are used. This length is connected to not only the <italic>sample size (gender)</italic> which refers to the size of the number of male and female speech fragments, but also to the <italic>test set size (gender)</italic>, which refers to the number of male and female speech fragments on which the model was evaluated. We experimented with the <italic>n-gram range</italic>: a range of (1,1) means that the model only looks at unigrams, in other words, single tokens. A range of (1,2) means that both single tokens and bi-grams (combinations of two words) are taken into account, and a range of (2,3) means that only bi-grams and tri-grams are taken into account. We have also run experiments without lemmatising the data, but since their performance was slightly lower than the experiments in which we did use lemmatisation, we only continued with experiments on lemmatised speech texts.</p>
<table-wrap id="tb003">
<label>Tab. 3</label>
<caption><p>F1-scores of the classification tasks with the TfIdf Vectorizer.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Experiment</th>
<th align="left" valign="top">Chunk size (words)</th>
<th align="left" valign="top">Sample size (per gender)</th>
<th align="left" valign="top">Test set size (per gender)</th>
<th align="left" valign="top">N-gram range</th>
<th align="left" valign="top">F1-score &#x2640;</th>
<th align="left" valign="top">F1-score &#x2642;</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">1a</td>
<td rowspan="4" align="left" valign="top">300</td>
<td rowspan="4" align="left" valign="top">1400</td>
<td rowspan="4" align="left" valign="top">280</td>
<td align="left" valign="top">(1,1)</td>
<td align="left" valign="top">0.729</td>
<td align="left" valign="top">0.726</td>
</tr>
<tr>
<td align="left" valign="top">1b</td>
<td align="left" valign="top">(1,2)</td>
<td align="left" valign="top">0.737</td>
<td align="left" valign="top">0.734</td>
</tr>
<tr>
<td align="left" valign="top">1c</td>
<td align="left" valign="top">(1,3)</td>
<td align="left" valign="top">0.743</td>
<td align="left" valign="top">0.741</td>
</tr>
<tr>
<td align="left" valign="top">1d</td>
<td align="left" valign="top">(2,3)</td>
<td align="left" valign="top"><bold>0.747</bold></td>
<td align="left" valign="top"><bold>0.747</bold></td>
</tr>
<tr>
<td align="left" valign="top">2a</td>
<td rowspan="4" align="left" valign="top">400</td>
<td rowspan="4" align="left" valign="top">1100</td>
<td rowspan="4" align="left" valign="top">220</td>
<td align="left" valign="top">(1,1)</td>
<td align="left" valign="top"><bold>0.750</bold></td>
<td align="left" valign="top"><bold>0.751</bold></td>
</tr>
<tr>
<td align="left" valign="top">2b</td>
<td align="left" valign="top">(1,2)</td>
<td align="left" valign="top">0.748</td>
<td align="left" valign="top">0.749</td>
</tr>
<tr>
<td align="left" valign="top">2c</td>
<td align="left" valign="top">(1,3)</td>
<td align="left" valign="top">0.744</td>
<td align="left" valign="top">0.745</td>
</tr>
<tr>
<td align="left" valign="top">2d</td>
<td align="left" valign="top">(2,3)</td>
<td align="left" valign="top">0.741</td>
<td align="left" valign="top">0.740</td>
</tr>
<tr>
<td align="left" valign="top">3a</td>
<td rowspan="4" align="left" valign="top">500</td>
<td rowspan="4" align="left" valign="top">900</td>
<td rowspan="4" align="left" valign="top">180</td>
<td align="left" valign="top">(1,1)</td>
<td align="left" valign="top">0.753</td>
<td align="left" valign="top">0.753</td>
</tr>
<tr>
<td align="left" valign="top">3b</td>
<td align="left" valign="top">(1,2)</td>
<td align="left" valign="top">0.752</td>
<td align="left" valign="top">0.752</td>
</tr>
<tr>
<td align="left" valign="top">3c</td>
<td align="left" valign="top">(1,3)</td>
<td align="left" valign="top">0.754</td>
<td align="left" valign="top">0.754</td>
</tr>
<tr>
<td align="left" valign="top">3d</td>
<td align="left" valign="top">(2,3)</td>
<td align="left" valign="top"><bold>0.756</bold></td>
<td align="left" valign="top"><bold>0.757</bold></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Highest F1-scores are in bold.</p>
<p>Source: DutchDraCor, version April 2024.</p>
</table-wrap-foot>
</table-wrap>
<p>Applications of machine learning like the one at stake here are usually evaluated with the so-called F1-score, a number representing the proportion of correct classifications compared to all cases that had to be classified (&#x2018;recall&#x2019;) and to all cases that were classified, correctly and incorrectly (&#x2018;precision&#x2019;). In our case, the F1-scores per gender indicate how well the model is able to predict the gender of a certain character&#x2019;s speech. We report the average F1-scores over fifty iterations. Overall, we see that the F1-scores increase when we increase the chunk size, suggesting that larger text fragments provide more reliable gender cues. Including bi-grams or tri-grams has no substantial effect on the models&#x2019; performances: for a chunk size of four hundred, the best results are obtained with unigrams, while for chunk sizes of three hundred and five hundred, bi-grams and tri-grams perform slightly better. However, the differences are minimal, with all scores falling between 0.726 and 0.757. While the performance improves with longer chunks, the model still makes a notable number of misclassifications.</p>
</sec>
<sec id="s10">
<title>Results <sc>iv</sc>: The Semantic Approach</title>
<p>Besides specific gendered vocabularies, we tested whether characters speak in gendered semantic fields or &#x2018;discourses&#x2019; &#x2013; using language in a specific context and sense that is distinguishable as either typical for male or typical for female characters. We considered this experiment the &#x2018;semantic&#x2019; approach. In our experiments with the models <italic>GysBERT-v2</italic> and <italic>m-e5-large</italic>, we use the same chunk sizes as in our previous lexical experiments (<xref ref-type="table" rid="tb003">tab. 3</xref>). The F1-scores per gender are included in <xref ref-type="table" rid="tb004">tab. 4</xref>.</p>
<table-wrap id="tb004">
<label>Tab. 4</label>
<caption><p>F1-scores of the classification tasks with <italic>Gysbert-v2</italic> and <italic>m-e5-large</italic>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Experiment</th>
<th align="left" valign="top">Model</th>
<th align="left" valign="top">Chunk size (words)</th>
<th align="left" valign="top">Sample size (per gender)</th>
<th align="left" valign="top">Test set size (per gender)</th>
<th align="left" valign="top">Accuracy</th>
<th align="left" valign="top">F1-score &#x2640;</th>
<th align="left" valign="top">F1-score &#x2642;</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">4a</td>
<td align="left" valign="top">GysBERT-v2</td>
<td rowspan="2" align="left" valign="top">300</td>
<td rowspan="2" align="left" valign="top">1400</td>
<td rowspan="2" align="left" valign="top">280</td>
<td align="left" valign="top">0.753</td>
<td align="left" valign="top"><bold>0.753</bold></td>
<td align="left" valign="top"><bold>0.753</bold></td>
</tr>
<tr>
<td align="left" valign="top">4b</td>
<td align="left" valign="top">m-e5-large</td>
<td align="left" valign="top">0.742</td>
<td align="left" valign="top">0.742</td>
<td align="left" valign="top">0.741</td>
</tr>
<tr>
<td align="left" valign="top">5a</td>
<td align="left" valign="top">GysBERT-v2</td>
<td rowspan="2" align="left" valign="top">400</td>
<td rowspan="2" align="left" valign="top">1100</td>
<td rowspan="2" align="left" valign="top">220</td>
<td align="left" valign="top">0.754</td>
<td align="left" valign="top"><bold>0.754</bold></td>
<td align="left" valign="top"><bold>0.754</bold></td>
</tr>
<tr>
<td align="left" valign="top">5b</td>
<td align="left" valign="top">m-e5-large</td>
<td align="left" valign="top">0.736</td>
<td align="left" valign="top">0.736</td>
<td align="left" valign="top">0.735</td>
</tr>
<tr>
<td align="left" valign="top">6a</td>
<td align="left" valign="top">GysBERT-v2</td>
<td rowspan="2" align="left" valign="top">500</td>
<td rowspan="2" align="left" valign="top">900</td>
<td rowspan="2" align="left" valign="top">180</td>
<td align="left" valign="top">0.756</td>
<td align="left" valign="top"><bold>0.755</bold></td>
<td align="left" valign="top"><bold>0.756</bold></td>
</tr>
<tr>
<td align="left" valign="top">6b</td>
<td align="left" valign="top">m-e5-large</td>
<td align="left" valign="top">0.749</td>
<td align="left" valign="top">0.749</td>
<td align="left" valign="top">0.749</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Highest F1-scores are in bold.</p>
<p>Source: DutchDraCor, version April 2024.</p>
</table-wrap-foot>
</table-wrap>
<p>Here, we see that the scores remain stable across chunk sizes. With scores between 0.735 and 0.756, they fall in the same range as the performances with the lexical models. For both embedding models, the F1-scores are nearly identical across genders, suggesting a balanced performance. The <italic>GysBERT-v2</italic> model slightly outperforms the <italic>m-e5-large</italic> model, but the differences are small. While these results indicate that all models capture some gendered semantic patterns, they are not strong or consistent enough to enable highly accurate classification.</p>
</sec>
<sec id="s11">
<title>Results <sc>v</sc>: The Stylometric Approach</title>
<p>In addition to gendered vocabularies and gendered discourses, we searched for traces of gendered &#x2018;styles&#x2019; in character speech. To assess the existence of a gender binary in the style of character speech, we performed a cluster analysis of all male and female character speech extracted from a selection of plays written by one single playwright. This analysis constructs dendrograms (or &#x2018;kinship trees&#x2019;) visualising clusters of stylistically similar documents based on the closeness (or &#x2018;kinship&#x2019;) between all possible document pairs in the selected sample. In our case, a document represented all speech by all male or by all female characters in a given play. This means that each edition is split into two documents: one for male character speech, one for female. We thus questioned whether the clustering of character speech could be explained by gender.</p>
<p>The dendrograms in <xref ref-type="fig" rid="fg005">fig. 5A</xref>, <xref ref-type="fig" rid="fg005">5B</xref> represent the stylistic clusters of all-male and all-female speech in the four sub-corpora, where male speech is represented by a grey/gray colour label, and female by a black label. The trees represent a simplification of all document-relationships in the corpus. They can be read like a family tree. The position of each sub-branch on the horizontal scale indicates the stylistic closeness (or &#x2018;kinship&#x2019;) of the (clusters of) documents; &#x2018;siblings&#x2019; are genetically more closely related than &#x2018;second cousins&#x2019;, for example. The question at stake is whether characters are more closely related &#x2013; in terms of their style &#x2013; to other characters with the same gender identity from other plays by the same playwright. If this were the case, we would expect to find coloured clusters in the dendrogram, with a clear separation between &#x2018;grey/gray&#x2019; and &#x2018;black&#x2019; documents. This pattern does not occur in the oeuvres by Krul, Lescailje, and Vondel, even if we repeat the analysis with different parameters, using smaller and larger selections from the vocabulary (with 50, 150, 200, and 250 <sc>mfw</sc>), or bigrams and trigrams instead of single words. The conclusion here is that gender cannot explain the stylistic clustering in these selections of plays: there is no correlation between character gender and speech style. Van Merken&#x2019;s oeuvre, however, seems to be an interesting exception, although the pattern does not always reappear with different parameters. Still, this analysis provides some evidence of a &#x2018;typical&#x2019; male or female style in Van Merken&#x2019;s characters.</p>
<fig id="fg005" position="float">
<label>Fig. 5</label> 
<caption><p>(A) Stylometric clustering of male and female character speech in nine plays by Jan Harmensz. Krul. (B) Stylometric clustering of male and female character speech in twenty-nine plays by Joost van den Vondel. (C) Stylometric clustering of male and female character speech in seven plays by Katharyne Lescailje. (D) Stylometric clustering of male and female character speech in seven plays by Lucretia Wilhelmina van Merken.</p>
<p>Source: DutchDraCor, version April 2024.</p></caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="figures/emlc.19653_fig5a.jpg"/>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="figures/emlc.19653_fig5b.jpg"/></fig>
<p>Finally, we used the oppose-function in Stylo to determine the strongest lexical markers of the distinction between male and female speech. This function computes Zeta-scores for all words appearing in the two text groups (male versus female speech). The Zeta metric is essentially a normalised indication of the deviation from the average frequency of a single word in all the tested documents. If, for example, the frequency of the word &#x2018;gun&#x2019; in male speech is higher than one would expect given its frequency in the speech of an average character, then this would translate into a higher Zeta score. <xref ref-type="fig" rid="fg006">Fig. 6</xref> shows the words with a high negative Zeta score (&#x2018;Avoided&#x2019;) and a high positive Zeta score (&#x2018;Preferred&#x2019;) in female speech compared to male speech. The graph shows all words in the vocabulary with a Zeta score higher than 0.05. The higher the score, higher on the list, the higher the predictive value of each word.</p>
<fig id="fg006" position="float">
<label>Fig. 6</label>
<caption><p>Lexical markers of female (&#x2018;Preferred&#x2019;) versus male character speech (&#x2018;Avoided&#x2019;) using Craig&#x2019;s Zeta.</p>
<p>Source: DutchDraCor, version April 2024.</p></caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="figures/emlc.19653_fig6.jpg"/></fig>
<p>By instructing the program to determine the strongest indications of &#x2018;maleness&#x2019; and &#x2018;femaleness&#x2019;, we are asking it to find the most stereotypical differences between male and female speech. Therefore, we should not be surprised that it gives us stereotypes in response. The most predictive words for female characters are words related to emotional descriptions (<italic>verdriet</italic>, <italic>gemoed</italic>, <italic>smart</italic>, <italic>rouw</italic>, <italic>hart</italic>), love (<italic>liefde</italic>, <italic>lief</italic>, <italic>lust</italic>), emotional exclamations (<italic>ach</italic>, <italic>helaas</italic>, <italic>ei</italic>, <italic>och</italic>), and family (<italic>moeder</italic>, <italic>vader</italic>). Typical male words are words related to politics (<italic>volk</italic>, <italic>land</italic>, <italic>rijk</italic>, <italic>troon</italic>, <italic>koning</italic>) and warfare (<italic>heir</italic>, <italic>kracht</italic>, <italic>geweer</italic>, <italic>wapen</italic>). These results confirm that some registers are indeed gendered. Moreover, they might in these cases help us to identify gendered dramatic functions. The fact that female characters are more likely to do the emotional labour in the play (indicated by the high frequency of emotion words in female speech), might lead to the interpretation that female characters often function as the mirrors reflecting the emotions expressed on stage by making those emotions explicit. However, we should not forget that <xref ref-type="fig" rid="fg006">fig. 6</xref> highlights only a small sample of the vocabulary, representing the extreme differences between the two gender groups, while obscuring the large proportion of vocabulary that is apparently not gendered at all. It is a common fallacy in computational gender studies to ascribe too much meaning to observed gender differences based on small selections from the vocabulary, computed with metrics designed to find deviations and exceptions.<xref ref-type="fn" rid="fn37"><sup>37</sup></xref> We prefer to read the small number of lexical markers of gendered speech that passed the 0.05-threshold as another confirmation of our assessment that most of the male vocabulary is not drastically different from most of the female vocabulary.</p>
</sec>
<sec id="s12">
<title>Discussion and Conclusion</title>
<p>Our analysis confirms that male characters significantly outnumbered female characters on the early modern Dutch stage. Given the fact that most actors were male in the studied period, Dutch dramatic discourse was mostly spoken by male characters embodied by male actors. In terms of stage presence and visibility, women were a minority in the fictional worlds of the Dutch theatre, in proportions similar to the Spanish and English story worlds created by Lope de Vega and Shakespeare.<xref ref-type="fn" rid="fn38"><sup>38</sup></xref> In terms of network centrality, however, female characters often hold a key position within their fictional social worlds, which seems to underline their relative exceptionality: female characters appear less on stage, but when they do, they tend to play important roles.</p>
<p>We questioned the function of character speech in the performance of early modern gender roles within this theatrical reality dominated by men. We tested the distinction between male and female character speech on three levels (lexical, semantic, and stylistic) and found only weak evidence of this distinction, except in a few stereotypical registers. Our lexical and semantic approach demonstrates that machine learning and large language models can be used to make an educated guess about the gender based on character speech. However, the performance of these models is relatively poor, especially compared to similar studies applying <sc>llm</sc>s for gender classification in English or contemporary literary texts. Note that our models perform better than a random classification, which would lead to F1-scores around 0.50. Nevertheless, 20 to 25 percent of the cases are still misclassified. We interpret this inability to convincingly classify gender using computational methods as an indication that the binary categories we are modelling (male versus female) do not operate in such a binary manner in the texts under scrutiny, early modern Dutch theatre editions. If computational methods fail to capture gender distinctions in character speech in 20 to 25 percent of the cases, then either the methods are unfit for this task, or these distinctions are only partially present in the texts. Since similar studies have shown these methods to be fit for purpose when applied to modern English novels or modern Dutch novels, we opt for the latter explanation: there is no such thing as a typical &#x2018;male&#x2019; or &#x2018;female&#x2019; vocabulary or style in early modern Dutch character speech.<xref ref-type="fn" rid="fn39"><sup>39</sup></xref> Returning to our hypothesis about the possible effects of the fluidity of gender in Dutch drama on the performance of gender, we conclude that character speech was not specifically gendered.</p>
<p>Before we turn to an interpretation of this result, we need to consider four characteristics of our corpus that might complicate comparisons to gender binaries in other literary corpora. First, the corpus is relatively small and the variation in spelling and morphology relatively large compared to the corpora analysed in similar studies. It is possible that gender distinctions become more pronounced when using larger samples. Secondly, the form of early modern Dutch theatre is relatively conventional compared to, for example, contemporary novels: all texts from the corpus are written in (paired) verse, and usually following a strict (iambic) meter. This also means that lines spoken by different characters often rhyme. The formal conventions of Dutch drama thus complicate the emergence of gender-specific vocabularies and styles. Thirdly, dramatic discourse is by definition limited to character speech, which is less suitable for gendered character descriptions or perceptions by narrators and focalisers, for example. Finally, we already suggested above that the absence of gendered speech might be explained by the overrepresentation of male characters in the discourses on stage, and by extension in the predominantly male group of poets who were responsible for writing a large majority of the plays published and staged in the Republic. The absence of a gender divide in character speech could thus be a symptom of a skewed gender divide in the authorial styles represented in the corpus.</p>
<p>Notwithstanding these caveats, the conclusion holds that we found only weak evidence of a clear distinction between male and female speech from early modern Dutch plays. Overall, male and female characters did not speak in fundamentally different styles or about different topics. What could we infer from this fact about the role of gender in Dutch drama and in the Dutch Republic at large? The absence of a gender binary reminds us first of all that early modern gender roles were performed dialogically: characters were shaped by what they said as much as by what others said about them. The form of early modern plays thus enabled representations of individuality, identity, and gender that were constructed through interactions between characters. Moreover, these results indicate that female characters were generally not &#x2018;othered&#x2019; in their male-dominated fictional world, at least not by their contributions to the discourse on stage. Stylistically and lexically, female characters were equal participants in the conversations on stage. They were not excluded from the discourses in which they spoke based on their gender (<italic>if</italic> they spoke at all, that is). This conversational equality on stage may have been a reflection of the many domains where men and women interacted and negotiated their place in the relatively egalitarian public spaces of the Republic: the street, the market, the caf&#x00E9;, or the horse-drawn barge.<xref ref-type="fn" rid="fn40"><sup>40</sup></xref></p>
<p>We are inclined to interpret the non-existence of male and female speaking styles as part of a theatrical reality in which gender was unstable and complex due to the habit of cross-dressing &#x2013; a reality that became even more complex after the first female actors entered the stage in 1655. Gender must have been marked by narratological, visual, and auditory features such as roles (the queen), costumes (her dress), and tone of voice (falsetto), but in terms of their speech text, male characters were not radically different from female characters. This fluidity may have been typical to the theatrical culture of the seventeenth century only, given our finding that character speech was in fact more clearly gendered in the late eighteenth-century plays by Lucretia Wilhelmina van Merken. Previous scholarly readers of Van Merken&#x2019;s work noted the remarkable role of her female characters, as well as her deliberate foregrounding of female roles in the stories she dramatised.<xref ref-type="fn" rid="fn41"><sup>41</sup></xref> Our stylometric approach suggests that she also used distinctive stylistic features in her characterisations of female figures.</p>
<p>In Dutch literary history, distinctions between &#x2018;maleness&#x2019; and &#x2018;femaleness&#x2019; thus seem to have a history of their own, similar to the way in which Ted Underwood and others have shown that gender demarcations in modern English novels changed &#x2013; becoming <italic>less</italic> pronounced, in their case &#x2013; in the nineteenth and twentieth centuries.<xref ref-type="fn" rid="fn42"><sup>42</sup></xref> Adding to the studies by Pipkin, Van Elk, and Ferket on the representation of sexual violence against female characters, and the representation of femininity in Dutch plays and female discourses on stage, our study reveals that from a comparativist perspective, male and female characters are surprisingly similar in the ways they spoke onstage. Instead of contradicting previous analyses of gendered stage roles, substantiated by qualitative textual evidence, the computational lens enriches the historiography of early modern gender performativity by showing that male and female characters communicated in similar, perhaps even equal ways, performing their gender roles through dialogue &#x2013; again, <italic>if</italic> female characters appeared on stage at all.</p>
<p>Finally, we should not forget that the fluidity of gender representations on the early modern stage sharply contrasted with the very real legal and social norms and boundaries that distinguished the position of men and women in early modern Dutch society. The symbolic domain of the theatre &#x2013; where gender roles could be complex and fluid &#x2013; thus challenged the everyday world around it &#x2013; where those roles were clearly separated. This contract opens up new questions about the long history of gender boundaries, both as representations in the symbolic domain of literature and as a social, legal, and material reality outside the theatres. Developing larger corpora and new computational approaches will remain crucial in finding answers to those comparativist questions about the <italic>longue dur&#x00E9;e</italic> in the history of gender.</p>
</sec>
</body>
<back>
<app-group>
<app id="a1">
<title>Appendix. List of Plays</title>
<table-wrap id="tb005">
<label>Appendix.</label>
<caption><p>List of plays from DutchDraCor, version 2024.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Author</th>
<th align="left" valign="top">Title</th>
<th align="left" valign="top">Year</th>
<th align="left" valign="top">DraCor ID</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Samuel Coster</td>
<td align="left" valign="top"><italic>Iphigenia</italic></td>
<td align="left" valign="top">1617</td>
<td align="left" valign="top">dut000219</td>
</tr><tr>
<td align="left" valign="top">Samuel Coster</td>
<td align="left" valign="top"><italic>Isabella</italic></td>
<td align="left" valign="top">1619</td>
<td align="left" valign="top">dut000220</td>
</tr><tr>
<td align="left" valign="top">Samuel Coster</td>
<td align="left" valign="top"><italic>Ithys</italic></td>
<td align="left" valign="top">1615</td>
<td align="left" valign="top">dut000222</td>
</tr><tr>
<td align="left" valign="top">Samuel Coster</td>
<td align="left" valign="top"><italic>Polyxena</italic></td>
<td align="left" valign="top">1619</td>
<td align="left" valign="top">dut000221</td>
</tr><tr>
<td align="left" valign="top">Pieter Cornelisz Hooft</td>
<td align="left" valign="top"><italic>Baeto</italic></td>
<td align="left" valign="top">1626</td>
<td align="left" valign="top">dut000218</td>
</tr><tr>
<td align="left" valign="top">Pieter Cornelisz Hooft</td>
<td align="left" valign="top"><italic>Geeraerdt van Velsen</italic></td>
<td align="left" valign="top">1613</td>
<td align="left" valign="top">dut000214</td>
</tr><tr>
<td align="left" valign="top">Jan Sijwertsz Kolm</td>
<td align="left" valign="top"><italic>Battaefsche vrienden-spieghel</italic></td>
<td align="left" valign="top">1615</td>
<td align="left" valign="top">dut000213</td>
</tr><tr>
<td align="left" valign="top">Jan Sijwertsz Kolm</td>
<td align="left" valign="top"><italic>Nederlants treur-spel</italic></td>
<td align="left" valign="top">1616</td>
<td align="left" valign="top">dut000212</td>
</tr><tr>
<td align="left" valign="top">Abraham de Koning</td>
<td align="left" valign="top"><italic>Achabs treur-spel</italic></td>
<td align="left" valign="top">1618</td>
<td align="left" valign="top">dut000209</td>
</tr><tr>
<td align="left" valign="top">Abraham de Koning</td>
<td align="left" valign="top"><italic>Iephthahs ende zijn eenighe dochters</italic></td>
<td align="left" valign="top">1615</td>
<td align="left" valign="top">dut000210</td>
</tr><tr>
<td align="left" valign="top">Abraham de Koning</td>
<td align="left" valign="top"><italic>Simsons treurspel</italic></td>
<td align="left" valign="top">1618</td>
<td align="left" valign="top">dut000211</td>
</tr><tr>
<td align="left" valign="top">Enoch Krook</td>
<td align="left" valign="top"><italic>De ondergang van Eigenbaat</italic></td>
<td align="left" valign="top">1707</td>
<td align="left" valign="top">dut000151</td>
</tr><tr>
<td align="left" valign="top">Jan Harmensz. Krul</td>
<td align="left" valign="top"><italic>Alcip en Amarillis</italic></td>
<td align="left" valign="top">1640</td>
<td align="left" valign="top">dut000074</td>
</tr><tr>
<td align="left" valign="top">Jan Harmensz. Krul</td>
<td align="left" valign="top"><italic>Mey spel van Cloris en Philida</italic></td>
<td align="left" valign="top">1631</td>
<td align="left" valign="top">dut000066</td>
</tr><tr>
<td align="left" valign="top">Jan Harmensz. Krul</td>
<td align="left" valign="top"><italic>Diana</italic></td>
<td align="left" valign="top">1627</td>
<td align="left" valign="top">dut000068</td>
</tr><tr>
<td align="left" valign="top">Jan Harmensz. Krul</td>
<td align="left" valign="top"><italic>Favstina</italic></td>
<td align="left" valign="top">1640</td>
<td align="left" valign="top">dut000097</td>
</tr><tr>
<td align="left" valign="top">Jan Harmensz. Krul</td>
<td align="left" valign="top"><italic>Helena</italic></td>
<td align="left" valign="top">1629</td>
<td align="left" valign="top">dut000064</td>
</tr><tr>
<td align="left" valign="top">Jan Harmensz. Krul</td>
<td align="left" valign="top"><italic>Musyk-spel, van Juliana, en Claudiaen</italic></td>
<td align="left" valign="top">1634</td>
<td align="left" valign="top">dut000069</td>
</tr><tr>
<td align="left" valign="top">Jan Harmensz. Krul</td>
<td align="left" valign="top"><italic>Rosilion en Rosanniere</italic></td>
<td align="left" valign="top">1641</td>
<td align="left" valign="top">dut000085</td>
</tr><tr>
<td align="left" valign="top">Jan Harmensz. Krul</td>
<td align="left" valign="top"><italic>Rozemond en Raniclis</italic></td>
<td align="left" valign="top">1632</td>
<td align="left" valign="top">dut000067</td>
</tr><tr>
<td align="left" valign="top">Jan Harmensz. Krul</td>
<td align="left" valign="top"><italic>Theodorus en Dianira</italic></td>
<td align="left" valign="top">1635</td>
<td align="left" valign="top">dut000070</td>
</tr><tr>
<td align="left" valign="top">Katharyne Lescailje</td>
<td align="left" valign="top"><italic>Ariadne</italic></td>
<td align="left" valign="top">1693</td>
<td align="left" valign="top">dut000204</td>
</tr><tr>
<td align="left" valign="top">Katharyne Lescailje</td>
<td align="left" valign="top"><italic>Genserik</italic></td>
<td align="left" valign="top">1685</td>
<td align="left" valign="top">dut000199</td>
</tr><tr>
<td align="left" valign="top">Katharyne Lescailje</td>
<td align="left" valign="top"><italic>Herkules en Dianira</italic></td>
<td align="left" valign="top">1688</td>
<td align="left" valign="top">dut000202</td>
</tr><tr>
<td align="left" valign="top">Katharyne Lescailje</td>
<td align="left" valign="top"><italic>Herodes en Mariamne</italic></td>
<td align="left" valign="top">1685</td>
<td align="left" valign="top">dut000200</td>
</tr><tr>
<td align="left" valign="top">Katharyne Lescailje</td>
<td align="left" valign="top"><italic>Kassandra</italic></td>
<td align="left" valign="top">1753</td>
<td align="left" valign="top">dut000198</td>
</tr><tr>
<td align="left" valign="top">Katharyne Lescailje</td>
<td align="left" valign="top"><italic>Nicom&#x00E9;des</italic></td>
<td align="left" valign="top">1692</td>
<td align="left" valign="top">dut000203</td>
</tr><tr>
<td align="left" valign="top">Katharyne Lescailje</td>
<td align="left" valign="top"><italic>Wensesla&#x00FC;ss, koning van Poolen</italic></td>
<td align="left" valign="top">1686</td>
<td align="left" valign="top">dut000201</td>
</tr><tr>
<td align="left" valign="top">Lucretia Wilhelmina van Merken</td>
<td align="left" valign="top"><italic>De camisards</italic></td>
<td align="left" valign="top">1774</td>
<td align="left" valign="top">dut000225</td>
</tr><tr>
<td align="left" valign="top">Lucretia Wilhelmina van Merken</td>
<td align="left" valign="top"><italic>Gelonide</italic></td>
<td align="left" valign="top">1786</td>
<td align="left" valign="top">dut000231</td>
</tr><tr>
<td align="left" valign="top">Lucretia Wilhelmina van Merken</td>
<td align="left" valign="top"><italic>Het beleg der stad Leyden</italic></td>
<td align="left" valign="top">1774</td>
<td align="left" valign="top">dut000226</td>
</tr><tr>
<td align="left" valign="top">Lucretia Wilhelmina van Merken</td>
<td align="left" valign="top"><italic>Jacob Simonszoon de Ryk</italic></td>
<td align="left" valign="top">1774</td>
<td align="left" valign="top">dut000227</td>
</tr><tr>
<td align="left" valign="top">Lucretia Wilhelmina van Merken</td>
<td align="left" valign="top"><italic>Louize d&#x2019;Arlac</italic></td>
<td align="left" valign="top">1786</td>
<td align="left" valign="top">dut000229</td>
</tr><tr>
<td align="left" valign="top">Lucretia Wilhelmina van Merken</td>
<td align="left" valign="top"><italic>Maria van Bourgondi&#x00C3;&#x00AB;n, gravinne van Holland</italic></td>
<td align="left" valign="top">1774</td>
<td align="left" valign="top">dut000228</td>
</tr><tr>
<td align="left" valign="top">Lucretia Wilhelmina van Merken</td>
<td align="left" valign="top"><italic>Sebille van Anjou</italic></td>
<td align="left" valign="top">1786</td>
<td align="left" valign="top">dut000230</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>Agrippa, koning van Alba</italic></td>
<td align="left" valign="top">1669</td>
<td align="left" valign="top">dut000139</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>Andromach&#x00E9;</italic></td>
<td align="left" valign="top">1678</td>
<td align="left" valign="top">dut000142</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>Astrate, koning van Tyrus</italic></td>
<td align="left" valign="top">1670</td>
<td align="left" valign="top">dut000162</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>Cinna</italic></td>
<td align="left" valign="top">1683</td>
<td align="left" valign="top">dut000146</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>De Amsterdamsche dragonnade</italic></td>
<td align="left" valign="top">1714</td>
<td align="left" valign="top">dut000157</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>De Bekeerde Alchimist</italic></td>
<td align="left" valign="top">1680</td>
<td align="left" valign="top">dut000144</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>De belachchelyke s&#x00E9;renade</italic></td>
<td align="left" valign="top">1712</td>
<td align="left" valign="top">dut000155</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>De gelukte list</italic></td>
<td align="left" valign="top">1689</td>
<td align="left" valign="top">dut000150</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>De gelyke tw&#x00E9;lingen</italic></td>
<td align="left" valign="top">1682</td>
<td align="left" valign="top">dut000161</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>De geschaakte bruid</italic></td>
<td align="left" valign="top">1690</td>
<td align="left" valign="top">dut000166</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>De leevendige doode</italic></td>
<td align="left" valign="top">1716</td>
<td align="left" valign="top">dut000158</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>De listige vryster</italic></td>
<td align="left" valign="top">1690</td>
<td align="left" valign="top">dut000159</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>De malle wedding</italic></td>
<td align="left" valign="top">1690</td>
<td align="left" valign="top">dut000141</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>De schaakingen</italic></td>
<td align="left" valign="top">1763</td>
<td align="left" valign="top">dut000160</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>De schilder door liefde</italic></td>
<td align="left" valign="top">1682</td>
<td align="left" valign="top">dut000165</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>De vermiste molenaar</italic></td>
<td align="left" valign="top">1713</td>
<td align="left" valign="top">dut000156</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>De vrijer in de kist</italic></td>
<td align="left" valign="top">1678</td>
<td align="left" valign="top">dut000164</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>De wanh&#x00E8;bbelyke liefde</italic></td>
<td align="left" valign="top">1678</td>
<td align="left" valign="top">dut000143</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>De verwaande Hollandsche Franschman</italic></td>
<td align="left" valign="top">1684</td>
<td align="left" valign="top">dut000148</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>Fielebout</italic></td>
<td align="left" valign="top">1680</td>
<td align="left" valign="top">dut000154</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>Het gedwongene huuwelyk</italic></td>
<td align="left" valign="top">1682</td>
<td align="left" valign="top">dut000153</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>Het huwelyk van Orondates en Statira</italic></td>
<td align="left" valign="top">1670</td>
<td align="left" valign="top">dut000140</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>Het spookend weeuwtje</italic></td>
<td align="left" valign="top">1670</td>
<td align="left" valign="top">dut000163</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>Ifigenia</italic></td>
<td align="left" valign="top">1678</td>
<td align="left" valign="top">dut000147</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>Loon naar w&#x00C3;&#x00A9;rk</italic></td>
<td align="left" valign="top">1709</td>
<td align="left" valign="top">dut000152</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>Roeland</italic></td>
<td align="left" valign="top">1686</td>
<td align="left" valign="top">dut000149</td>
</tr><tr>
<td align="left" valign="top">Nil Volentibus Arduum</td>
<td align="left" valign="top"><italic>Tieranny van Eigenbaat in het Eiland van Vryekeur</italic></td>
<td align="left" valign="top">1679</td>
<td align="left" valign="top">dut000145</td>
</tr><tr>
<td align="left" valign="top">Catharina Questiers</td>
<td align="left" valign="top"><italic>Casimier</italic></td>
<td align="left" valign="top">1656</td>
<td align="left" valign="top">dut000207</td>
</tr><tr>
<td align="left" valign="top">Catharina Questiers</td>
<td align="left" valign="top"><italic>Den geheymen minnaar</italic></td>
<td align="left" valign="top">1655</td>
<td align="left" valign="top">dut000206</td>
</tr><tr>
<td align="left" valign="top">Catharina Questiers</td>
<td align="left" valign="top"><italic>D&#x2019;ondanckbare Fulvius en getrouwe Octavia</italic></td>
<td align="left" valign="top">1665</td>
<td align="left" valign="top">dut000205</td>
</tr><tr>
<td align="left" valign="top">Theodoor Rodenburg</td>
<td align="left" valign="top"><italic>Casandra</italic></td>
<td align="left" valign="top">1617</td>
<td align="left" valign="top">dut000223</td>
</tr><tr>
<td align="left" valign="top">Theodoor Rodenburg</td>
<td align="left" valign="top"><italic>Rodomont en Isabella</italic></td>
<td align="left" valign="top">1618</td>
<td align="left" valign="top">dut000224</td>
</tr><tr>
<td align="left" valign="top">Catharina Verwers Dusart</td>
<td align="left" valign="top"><italic>Spaensche heydin</italic></td>
<td align="left" valign="top">1644</td>
<td align="left" valign="top">dut000208</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Adam in Ballingschap</italic></td>
<td align="left" valign="top">1664</td>
<td align="left" valign="top">dut000030</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Adonias</italic></td>
<td align="left" valign="top">1661</td>
<td align="left" valign="top">dut000028</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>De Amsteldamsche Hecvba</italic></td>
<td align="left" valign="top">1626</td>
<td align="left" valign="top">dut000007</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Batavische Gebroeders</italic></td>
<td align="left" valign="top">1663</td>
<td align="left" valign="top">dut000029</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Koning David In ballingschap</italic></td>
<td align="left" valign="top">1660</td>
<td align="left" valign="top">dut000024</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Elektra</italic></td>
<td align="left" valign="top">1639</td>
<td align="left" valign="top">dut000012</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Euripides Feniciaensche</italic></td>
<td align="left" valign="top">1668</td>
<td align="left" valign="top">dut000002</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Faeton</italic></td>
<td align="left" valign="top">1663</td>
<td align="left" valign="top">dut000003</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Gebroeders</italic></td>
<td align="left" valign="top">1640</td>
<td align="left" valign="top">dut000014</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Gysbreght van Aemstel</italic></td>
<td align="left" valign="top">1637</td>
<td align="left" valign="top">dut000001</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Sofokles Herkules in Trachin</italic></td>
<td align="left" valign="top">1668</td>
<td align="left" valign="top">dut000004</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Hiervsalem verwoest</italic></td>
<td align="left" valign="top">1620</td>
<td align="left" valign="top">dut000008</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Hippolytvs</italic></td>
<td align="left" valign="top">1628</td>
<td align="left" valign="top">dut000010</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>IfigenieT in Tauren</italic></td>
<td align="left" valign="top">1666</td>
<td align="left" valign="top">dut000031</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Iosef</italic></td>
<td align="left" valign="top">1635</td>
<td align="left" valign="top">dut000011</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Jeptha</italic></td>
<td align="left" valign="top">1659</td>
<td align="left" valign="top">dut000023</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Joseph in Dothan</italic></td>
<td align="left" valign="top">1640</td>
<td align="left" valign="top">dut000015</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Joseph In Egypten</italic></td>
<td align="left" valign="top">1640</td>
<td align="left" valign="top">dut000016</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Koning David Herstelt</italic></td>
<td align="left" valign="top">1660</td>
<td align="left" valign="top">dut000025</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Koning Edipus</italic></td>
<td align="left" valign="top">1660</td>
<td align="left" valign="top">dut000026</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Lucifer</italic></td>
<td align="left" valign="top">1654</td>
<td align="left" valign="top">dut000021</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Maeghden</italic></td>
<td align="left" valign="top">1639</td>
<td align="left" valign="top">dut000013</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Maria Stuart</italic></td>
<td align="left" valign="top">1646</td>
<td align="left" valign="top">dut000018</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Noah</italic></td>
<td align="left" valign="top">1667</td>
<td align="left" valign="top">dut000005</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Palamedes</italic></td>
<td align="left" valign="top">1625</td>
<td align="left" valign="top">dut000009</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Peter en Pauwels</italic></td>
<td align="left" valign="top">1641</td>
<td align="left" valign="top">dut000017</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Salmoneus</italic></td>
<td align="left" valign="top">1657</td>
<td align="left" valign="top">dut000022</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Salomon</italic></td>
<td align="left" valign="top">1648</td>
<td align="left" valign="top">dut000020</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Samson</italic></td>
<td align="left" valign="top">1660</td>
<td align="left" valign="top">dut000027</td>
</tr><tr>
<td align="left" valign="top">Joost van den Vondel</td>
<td align="left" valign="top"><italic>Zungchin</italic></td>
<td align="left" valign="top">1667</td>
<td align="left" valign="top">dut000006</td>
</tr>
</tbody>
</table>
</table-wrap>
</app>
</app-group>
<ref-list>
<title>Bibliography</title>
<ref id="r1"><mixed-citation>Amelang, James, &#x2018;Playing Gender. Toward a Quantitative Comparison of Female Roles in Lope de Vega and Shakespeare&#x2019;, <italic>Bulletin of the Comediantes</italic> 71 (2019/1-2) 119-134.</mixed-citation></ref>
<ref id="r2"><mixed-citation>Blom, Frans, <italic>Podium van Europa. Creativiteit en ondernemen in de Amsterdamse Schouwburg van de zeventiende eeuw</italic> (Amsterdam 2021).</mixed-citation></ref>
<ref id="r3"><mixed-citation>Blom, Frans, Harm Nijboer, and Rob van der Zalm, &#x2018;ONSTAGE, the Online Data System of Theatre in Amsterdam from the Golden Age to Today&#x2019;, <italic>Research Data Journal for the Humanities and Social Sciences</italic> 5 (2020/2) 27-40.</mixed-citation></ref>
<ref id="r4"><mixed-citation>Bloom, Gina, &#x2018;&#x201C;Thy Voice Squeaks&#x201D;. Listening for Masculinity on the Early Modern Stage&#x2019;, <italic>Renaissance Drama</italic> 29 (1998) 39-71.</mixed-citation></ref>
<ref id="r5"><mixed-citation>Bloom, Gina, <italic>Voice in Motion. Staging Gender, Shaping Sound in Early Modern England</italic> (Philadelphia 2007).</mixed-citation></ref>
<ref id="r6"><mixed-citation>Brown, Pamela Allen, &#x2018;&#x201C;Cattle of this colour&#x201D;. Boying the Diva in <italic>As You Like It</italic>&#x2019;, <italic>Early Theatre</italic> 15 (2012/1) 145-166.</mixed-citation></ref>
<ref id="r7"><mixed-citation>Burrows, John, &#x2018;&#x201C;Delta&#x201D;. A Measure of Stylistic Difference and a Guide to Likely Authorship&#x2019;, <italic>Literary and Linguistic Computing</italic> 17 (2002/3) 267-287.</mixed-citation></ref>
<ref id="r8"><mixed-citation>Conneau, Alexis, et al., &#x2018;Unsupervised Cross-lingual Representation Learning at Scale&#x2019;, in Dan Jurafsky, Joyce Chai, Natalie Schluter, and Joel Tetreault (eds.), <italic>Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics</italic> (2020) 8440-8451.</mixed-citation></ref>
<ref id="r9"><mixed-citation>Dabrowska, Monika, and Mar&#x00ED;a Teresa Santa Mar&#x00ED;a Fern&#x00E1;ndez, &#x2018;Gender relations in Spanish theatre during the Silver Age. A quantitative comparison of works in the <italic>Spanish Drama Corpus</italic>&#x2019;, <italic>Digital Scholarship in the Humanities</italic> 40 (2025/1) 3-17.</mixed-citation></ref>
<ref id="r10"><mixed-citation>Deijl, Lucas van der (ed.), <italic>DutchDraCor</italic>, (2024), <ext-link ext-link-type="uri" xlink:href="https://dracor.org/dutch">https://dracor.org/dutch</ext-link> (Accessed on 21 August 2025).</mixed-citation></ref>
<ref id="r11"><mixed-citation>Deijl, Lucas van der, &#x2018;Orde en rationalisme in het toneel van Nil Volentibus Arduum. Een computationele benadering van vroegmoderne verhaalmodellen&#x2019;, <italic>Spiegel der Letteren</italic> 66 (2024/1) 53-94.</mixed-citation></ref>
<ref id="r12"><mixed-citation>Dekker, Rudolf, and Lotte van de Pol, <italic>Vrouwen in mannenkleren. De geschiedenis van een tegendraadse traditie Europa, 1500-1800</italic> (Amsterdam 1989).</mixed-citation></ref>
<ref id="r13"><mixed-citation>Delobelle, Pieter, and Fran&#x00E7;ois Remy, &#x2018;RobBERT-2023. Keeping Dutch Language Models Up-To-Date at a Lower Cost Thanks to Model Conversion&#x2019;, <italic>Computational Linguistics in the Netherlands Journal</italic> 13 (2024) 193-203.</mixed-citation></ref>
<ref id="r14"><mixed-citation>Dietz, Feike, &#x2018;Twee schatkisten en hun erfenis. Dimensies van gender en age in de Nederlandse vroegmoderne letterkunde&#x2019;, <italic>Nederlandse letterkunde</italic> 26 (2021/2-3) 202-222.</mixed-citation></ref>
<ref id="r15"><mixed-citation>Dinan, Emily, et al., &#x2018;Multi-Dimensional Gender Bias Classification&#x2019;, in Bonnie Webber et al. (eds.), <italic>Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)</italic> (2020) 314-331.</mixed-citation></ref>
<ref id="r16"><mixed-citation>Eder, Maciej, Jan Rybicki, and Mike Kestemont, &#x2018;Stylometry with R. A Package for Computational Text Analysis&#x2019;, <italic>The R Journal</italic> 8 (2016/1) 107-121.</mixed-citation></ref>
<ref id="r17"><mixed-citation>Elk, Martine van, &#x2018;&#x201C;Before she ends up in a brothel&#x201D;. Public Femininity and the First Actresses in England and the Low Countries&#x2019;, <italic>Early Modern Low Countries</italic> 1 (2017/1) 30-50.</mixed-citation></ref>
<ref id="r19"><mixed-citation>Elk, Martine van, &#x2018;Women Writers and the Dutch Stage. Public Femininity in the Plays of Verwers and Questiers&#x2019;, in Amanda Pipkin and Sarah Joan Moran (eds.), <italic>Women and Gender in the Early Modern Low Countries, 1500-1750</italic> (Leiden 2019) 167-191.</mixed-citation></ref>
<ref id="r20"><mixed-citation>Evert, Stefan, et al., &#x2018;Understanding and explaining Delta measures for authorship attribution&#x2019;, <italic>Digital Scholarship in the Humanities</italic> 32 (2017/2) 4-16.</mixed-citation></ref>
<ref id="r21"><mixed-citation>Feldkamp, Pascale, et al., &#x2018;Canonical Status and Literary Influence. A Comparative Study of Danish Novels from the Modern Breakthrough (1870-1900)&#x2019;, in Mika H&#x00E4;m&#x00E4;l&#x00E4;inen et al. (eds.), <italic>Proceedings of the Joint 4th International Conference on Natural Language Processing for Digital Humanities</italic> (Miami 2024) 140-155.</mixed-citation></ref>
<ref id="r22"><mixed-citation>Ferket, Johanna, &#x2018;Women About Women in 17th-century Comic Theatre. Towards a Nuanced Understanding of the Representation of Female Characters&#x2019;, <italic>Journal of Dutch Literature</italic> 12 (2021/1) 1-24.</mixed-citation></ref>
<ref id="r23"><mixed-citation>Fischer, Frank, et al., &#x2018;Network Dynamics, Plot Analysis. Approaching the Progressive Structuration of Literary Texts&#x2019;, in Rhian Lewis et al. (eds), <italic>Digital Humanities 2017 Conference Abstracts</italic> (McGill University 2017), <ext-link ext-link-type="uri" xlink:href="https://dh2017.adho.org/abstracts/071/071.pdf">https://dh2017.adho.org/abstracts/071/071.pdf</ext-link> (Accessed on 2 September 2025).</mixed-citation></ref>
<ref id="r24"><mixed-citation>Fischer, Frank, et al., &#x2018;Programmable Corpora. Introducing DraCor, an Infrastructure for the Research on European Drama&#x2019;, in <italic>Digital Humanities 2019 Conference Abstracts</italic> (Utrecht University 2019), <ext-link ext-link-type="doi" xlink:href="10.5281/zenodo.4284002">https://doi.org/10.5281/zenodo.4284002</ext-link> (Accessed on 21 August 2025).</mixed-citation></ref>
<ref id="r25"><mixed-citation>Gemert, Lia van, &#x2018;Echte helden zie je zelden. Idolen in het Nederlandse epos&#x2019;, <italic>De Achttiende eeuw</italic> 38 (2006) 22-38.</mixed-citation></ref>
<ref id="r26"><mixed-citation>Grijp, Louis Peter, &#x2018;Boys and Female Impersonators in the Amsterdam Theatre of the Seventeenth Century&#x2019;, <italic>Medieval English Theatre</italic> 28 (2006) 131-170.</mixed-citation></ref>
<ref id="r27"><mixed-citation>Herrmann, J. Berenike, Karina van Dalen-Oskam, and Christof Sch&#x00F6;ch, &#x2018;Revisiting Style, a Key Concept in Literary Studies&#x2019;, <italic>Journal of Literary Theory</italic> 9 (2015/1) 25-52.</mixed-citation></ref>
<ref id="r28"><mixed-citation>Hicke, Rebecca M.M., and David Mimno, &#x2018;&#x201C;Let every word weigh heavy of her worth&#x201D;. Examining How Women Enact Power in Shakespeare&#x2019;s Comedies through Interactive Speech Pattern Visualizations&#x2019;, in Melanie Andresen and Nils Reiter (eds.), <italic>Computational Drama Analysis. Reflecting on Methods and Interpretations</italic> (Berlin 2024) 87-106.</mixed-citation></ref>
<ref id="r29"><mixed-citation>Hota, Sobhan, and Schlomo Argamon, &#x2018;Performing Gender. Automatic Stylistic Analysis of Shakespeare&#x2019;s Characters&#x2019;, in <italic>Proceedings of Digital Humanities 2006</italic> (Paris 2006) 82-88, <ext-link ext-link-type="uri" xlink:href="https://www.researchgate.net/publication/327945332_Performing_Gender_Automatic_Stylistic_Analysis_of_Shakespeare%27s_Characters_httpwwwcsdltamuedufuruta689dhdh06readingsDH06-082-088pdf">https://www.researchgate.net/publication/327945332_Performing_Gender_Automatic_Stylistic_Analysis_of_Shakespeare&#x2019;s_Characters_httpwwwcsdltamuedufuruta689dhdh06readingsDH06-082-088pdf</ext-link> (Accessed on 2 September 2025).</mixed-citation></ref>
<ref id="r30"><mixed-citation>Hoover, David L., &#x2018;Testing Burrows&#x2019;s Delta&#x2019;, <italic>Literary and Linguistic Computing</italic> 19 (2004/4) 453-475.</mixed-citation></ref>
<ref id="r31"><mixed-citation>Juola, Patrick, &#x2018;The Rowling Case. A Proposed Standard Analytic Protocol for Authorship Questions&#x2019;, <italic>Digital Scholarship in the Humanities</italic> 30 (2015/1) 100-113.</mixed-citation></ref>
<ref id="r32"><mixed-citation>Koolen, Corina, <italic>Reading beyond the Female. The Relationship between Perception of Author Gender and Literary Quality</italic>. PhD diss., University of Amsterdam, 2018.</mixed-citation></ref>
<ref id="r34"><mixed-citation>Manjavacas, Enrique, and Lauren Fonteyn, &#x2018;Non-Parametric Word Sense Disambiguation for Historical Languages&#x2019;, in Mika H&#x00E4;m&#x00E4;l&#x00E4;inen et al. (eds.), <italic>Proceedings of the 2nd International Workshop on Natural Language Processing for Digital Humanities</italic> (Taipei 2022) 123-134.</mixed-citation></ref>
<ref id="r35"><mixed-citation>Marion, Olga van, <italic>Gouden diva&#x2019;s. De eerste Nederlandse actrices en hun sporen in de literatuur</italic> (Leiden 2021).</mixed-citation></ref>
<ref id="r36"><mixed-citation>McManus, Clare, &#x2018;Women and English Renaissance Drama. Making and Unmaking the &#x201C;All-Male Stage&#x201D;&#x2019;, <italic>Literature Compass</italic> 4 (2007/3) 784-796.</mixed-citation></ref>
<ref id="r37"><mixed-citation>McManus, Clare, &#x201C;&#x201C;Sing it Like Poor Barbary&#x201D;. <italic>Othello</italic> and Early Modern Women&#x2019;s Performance&#x201D;, <italic>Shakespeare Bulletin</italic> 33 (2015/1) 99-120.</mixed-citation></ref>
<ref id="r38"><mixed-citation>Meeus, Hubert, <italic>Repertorium van het ernstige drama in de Nederlanden, 1600-1650</italic> (Leuven 1983).</mixed-citation></ref>
<ref id="r39"><mixed-citation>Meijer Drees, Marijke, &#x2018;Burgemeester Van der Werf als vaderlandse toneelheld. Een politieke autoriteit in belegeringsdrama&#x2019;s&#x2019;, <italic>De Zeventiende Eeuw</italic> 8 (1992) 167-173.</mixed-citation></ref>
<ref id="r40"><mixed-citation>Moretti, Franco, &#x2018;Network Theory, Plot Analysis&#x2019;, <italic>Stanford Literary Lab Pamphlets</italic> 2 (2011), <ext-link ext-link-type="uri" xlink:href="https://litlab.stanford.edu/LiteraryLabPamphlet2.pdf">https://litlab.stanford.edu/LiteraryLabPamphlet2.pdf</ext-link> (Accessed on 2 September 2025).</mixed-citation></ref>
<ref id="r41"><mixed-citation>Mukherjee, Arjun, and Bing Liu, &#x2018;Improving Gender Classification of Blog Authors&#x2019;, in Hang Li and Llu&#x00ED;s M&#x00E0;rquez (eds.), <italic>Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing</italic> (Cambridge, MA 2010) 207-217.</mixed-citation></ref>
<ref id="r43"><mixed-citation>Orgel, Stephen, <italic>Impersonations. The Performance of Gender in Shakespeare&#x2019;s England</italic> (Cambridge 1996).</mixed-citation></ref>
<ref id="r44"><mixed-citation>Pierik, Bob, &#x2018;Privacy, Publicity and Gender in Amsterdam&#x2019;s Early Modern Urban Space&#x2019;, <italic>Privacy Studies Journal</italic> 1 (2022/1) 1-22.</mixed-citation></ref>
<ref id="r45"><mixed-citation>Pipkin, Amanda C., <italic>Rape in the Republic, 1609-1725. Formulating Dutch Identity</italic> (Leiden 2013).</mixed-citation></ref>
<ref id="r46"><mixed-citation>Simpson, Erik, &#x2018;Networked Cross-Dressing. A Digital Refashioning of Shakespearean Gender Subversion&#x2019;, <italic>Digital Humanities Quarterly</italic> 16 (2022/3), <ext-link ext-link-type="uri" xlink:href="https://dhq-static.digitalhumanities.org/pdf/000562.pdf">https://dhq-static.digitalhumanities.org/pdf/000562.pdf</ext-link> (Accessed on 2 September 2025).</mixed-citation></ref>
<ref id="r47"><mixed-citation>Smeets, Roel, &#x2018;Emancipatie en de roman. Vrouwelijke personages in Nederlandstalige romans tussen 1960 en 2010&#x2019;, <italic>Nederlandse letterkunde</italic> 29 (2024/2) 180-210.</mixed-citation></ref>
<ref id="r48"><mixed-citation>Smits-Veldt, Mieke B., <italic>Het Nederlandse renaissance-toneel</italic> (Utrecht 1991).</mixed-citation></ref>
<ref id="r49"><mixed-citation>Stiller, James, Daniel Nettle, and Robin I.M. Dunbar, &#x2018;The Small World of Shakespeare&#x2019;s Plays&#x2019;, <italic>Human Nature</italic> 14 (2003/4) 397-408.</mixed-citation></ref>
<ref id="r50"><mixed-citation>Trilcke, Peer, Evgeniya Ustinova, Ingo B&#x00F6;rner, Frank Fischer and Carsten Milling, &#x2018;Detecting Small Worlds in a Corpus of Thousands of Theater Plays&#x2019;, in Melanie Adresen and Nils Reiter (eds.), <italic>Computational Drama Analysis. Reflecting on Methods and Interpretations</italic> (Berlin 2024) 7-34.</mixed-citation></ref>
<ref id="r51"><mixed-citation>Underwood, Ted, David Bamman, and Sabrina Lee, &#x2018;The Transformation of Gender in English-Language Fiction&#x2019;, <italic>Journal of Cultural Analytics</italic> 3 (2018/2), <ext-link ext-link-type="doi" xlink:href="10.22148/16.019">https://doi.org/10.22148/16.019</ext-link> (Accessed on 2 September 2025).</mixed-citation></ref>
<ref id="r52"><mixed-citation>Vashisth, Pradeep, and Kevin Meehan, &#x2018;Gender Classification using Twitter Text Data&#x2019;, in <italic>2020 31st Irish Signals and Systems Conference (ISSC)</italic> (Letterkenny 2020) 1-6.</mixed-citation></ref>
<ref id="r53"><mixed-citation>Vergeer, Tim, and Cornelis van der Haven, &#x2018;The Travesty of Egoism. Same-Gender Passion and Homosocial Desire in a Dutch Seventeenth-Century Morality Play&#x2019;, <italic>Interdisciplinary Literary Studies</italic> 19 (2017/4) 529-550.</mixed-citation></ref>
<ref id="r54"><mixed-citation>Vitse, Sven, &#x2018;De kracht van foute voorspellingen. Een methodische reflectie op machine learning aan de hand van onderzoek naar genderrepresentatie&#x2019;, <italic>Spiegel der Letteren</italic> 66 (2024/2) 169-199.</mixed-citation></ref>
<ref id="r55"><mixed-citation>Vries, Wietse de, et al., &#x2018;BERTje. A Dutch BERT Model&#x2019;, <italic>arXiv:1912.0958</italic>, 19 December 2019, <ext-link ext-link-type="uri" xlink:href="http://arxiv.org/abs/1912.09582">http://arxiv.org/abs/1912.09582</ext-link> (Accessed on 21 August 2025).</mixed-citation></ref>
<ref id="r56"><mixed-citation>Vries, Wietse de, Martijn Wieling, and Malvina Nissim, &#x2018;DUMB. A Benchmark for Smart Evaluation of Dutch Models&#x2019;, in Houda Bouamor, Juan Pino, and Kalika Bali (eds.), <italic>Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing</italic> (Singapore 2023) 7221-7241.</mixed-citation></ref>
<ref id="r57"><mixed-citation>Wang, Liang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, and Furu Wei, &#x2018;Multilingual E5 Text Embeddings. A Technical Report&#x2019;, <italic>arXiv:2402.05672</italic>, 8 February 2024, <ext-link ext-link-type="doi" xlink:href="10.48550/arXiv.2402.05672">https://doi.org/10.48550/arXiv.2402.05672</ext-link> (Accessed on 2 September 2025).</mixed-citation></ref>
</ref-list>
<fn-group>
<fn id="fn1"><label>1</label><p>Grijp, &#x2018;Boys and Female Impersonators&#x2019;, 134.</p></fn>
<fn id="fn2"><label>2</label><p>Orgel, <italic>Impersonations</italic>; Bloom, <italic>Voice in Motion</italic>; Brown, &#x2018;&#x201C;Cattle of this colour&#x2019;&#x201D;; McManus, &#x2018;Women and English Renaissance Drama&#x2019;; McManus &#x2018;&#x201C;Sing it like Poor Barbary&#x201D;&#x2019;; Simpson, &#x2018;Networked Cross-Dressing&#x2019;.</p></fn>
<fn id="fn3"><label>3</label><p>Bloom, &#x2018;&#x201C;Thy Voice Squeaks&#x201D;&#x2019;.</p></fn>
<fn id="fn4"><label>4</label><p>Grijp, &#x2018;Boys and Female Impersonators&#x2019;.</p></fn>
<fn id="fn5"><label>5</label><p>Vergeer and Van der Haven, &#x2018;The Travesty of Egoism&#x2019;.</p></fn>
<fn id="fn6"><label>6</label><p>Pipkin, <italic>Rape in the Republic</italic>; Van Elk, &#x2018;Women Writers and the Dutch Stage&#x2019;; Van Marion, <italic>Gouden diva&#x2019;s</italic>; Ferket, &#x2018;Women about Women&#x2019;; See also: Dekker and Van de Pol, Mannen in vrouwenkleren.</p></fn>
<fn id="fn7"><label>7</label><p>To ensure reproducibility, our code and data are available at <ext-link ext-link-type="uri" xlink:href="https://github.com/awlassche/emlc-performing-gender">https://github.com/awlassche/emlc-performing-gender</ext-link> (Accessed on 14 August 2025).</p></fn>
<fn id="fn8"><label>8</label><p>Van Marion, <italic>Gouden diva&#x2019;s</italic>, 41.</p></fn>
<fn id="fn9"><label>9</label><p>Van Elk, &#x2018;&#x201C;Before she ends up in a brothel&#x201D;&#x2019;, 40-41; Grijp, &#x2018;Boys and Female Impersonators&#x2019;, 133.</p></fn>
<fn id="fn10"><label>10</label><p>Grijp, &#x2018;Boys and Female Impersonators&#x2019;, 18.</p></fn>
<fn id="fn11"><label>11</label><p>Van Marion, <italic>Gouden diva&#x2019;s</italic>, 59.</p></fn>
<fn id="fn12"><label>12</label><p>Grijp, &#x2018;Boys and Female Impersonators&#x2019;, 141.</p></fn>
<fn id="fn13"><label>13</label><p>Van Marion, <italic>Gouden diva&#x2019;s</italic>, 41; Blom, <italic>Podium van Europa</italic>, 55, 281. On the ONSTAGE database, see Blom, Nijboer, and Van der Zalm, &#x2018;ONSTAGE&#x2019;; <ext-link ext-link-type="uri" xlink:href="https://www.vondel.humanities.uva.nl/onstage/">https://www.vondel.humanities.uva.nl/onstage/</ext-link> (Accessed on 14 August 2025).</p></fn>
<fn id="fn14"><label>14</label><p>Van Marion, <italic>Gouden diva&#x2019;s</italic>, 53.</p></fn>
<fn id="fn15"><label>15</label><p>Van Marion, <italic>Gouden diva&#x2019;s</italic>, 51-52.</p></fn>
<fn id="fn16"><label>16</label><p>Pipkin, <italic>Rape in the Republic</italic>, 207.</p></fn>
<fn id="fn17"><label>17</label><p>Van Elk, &#x2018;Women Writers and the Dutch Stage&#x2019;, 191.</p></fn>
<fn id="fn18"><label>18</label><p>Ferket, &#x2018;Women about Women&#x2019;, 10.</p></fn>
<fn id="fn19"><label>19</label><p>Cf. Dietz, &#x2018;Twee schatkisten en hun erfenis&#x2019;, 212.</p></fn>
<fn id="fn20"><label>20</label><p>Van Marion, <italic>Gouden diva&#x2019;s</italic>, 23-27.</p></fn>
<fn id="fn21"><label>21</label><p>Van der Deijl (ed.), &#x2018;DutchDraCor&#x2019;; <ext-link ext-link-type="uri" xlink:href="https://dracor.org/dutch">https://dracor.org/dutch</ext-link> (Accessed on 21 August 2025).</p></fn>
<fn id="fn22"><label>22</label><p>Fischer et al., &#x2018;Programmable Corpora&#x2019;.</p></fn>
<fn id="fn23"><label>23</label><p>Smits-Veldt, <italic>Het Nederlandse renaissance-toneel</italic>, 22.</p></fn>
<fn id="fn24"><label>24</label><p>Meeus, <italic>Repertorium van het ernstige drama</italic>, 9.</p></fn>
<fn id="fn25"><label>25</label><p><ext-link ext-link-type="uri" xlink:href="https://www.vondel.humanities.uva.nl/onstage/plays/?task=plays&#x0026;lang=nl&#x0026;oby=title">https://www.vondel.humanities.uva.nl/onstage/plays/?task&#x003D;plays&#x0026;lang&#x003D;nl&#x0026;oby&#x003D;title</ext-link>(Accessed on 21 August 2025).</p></fn>
<fn id="fn26"><label>26</label><p>Mukherjee and Liu, &#x2018;Improving Gender Classification of Blog Authors&#x2019;; Vashisth and Meehan, &#x2018;Gender Classification using Twitter Text Data&#x2019;; Dinan et al., &#x2018;Multi-Dimensional Gender Bias Classification&#x2019;.</p></fn>
<fn id="fn27"><label>27</label><p>Hota et al., &#x2018;Performing Gender&#x2019;.</p></fn>
<fn id="fn28"><label>28</label><p>Vitse, &#x2018;De kracht van foute voorspellingen&#x2019;; See also: Smeets, &#x2018;Emancipatie en de roman&#x2019;.</p></fn>
<fn id="fn29"><label>29</label><p><italic>GysBERT-v2</italic> is a historical Dutch language model that is trained on Delpher, a database of historical newspapers, books, and journals spanning from 1618 to the end of the twentieth century, and the Digital Library of Dutch Literature (<sc>dbnl</sc>), a comprehensive digital library of Dutch literature (see Manjavacas and Fonteyn, &#x2018;Non-Parametric Word Sense Disambiguation&#x2019;). <italic>robbert-2023-large</italic> and <italic>bert-base-dutch-cased</italic> are two models trained on contemporary Dutch (see Delobelle and Remy &#x2018;RobBERT-2023&#x2019;; De Vries et al., &#x2018;BERTje&#x2019;). <italic>xlm-roberta-large</italic> is the best performing multilingual model as evaluated in the Dutch Embedding Benchmark (see <sc>dumb</sc>, <ext-link ext-link-type="uri" xlink:href="http://www.dumbench.nl/">www.dumbench.nl</ext-link>; Conneau et al., &#x2018;Unsupervised Cross-lingual Representation Learning at Scale&#x2019;; De Vries, &#x2018;<sc>dumb</sc>&#x2019;). Recent studies have shown that <italic>m-e5-large</italic> performs well on historical texts in other languages: Feldkamp et al., &#x2018;Canonical Status and Literary Influence&#x2019;; Wang et al. &#x2018;Multilingual E5 Text Embeddings&#x2019;.</p></fn>
<fn id="fn30"><label>30</label><p>Herrmann et al., &#x2018;Revisiting Style&#x2019;, 44.</p></fn>
<fn id="fn31"><label>31</label><p>Cf. Juola, &#x2018;The Rowling Case&#x2019;, 102.</p></fn>
<fn id="fn32"><label>32</label><p>Burrows, &#x2018;Delta&#x2019;; Hoover, &#x2018;Testing Burrows&#x2019;s Delta&#x2019;; Evert et al., &#x2018;Understanding and Explaining&#x2019;, 8.</p></fn>
<fn id="fn33"><label>33</label><p>Evert et al., &#x2018;Understanding and Explaining&#x2019;, 9.</p></fn>
<fn id="fn34"><label>34</label><p>Eder et al. &#x2018;Stylometry with R&#x2019;.</p></fn>
<fn id="fn35"><label>35</label><p>Moretti, &#x2018;Network Theory, Plot Analysis&#x2019;; Fischer et al. &#x2018;Network Dynamics, Plot Analysis&#x2019;; Van der Deijl, &#x2018;Orde en rationalisme&#x2019;; Stiller, &#x2018;The Small World&#x2019;; Trilcke et al., &#x2018;Detecting Small Worlds&#x2019;.</p></fn>
<fn id="fn36"><label>36</label><p>For a similar application of degree centrality of gendered characters in Spanish plays, see Dabrowska et al., &#x2018;Gender relations in Spanish theatre&#x2019;, 6, 12.</p></fn>
<fn id="fn37"><label>37</label><p>Koolen, <italic>Reading beyond the Female</italic>, 159.</p></fn>
<fn id="fn38"><label>38</label><p>Amelang, &#x2018;Playing Gender&#x2019;, 133; Hicke et al. &#x2018;Let every word&#x2019;, 95-101; Hota and Argamon, &#x2018;Gender in Shakespeare&#x2019;.</p></fn>
<fn id="fn39"><label>39</label><p>Underwood, &#x2018;The Transformation of Gender&#x2019;; Vitse, &#x2018;De kracht van foute voorspellingen&#x2019;; Koolen, <italic>Reading Beyond the Female</italic>.</p></fn>
<fn id="fn40"><label>40</label><p>Cf. Pierik, &#x2018;Privacy, Publicity and Gender in Amsterdam&#x2019;s Early Modern Urban Space&#x2019;.</p></fn>
<fn id="fn41"><label>41</label><p>Van Gemert, &#x2018;Echte helden zie je zelden&#x2019;, 30; Meijer Drees, &#x2018;Burgemeester Van der Werf&#x2019;, 171-172.</p></fn>
<fn id="fn42"><label>42</label><p>Underwood, &#x2018;The Transformation of Gender&#x2019;.</p></fn>
</fn-group>
</back>
</article>