8 July 2021

Assessing Wargame Effectiveness: Using Natural Language Processing to Evaluate Wargaming Dynamics and Outcomes

Leah Windsor & Susan Allen

Summary: Group decision-making research, while well-established, is not applied in wargames with a win / lose focus. The deliberative data within wargaming can yield predictive metrics for game outcomes. Computational text analysis illuminates participant effects, such as status, gender, and experience. Analyzing participants’ language can provide insight into the intra-group and inter-group dynamics that exclude or invite potential solutions.

Text: The outcome of wargames reveals who wins and loses – but how do participants and strategists know if this is the optimal outcome from the range of potential outcomes? To understand why groups make particular decisions that lead to success or failure in wargames, the authors focus on the intra-group and inter-group communication that transpires during the wargame itself. The processes of group dynamics influence the outcomes of wargaming exercises, yet little attention is paid to these deliberations. Implicit biases manifest in language and other multimodal signals that influence participants and shape the process of negotiations [1][2].

A novel approach to analyzing wargames would include a process that informs the outcome, and models communicative interchanges computationally by examining linguistic features of participants’ deliberations. Participants’ exchanges and deliberations influence the dynamics within and across wargaming exercises and rounds of play. At present, the authors are aware of no computational models of wargaming deliberations exist that assess the intra-group and inter-group deliberations. A wealth of research using computational text-as-data approaches has established that language has predictive power in analyzing attributes like hierarchy, deception, and closeness [3][4][5].

Examining group dynamics is essential for understanding military and foreign policy decision-making because such choices are rarely made by individuals, particularly in democracies, but also within the winning coalition in autocracies. Despite the fact that deliberative group dynamics are affected by emotions, pride, status, reputation, and communication failures, these dynamics are seldom studied[6]. Natural language processing (NLP) approaches can help reveal why teams arrive at various outcomes, how power structures evolve and change within groups during deliberations, what patterns of group deliberation emerge across iterations, and how biases, whether implicit or through participant selection, affect the process of deliberations and outcomes. Because the dialogue patterns of participants have not been evaluated using the multimodal methods proposed, the authors anticipate that NLP will provide agenda-setting contributions to both the scientific and DoD communities.

To illustrate this point, the authors analyzed some of the communications from a wargaming exercise, Counter-Da’esh influence operations: Cognitive space narrative simulation insights[7]. Using computational linguistics techniques, the authors analyzed the use of language related to positive emotion over time, by rounds, across teams in this wargaming simulation. NLP can explore several aspects of between-group and within-group communications, as shown in Figure 1. First, NLP can compare the patterns of language between teams that lead to different outcomes, such as which team wins or loses.

Second, NLP can model the language relationship between teams to understand which team is leading, and which team is following. Lexical entrainment, semantic similarity, and linguistic style-matching all refer to the process of speakers aligning their language as they collaborate and interact more [8][9][10]. This is visible especially in Rounds 2 and 3 where the Red and Blue teams show similar patterns of positive emotion language use, although with different magnitudes.

Third, this analysis can be approached with more granularity to examine the individual participants within groups, over time, and between rounds, to determine who are the thought leaders, influencers, and idea entrepreneurs with the greatest power of persuasion. Sentiment analysis has been used to explain how leaders use emotionally evocative language to persuade followers, where positive emotion leads to improved public opinion ratings[11].
Figure 1. Positive emotion by round, over time, and across teams for ICONS wargaming exercise

One of the critiques of wargaming has been that it is not always cross-culturally representative, which may introduce unintended cultural biases that lead to sub-optimal outcomes. Linguistic analysis of wargaming transcripts using cutting edge natural language processing approaches like Bidirectional Encoder Representations from Transformers aka BERT[12] can help reveal how word meanings vary across issue area, culture, and context, and in doing so, provide objective metrics of language and cultural bias. Computational linguistics approaches can help reveal what people mean when they refer to particular concepts, and how this meaning is interpreted differently by other audiences. Figure 2 illustrates this point well: Windsor[13] plots the use of two semantically related terms, conflict and war, over time between 1900 and 2000 in six different languages. While the use of these terms generally follow similar patterns, they vary in three different ways: over time; by language; and by term.

In practice, war and conflict can be used interchangeably, but they also demonstrate remarkable differences over time and between languages. This means that when speakers use these terms, listeners may broadly share related interpretations of the words’ meanings, but room for misinterpretation clearly exists. The Sapir-Whorf Hypothesis suggests that language makes different interpretations of the world available based on the structure of language and lexicon available to speakers[14][15]. Using the BERT process on wargaming transcripts can help reveal instances where participants in the wargaming exercise misunderstand each other, and which concepts provide the most ambiguity and need the most clarification. In the field, understanding the opponent is part and parcel of the “winning hearts and minds” strategy. Gaps in cultural and linguistic understanding can create potentially dangerous, and unnecessary, chasms between people in conflict zones[16]. Computational linguistics approaches can help to identify these gaps so that military personnel, strategists, policymakers – and scholars – can better understand the optimal conditions for negotiating mutually beneficial outcomes.
Figure 2. Trends in Google NGram for “War” and “Conflict” by Language (1900-2018), taken from Windsor (2021)

Theories of group decision-making are becoming more sophisticated as scholars of international relations and foreign policy re-embrace and return to the foundations of behavioral psychology. While Janis[17] hypothesized about group-think a generation ago, more recently scholars focused on political psychology have highlighted the importance of experience, poly-think, and framing effects for groups[18]. While this research has advanced ideas about the nature of group decision-making, in practice the group dynamics that shape foreign policy decision-making are more opaque. Wargaming exercises prove a unique opportunity for exploring such theories. This approach builds on the extant literature on wargaming[19][20][21], and offers a path forward for advancing the study of wargaming using theoretically-grounded computational social science methods.

No comments: