Loading...
Thumbnail Image
Item

Introducing the Single Player Offline Game Corpus (SPOC): A corpus of seven registers from digital role-playing games

Dixon, Daniel
Citations
Altmetric:
Abstract

This paper describes the compilation and design of the Single Player Offline Game Corpus (SPOC), which is being made freely available for research and educational purposes. The SPOC was compiled by extracting the localization files from the digital directories of four popular commercial digital role-playing games: Divinity: Original Sin II, Fallout 4, the Elder Scrolls V: Skyrim, and the Witcher 3: Wild Hunt. The 3.7-million-word corpus contains more than 30,000 texts and is unique from other game corpora in that it has the following three characteristics: (1) the texts are categorized into seven registers using Biber and Conrad's (2019) register framework, (2) texts are systematically parsed into the smallest meaningful units of observation, and (3) all texts were compiled from the data files of the games themselves. Nearly all language use in the four games is accounted for and parsed into register categories based on their underlying situational characteristics, in particular the communicative purposes and the associated contexts in which the texts appear in the games.

Comments
This is an Author’s Original Manuscript of an article to be published by Edinburgh University Press in Corpora.
Description
Date
2023-01-01
Journal Title
Journal ISSN
Volume Title
Publisher
Research Projects
Organizational Units
Journal Issue
Keywords
game corpus, digital games, video games, register analysis, NLP data
Citation
Dixon, D. H. 2024. Introducing the Single Player Offline Game Corpus (SPOC): A corpus of seven registers from digital role-playing games. Corpora 19(1).
Embargo Lift Date
DOI
Embedded videos