A computer science academic vocabulary list

Master's thesis, Portland State University, 2020

Recommended citation: Roesler, D. (2020). A computer science academic vocabulary list [Master's thesis, Portland State University]. https://doi.org/10.15760/etd.7414

Abstract

This thesis documents the development of the Computer Science Academic Vocabulary List (CSAVL), a pedagogical tool intended for use by English-for-specific-purpose educators and material developers. A 3.5-million-word corpus of academic computer science textbooks and journal articles was developed in order to produce the CSAVL. This study draws on the improved methodologies used in the creation of recent lemma-based word lists such as the Academic Vocabulary List (AVL) and the Medical Academic Vocabulary List (MAVL), which take into account the discipline-specific meanings of academic vocabulary. The CSAVL provides specific information for each entry, including part of speech and CS-specific meanings in order to provide users with clues as to how each item is used within the context of academic CS. Based on the comparative analyses performed in this study, the CSAVL was found to be a more efficient tool for reaching an minimal level of academic CS reading comprehension than the widely-used Academic Word List (AWL), or the combination of the AWL with the Computer Science Word List (CSWL). Through coverage tests performed on a variety of corpora, CSAVL was shown to be representative of the written language of academic computer science and focused on the lemmas that are the most relevant to the context of written academic CS.

Download paper here