Posts by Collection

portfolio

Substance Use Stigma Detection System for Reddit Data (2022)

System that leverages contextual embeddings combined with affective, social, and behavioral features to classify instances of substance use stigma in Reddit posts.

English and Arabic Sarcasm Detection in Tweets (2022)

English and Arabic Twitter sarcasm detection systems. Group project for University of Washington 573: NLP Systems and Applications.

H.P. Lovecraft RNN Text Generator (2021)

A GRU-based character-level language model trained on a corpus of the fiction of H.P. Lovecraft.

Deep Averaging Network (2021)

A Pytorch implementation of the Deep Averaging Network introduced in Iyyer et al (2015). Performs binary sentiment classification on the IMDB reviews dataset.

K-Nearest Neighbors Classifier (2020)

This code was created to classify a subset of the 20 newsgroups text dataset from sci-kit learn. Posts drawn from the ‘talk.politics.guns’, ‘talk.politics.mideast’, and ‘talk.politics.misc’ were converted to a bag of words representation, and classified using a ‘from scratch’ kNN implementation.

Decision Tree Classifier (2020)

The script builds a decision tree ‘from scratch’ using the training data (a subset of the 20 newsgroups text dataset from sci-kit learn), classifies the training and test data, and calculates the accuracy.

Viterbi Implementation for HMM POS Tagging (2020)

This script reads an HMM file produced by the MALLET machine learning toolkit and uses an implementation of the Viterbi algorithm to find the most probable tag sequence for the text.

Naive Bayes Language Classifier (2020)

A ‘from scratch’ naive Bayes classifier implementation that classifies fragments of text according to language category.

DNA Sequence Search Trie (2020)

A search trie implementation that locates DNA sequences in the human genome chromosome dataset produced by UCSC.

publications

A computer science academic vocabulary list

Master's thesis, Portland State University, 2020

This thesis documents the development of the Computer Science Academic Vocabulary List (CSAVL), a pedagogical tool intended for use by English-for-specific-purpose educators and material developers.

Recommended citation: Roesler, D. (2020). A computer science academic vocabulary list [Master's thesis, Portland State University]. https://doi.org/10.15760/etd.7414

When a bug is not a bug: An introduction to the Computer Science Academic Vocabulary List

Journal of English for Academic Purposes, 2021

This article presents the Computer Science Academic Vocabulary List (CSAVL), a pedagogical tool intended for use by English-for-specific-purpose educators and material developers.

Recommended citation: Roesler, D. (2021). When a bug is not a bug: An introduction to the Computer Science Academic Vocabulary List, Journal of English for Academic Purposes, 101044. https://doi.org/10.1016/j.jeap.2021.101044

(pre-print) Leveraging Contextual Embeddings with Affective, Social, and Behavioral Features for Substance Use Stigma Detection

under review, 2022

Using an annotated Reddit corpus, we train a set of binary classifiers, in which each classifier detects one of three substance use stigma types: Internalized Stigma, Anticipated Stigma, and Enacted Stigma. By combining RoBERTa contextual embeddings and affective, social, and behavioral features, we produce systems that identify instances of substance use stigma for all three stigma types and outperform RoBERTa-only baselines by up to 6.45 macro F1.

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015