Writing and Language Studies Faculty Publications and Presentations

Document Type


Publication Date



This chapter explores the many components that are involved in creating a student-based sociolinguistic corpus. Sociolinguistic corpora can be used as tools for social justice in that they promote local (or often stigmatized) varieties of language and students who speak said varieties often experience heightened language pride or greater esteem for their own language. Using the Corpus del Español en el Sur de Arizona (Carvalho 2012-) and the Corpus Bilingüe del Valle (Christoffersen and Bessett 2019-) as models, this chapter first details how to build the corpus, including the documents needed, the interview protocol, the transcription protocol, and the creation of a website. Next, since the most daunting and time-consuming task is transcription, we report the results of field trials with various technologically-aided transcription methods to help improve the process. Lastly, we explore the ways in which the corpus can be used to promote social justice and how to incorporate the corpus into the classroom. By providing and explaining the tools necessary to create a corpus, we hope this chapter inspires others to create student-based semi-open sociolinguistic corpora throughout the United States and around the world.


Original published version available at https://doi.org/10.1515/9783110799392-011

Publication Title

Digital Flux, Linguistic Justice and Minoritized Languages





To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.