Keywords

Egyptian Arabic, language sources, register variation

Abstract

The limited availability of Egyptian Arabic (EA) corpus resources, especially speech corpora, has left open opportunity for research into such dialect phenomena as register. In this paper we introduce a new two-millionword EA corpus, CALM. We perform a register analysis on EA between two subcorpora of CALM (i.e. Movies and Blogs), showing several features that vary between the two. A discussion follows about how annotation was carried out automatically, how it was handcorrected, and what the prospects are for carrying out similar studies using CALM.

Original Publication Citation

Michael Grant White and Deryle W. Lonsdale (2019). Verbs in Egyptian Arabic: a case for register variation. In: Mahmoud El-haj, Paul Rayson, Eric Atwell, and Lama Alsudias (Eds.),Proceedings of the 3rd Workshop on Arabic Corpus Linguistics (WACL-3); ACL AnthologyW19-5608; Association for Computational Linguistics, pp. 60-71. ISBN 978-1-950737-32-1.

Document Type

Peer-Reviewed Article

Publication Date

2019

Publisher

Association for Computational Linguistics

Language

English

College

Humanities

Department

Linguistics

University Standing at Time of Publication

Associate Professor

Included in

Linguistics Commons

Share

COinS