Text Variability Measures in Corpus Design for Setswana Lexicograph PDF ePub eBook

Books Info:

Text Variability Measures in Corpus Design for Setswana Lexicograph free pdf This book is about the design of a Setswana corpus for lexicography. While various corpora have been compiled and a variety of corpora-based researches attempted in African languages, no effort has been made towards corpus design. Additionally, although extensive analysis of the Setswana language has been done by missionaries, grammarians and linguists since the 1800s, none of such research is in corpus design. Most research has been largely on the grammatical study of the language. The recent corpora research in African languages in general has been on the use of corpora for the compilation of dictionaries and little of it is in corpus design. Pioneers of this kind of corpora research in African languages are Prinsloo and De Schryver (1999), De Schryver and Prisloo (2000 and 2001) and Gouws and Prisloo (2005). Because of a lack of research in corpora design particularly in African languages, this book is an attempt at filling that gap, especially for Setswana. It is hoped that the finding of this study will inspire similar designs in other languages comparable to Setswana. We explore corpus design by focusing on measuring a variety of text types for lexical richness at comparable token points. The study explores the question of whether a corpus compiled for lexicography must comprise a variety of texts drawn from different text types or whether the quality of retrieved information for lexicographic purposes from a corpus comprising diverse text varieties could be equally extracted from a corpus with a single text type. This study therefore determines whether linguistic variability is crucial in corpus design for lexicography.

About Thapelo J. Otlogetswe

Dr. Otlogetswe is a Senior Lecturer in English linguistics and lexicography in the Department of English. He holds Bachelor of Arts and a Post Graduate Diploma in Education from the University of Botswana. He read for M.Phil in Comparative Linguistics and Philology at the University of Oxford. His doctoral studies in Corpus Linguistics were done at the University of Brighton and the University of Pretoria. His research is in lexical computing and corpus lexicography, particularly that of the Setswana language which he is passionate about. His research also includes computational and statistical genre and text type analysis, Setswana names, and Setswana rhyming patterns. He has been involved in the development of a Setswana spellchecker (for OpenOffice) and the compilation of a multi-million token Setswana corpus. Dr. Otlogetswe has published a number of books amongst these: 'English-Setswana Dictionary' and 'Poeletso-medumo ya Setswana: a Setswana Rhyming dictionary' and 'MLA Kgasa: a pioneer Setswana lexicographer'. He has also co-authored a Setswana orthography book: 'Mokwalo o o lolameng wa Setswana'. Dr. Otlogetswe led the groundbreaking translation work on the Setswana Google Search which has made it possible for people to access the Google search interface in the Setswana language. He is a member of the African Association for Lexicography (Afrilex) as well as a commissioner of the Setswana Commission established by the Academy of African Languages (ACALAN), a language arm of the African Union.

Details Book

Author : Thapelo J. Otlogetswe
Publisher : Cambridge Scholars Publishing
Data Published : 01 January 2011
ISBN : 1443826375
EAN : 9781443826372
Format Book : PDF, Epub, DOCx, TXT
Number of Pages : 330 pages
Age + : 15 years
Language : English
Rating :

Reviews Text Variability Measures in Corpus Design for Setswana Lexicograph



17 Comments Add a comment




Related eBooks Download


  • Doing Corpus Linguistics free pdfDoing Corpus Linguistics

    Doing Corpus Linguistics offers a practical step-by-step introduction to corpus linguistics. making use of widely available corpora and of a register analysis-based theoretical framework to provide students in Applied Linguistics and TESOL with the understanding and skills necessary to meaningfully analyze corpora and carry out successful corpus-based research..


  • A  Taste for Corpora free pdfA Taste for Corpora

    The eleven contributions to this volume. written by expert corpus linguists. tackle corpora from a wide range of perspectives and aim to shed light on the numerous linguistic and pedagogical uses to which corpora can be put..


  • Corpus Linguistics free pdfCorpus Linguistics

    A corpus is a collection of specimens of a language as used in real life. in writing and/or speech. Corpus lingustics is research. carried out in university linguistics departments and computing departments (and nowadays in industrial research labs too)..


  • Language, Corpus and Gesture free pdfLanguage, Corpus and Gesture

    This book proposes the use of multimodal corpora in order to examine spoken discourse more effectively and with greater accuracy. Current corpora are invaluable resources for generating accurate and objective analyses of patterns of language use..


  • Linguistic Variation in the Shakespeare Corpus free pdfLinguistic Variation in the Shakespeare Corpus

    This study investigates the morpho-syntactic variability of the second person pronouns in the Shakespeare Corpus. seeking to elucidate the factors that underlie their choice..


  • Text Variability Measures in Corpus Design for Setswana Lexicograph free pdfText Variability Measures in Corpus Design for Setswana Lexicograph

    . This book is about the design of a Setswana corpus for lexicography. While various corpora have been compiled and a variety of corpora-based researches attempted in African languages, no effort has