History, Features, and Typology of Language Corpora

This book discusses key issues of corpus linguistics like the definition of the corpus, primary features of a corpus, and utilization and limitations of corpora. It presents a unique classification scheme of language corpora to show how they can be studied from the perspective of genre, nature, text...

Full description

Bibliographic Details
Main Authors: Dash, Niladri Sekhar, Arulmozi, S. (Author)
Format: eBook
Language:English
Published: Singapore Springer Nature Singapore 2018, 2018
Edition:1st ed. 2018
Subjects:
Online Access:
Collection: Springer eBooks 2005- - Collection details see MPG.ReNa
LEADER 03147nmm a2200325 u 4500
001 EB001763570
003 EBX01000000000000000969474
005 00000000000000.0
007 cr|||||||||||||||||||||
008 180302 ||| eng
020 |a 9789811074585 
100 1 |a Dash, Niladri Sekhar 
245 0 0 |a History, Features, and Typology of Language Corpora  |h Elektronische Ressource  |c by Niladri Sekhar Dash, S. Arulmozi 
250 |a 1st ed. 2018 
260 |a Singapore  |b Springer Nature Singapore  |c 2018, 2018 
300 |a XXIX, 293 p. 75 illus., 17 illus. in color  |b online resource 
505 0 |a 1. Definition of Corpus -- 2. Features of Corpus -- 3. Genre of Text -- 4. Nature of Data -- 5. Type and Purpose of Text -- 6. Nature of Text Application -- 7. Parallel Translation Corpus -- 8. Web Text Corpus -- 9. Pre-Digital Corpora (Part-I) -- 10. Pre-Digital Language Corpora (Part-2) -- 11. Digital Text Corpora (Part-I) -- 12. Digital Text Corpora (Part-II) -- 13. Digital Speech Corpora -- 14. Utilization of Language Corpora -- 15. Limitations of Language Corpora 
653 |a Language Teaching and Learning 
653 |a Computational Linguistics 
653 |a Computational linguistics 
653 |a Language and languages / Study and teaching 
653 |a Natural Language Processing (NLP) 
653 |a Natural language processing (Computer science) 
700 1 |a Arulmozi, S.  |e [author] 
041 0 7 |a eng  |2 ISO 639-2 
989 |b Springer  |a Springer eBooks 2005- 
028 5 0 |a 10.1007/978-981-10-7458-5 
856 4 0 |u https://doi.org/10.1007/978-981-10-7458-5?nosfx=y  |x Verlag  |3 Volltext 
082 0 |a 410.285 
520 |a This book discusses key issues of corpus linguistics like the definition of the corpus, primary features of a corpus, and utilization and limitations of corpora. It presents a unique classification scheme of language corpora to show how they can be studied from the perspective of genre, nature, text type, purpose, and application. A reference to parallel translation corpus is mandatory in the discussion of corpus generation, which the authors thoroughly address here, with a focus on Indian language corpora and English. Web-text corpus, a new development in corpus linguistics, is also discussed with elaborate reference to Indian web text corpora. The book also presents a short history of corpus generation and provides scenarios before and after the advent of computer-generated digital corpora. This book has several important features: it discusses many technical issues of the field in a lucid manner; contains extensive new diagrams and chartsfor easy comprehension; and presents discussions in simplified English to cater to the needs of non-native English readers. This is an important resource authored by academics who have many years of experience teaching and researching corpus linguistics. Its focus on Indian languages and on English corpora makes it applicable to students of graduate and postgraduate courses in applied linguistics, computational linguistics and language processing in South Asia and across countries where English is spoken as a first or second language.