72 страниц. 2010 год. LAP Lambert Academic Publishing For determining a language's morphological specialties, it is needed to generate a corpus that represents the language. If there is a large scale Turkish corpus that involves all specialties of the language, some statistical properties of the Turkish language depending on the words can also be investigated. In this book, how must a large scale, comprehensive, understandable, easily used Turkish corpus be generated and determining an appropriate method to generate it, and also determining an efficient method to determine stem, root and suffixes of the words that are used to form this corpus are explained. This book is written to guide the researchers who work in the Natural Language Processing area and specially on generating large scale corpus.