Machine Learning in Translation Corpora Processing, Wolk, Krzysztof
Автор: Inguna Skadi?a; Robert Gaizauskas; Bogdan Babych; Название: Using Comparable Corpora for Under-Resourced Areas of Machine Translation ISBN: 3319990039 ISBN-13(EAN): 9783319990033 Издательство: Springer Рейтинг: Цена: 18167.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This book provides an overview of how comparable corpora can be used to overcome the lack of parallel resources when building machine translation systems for under-resourced languages and domains. It presents a wealth of methods and open tools for building comparable corpora from the Web, evaluating comparability and extracting parallel data that can be used for the machine translation task. It is divided into several sections, each covering a specific task such as building, processing, and using comparable corpora, focusing particularly on under-resourced language pairs and domains.The book is intended for anyone interested in data-driven machine translation for under-resourced languages and domains, especially for developers of machine translation systems, computational linguists and language workers. It offers a valuable resource for specialists and students in natural language processing, machine translation, corpus linguistics and computer-assisted translation, and promotes the broader use of comparable corpora in natural language processing and computational linguistics.
Автор: Jean V?ronis Название: Parallel Text Processing ISBN: 904815555X ISBN-13(EAN): 9789048155552 Издательство: Springer Рейтинг: Цена: 35218.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: As I recalled in the introduction, the Rosetta stone won eternal fame as the prototype of parallel texts, but such texts are probably almost as old as the invention of writing.
Описание: The series serves to propagate investigations into language usage, especially with respect to computational support. This includes all forms of text handling activity, not only interlingual translations, but also conversions carried out in response to different communicative tasks. Among the major topics are problems of text transfer and the interplay between human and machine activities.
Название: Approaching language variation through corpora ISBN: 3034312644 ISBN-13(EAN): 9783034312646 Издательство: Peter Lang Рейтинг: Цена: 19028.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: A collection of papers using samples of real language data (corpora) to explore variation in the use of English. It celebrates the achievements of Toshio Saito, a pioneer in corpus linguistics within Japan and founder of the Japan Association for English Corpus Studies (JAECS).
Описание: This book adopts a corpus-based critical discourse analysis approach and examines a corpus of newspaper articles from Pakistani and Indian publications to gain comparative insights into the ideological construction of China’s Belt and Road Initiative (BRI) and the China-Pakistan Economic Corridor (CPEC) within news discourses. This book contributes to the works on perceptions of BRI in English newspapers of India and Pakistan. A multi-billion-dollar project of BRI or the "One Belt One Road” (OBOR), CPEC symbolizes a vision for regional revival under China's economic leadership and clout. Propelled by the Chinese Premier’s dream to revive the Chinese economy as well as to restructure and catalyze infrastructural development in Asia, BRI is aimed at connecting Asia via land and sea routes with Europe, Africa, and the Middle Eastern states.
Описание: Posthumanism and Deconstructing Arguments presents a new and practical approach in Critical Discourse Studies, providing a data-driven method for the examination of arguments in the public sphere. This ground-breaking book shows how the reader can evaluate arguments from points of view other than their own, using digital
Автор: Corrigan Название: Creating and Digitizing Language Corpora ISBN: 1137386444 ISBN-13(EAN): 9781137386441 Издательство: Springer Рейтинг: Цена: 13974.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This book unites a range of approaches to the collection and digitization of diverse language corpora. Its specific focus is on best practices identified in the exploitation of these resources in landmark impact initiatives across different parts of the globe. The development of increasingly accessible digital corpora has coincided with improvements in the standards governing the collection, encoding and archiving of ‘Big Data’. Less attention has been paid to the importance of developing standards for enriching and preserving other types of corpus data, such as that which captures the nuances of regional dialects, for example. This book takes these best practices another step forward by addressing innovative methods for enhancing and exploiting specialized corpora so that they become accessible to wider audiences beyond the academy.
Автор: Sandra Kuebler, Heike Zinsmeister Название: Corpus Linguistics and Linguistically Annotated Corpora ISBN: 1441164472 ISBN-13(EAN): 9781441164476 Издательство: Bloomsbury Academic Рейтинг: Цена: 25344.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: Linguistically annotated corpora are becoming a central part of the corpus linguistics field. One of their main strengths is the level of searchability they offer, but with the annotation come problems of the initial complexity of queries and query tools. This book gives a full, pedagogic account of this burgeoning field. Beginning with an overview of corpus linguistics, its prerequisites and goals, the book then introduces linguistically annotated corpora. It explores the different levels of linguistic annotation, including morphological, parts of speech, syntactic, semantic and discourse-level, as well as advantages and challenges for such annotations. It covers the main annotated corpora for English, the Penn Treebank, the International Corpus of English, and OntoNotes, as well as a wide range of corpora for other languages. In its third part, search strategies required for different types of data are explored. All chapters are accompanied by exercises and by sections on further reading.
Автор: Niladri Sekhar Dash; L. Ramamoorthy Название: Utility and Application of Language Corpora ISBN: 9811346887 ISBN-13(EAN): 9789811346880 Издательство: Springer Рейтинг: Цена: 9781.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This book discusses some of the basic issues relating to corpus generation and the methods normally used to generate a corpus. Since corpus-related research goes beyond corpus generation, the book also addresses other major topics connected with the use and application of language corpora, namely, corpus readiness in the context of corpus sanitation and pre-editing of corpus texts; the application of statistical methods; and various text processing techniques. Importantly, it explores how corpora can be used as a primary or secondary resource in English language teaching, in creating dictionaries, in word sense disambiguation, in various language technologies, and in other branches of linguistics. Lastly, the book sheds light on the status quo of corpus generation in Indian languages and identifies current and future needs.Discussing various technical issues in the field in a lucid manner, providing extensive new diagrams and charts for easy comprehension, and using simplified English, the book is an ideal resource for non-native English readers. Written by academics with many years of experience teaching and researching corpus linguistics, its focus on Indian languages and on English corpora makes it applicable to graduate and postgraduate students of applied linguistics, computational linguistics and language processing in South Asia and across countries where English is spoken as a first or second language.
Автор: Liu Kanglong Название: Corpus-Assisted Translation Teaching: Issues and Challenges ISBN: 9811589976 ISBN-13(EAN): 9789811589973 Издательство: Springer Рейтинг: Цена: 15372.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This book sheds new light on corpus-assisted translation pedagogy, an intersection of three distinct but cognate disciplines: corpus linguistics, translation and pedagogy.
Автор: S. Armstrong; Kenneth W. Church; Pierre Isabelle; Название: Natural Language Processing Using Very Large Corpora ISBN: 0792360559 ISBN-13(EAN): 9780792360551 Издательство: Springer Рейтинг: Цена: 27251.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: Intended for researchers who want to keep abreast of developments in corpus-based natural language processing. This work captures the essence of a series of successful workshops. It contains papers that cover a range of research topics in this field including part-of-speech tagging, word sense disambiguation, parsing on real-life texts, and more.
Автор: S. Armstrong; Kenneth W. Church; Pierre Isabelle; Название: Natural Language Processing Using Very Large Corpora ISBN: 9048153492 ISBN-13(EAN): 9789048153497 Издательство: Springer Рейтинг: Цена: 27251.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: The demand for international forums on corpus-based NLP has been expanding so rapidly that in 1995 SIGDAT was led to organize not only the Third Workshop on Very Large Corpora (Cambridge, Mass.
ООО "Логосфера " Тел:+7(495) 980-12-10 www.logobook.ru