000 05137nam a22005895i 4500
001 978-3-319-99004-0
003 DE-He213
005 20240423125058.0
007 cr nn 008mamaa
008 190206s2019 sz | s |||| 0|eng d
020 _a9783319990040
_9978-3-319-99004-0
024 7 _a10.1007/978-3-319-99004-0
_2doi
050 4 _aQA76.9.N38
072 7 _aUYQL
_2bicssc
072 7 _aCOM073000
_2bisacsh
072 7 _aUYQL
_2thema
082 0 4 _a006.35
_223
245 1 0 _aUsing Comparable Corpora for Under-Resourced Areas of Machine Translation
_h[electronic resource] /
_cedited by Inguna Skadiņa, Robert Gaizauskas, Bogdan Babych, Nikola Ljubešić, Dan Tufiş, Andrejs Vasiļjevs.
250 _a1st ed. 2019.
264 1 _aCham :
_bSpringer International Publishing :
_bImprint: Springer,
_c2019.
300 _aVI, 323 p. 63 illus., 39 illus. in color.
_bonline resource.
336 _atext
_btxt
_2rdacontent
337 _acomputer
_bc
_2rdamedia
338 _aonline resource
_bcr
_2rdacarrier
347 _atext file
_bPDF
_2rda
490 1 _aTheory and Applications of Natural Language Processing,
_x2192-0338
505 0 _aIntroduction -- Cross-language comparability and its Applications for MT (Bogdan Babych, Fangzhong Su, Anthony Hartley, Ahmet Aker, Monica Lestari Paramita, Paul Clough, Robert Gaizauskas) -- Collecting comparable corpora (Monica Lestari Paramita, Ahmet Aker, Paul Clough, Robert Gaizauskas, Nikos Glaros, Nikos Mastropavlos, Olga Yannoutsou, Radu Ion, Dan Ștefănescu, Alexandru Ceauşu, Dan Tufiș and Judita Preiss) -- Extracting data from comparable corpora (Mārcis Pinnis, Nikola Ljubešić, Dan Ştefănescu, Inguna Skadiņa, Marko Tadić, Tatjana Gornostaja, Špela Vintar, Darja Fišer) -- Mapping and aligning units from comparable corpora (Ahmet Aker, Alexandru Ceaușu, Yang Feng, Robert Gaizauskas, Sabine Hunsicker, Radu Ion, Elena Irimia, Dan Ștefănescu, Dan Tufiș) -- Training, enhancing, evaluating and using MT-Systems with comparable data (Bogdan Babych, Yu Chen, Andreas Eisele, Sabine Hunsicker, Mārcis Pinnis, Inguna Skadiņa, Raivis Skadiņš, Gregor Thurmair, Andrejs Vasiļjevs, Mateja Verlic, Xiaojun Zhang) -- New areas of application of comparable corpora (Reinhard Rapp, Vivian Xu, Michael Zock, Serge Sharoff, Richard Forsyth, Bogdan Babych, Chenhui Chu, Toshiaki Nakazawa, Sadao Kurohashi) -- Appendices (Ahmet Aker, Radu Ion, Nikos Mastropavlos, Monica Paramita, Mārcis Pinnis, Dan Ştefănescu, Fangzhong Su, Gregor Thurmair,Elena Irimia, Nikola Ljubešić, Evangelos Kanoulas, Judita Preiss, Rob Gaizauskas, Paul Clough, Emma Barker, Nikos Glaros, Tiberiu Boroș, Inguna Skadiņa, Andrejs Vasiļjevs).
520 _aThis book provides an overview of how comparable corpora can be used to overcome the lack of parallel resources when building machine translation systems for under-resourced languages and domains. It presents a wealth of methods and open tools for building comparable corpora from the Web, evaluating comparability and extracting parallel data that can be used for the machine translation task. It is divided into several sections, each covering a specific task such as building, processing, and using comparable corpora, focusing particularly on under-resourced language pairs and domains. The book is intended for anyone interested in data-driven machine translation for under-resourced languages and domains, especially for developers of machine translation systems, computational linguists and language workers. It offers a valuable resource for specialists and students in natural language processing, machine translation, corpus linguistics and computer-assisted translation, and promotes the broader use of comparable corpora in natural language processing and computational linguistics.
650 0 _aNatural language processing (Computer science).
650 0 _aComputational linguistics.
650 0 _aData mining.
650 1 4 _aNatural Language Processing (NLP).
650 2 4 _aComputational Linguistics.
650 2 4 _aData Mining and Knowledge Discovery.
700 1 _aSkadiņa, Inguna.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
700 1 _aGaizauskas, Robert.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
700 1 _aBabych, Bogdan.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
700 1 _aLjubešić, Nikola.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
700 1 _aTufiş, Dan.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
700 1 _aVasiļjevs, Andrejs.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
710 2 _aSpringerLink (Online service)
773 0 _tSpringer Nature eBook
776 0 8 _iPrinted edition:
_z9783319990033
776 0 8 _iPrinted edition:
_z9783319990057
830 0 _aTheory and Applications of Natural Language Processing,
_x2192-0338
856 4 0 _uhttps://doi.org/10.1007/978-3-319-99004-0
912 _aZDB-2-SCS
912 _aZDB-2-SXCS
942 _cSPRINGER
999 _c173991
_d173991