Professional Documents
Culture Documents
SDL Learning and Development: Post-Editing Certification SDL Imt
SDL Learning and Development: Post-Editing Certification SDL Imt
DEVELOPMENT
Post-Editing Certification
SDL iMT
Over 20 customers,
more than 120
language pair
Dramatic
MT Group set up customisations
increase in the
to use RBMT in Team is re-branded number of words
a high quality Partnership with 2010 iMT (intelligent post-edited in
translation Language Machine Translation) SDL by trained
process Weaver and post-editors
Acquisition of training on how First post-editing
Rules Based to customise projects using SMT
2004 2012
Machine go into production
SMT engines
Translation
(RBMT) engine
2009 2011
2000
6
Definition of Post-Editing
• The “term used for the correction of
machine translation output by human
linguists/editors” (Veale and Way 1997)
• “Checking, proof-reading and revising
translations carried out by any kind of
translating automaton” (Gouadec 2007)
• “In basic terms, the task of the post-editor
is to edit, modify and/or correct pre-
translated text that has been processed by
a machine translation system from a
source language (a) in to target
language(s).“ (Allen 2003)
• This […] usually consists in automatically
translating the source file and providing the
result to the translator as a first version of
the text to be post-edited. This practice is
usually referred as MT post-editing and is
considerably promising in enhancing the
translation process both in terms of time
and quality (Vieira. L.; Specia, L. 2011)
• Following guidelines, the translators
correct the output from translation
memories and machine translation to
produce different levels of quality.
Gradually this activity, post-editing, is
becoming a more frequent activity in
localisation, as opposed to the full
translation of new texts. (Guerberof, Ana.
2009)
Why Use
Machine
Translation
Today’s Reality
• Multi-directional communication
• Proliferation of channels and
devices
• Channel relevance critical
• The ‘empowered’ consumer
• Cyclical and multi-directional
engagement process
9
Need for a MT Solution
10
Language growth on internet 2000 – 2010
Arabic +20x
Chinese +20x
Portuguese +9x
Spanish +7x
French +6x
EN +3x
11
Why the Need for Machine Translation?
12
Machine
Translation
Technologies
Understanding the Challenges of MT
14
Challenges of MT
Same sentence can be translated completely differently depending on context.
15
Different MT technologies
16
Rules-Based Machine Translation - RBMT
17
Challenges of RBMT
• Grammar is ambiguous
– I saw a man with a telescope
19
Statistical Machine Translation
Better known as Data-Driven Machine Translation
20
Statistical versus Rules-based
Statistical Rules-based
• Starting point for a new language direction is an • Each language pair is built by looking at
aligned corpus of 200 to 300 million words (uses construction of both source and target language
existing translation databases to build up (taking into account source and target grammar
language pairs) and vocabulary)
• Language pairs will differ in quality depending on • Rules and information encoded in the dictionaries
the quality and extent of the databases are used to analyse the source and generate the
translation
• System learns how to translate by analysing
statistical relationships between large volumes of • It takes a number of man years to develop a new
aligned source and target data language pair
• New language directions can be created quickly • Source content needs to be well written to
generate good output
• Client customisations possible (recommendation
is an aligned corpus of 1 to 5 million words of • End result will not flow like statistical output, no
relevant customer data) context-sensitivity
• Source content should be of acceptable quality • Easy to set terminology per client (terminology for
the dictionaries is translated to ensure
• More fluent output and some context-sensitivity
consistency)
• Little control on terminology apart from TM
21
Creating
Statistical
Machine
Translation
Output
How do you get high quality from MT?
23
SMT Engine Training
24
SDL iMT Baselines
26
SDL iMT Customised Engines
27
Training and
Testing
Process
SDL iMT System – Process Summary
Content MT Customisation
Production QA
Evaluation
Evaluate MT output
Post-Edit Update
Refine training Translation
or deploy for Memory
production
Integrate MT on
Translation process
SDL MT Translation
Server Memory
29
How to measure the quality of an engine?
Human evaluation Automatic evaluation
31
Improvement through Feedback
SDL iMT
Post-Editing
Post-Editing as an Opportunity for Translators
Technology changes are not a threat to the Post-editing brings MT and human
role of the translator, but can help make skills together
translators more efficient and competitive in
the market MT will not replace human translation!
35
The translation landscape is changing
36
Post-editing Integration into Production Environment
Post-Edit
Update
Translation Memory
37
Effective
Post-Editing
How to be a Good Post-Editor
39
Post-Editing vs. Review
40
Degrees of Post-Editing
41
Post-Editing to Understandable Quality
42
Typical Guidelines in Post-Editing to
Understandable Quality
43
Post-Editing and Conventional Translation
Make use of MT
output as much as
possible
Do not over-edit or
under-edit the machine
translation output
45
Post-editing Process
Ready Set Go
• Read source • Build round • Ensure all
text, then MT MT output elements
output • Don’t under or present
• Determine over edit • Correct
usable • Focus on grammar
elements accuracy • Check
terminology
46
Patterns to watch out for in SMT
Compound Context-dependent
Proper nouns
formation and terminology translated
hyphenation
Wrong preposition,
Capitalisation gender, agreement Antonyms
or verb inflection
47
Quality Expectations in Post-Editing
• Is post-edited MT capable of producing high quality documents for
publication?
• Yes
48
Use of SDL
BeGlobal
Baselines in
SDL Studio
SMT at your fingertips
51
Sources of
Reference
REFERENCES
• Slide 10
– MT: The New Lingua Franca, TAUS https://
www.taus.net/articles/mt-the-new-lingua-franca
• Slide 36
– Trends in Translation Pricing, Copyright © 2012 by Common Sense
Advisory, Inc., September 2012
53
Copyright © 2008-2013 SDL plc. All rights reserved. All company names, brand names, trademarks, service marks,
This presentation and its content are SDL confidential unless otherwise specified, and may not be copied, used or