A Comparison of Machine Translation Paradigms for Use in Black-Box Fuzzy-Match Repair

Similar documents
Overseas Market Access Requirements Notification - Animal Products Act 1999

Perplexity of n-gram and dependency language models

King Fahd University of Petroleum & Minerals College of Industrial Management

Exploring Food Aggression in Shelter Dogs

Genera&on of Image Descrip&ons. Tambet Ma&isen

Connecting Literature and Math - Component of STEM Curriculum

Adaptations of Turtles Lesson Plan (Level 1 Inquiry Confirmation)

[Boston March for Science 2017 photo Hendrik Strobelt]

ATLAS DE ANATOMIA HUMANA / ATLAS OF HUMAN ANATOMY (SPANISH EDITION) BY MARK NIELSEN, SHAWN MILLER

Dasher Web Service USER/DEVELOPER DOCUMENTATION June 2010 Version 1.1

JEAN K SOLER MALTA WICC TURKU Update on the ICPC-2-ICD-10 Thesaurus, the TRANSFoRm Project and the Archetype (Content) Model

Animal Language, Top 5 Most Amazing Examples of Animal Communication by Andrew Latham By Mark Henderson, Science Editor

NCHRP Project Production of a Major Update to the Highway Capacity Manual 2010

COMMISSION (2003/708/EC)

Grade 2 English Language Arts

Environmental vs Genetic Factors Argumentation (CER) Prompts

Dynamic Programming for Linear Time Incremental Parsing

Egg laying vs. Live Birth

Dog Off Leash Strategy

Dunbia 2017 Dunbia 2017

Machine Learning.! A completely different way to have an. agent acquire the appropriate abilities to solve a particular goal is via machine learning.

Use of monthly collected milk yields for the early detection of vector-borne emerging diseases.

Overview of the OIE PVS Pathway

Advanced Uses of Earned Value Management in Projects, Programmes and Portfolios

HCM 6: Highway Capacity Manual: A Guide for Multimodal Mobility Analysis

OIE Regional Commission for Europe Regional Work Plan Framework Version adopted during the 85 th OIE General Session (Paris, May 2017)

Limited English Proficiency Plan. Northern Oklahoma Development Authority. DBA: Cherokee Strip Transit. June 2017

Memorandum. To: Tim Walsh Date: April 16, From: Michael D. Loberg cc: MVCHI Review Team

The Dominant Animal Human Evolution And Environment Paul R Ehrlich

Effective Vaccine Management (EVM) Global Data Analysis

Genetics, a tool to prevent mastitis in dairy cows

4-H Dog Obedience Proficiency Program A Member s Guide

Eating Your Own Dog Food

Navajo gophers (15 marks)

288 Seymour River Place North Vancouver, BC V7H 1W6

Answers to Questions about Smarter Balanced 2017 Test Results. March 27, 2018

The Emergency Shelter Learning Series. Low-Barrier Access to Shelters for People and Their Animals

PAWS FOR INDEPENDENCE SCHOLARSHIPS

Managing AMR at the Human-Animal Interface. OIE Contributions to the AMR Global Action Plan

Promoting One Health : the international perspective OIE

The Scarlet Pimpernel (Webster's Spanish Thesaurus Edition) By Baroness Emmuska Orczy

151 West 26 th Street New York, NY, STUDY GUIDE

Grade 5 English Language Arts

Entailment above the word level in distributional semantics

The Netherlands and Katje the Windmill Cat

OIE Collaborating Centres Reports Activities

Original Cartoons: The Frederator Studio Postcards By Fred Seibert;Eric Homan READ ONLINE

A systematic review of zoonoses transmission and livestock/wildlife interactionspreliminary

FDA S ANTIPARASITIC RESISTANCE MANAGEMENT STRATEGY (ARMS)

Bambino By Desiree Granger

SYTLE FORMAL : The Online Dog Trainer In-Depth Review

4--Why are Community Documents So Difficult to Read and Revise?

Media Release 11 May 2017

It s that time of year again the Sacramento Area Animal Coalition (SAAC) will organize SPAY DAY SACRAMENTO 2012 on Sunday, February 26!

Humane Handling GMPs. A Regulatory Perspective. Craig Shultz, DVM Food Safety and Inspection Service Cargill-Taylor Beef Wyalusing, PA

Standard operating procedure

Grade 3, Prompt for Opinion Writing

REPORT FROM THE COMMISSION TO THE EUROPEAN PARLIAMENT AND THE COUNCIL. on systems restraining bovine animals by inversion or any unnatural position

GUIDELINES. Ordering, Performing and Interpreting Laboratory Tests in Veterinary Clinical Practice

Prevention Concepts & Solutions Inc.

Veterinary Education in Europe 2009 and beyond

If you are searching for the ebook by Francisco Alarcón Iguanas in the Snow: And Other Winter Poems / Iguanas en la Nieve: Y Otros Poemas de Invierno

INTRODUCTION & MEASURING ANIMAL BEHAVIOR

Netherland Dwarf Rabbits, The Complete Owner's Guide To Netherland Dwarf Bunnies, How To Care For Your Netherland Dwarf, Including Health, Breeding,

Antimicrobial Resistance Surveillance in the Americas

Creating Strategic Capital for EVM. EVA th June 2012 Andrew Hill PROJECT CONTROLS CONSULTING

Telemundo 23 KMUV-TV. Para La Costa Central. Telemundo for the Central Coast Monterey/Salinas

World Organization for Animal Health (OIE) and Animal Welfare Presentation to the National Farm Animal Care Council May 13, 2010

Dog training and behaviour skills: program overview

Grade 5, Prompt for Opinion Writing Common Core Standard W.CCR.1

SMT FINGER PRODUCT. Many different shape of SMT metal parts :

Learn more at LESSON TITLE: BRINGING UP BIRDY GRADE LEVEL: 2-3. TIME ALLOTMENT: One to two 45-minute class periods OVERVIEW:

News English.com Ready-to-use ESL / EFL Lessons

Expanded noun phrases and verbs to describe an underwater world

Muse Teacher Guide: February 2018

Lacey Blocker Vernon Parish Teacher Leader NBCT

Indigo Sapphire Bear. Newfoundland. Indigo Sapphire Bear. January. Dog's name: DR. NEALE FRETWELL. R&D Director

Handling missing data in matched case-control studies using multiple imputation

AUTOMATIC MILKING SYSTEMS AND MASTITIS

Effective Vaccine Management (EVM) Global Data Analysis

Grasshopper Dissection

LABRADOR RETRIEVER: LABRADOR RETRIEVER TRAINING - COMPLETE LABRADOR PUPPY TRAINING GUIDE, OBEDIENCE, POTTY TRAINING, AND CARE TIPS (RETRIEV

101 Uses For A Golden Retriever By Denver Bryan READ ONLINE

Meet the Larvae BROWARD COUNTY ELEMENTARY SCIENCE BENCHMARK PLAN. SC.F The student knows the basic needs of all living things FOR PERSONAL USE

2013 Holiday Lectures on Science Medicine in the Genomic Era

European Association of Establishments for Veterinary Document approved by the Executive Committee on January Education

Going Buggy by Guy Belleranti

Controllability of Complex Networks. Yang-Yu Liu, Jean-Jacques Slotine, Albert-Laszlo Barbasi Presented By Arindam Bhattacharya


Shepherding Behaviors with Multiple Shepherds

MOON PHASES FOR 2018, at Kitt Peak Times and dates are given in local time, zone = 7 hr West. They are generally better than +- 2 minutes.

MOON PHASES FOR 2019, at Kitt Peak Times and dates are given in local time, zone = 7 hr West. They are generally better than +- 2 minutes.

international news RECOMMENDATIONS

Testing Phylogenetic Hypotheses with Molecular Data 1

Application of Fuzzy Logic in Automated Cow Status Monitoring

Taking Care of a fish

Recommendation for the basic surveillance of Eudravigilance Veterinary data

My signature confirms that I will not discuss the content of the test with anyone until the end of the 5 day test window.

NEWS ENGLISH LESSONS.com

An Esterel Virtual Machine (EVM) Aruchunan Vaseekaran

Transcription:

A Comparison of Machine Translation Paradigms for Use in Black-Box Fuzzy-Match Repair AMTA 2018, Boston, March 21st, 2018 Rebecca Knowles John E. Ortega Philipp Koehn Johns Hopkins University Universitat d'alacant Johns Hopkins University

Overview Fuzzy-Match Repair Comparison of MT Paradigms Results & Analysis Future Work

Introduction to Fuzzy-Match Repair 01 02 03 The Source Sentence (s') The cat blinks when the dog arrives The TM Source (s) The cat runs when the dog arrives The TM Target (t) El gato corre cuando llega el perro Our Fuzzy-Match Repair algorithm will repair proposals from the TM and propose translation hypotheses closer to the source sentence

Introduction to Fuzzy-Match Repair The Translator When working with fuzzy matches, the translator has to make changes to transform t into an adequate translation of s'. Translation Proposals Our goal is to repair fuzzy matches and provide translation proposals so that the amount of post-editing by the translator is kept to a minimum.

FMR Algorithm 01 Align input source (s') to TM source (s) 02 Translate mismatches 03 Match translations to their TM target (t) 04 05 Build pairs of repair operators (, )(, ) Generate hypotheses (t*)

FMR Algorithm The blue dog barks (s source) The red dog barks (s tm-source) El perro rojo ladra (t tm-target) σ - The blue dog, blue dog, blue σ - The red dog, red dog, red - el perro azul, perro azul, azul - el perro rojo, perro rojo, rojo El perro azul ladra (t* the best (oracle) of many hypotheses)

Oracle Evaluation (for FMR) Get TUs that meet fuzzy-match threshold If no TU meets threshold, use MT. Otherwise, get highest scoring TU and produce all possible hypotheses. Select repair with minimum edit distance.

FMR Requirements Black-Box Translation Our approach to fuzzy-match repair allows the use of any external source of bilingual information (SBI) such as rule-based, statistical, or neural machine translation systems, dictionaries, and more...

Introduction to Fuzzy-Match Repair Previous Work Current Work FMR introduced Oracle evaluation on 3 language pairs 3 MT Paradigms Oracle performance eval. & sub-segment analysis

Machine Translation Paradigms Rule-Based (RB) Apertium Statistical (SMT) Moses Training: Europarl, News Commentary, DGT-TM 2011-13 + Large LM Neural (NMT) Nematus Training: Europarl, News Commentary + DGT-TM 2011-13

Results & Analysis Compare System Performance: Translation & Oracle FMR Direct Comparison of Two Best Systems Analysis of Sub-Segment Translations

System Performance SMT performs best for translation

System Performance SMT performs best for translation...but NMT performs best for FMR.

Direct Comparison: SMT vs. NMT NMT is able to repair more segments And it produces more repair options per segment On a subset of segments that NMT & SMT both repair: FMR performance is very similar between SMT and NMT More repair options (NMT) gives better FMR performance True under the oracle evaluation, but with a pessimal oracle, NMT suffers a greater performance drop than SMT

Sub-Segment Translations Source SMT NMT annex 3 ; it cannot be furnished 's authorities shall within el anexo 3 ; no podrán aportarse dentro de las autoridades de anexo 3 las autoridades de los estados miembros dispondrán de las autoridades nacionales competentes en el ( place and date ) ( lugar y fecha ) ( lugar y fecha ) ( 2 ( 2, ( 2

Future Work Fuzzy-match Repair paper presented at AMTA with initial idea and concept 2014 Black-Box MT paradigms and sub-segment analysis presented at AMTA 2018 2016 Idea formalized and algorithm released to the MT community at AMTA 2016 2018+ Formalize features for Quality Estimation in FMR to rank hypotheses with unseen reference.

Thank you! Rebecca Knowles (rknowles@jhu.edu) John E. Ortega (jeo10@alu.ua.es) Philipp Koehn (phi@jhu.edu)

This work was partially supported by a National Science Foundation Graduate Research Fellowship under Grant No. DGE-1232825 (to the first author) and by the Spanish government through the EFFORTUNE (TIN2015-69632-R) project (the second author). Any opinion, findings, and conclusions or recommendations expressed in this material are those of the authors(s) and do not necessarily reflect the views of the National Science Foundation.