<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Publications | Samuel Larkin</title><link>https://samuellarkin.github.io/publications/</link><atom:link href="https://samuellarkin.github.io/publications/index.xml" rel="self" type="application/rss+xml"/><description>Publications</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Sat, 01 Nov 2025 00:00:00 +0000</lastBuildDate><image><url>https://samuellarkin.github.io/media/icon_hu7729264130191091259.png</url><title>Publications</title><link>https://samuellarkin.github.io/publications/</link></image><item><title>MSLC25: Metric Performance on Low-Quality Machine Translation, Empty Strings, and Language Variants</title><link>https://samuellarkin.github.io/publications/knowles-etal-2025-mslc25/</link><pubDate>Sat, 01 Nov 2025 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/knowles-etal-2025-mslc25/</guid><description/></item><item><title>NRC Systems for the WMT2025-LRSL Shared Task</title><link>https://samuellarkin.github.io/publications/larkin-etal-2025-nrc/</link><pubDate>Sat, 01 Nov 2025 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/larkin-etal-2025-nrc/</guid><description/></item><item><title>Challenges in Technical Regulatory Text Variation Detection</title><link>https://samuellarkin.github.io/publications/chikati-etal-2025-challenges/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/chikati-etal-2025-challenges/</guid><description/></item><item><title>Speech Generation for Indigenous Language Education</title><link>https://samuellarkin.github.io/publications/pine-2025101723/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/pine-2025101723/</guid><description/></item><item><title>MSLC24 Submissions to the General Machine Translation Task</title><link>https://samuellarkin.github.io/publications/larkin-etal-2024-mslc-24/</link><pubDate>Sun, 01 Dec 2024 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/larkin-etal-2024-mslc-24/</guid><description/></item><item><title>MSLC24: Further Challenges for Metrics on a Wide Landscape of Translation Quality</title><link>https://samuellarkin.github.io/publications/knowles-etal-2024-mslc-24/</link><pubDate>Fri, 01 Nov 2024 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/knowles-etal-2024-mslc-24/</guid><description/></item><item><title>Some Tradeoffs in Continual Learning for Parliamentary Neural Machine Translation Systems</title><link>https://samuellarkin.github.io/publications/knowles-etal-2024-tradeoffs/</link><pubDate>Sun, 01 Sep 2024 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/knowles-etal-2024-tradeoffs/</guid><description/></item><item><title>Speech Generation for Indigenous Language Education</title><link>https://samuellarkin.github.io/publications/pine-2024101723/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/pine-2024101723/</guid><description/></item><item><title>Metric Score Landscape Challenge (MSLC23): Understanding Metrics′ Performance on a Wider Landscape of Translation Quality</title><link>https://samuellarkin.github.io/publications/lo-etal-2023-metric/</link><pubDate>Fri, 01 Dec 2023 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/lo-etal-2023-metric/</guid><description/></item><item><title>Long to reign over us: A Case Study of Machine Translation and a New Monarch</title><link>https://samuellarkin.github.io/publications/knowles-larkin-2023-long/</link><pubDate>Sat, 01 Jul 2023 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/knowles-larkin-2023-long/</guid><description/></item><item><title>Terminology in Neural Machine Translation: A Case Study of the Canadian Hansard</title><link>https://samuellarkin.github.io/publications/knowles-etal-2023-terminology/</link><pubDate>Thu, 01 Jun 2023 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/knowles-etal-2023-terminology/</guid><description/></item><item><title>NRC-CNRC Systems for Upper Sorbian-German and Lower Sorbian-German Machine Translation 2021</title><link>https://samuellarkin.github.io/publications/knowles-larkin-2021-nrc/</link><pubDate>Mon, 01 Nov 2021 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/knowles-larkin-2021-nrc/</guid><description/></item><item><title>Like Chalk and Cheese? On the Effects of Translationese in MT Training</title><link>https://samuellarkin.github.io/publications/larkin-etal-2021-like/</link><pubDate>Sun, 01 Aug 2021 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/larkin-etal-2021-like/</guid><description/></item><item><title>NRC-CNRC Machine Translation Systems for the 2021 AmericasNLP Shared Task</title><link>https://samuellarkin.github.io/publications/knowles-etal-2021-nrc/</link><pubDate>Tue, 01 Jun 2021 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/knowles-etal-2021-nrc/</guid><description/></item><item><title>Machine Translation Reference-less Evaluation using YiSi-2 with Bilingual Mappings of Massive Multilingual Language Model</title><link>https://samuellarkin.github.io/publications/lo-larkin-2020-machine/</link><pubDate>Sun, 01 Nov 2020 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/lo-larkin-2020-machine/</guid><description/></item><item><title>NRC Systems for Low Resource German-Upper Sorbian Machine Translation 2020: Transfer Learning with Lexical Modifications</title><link>https://samuellarkin.github.io/publications/knowles-etal-2020-nrc-systems/</link><pubDate>Sun, 01 Nov 2020 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/knowles-etal-2020-nrc-systems/</guid><description/></item><item><title>NRC Systems for the 2020 Inuktitut-English News Translation Task</title><link>https://samuellarkin.github.io/publications/knowles-etal-2020-nrc/</link><pubDate>Sun, 01 Nov 2020 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/knowles-etal-2020-nrc/</guid><description/></item><item><title>The Nunavut Hansard Inuktitut--English Parallel Corpus 3.0 with Preliminary Machine Translation Results</title><link>https://samuellarkin.github.io/publications/joanis-etal-2020-nunavut/</link><pubDate>Fri, 01 May 2020 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/joanis-etal-2020-nunavut/</guid><description/></item><item><title>Bursty event detection on social media</title><link>https://samuellarkin.github.io/publications/cai-2019/</link><pubDate>Thu, 01 Aug 2019 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/cai-2019/</guid><description/></item><item><title>Multi-Source Transformer for Kazakh-Russian-English Neural Machine Translation</title><link>https://samuellarkin.github.io/publications/littell-etal-2019-multi/</link><pubDate>Thu, 01 Aug 2019 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/littell-etal-2019-multi/</guid><description/></item><item><title>An example preprint / working paper</title><link>https://samuellarkin.github.io/publications/preprint/</link><pubDate>Sun, 07 Apr 2019 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/preprint/</guid><description>&lt;p>This work is driven by the results in my &lt;a href="https://samuellarkin.github.io/publication/conference-paper/">previous paper&lt;/a> on LLMs.&lt;/p>
&lt;div class="flex px-4 py-3 mb-6 rounded-md bg-primary-100 dark:bg-primary-900">
&lt;span class="pr-3 pt-1 text-primary-600 dark:text-primary-300">
&lt;svg height="24" xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24">&lt;path fill="none" stroke="currentColor" stroke-linecap="round" stroke-linejoin="round" stroke-width="1.5" d="m11.25 11.25l.041-.02a.75.75 0 0 1 1.063.852l-.708 2.836a.75.75 0 0 0 1.063.853l.041-.021M21 12a9 9 0 1 1-18 0a9 9 0 0 1 18 0m-9-3.75h.008v.008H12z"/>&lt;/svg>
&lt;/span>
&lt;span class="dark:text-neutral-300">Create your slides in Markdown - click the &lt;em>Slides&lt;/em> button to check out the example.&lt;/span>
&lt;/div>
&lt;p>Add the publication&amp;rsquo;s &lt;strong>full text&lt;/strong> or &lt;strong>supplementary notes&lt;/strong> here. You can use rich formatting such as including &lt;a href="https://docs.hugoblox.com/content/writing-markdown-latex/" target="_blank" rel="noopener">code, math, and images&lt;/a>.&lt;/p></description></item><item><title>Accurate semantic textual similarity for cleaning noisy parallel corpora using semantic machine translation evaluation metric: The NRC supervised submissions to the Parallel Corpus Filtering task</title><link>https://samuellarkin.github.io/publications/lo-etal-2018-accurate/</link><pubDate>Mon, 01 Oct 2018 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/lo-etal-2018-accurate/</guid><description/></item><item><title>Measuring sentence parallelism using Mahalanobis distances: The NRC unsupervised submissions to the WMT18 Parallel Corpus Filtering shared task</title><link>https://samuellarkin.github.io/publications/littell-etal-2018-measuring/</link><pubDate>Mon, 01 Oct 2018 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/littell-etal-2018-measuring/</guid><description/></item><item><title>EuroGames16: Evaluating Change Detection in Online Conversation</title><link>https://samuellarkin.github.io/publications/goutte-etal-2018-eurogames-16/</link><pubDate>Tue, 01 May 2018 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/goutte-etal-2018-eurogames-16/</guid><description/></item><item><title>NRC Machine Translation System for WMT 2017</title><link>https://samuellarkin.github.io/publications/lo-etal-2017-nrc/</link><pubDate>Fri, 01 Sep 2017 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/lo-etal-2017-nrc/</guid><description/></item><item><title>Cost Weighting for Neural Machine Translation Domain Adaptation</title><link>https://samuellarkin.github.io/publications/chen-etal-2017-cost/</link><pubDate>Tue, 01 Aug 2017 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/chen-etal-2017-cost/</guid><description/></item><item><title>An example journal article</title><link>https://samuellarkin.github.io/publications/journal-article/</link><pubDate>Tue, 01 Sep 2015 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/journal-article/</guid><description>&lt;div class="flex px-4 py-3 mb-6 rounded-md bg-primary-100 dark:bg-primary-900">
&lt;span class="pr-3 pt-1 text-primary-600 dark:text-primary-300">
&lt;svg height="24" xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24">&lt;path fill="none" stroke="currentColor" stroke-linecap="round" stroke-linejoin="round" stroke-width="1.5" d="m11.25 11.25l.041-.02a.75.75 0 0 1 1.063.852l-.708 2.836a.75.75 0 0 0 1.063.853l.041-.021M21 12a9 9 0 1 1-18 0a9 9 0 0 1 18 0m-9-3.75h.008v.008H12z"/>&lt;/svg>
&lt;/span>
&lt;span class="dark:text-neutral-300">Click the &lt;em>Cite&lt;/em> button above to demo the feature to enable visitors to import publication metadata into their reference management software.&lt;/span>
&lt;/div>
&lt;div class="flex px-4 py-3 mb-6 rounded-md bg-primary-100 dark:bg-primary-900">
&lt;span class="pr-3 pt-1 text-primary-600 dark:text-primary-300">
&lt;svg height="24" xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24">&lt;path fill="none" stroke="currentColor" stroke-linecap="round" stroke-linejoin="round" stroke-width="1.5" d="m11.25 11.25l.041-.02a.75.75 0 0 1 1.063.852l-.708 2.836a.75.75 0 0 0 1.063.853l.041-.021M21 12a9 9 0 1 1-18 0a9 9 0 0 1 18 0m-9-3.75h.008v.008H12z"/>&lt;/svg>
&lt;/span>
&lt;span class="dark:text-neutral-300">Create your slides in Markdown - click the &lt;em>Slides&lt;/em> button to check out the example.&lt;/span>
&lt;/div>
&lt;p>Add the publication&amp;rsquo;s &lt;strong>full text&lt;/strong> or &lt;strong>supplementary notes&lt;/strong> here. You can use rich formatting such as including &lt;a href="https://docs.hugoblox.com/content/writing-markdown-latex/" target="_blank" rel="noopener">code, math, and images&lt;/a>.&lt;/p></description></item><item><title>Transferring markup tags in statistical machine translation: a two-stream approach</title><link>https://samuellarkin.github.io/publications/joanis-etal-2013-transferring/</link><pubDate>Sun, 01 Sep 2013 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/joanis-etal-2013-transferring/</guid><description/></item><item><title>An example conference paper</title><link>https://samuellarkin.github.io/publications/conference-paper/</link><pubDate>Mon, 01 Jul 2013 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/conference-paper/</guid><description>&lt;div class="flex px-4 py-3 mb-6 rounded-md bg-primary-100 dark:bg-primary-900">
&lt;span class="pr-3 pt-1 text-primary-600 dark:text-primary-300">
&lt;svg height="24" xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24">&lt;path fill="none" stroke="currentColor" stroke-linecap="round" stroke-linejoin="round" stroke-width="1.5" d="m11.25 11.25l.041-.02a.75.75 0 0 1 1.063.852l-.708 2.836a.75.75 0 0 0 1.063.853l.041-.021M21 12a9 9 0 1 1-18 0a9 9 0 0 1 18 0m-9-3.75h.008v.008H12z"/>&lt;/svg>
&lt;/span>
&lt;span class="dark:text-neutral-300">Click the &lt;em>Cite&lt;/em> button above to demo the feature to enable visitors to import publication metadata into their reference management software.&lt;/span>
&lt;/div>
&lt;div class="flex px-4 py-3 mb-6 rounded-md bg-primary-100 dark:bg-primary-900">
&lt;span class="pr-3 pt-1 text-primary-600 dark:text-primary-300">
&lt;svg height="24" xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24">&lt;path fill="none" stroke="currentColor" stroke-linecap="round" stroke-linejoin="round" stroke-width="1.5" d="m11.25 11.25l.041-.02a.75.75 0 0 1 1.063.852l-.708 2.836a.75.75 0 0 0 1.063.853l.041-.021M21 12a9 9 0 1 1-18 0a9 9 0 0 1 18 0m-9-3.75h.008v.008H12z"/>&lt;/svg>
&lt;/span>
&lt;span class="dark:text-neutral-300">Create your slides in Markdown - click the &lt;em>Slides&lt;/em> button to check out the example.&lt;/span>
&lt;/div>
&lt;p>Add the publication&amp;rsquo;s &lt;strong>full text&lt;/strong> or &lt;strong>supplementary notes&lt;/strong> here. You can use rich formatting such as including &lt;a href="https://docs.hugoblox.com/content/writing-markdown-latex/" target="_blank" rel="noopener">code, math, and images&lt;/a>.&lt;/p></description></item><item><title>PORT: a Precision-Order-Recall MT Evaluation Metric for Tuning</title><link>https://samuellarkin.github.io/publications/chen-etal-2012-port/</link><pubDate>Sun, 01 Jul 2012 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/chen-etal-2012-port/</guid><description/></item><item><title>Lessons from NRC′s Portage System at WMT 2010</title><link>https://samuellarkin.github.io/publications/larkin-etal-2010-lessons/</link><pubDate>Thu, 01 Jul 2010 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/larkin-etal-2010-lessons/</guid><description/></item><item><title>Incorporating Knowledge of Source Language Text in a System for Dictation of Document Translations</title><link>https://samuellarkin.github.io/publications/reddy-etal-2009-incorporating/</link><pubDate>Sat, 01 Aug 2009 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/reddy-etal-2009-incorporating/</guid><description/></item><item><title>PortageLive: delivering machine translation technology via virtualization</title><link>https://samuellarkin.github.io/publications/paul-etal-2009-portagelive/</link><pubDate>Sat, 01 Aug 2009 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/paul-etal-2009-portagelive/</guid><description/></item><item><title>Tightly Packed Tries: How to Fit Large Models into Memory, and Make them Load Fast, Too</title><link>https://samuellarkin.github.io/publications/germann-etal-2009-tightly/</link><pubDate>Mon, 01 Jun 2009 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/germann-etal-2009-tightly/</guid><description/></item><item><title>PORTAGE in the NIST 2009 MT Evaluation</title><link>https://samuellarkin.github.io/publications/foster-09-portagein/</link><pubDate>Thu, 01 Jan 2009 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/foster-09-portagein/</guid><description/></item><item><title>NRC`s PORTAGE System for WMT 2007</title><link>https://samuellarkin.github.io/publications/ueffing-etal-2007-nrcs/</link><pubDate>Fri, 01 Jun 2007 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/ueffing-etal-2007-nrcs/</guid><description/></item><item><title>Manageable Phrase-based Statistical Machine Translation Models with Pseudo-code and Proofs</title><link>https://samuellarkin.github.io/publications/unknown/</link><pubDate>Mon, 01 Jan 2007 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/unknown/</guid><description/></item><item><title>PORTAGE: with Smoothed Phrase Tables and Segment Choice Models</title><link>https://samuellarkin.github.io/publications/johnson-etal-2006-portage/</link><pubDate>Thu, 01 Jun 2006 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/johnson-etal-2006-portage/</guid><description/></item><item><title>PORTAGE Phrase-Based System for Chinese-to-English Translation</title><link>https://samuellarkin.github.io/publications/article/</link><pubDate>Sun, 01 Jan 2006 00:00:00 +0000</pubDate><guid>https://samuellarkin.github.io/publications/article/</guid><description/></item></channel></rss>