Transformers for 🇻🇦→🇬🇧 NMT

Geremia · April 11, 2026, 01:52:05 AM

Excellent explanation of Transformers, cited in Natural Language Processing in Action §9.2.2 as "a mind-expanding walk through the modern GPT architecture", "3Blue1Brown visualizations and explanations by Grant Sanderson":

(from full Neural Nets playlist)

Geremia · April 18, 2026, 11:25:11 PM

AquinasLatinEnglish / AquinasLatinEnglishModel uses Transformers and Byte-Pair Encoding.

original Transformers paper:

Vaswani, Ashish, Noam Shazeer, Niki Parmar, et al. "Attention Is All You Need." arXiv:1706.03762. Preprint, arXiv, August 2, 2023 [1^st ed.: 2017].

original byte-pair encoding (BPE) paper:

Sennrich, Rico, Barry Haddow, and Alexandra Birch. "Neural Machine Translation of Rare Words with Subword Units." arXiv:1508.07909. Preprint, arXiv, June 10, 2016.

Geremia · April 19, 2026, 11:40:16 PM

Latin 🇻🇦 → English 🇬🇧 Translator is now live. 😃

Geremia · April 25, 2026, 07:38:46 PM

Quote from: Geremia on April 18, 2026, 11:25:11 PMoriginal Transformers paper:
Vaswani, Ashish, Noam Shazeer, Niki Parmar, et al. "Attention Is All You Need." arXiv:1706.03762. Preprint, arXiv, August 2, 2023 [1^st ed.: 2017].

Another good explanation of Transformers:

News:

Transformers for 🇻🇦→🇬🇧 NMT

Geremia

Geremia

Geremia

Geremia