Skip to main content
Cornell University
Learn about arXiv becoming an independent nonprofit.
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs > arXiv:2510.15511

Help | Advanced Search

Computer Science > Machine Learning

(cs)
[Submitted on 17 Oct 2025 (v1), last revised 13 Mar 2026 (this version, v4)]

Title:Language Models are Injective and Hence Invertible

Authors:Giorgos Nikolaou, Tommaso Mencattini, Donato Crisostomi, Andrea Santilli, Yannis Panagakis, Emanuele Rodolà
View a PDF of the paper titled Language Models are Injective and Hence Invertible, by Giorgos Nikolaou and 5 other authors
View PDF HTML (experimental)
Abstract:Transformer components such as non-linear activations and normalization are inherently non-injective, suggesting that different inputs could map to the same output and prevent exact recovery of the input from a model's representations. In this paper, we challenge this view. First, we prove mathematically that transformer language models mapping discrete input sequences to their corresponding sequence of continuous representations are injective and therefore lossless, a property established at initialization and preserved during training. Second, we confirm this result empirically through billions of collision tests on six state-of-the-art language models, and observe no collisions. Third, we operationalize injectivity: we introduce SipIt, the first algorithm that provably and efficiently reconstructs the exact input text from hidden activations, establishing linear-time guarantees and demonstrating exact invertibility in practice. Overall, our work establishes injectivity as a fundamental and exploitable property of language models, with direct implications for transparency, interpretability, and safe deployment.
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as: arXiv:2510.15511 [cs.LG]
  (or arXiv:2510.15511v4 [cs.LG] for this version)
  https://doi.org/10.48550/arXiv.2510.15511
arXiv-issued DOI via DataCite

Submission history

From: Giorgos Nikolaou [view email]
[v1] Fri, 17 Oct 2025 10:25:30 UTC (3,980 KB)
[v2] Mon, 20 Oct 2025 07:29:02 UTC (3,980 KB)
[v3] Tue, 21 Oct 2025 14:44:49 UTC (3,980 KB)
[v4] Fri, 13 Mar 2026 15:58:05 UTC (3,978 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled Language Models are Injective and Hence Invertible, by Giorgos Nikolaou and 5 other authors
  • View PDF
  • HTML (experimental)
  • TeX Source
license icon view license
Current browse context:
cs.LG
< prev   |   next >
new | recent | 2025-10
Change to browse by:
cs
cs.AI

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
export BibTeX citation Loading...

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status