XZ Utils: Infiltrating Open Source Through Social Engineering

May 8, 2025

Imagine waking up to discover that a seemingly harmless open-source tool, deeply integrated within your systems, has covertly turned into a threat.

XZ Utils: The Quiet Pillar

XZ Utils might not be well-known, but it’s omnipresent: SSH sessions, Linux distributions (96.3% of The top 1,000,000 web servers use Linux) , package managers, and system libraries rely heavily on this ultra-efficient compression tool, akin to zip.

Thanks for reading ThreatLink! Subscribe for free to receive new posts and support my work.Subscribed

This widespread integration made XZ Utils an ideal target. Attackers recognized that compromising XZ Utils could potentially mean compromising thousands of systems, applications, and third-party vendors.

You can easily test this tool yourself in your terminal. For example, compressing or decompressing any file:

Andres Freund

In March 2024, Microsoft engineer Andres Freund noticed an unusual latency (500 milliseconds) during SSH connections on a Debian machine.

This seemingly minor detail led him to discover malicious code. Identifying such subtle anomalies isn't within everyone's skill set.

Understanding the Hack

The attack traces back to a contributor named "Jia Tan," who gradually became involved with the XZ Utils project starting in 2021. Through regular, valuable contributions, Jia gained the community’s trust.

By 2023, he had earned commit rights and released versions 5.6.0 and 5.6.1 of XZ Utils, each hiding a meticulously concealed backdoor.

Publishing these versions meant all tools dependent on XZ Utils—a vast majority—were alerted about a new "stable" version, encouraged to upgrade from previous versions.

In simple terms, here’s what’s going on: typically, when writing code, we create tests to check if the application behaves as expected. In this case (XZ), it often involves using intentionally corrupted test files to see how the library reacts.

To avoid detection, given the open-source nature of the code, the attacker devised a multi-stage approach, which can be simplified as follows

Malicious code was hidden inside a seemingly corrupted test file named bad-3-corrupt_lzma2.xz.
During compilation, a macro (build-to-host) would "repair" this corrupted file.
The script then extracted and decrypted another file (good-large_compressed.lzma) and embedded it into the final compiled binary, activating the backdoor.

Everything was conducted in plain sight, open-source:

Of course, the vulnerability was rated with the maximum severity score of 10/10.

See the end of the article directly on ThreatLink

<The 5 steps to integrate AI into your cyber processes

Exploring the Galink Assessment in Depth>