Wind River Support Network

HomeDefectsLIN1023-13091
Acknowledged

LIN1023-13091 : Security Advisory - linux - CVE-2025-37964

Created: May 21, 2025    Updated: May 30, 2025
Found In Version: 10.23.30.1
Severity: Standard
Applicable for: Wind River Linux LTS 23
Component/s: Kernel

Description

In the Linux kernel, the following vulnerability has been resolved:

x86/mm: Eliminate window where TLB flushes may be inadvertently skipped

tl;dr: There is a window in the mm switching code where the new CR3 is
set and the CPU should be getting TLB flushes for the new mm.  But
should_flush_tlb() has a bug and suppresses the flush.  Fix it by
widening the window where should_flush_tlb() sends an IPI.

Long Version:

=== History ===

There were a few things leading up to this.

First, updating mm_cpumask() was observed to be too expensive, so it was
made lazier.  But being lazy caused too many unnecessary IPIs to CPUs
due to the now-lazy mm_cpumask().  So code was added to cull
mm_cpumask() periodically2].  But that culling was a bit too aggressive
and skipped sending TLB flushes to CPUs that need them.  So here we are
again.

=== Problem ===

The too-aggressive code in should_flush_tlb() strikes in this window:

\t// Turn on IPIs for this CPU/mm combination, but only
\t// if should_flush_tlb() agrees:
\tcpumask_set_cpu(cpu, mm_cpumask(next));

\tnext_tlb_gen = atomic64_read(&next->context.tlb_gen);
\tchoose_new_asid(next, next_tlb_gen, &new_asid, &need_flush);
\tload_new_mm_cr3(need_flush);
\t// ^ After 'need_flush' is set to false, IPIs *MUST*
\t// be sent to this CPU and not be ignored.

        this_cpu_write(cpu_tlbstate.loaded_mm, next);
\t// ^ Not until this point does should_flush_tlb()
\t// become true!

should_flush_tlb() will suppress TLB flushes between load_new_mm_cr3()
and writing to 'loaded_mm', which is a window where they should not be
suppressed.  Whoops.

=== Solution ===

Thankfully, the fuzzy "just about to write CR3" window is already marked
with loaded_mm==LOADED_MM_SWITCHING.  Simply checking for that state in
should_flush_tlb() is sufficient to ensure that the CPU is targeted with
an IPI.

This will cause more TLB flush IPIs.  But the window is relatively small
and I do not expect this to cause any kind of measurable performance
impact.

Update the comment where LOADED_MM_SWITCHING is written since it grew
yet another user.

Peter Z also raised a concern that should_flush_tlb() might not observe
'loaded_mm' and 'is_lazy' in the same order that switch_mm_irqs_off()
writes them.  Add a barrier to ensure that they are observed in the
order they are written.

CREATE(Triage):(User=admin) [CVE-2025-37964 (https://nvd.nist.gov/vuln/detail/CVE-2025-37964)
Live chat
Online