登录查看更多内容

Spinlocks vs. Semaphores: Understanding Synchronization Mechanisms

Deepesh Menon

Principal Engineer | Heterogeneous Computing Systems | Virtualization | Embedded Systems

发布日期: 2024年11月25日

Synchronization mechanisms are essential for managing concurrent access to shared resources in modern computing. Two of the most commonly discussed synchronization primitives are spinlocks and semaphores. While they may serve overlapping purposes, their behavior, implementation, and use cases differ significantly.

This article explores the key differences between spinlocks and semaphores, diving into their low-level implementations on architectures like ARM and x86, and addressing the historical concepts of P and V operations in semaphores.

Disclaimer: The code snippets provided in this article are for quick reference and conceptual understanding only. They have not been compiled or tested and should be treated as high-level pseudocode rather than production-ready implementations. Use them as a guide to understand the underlying concepts, not as exact implementations.

What Are Spinlocks?

Spinlocks are lightweight synchronization mechanisms primarily used for protecting critical sections. They work by continuously checking (spinning) a shared lock variable until it becomes available.

Key Characteristics of Spinlocks:

Busy-Waiting: A thread that tries to acquire a spinlock repeatedly checks the lock in a loop, consuming CPU cycles.
Low Overhead: Spinlocks avoid context switches, making them suitable for short critical sections where the lock is expected to be held briefly.
No Blocking: Threads are not put to sleep; instead, the CPU core remains actively engaged in the spinning loop.

Modern Optimizations on ARM:

To reduce CPU resource wastage, modern ARM architectures use WFE (Wait For Event) instructions to temporarily put the core into a low-power state if the lock is unavailable. This optimization:

Minimizes power consumption during contention.
Wakes the core only when another core signals the release of the lock using SEV (Send Event).

Example Spinlock Implementation (ARM):

acquire_spinlock:
    LDREX   r0, [lock_var]      // Load the lock value
    CMP     r0, #0              // Check if the lock is free
    BNE     wait_for_event      // If not free, wait
    STREX   r1, r2, [lock_var]  // Try to acquire the lock
    CMP     r1, #0              // Check if STREX succeeded
    BNE     acquire_spinlock    // Retry if failed
    DMB                         // Memory barrier
    BX      lr                  // Return

wait_for_event:
    WFE                         // Wait for an event
    B       acquire_spinlock    // Retry acquiring the lock

What Are Semaphores?

Semaphores are more versatile synchronization primitives that use a counter to manage access to shared resources. They can block threads if a resource is unavailable.

Key Characteristics of Semaphores:

Counting Mechanism: The semaphore counter tracks the number of available resources. Threads can decrement (acquire) or increment (release) the count.
Thread Blocking: Unlike spinlocks, semaphores block threads when the counter is zero, allowing the CPU to execute other tasks.
Scheduler Involvement: The operating system scheduler handles blocking and waking threads waiting on a semaphore, ensuring efficient CPU utilization.

领英推荐

How the Catalyst UK program seeds the next generations…

Dana Gardner 5 年前

RISC-V Newsletter - January 2025

RISC-V International 1 个月前

The History and Evolution of RISC Architecture: A…

Suresh Surenthiran 2 个月前

Semaphore Operations: P and V

The fundamental operations on a semaphore are often referred to as P and V, terms introduced by Edsger Dijkstra:

P (Proberen):
V (Verhogen):

These terms, though less common in modern APIs, reflect the theoretical underpinnings of semaphores and emphasize their purpose.

Example Semaphore Wait and Signal:

void semaphore_wait(int *sem) {
    while (1) {
        int val = ldrex(sem);      // Load semaphore value
        if (val > 0) {
            if (strex(sem, val - 1) == 0)  // Decrement if available
                break;                     // Success
        } else {
            block_thread();               // Scheduler blocks the thread
        }
    }
}

void semaphore_signal(int *sem) {
    while (1) {
        int val = ldrex(sem);             // Load semaphore value
        if (strex(sem, val + 1) == 0)     // Increment the count
            break;
    }
    wake_thread();                        // Wake a blocked thread, if any
}

Key Differences Between Spinlocks and Semaphores

When to Use Spinlocks vs. Semaphores

Spinlocks: Use when critical sections are very short, and contention is rare. Examples: Low-level kernel synchronization, interrupt handling.
Semaphores: Use when contention is expected or when threads may need to wait for extended periods. Examples: Managing access to a bounded buffer (producer-consumer problem), synchronizing multiple threads.

Conclusion

Both spinlocks and semaphores are critical tools in concurrent programming, but their use depends on the context:

Spinlocks are lightweight and efficient for short critical sections, especially with modern optimizations like ARM’s WFE.
Semaphores excel in scenarios involving longer waits or managing shared resources, offering better CPU utilization and fairness.

Understanding the trade-offs between these synchronization primitives—and the conceptual meaning of operations like P and V—is essential for designing efficient and scalable systems. Whether you're working in embedded systems, operating system design, or application-level concurrency, knowing when to use each mechanism is key to optimizing performance.

Deepesh Menon

Principal Engineer | Heterogeneous Computing Systems | Virtualization | Embedded Systems

3 个月

Torsten Robitzki , Kai Lampka, Dr.-Ing. (habil.) this article is specifically oriented toward semaphore. In my previous articles I have detailed about Mutex and Spinlocks. Spinlocks : https://www.dhirubhai.net/pulse/understanding-spinlocks-how-cpu-supports-atomic-locks-p-m-f9xhc?utm_source=share&utm_medium=member_android&utm_campaign=share_via Mutex locks: https://www.dhirubhai.net/pulse/operating-system-synchronization-primitives-mutex-locks-p-m-beu2c?utm_source=share&utm_medium=member_android&utm_campaign=share_via

2 次回应

Kai Lampka, Dr.-Ing. (habil.)

3 个月

Shouldn't it be Mutexes vs. Semaphores and sleep wait vs. busy wait?

1 次回应

Torsten Robitzki

Torrox GmbH & Co. KG

3 个月

Simply: Don't use spinlocks unless, you really, really know what you are doing. If you are using spinlocks in an IRQ, making sure, that a critical section of code can not be seen with invariants broken, who is going to release that lock, in case, it is found to be locked? If there is only one CPU involved, it can only be an IRQ with higher priority. So, the use of spinlocks in a single CPU system, is very little, if any. For semaphores, there must be at least some kind of scheduler involved. I also would not bet on a semaphore implementations to be "fair", as this usually increases complexity of the implementation and reduces performance.

4 次回应

查看更多评论

要查看或添加评论，请登录

Deepesh Menon的更多文章

C essentially is not Embedded!

2024年12月23日

C essentially is not Embedded!

C is a general-purpose programming language designed to be versatile and portable across various platforms. While it is…

20 条评论
Understanding Hypervisors: A Dive into Virtualization

2024年12月16日

Understanding Hypervisors: A Dive into Virtualization

Virtualization has revolutionized modern computing, enabling the efficient utilization of hardware by running multiple…

4 条评论
Decoding QEMU's TCG: Unveiling the Magic Behind Dynamic Binary Translation

2024年12月6日

Decoding QEMU's TCG: Unveiling the Magic Behind Dynamic Binary Translation

?? Understanding QEMU's Tiny Code Generator (TCG): Translating Target Code to Host Machine Code QEMU's Tiny Code…

5 条评论
Data Structure Selection in Embedded Systems: Maximizing Cache Efficiency and Security

2024年11月5日

Data Structure Selection in Embedded Systems: Maximizing Cache Efficiency and Security

Leveraging hardware features such as L2/L3 cache effectively can indeed make a notable difference in performance…

2 条评论
C++ References: The Timeless Mark of Simplicity and Elegance

2024年11月2日

C++ References: The Timeless Mark of Simplicity and Elegance

In the world of programming, we’re fortunate to have multiple languages, each with its unique philosophy and strengths.…

1 条评论
Leveraging ARM v9 Confidential Compute Architecture (CCA) for Secure and Isolated Avionics Integrated Modular Avionics (IMA) Applications

2024年10月31日

Leveraging ARM v9 Confidential Compute Architecture (CCA) for Secure and Isolated Avionics Integrated Modular Avionics (IMA) Applications

A Whitepaper Executive Summary As avionics systems become increasingly consolidated, the challenge of ensuring robust…

1 条评论
Why C++ Threads Matter Despite the Existence of POSIX Threads

2024年10月30日

Why C++ Threads Matter Despite the Existence of POSIX Threads

When the C++ standards committee introduced built-in language support for threading with C++11, many developers…

4 条评论
Cache-Aware Memory Allocation Techniques for RTOS

2024年10月29日

Cache-Aware Memory Allocation Techniques for RTOS

Abstract In real-time operating systems (RTOS), task memory allocation is critical for maintaining system performance…
Operating System Synchronization Primitives: Mutex Locks

2024年10月28日

Operating System Synchronization Primitives: Mutex Locks

?? Spinlock vs. Mutex: Key Differences in Multi-Core Synchronization ?? When it comes to synchronization in…

2 条评论
Understanding Spinlocks - How CPU supports Atomic locks

2024年10月27日

Understanding Spinlocks - How CPU supports Atomic locks

In multi-core systems, managing shared resources across threads and cores is essential. For this purpose, spinlocks are…

33 条评论

See all articles

Spinlocks vs. Semaphores: Understanding Synchronization Mechanisms

Deepesh Menon

Principal Engineer | Heterogeneous Computing Systems | Virtualization | Embedded Systems

What Are Spinlocks?

Key Characteristics of Spinlocks:

Modern Optimizations on ARM:

What Are Semaphores?

Key Characteristics of Semaphores:

领英推荐

Semaphore Operations: P and V

When to Use Spinlocks vs. Semaphores

Conclusion

Deepesh Menon的更多文章

社区洞察

其他会员也浏览了

DRAM chip raise up in Q4

Pushing the Boundaries of Performance AAEON Solutions Featuring the 13th Generation Intel? Core? Processor Family

DDR5 Memory Enables Next-Generation Computing

#StridingTowardsTheIntelligentWorld-The CPU-Centric Architecture Is Evolving into a Data-Centric Composable Architecture

Scalable and modular – but can it be software-defined?

Navigating the CPU: Understanding Execution Times, Challenges, Efficiency, Troubleshooting, and Task Distinctions part II

Microprocessors and Instruction Set Architectures (ISA) v2023

Dedicated CPU Vs Shared vCPUs

August 27, 2022

ERDC strengthens high-performance computing capability with new system

What Are Spinlocks?

Key Characteristics of Spinlocks:

Modern Optimizations on ARM:

What Are Semaphores?

Key Characteristics of Semaphores:

领英推荐

Semaphore Operations: P and V

When to Use Spinlocks vs. Semaphores

Conclusion

Deepesh Menon的更多文章

C essentially is not Embedded!

Understanding Hypervisors: A Dive into Virtualization

Decoding QEMU's TCG: Unveiling the Magic Behind Dynamic Binary Translation

Data Structure Selection in Embedded Systems: Maximizing Cache Efficiency and Security

C++ References: The Timeless Mark of Simplicity and Elegance

Leveraging ARM v9 Confidential Compute Architecture (CCA) for Secure and Isolated Avionics Integrated Modular Avionics (IMA) Applications

Why C++ Threads Matter Despite the Existence of POSIX Threads

Cache-Aware Memory Allocation Techniques for RTOS

Operating System Synchronization Primitives: Mutex Locks

Understanding Spinlocks - How CPU supports Atomic locks

社区洞察

其他会员也浏览了

DRAM chip raise up in Q4

Pushing the Boundaries of Performance AAEON Solutions Featuring the 13th Generation Intel? Core? Processor Family

DDR5 Memory Enables Next-Generation Computing

#StridingTowardsTheIntelligentWorld-The CPU-Centric Architecture Is Evolving into a Data-Centric Composable Architecture

Scalable and modular – but can it be software-defined?

Navigating the CPU: Understanding Execution Times, Challenges, Efficiency, Troubleshooting, and Task Distinctions part II

Microprocessors and Instruction Set Architectures (ISA) v2023

Dedicated CPU Vs Shared vCPUs

August 27, 2022

ERDC strengthens high-performance computing capability with new system