简体繁体中英

How memory barriers/fences work in a multicore environment?

原文 2018-08-12 05:02:19 0 1 x86/ jvm/ volatile/ memory-barriers

I've been trying to understand how Java volatile works internally and came across memory fences. Following two articles by Martin Thompson talks about using store fence (sfence) and load fence (lfence) for preserving happens before for volatile.

Memory Barriers/Fences

CPU Cache Flushing Fallacy

What I find hard to understand is that if these fencing instructions are applied across the entire set of cores (or sockets) or only in effect for a single core. It would really help me if someone could explain how these fences work in a multi-core processor.

1 answers

What I find hard to understand is that if these fencing instructions are applied across the entire set of cores (or sockets) or only in effect for a single core. I

A fence issued on one thread translates to effects on the execution on a single core. And they are not only instructions executed by the CPU but also a signal to the compiler to not reorder execution around them.

It would really help me if someone could explain how these fences work in a multi-core processor.

They work in pairs. One thread orders all the writes before a release , the reading thread orders dependent reads after an acquire . If they are not paired properly then you still get races because one of the threads can reorder which means the other thread can observe the reordering.

Note that fences are stronger constructs than atomic writes and reads since they order all memory accesses while ordered accesses only order around accesses to the same memory location, fences may translate to different CPU instructions compared to ordered atomics.

How this translates to machine instructions depends on the architecture. x86 for example provides fairly strong ordering out of the box and thus all but one fence types translate noops on the CPU level and only need to inhibit reorderings performed by the compiler. ARM on the other hand has a weaker memory model and needs both store and load fence instructions in addition to compiler level barriers.

How those instructions are exactly implemented on the hardware level depends not only on the architecture but individual processor families. It generally involves the cache coherency protocol and additional constraints for out of order pipelines. See this answer for an example how it works in current x86 processors.

how are barriers/fences and acquire, release semantics implemented microarchitecturally?

How is the transitivity/cumulativity property of memory barriers implemented micro-architecturally?

Making sense of Memory Barriers

How many memory barriers do we need to implement a Peterson lock?

Out of Order Execution and Memory Fences

If we use memory fences to enforce consistency, how does “thread-thrashing” ever occur?

How many memory barriers instructions does an x86 CPU have?

x86: Are memory barriers needed here?

How are the C++11 memory barriers implemented for x86-like systems?

Is the MESI protocol enough, or are memory barriers still required? (Intel CPUs)

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question how are barriers/fences and acquire, release semantics implemented microarchitecturally? How is the transitivity/cumulativity property of memory barriers implemented micro-architecturally? Making sense of Memory Barriers How many memory barriers do we need to implement a Peterson lock? Out of Order Execution and Memory Fences If we use memory fences to enforce consistency, how does “thread-thrashing” ever occur? How many memory barriers instructions does an x86 CPU have? x86: Are memory barriers needed here? How are the C++11 memory barriers implemented for x86-like systems? Is the MESI protocol enough, or are memory barriers still required? (Intel CPUs)

Related Tags

How memory barriers/fences work in a multicore environment?

Question

1 answers

solution1 3 2018-08-12 19:45:35

solution1
3 2018-08-12 19:45:35