登录查看更多内容

Does time progress at the same rate everywhere?

Umang Agarwal

Senior SDE @Siemens Healthineers || Code and Content ?? ||

发布日期: 2022年12月1日

We all have an intuitive concept of time based on our own experience as individuals. Unfortunately, that intuitive notion of time makes it easier to picture total order rather than partial order. It's easier to picture a sequence in which things happen one after another, rather than concurrently. It is easier to reason about a single order of messages than to reason about messages arriving in different orders and with different delays.

However, when implementing distributing systems we want to avoid making strong assumptions about time and order, because the stronger the assumptions, the more fragile a system is to issues with the "time sensor" - or the onboard clock. Furthermore, imposing an order carries a cost. The more temporal nondeterminism that we can tolerate, the more we can take advantage of distributed computation.

There are three common answers to the question "does time progress at the same rate everywhere?". These are:

"Global clock": yes
"Local clock": no, but
"No clock": no!

The synchronous system model has a global clock, the partially synchronous model has a local clock, and in the asynchronous system model one cannot use clocks at all. Let's look at these in more detail.

Time with a "global-clock" assumption

The global clock assumption is that there is a global clock of perfect accuracy, and that everyone has access to that clock. This is the way we tend to think about time, because in human interactions small differences in time don't really matter.

The global clock is basically a source of total order (exact order of every operation on all nodes even if those nodes have never communicated).

However, this is an idealized view of the world: in reality, clock synchronization is only possible to a limited degree of accuracy. This is limited by the lack of accuracy of clocks in commodity computers, by latency if a clock synchronization protocol such as?NTP?is used and fundamentally by?the nature of spacetime.

Assuming that clocks on distributed nodes are perfectly synchronized means assuming that clocks start at the same value and never drift apart. It's a nice assumption because you can use timestamps freely to determine a global total order - bound by clock drift rather than latency - but this is a?nontrivial?operational challenge and a potential source of anomalies. There are many different scenarios where a simple failure - such as a user accidentally changing the local time on a machine, or an out-of-date machine joining a cluster, or synchronized clocks drifting at slightly different rates and so on that can cause hard-to-trace anomalies.

Nevertheless, there are some real-world systems that make this assumption. Facebook's?Cassandra?is an example of a system that assumes clocks are synchronized. It uses timestamps to resolve conflicts between writes - the write with the newer timestamp wins. This means that if clocks drift, new data may be ignored or overwritten by old data; again, this is an operational challenge (and from what I've heard, one that people are acutely aware of). Another interesting example is Google's?Spanner: the paper describes their TrueTime API, which synchronizes time but also estimates worst-case clock drift.

领英推荐

DDR6 & package substrate

AKEN Cheung 封装基板制造商 9 个月前

Hyperchange

chris H. 3 年前

Unpacking the Three Laws of the Edge: Insights from…

Frank La Vigne 11 个月前

Time with a "Local-clock" assumption

The second, and perhaps more plausible assumption is that each machine has its own clock, but there is no global clock. It means that you cannot use the local clock in order to determine whether a remote timestamp occurred before or after a local timestamp; in other words, you cannot meaningfully compare timestamps from two different machines.

The local clock assumption corresponds more closely to the real world. It assigns a partial order: events on each system are ordered but events cannot be ordered across systems by only using a clock.

However, you can use timestamps to order events on a single machine; and you can use timeouts on a single machine as long as you are careful not to allow the clock to jump around. Of course, on a machine controlled by an end-user this is probably assuming too much: for example, a user might accidentally change their date to a different value while looking up a date using the operating system's date control.

Time with a "No-clock" assumption

Finally, there is the notion of logical time. Here, we don't use clocks at all and instead track causality in some other way. Remember, a timestamp is simply a shorthand for the state of the world up to that point - so we can use counters and communication to determine whether something happened before, after or concurrently with something else.

This way, we can determine the order of events between different machines, but cannot say anything about intervals and cannot use timeouts (since we assume that there is no "time sensor"). This is a partial order: events can be ordered on a single system using a counter and no communication, but ordering events across systems requires a message exchange.

One of the most cited papers in distributed systems is Lamport's paper on?time, clocks and the ordering of events. Vector clocks, a generalization of that concept (which I will cover in more detail), are a way to track causality without using clocks. Cassandra's cousins Riak (Basho) and Voldemort (Linkedin) use vector clocks rather than assuming that nodes have access to a global clock of perfect accuracy. This allows those systems to avoid the clock accuracy issues mentioned earlier.

When clocks are not used, the maximum precision at which events can be ordered across distant machines is bound by communication latency.

I hope you found the article useful.

Happy Coding :)

要查看或添加评论，请登录

Umang Agarwal的更多文章

Push and Pull Configuration Management Tools

2023年5月28日

Push and Pull Configuration Management Tools

Push and pull configuration management tools are software solutions that facilitate the management and distribution of…

10 条评论
What is Configuration Management in DevOps?

2023年5月27日

What is Configuration Management in DevOps?

Configuration management in DevOps refers to the process of managing and controlling the configuration of software…

3 条评论
Principles of Web Distributed Systems Design

2022年12月5日

Principles of Web Distributed Systems Design

Like most things in life, taking the time to plan ahead when building a web service can help in the long run;…
What is non-monotonicity good for?

2022年12月4日

What is non-monotonicity good for?

The difference between monotonicity and non-monotonicity is interesting. For example, adding two numbers is monotonic…
Lamport Clocks

2022年12月2日

Lamport Clocks

Assuming that we cannot achieve accurate clock synchronization - or starting with the goal that our system should not…
Two Phase Commit (2PC)

2022年11月29日

Two Phase Commit (2PC)

Two phase commit (2PC) is a protocol used in many classic relational databases. For example, MySQL Cluster (not to be…
Partition and Replication

2022年11月24日

Partition and Replication

The manner in which a data set is distributed between multiple nodes is very important. In order for any computation to…
Forward Proxy vs Reverse Proxy Servers

2022年10月31日

Forward Proxy vs Reverse Proxy Servers

Forward Proxy A forward proxy is an intermediary that sits between one or more user devices and the internet. Instead…
Remote Procedure Calls (RPC)

2022年10月28日

Remote Procedure Calls (RPC)

In distributed computing, a remote procedure call (RPC) is when a computer program causes a procedure (subroutine) to…
Types of NoSQL databases

2022年10月25日

Types of NoSQL databases

NoSQL is a collection of data items represented in a key-value store, document store, wide column store, or a graph…

See all articles

Does time progress at the same rate everywhere?

Umang Agarwal

Senior SDE @Siemens Healthineers || Code and Content ?? ||

Time with a "global-clock" assumption

领英推荐

Time with a "Local-clock" assumption

Time with a "No-clock" assumption

Umang Agarwal的更多文章

社区洞察

其他会员也浏览了

Difference Between DDR Refresh and Precharge Cycles

DDR prefetching is a technique used in computer architecture to improve system performance by predicting and fetching data from memory.

Distributed scheduling of quantum circuits with noise and time optimization

Memory mapping in DDR is essential for defining how data is stored, accessed, and managed in the physical address space of DRAM

DDR Write and Read Leveling in DDR Protocol

Superman Memory Crystal

Tubes, Transistors, and Time Machines

DPD challenges for UWB signals

Low Latency High-speed Systems - The Kernel Bypass Approach

Time with a "global-clock" assumption

领英推荐

Time with a "Local-clock" assumption

Time with a "No-clock" assumption

Umang Agarwal的更多文章

Push and Pull Configuration Management Tools

What is Configuration Management in DevOps?

Principles of Web Distributed Systems Design

What is non-monotonicity good for?

Lamport Clocks

Two Phase Commit (2PC)

Partition and Replication

Forward Proxy vs Reverse Proxy Servers

Remote Procedure Calls (RPC)

Types of NoSQL databases

社区洞察

其他会员也浏览了

Difference Between DDR Refresh and Precharge Cycles

DDR prefetching is a technique used in computer architecture to improve system performance by predicting and fetching data from memory.

Distributed scheduling of quantum circuits with noise and time optimization

Memory mapping in DDR is essential for defining how data is stored, accessed, and managed in the physical address space of DRAM

DDR Write and Read Leveling in DDR Protocol

Superman Memory Crystal

Tubes, Transistors, and Time Machines

DPD challenges for UWB signals

Low Latency High-speed Systems - The Kernel Bypass Approach