[chrony-dev] new feature request: add "fast" and "slow" to "clock wrong" and "clock stepped" log messages

Discussion:

James Feeney

2017-10-27 16:31:16 UTC

On start-up, in the log file, there are these informational messages, for instance:

chronyd[622]: System clock wrong by 1.693005 seconds, adjustment started
chronyd[622]: System clock was stepped by 1.693005 seconds

Without some foreknowledge, it is unclear if the quantity "1.693005 seconds" is referring to the system clock with respect to the NTP server clock, or is referring to the NTP server clock with respect to the system clock, and so, the idea of "plus" and "minus" is ambiguous in this context.

How about, rather than using the term "wrong", instead use the terms "fast" and "slow" to describe this quantity "1.693005 seconds"? Then the log message might read:

chronyd[622]: System clock fast by 1.693005 seconds, adjustment started

or

chronyd[622]: System clock slow by 1.693005 seconds, adjustment started

Similarly, the "clock was stepped" message could include "+" and "-", or better, "forward" and "back", to clarify the "direction" of the error. Consider:

chronyd[622]: System clock was stepped back by 1.693005 seconds

or

chronyd[622]: System clock was stepped forward by 1.693005 seconds

As it is now, I literally do not know whether my system clock runs fast or slow, or whether the system clock was stepped forward in time, or back in time.

I'm sure that the meaning of "wrong" and "stepped" is obvious to some of you, simply because you already know how the code is written. But the log message could make this clear to the uninitiated user. Ha!

--
To unsubscribe email chrony-dev-***@chrony.tuxfamily.org with "unsubscribe" in the subject.
For help email chrony-dev-***@chrony.tuxfamily.org with "help" in the subject.
Trouble? Email ***@chrony.tuxfamily.org.

Bill Unruh

2017-10-27 17:10:32 UTC

Permalink

The sign of that offset is already there. Ie, it could also have said
System clock wrong by -1.693005 seconds
iIe, that report is no ambiguous about whether the clock is fast or slow.
Of course the sentence does not indicate what convention chrony uses (ie does
the plus sign mean that the system clock is ahead or behind the true time)
I can never remember which convetion is used but I think it is
UTC-systemclock.
In the tracking report, one has the terminology

System time : 0.000062501 seconds fast of NTP time

which would not be ambigous as to the sign convention.

Post by James Feeney
chronyd[622]: System clock wrong by 1.693005 seconds, adjustment started
chronyd[622]: System clock was stepped by 1.693005 seconds
Without some foreknowledge, it is unclear if the quantity "1.693005 seconds" is referring to the system clock with respect to the NTP server clock, or is referring to the NTP server clock with respect to the system clock, and so, the idea of "plus" and "minus" is ambiguous in this context.
chronyd[622]: System clock fast by 1.693005 seconds, adjustment started
or
chronyd[622]: System clock slow by 1.693005 seconds, adjustment started
chronyd[622]: System clock was stepped back by 1.693005 seconds
or
chronyd[622]: System clock was stepped forward by 1.693005 seconds
As it is now, I literally do not know whether my system clock runs fast or slow, or whether the system clock was stepped forward in time, or back in time.
I'm sure that the meaning of "wrong" and "stepped" is obvious to some of you, simply because you already know how the code is written. But the log message could make this clear to the uninitiated user. Ha!
--

James Feeney

2017-10-27 17:43:45 UTC

Permalink

... Of course the sentence does not indicate what convention chrony uses (ie does
the plus sign mean that the system clock is ahead or behind the true time) I> can never remember which convetion is used but I think it is UTC-systemclock.

Ha! Yes, difficult to remember, on occasion. And then, it seems simple enough to add a single word to clarify the "direction" of the time offsets.

In the tracking report, one has the terminology
System time : 0.000062501 seconds fast of NTP time
which would not be ambigous as to the sign convention.

Including "fast" there does make it much easier to read. I'm thinking that it would be nice to include this sort of thing also in the log messages.

Miroslav Lichvar

2017-10-30 11:07:23 UTC

Permalink

Post by James Feeney
chronyd[622]: System clock fast by 1.693005 seconds, adjustment started
or
chronyd[622]: System clock slow by 1.693005 seconds, adjustment started

I agree this would be much clearer for the user. I never remember
which sign is for fast and slow in what context (it's not consistent
unfortunately). The trouble is that it would break existing scripts
that parse the log and the parsing itself would be more difficult if
it had to look for the word "slow" or "fast" instead of the sign. I'm
not sure how important that really is.

Do you think it would make sense to keep the sign and indicate whether
it's fast or slow in parentheses?

System clock wrong by +/-?.??????? (slow/fast) ?

System clock stepped by +/-?.??????? (forward/backward) ?

There are other messages that print an offset, so maybe they could be
all changed at once to keep it consistent.

What do others think?

--
Miroslav Lichvar
--
To unsubscribe email chrony-dev-***@chrony.tuxfamily.org with "unsubscribe" in the subject.
For help email chrony-dev-***@chrony.tuxfamily.org with "help" in the subject.
Trouble? Email ***@chrony.tuxfamily.org.

Denny Page

2017-10-30 18:42:14 UTC

Permalink

FWIW, I believe “ahead” and “behind” are the most clear in speaking to the condition before initiating a correct. “Fast” and “Slow” somewhat imply an ongoing condition that may or may not be true. It could be the accuracy of setting the RTC before reboot for instance.

System clock ahead by ?.????????
System clock behind by ?.????????

Denny

Post by Miroslav Lichvar

Post by James Feeney
chronyd[622]: System clock fast by 1.693005 seconds, adjustment started
or
chronyd[622]: System clock slow by 1.693005 seconds, adjustment started

I agree this would be much clearer for the user. I never remember
which sign is for fast and slow in what context (it's not consistent
unfortunately). The trouble is that it would break existing scripts
that parse the log and the parsing itself would be more difficult if
it had to look for the word "slow" or "fast" instead of the sign. I'm
not sure how important that really is.
Do you think it would make sense to keep the sign and indicate whether
it's fast or slow in parentheses?
System clock wrong by +/-?.??????? (slow/fast) ?
System clock stepped by +/-?.??????? (forward/backward) ?
There are other messages that print an offset, so maybe they could be
all changed at once to keep it consistent.
What do others think?
--
Miroslav Lichvar
--

FUSTE Emmanuel

2017-10-31 09:19:37 UTC

Permalink

Post by Denny Page
FWIW, I believe “ahead” and “behind” are the most clear in speaking to the condition before initiating a correct. “Fast” and “Slow” somewhat imply an ongoing condition that may or may not be true. It could be the accuracy of setting the RTC before reboot for instance.
System clock ahead by ?.????????
System clock behind by ?.????????
Denny

I second you:

For system logs messages
"ahead" / "behind" for offset
"fast" / "slow" for freq

measurements.log are already totally unambiguous/perfectly specified.

Emmanuel.

Post by Denny Page

Post by Miroslav Lichvar

Post by James Feeney
chronyd[622]: System clock fast by 1.693005 seconds, adjustment started
or
chronyd[622]: System clock slow by 1.693005 seconds, adjustment started

I agree this would be much clearer for the user. I never remember
which sign is for fast and slow in what context (it's not consistent
unfortunately). The trouble is that it would break existing scripts
that parse the log and the parsing itself would be more difficult if
it had to look for the word "slow" or "fast" instead of the sign. I'm
not sure how important that really is.
Do you think it would make sense to keep the sign and indicate whether
it's fast or slow in parentheses?
System clock wrong by +/-?.??????? (slow/fast) ?
System clock stepped by +/-?.??????? (forward/backward) ?
There are other messages that print an offset, so maybe they could be
all changed at once to keep it consistent.
What do others think?
--
Miroslav Lichvar
--

��칻�&�zf��k�|�z�ު笵�k�|��ښ)r��0��n�˛��m觶��r�h��隊W!��u��z��!��_jh�ʊ��+a��i�{az˛��-

Miroslav Lichvar

2017-10-31 11:12:54 UTC

Permalink

Post by FUSTE Emmanuel

For system logs messages
"ahead" / "behind" for offset
"fast" / "slow" for freq

That looks good to me.

Another question is whether it's correct to say "System clock". The
messages don't actually print the offset of the system clock, but
rather the change in the offset relative to the NTP time that chronyd
is tracking. On start that is the same as the offset of the system
clock, but if there is an unfinished correction, it's the difference
between the old offset and new offset.

There are at least the following options:
1. change the message to better describe what is behind or ahead
2. change the message to print the real offset of the system clock
a) print it on each update as long as the offset is larger than
logchange
b) print it only when the change in the offset is larger than
logchange

Thoughts?

James Feeney

2017-10-31 16:41:16 UTC

Permalink

Post by Miroslav Lichvar

Post by FUSTE Emmanuel
For system logs messages
"ahead" / "behind" for offset
"fast" / "slow" for freq

That looks good to me.

"ahead" and "behind" also will work for both the offset message and the "was stepped" message.

Post by Miroslav Lichvar
Another question is whether it's correct to say "System clock". The
messages don't actually print the offset of the system clock, but
rather the change in the offset relative to the NTP time that chronyd
is tracking. On start that is the same as the offset of the system
clock, but if there is an unfinished correction, it's the difference
between the old offset and new offset.

I'm sorry, I don't understand. What does that mean, "the change in the offset"?

The quantity ( system_clock_time - NTP_server_time ), I think I understand.

Now, are you describing something like
(( new_system_clock_time - new_NTP_server_time ) -
( old_system_clock_time - old_NTP_server_time ))?

Are you saying that there is a system log message that would be saying "the system clock was behind by 5 seconds, but now, after some correction, the system clock is behind by only 3 seconds, and the offset has been 'reduced' by 2 seconds"?

That is what "the difference between the old offset and new offset" suggests to me. But then - really? Is chrony reporting that? Why would anyone want to see a report about that? Instead, it might be useful to know, in that example, that there was still 3 seconds of correction remaining. But that quantity would be exactly the same as saying "the system clock is now behind by 3 seconds".

So, I need some help there.

Post by Miroslav Lichvar
1. change the message to better describe what is behind or ahead
2. change the message to print the real offset of the system clock
a) print it on each update as long as the offset is larger than
logchange
b) print it only when the change in the offset is larger than
logchange
Thoughts?

If I am understanding, I think I prefer option '2.a)'.

Option '2.b)' does not tell me what event is going to compel a system log message - at the first state-change? And only once? Where option '2.a)' will print a log message "on each update" - which I don't know when chrony normally makes those updates - but then, the log messages would inform me. That sounds like a good thing.

I am suspicious that option '1.' means having to describe some really complicated concept, this "change in the offset". But I'm only guessing, since I don't actually understand the "what". Could you give an example?

In common speaking, I still tend to speak in terms of frequency. I might say something like "That clock is running fast" and "That clock is running slow", where "running" and "slow" are adjectives, as in a "fast running clock" and a "slow running clock". Somehow, instinctively, that is the information for which I am searching. I can infer a "slow clock" when the system clock offset is "behind", but in my mind, I want to reduce the concept to an effective relative frequency. Of course, subsequently, I will want to quantify the resulting offset.

I don't know that other people do that in their head. But it might be something to consider, in the printing of system log messages. A "complicated" log message might say something like "The system clock is slow and is behind by 3.000000 seconds." That's a bit more like natural human speech - though it may not be as "manly" as a terse and cryptic log message. Ha!

Of course, there are also assumptions there,
1) that there exists some local reference clock, and
2) that local clock had been set to the correct time in the past, and
3) that that local clock was used to set the system clock, and
4) the system clock is an analog of the local reference clock.

Assumption 4 is not actually valid if, for instance, the local reference clock and the system clock operate from different crystals, which is common. The system clock might actually run fast while the local clock runs slow. But, as you suggest, the casual assumption is also that "the offset is larger than logchange", which, I think, leaves us pretty much in the realm of setting the system clock at boot-up, or at wake-up. I suspect that the casual user understands this. Once the system clock has been set, tweaking the system clock drift is at a very different scale, and no system log message is printed for that change.

I'm also assuming that booting a device that has no local reference clock, like a network appliance, a router or a switch or an access point, there is not going to be any "offset" message or any "stepped by" message printed in the system log. Or, maybe the system clock time simply defaults to the epic, "0", and the offset to the current time is shown?

James Feeney

2017-10-31 18:09:07 UTC

Permalink

"ahead" and "behind" also will work for both the offset message and the> "was stepped" message.

Also:
forward / back
advanced / delayed
leading / lagging These are more traditional technical terms describing offsets.
incremented / decremented
increased / decreased "stepped forward by" -> "increased by", etc.
raised / lowered Ha! Our concept of time is so "horizontal".
early / late

Hmm - on second thought, "was stepped ahead" seems natural, but "was stepped behind", not so much. And then, "clock behind by 3.000000 seconds" followed by "was stepped ahead by 3.000000 seconds" might seem to mix-together "behind" and "ahead" in a contradictory way. Different word pairs for each message might be easier to read and understand.

For reference, from my original example:

chronyd[622]: System clock wrong by 1.693005 seconds, adjustment started
chronyd[622]: System clock was stepped by 1.693005 seconds

I might lean more toward "leading / lagging" and "increased / decreased".

Miroslav Lichvar

2017-11-06 16:17:38 UTC