Time transitions should be explicit

time to read 3 min | 412 words

Let us talk about time for a second, okay? We deal with in just about every application we write, but we treat it quite dismissively. But let me give an example first. We need to build a notification system, the system is based on timed notifications that should be displayed in a web page.

Thinking about it, I came up with the following design:

And this query:

SELECT TOP 3 Id, PublishAt, Title, Content FROM Notifications
WHERE PublishAt > GETDATE()
ORDER BY PublishAt DESC

That seems to satisfy the requirements, it is simple and it works. Done.

Not quite, this system design suffer from a pretty important problem, the time transitions are implicit. But why is that important?

Because the state transition from waiting-to-be-published and published is a meaningful transition in our domain. As a simple example, I can’t post a notification to Twitter when a notification is published, simply because I have absolutely no idea when that is going to happen. In many real applications, silent state transitions are going to lead to a lot of hacks. Likely something like adding WasPublished flag that we can check and then do some action if we get a notification that wasn’t published yet.

A much better plan is to model things so that time is an explicit state transition, instead of just checking for PublishAt, we will check the IsPublished flag, and we have a background process that will check for the PublishAt and the current date and explicitly set the IsPublished flag. That is also the place where we will place logic relating to the state transition. It also means that we aren’t depending on a side affect (someone viewing the page to cause the publication process) to make something important happen in our application.

You might have noticed a theme here, I like making things explicit, it means that it is easier to handle them.

Tweet Share Share 22 comments

Tags:

Design

Comments

08 Dec 2009
12:14 PM

Dennis van der Stelt

"we will check was IsPublished" doesn't seem like a logical sentence. And I also don't really get the context of the story. Is the website gathering the top 3 of items it should show? Than what does the background process do?

08 Dec 2009
12:21 PM

Dave

In code I do the same. I always specify the visiblity of a member explicit. It makes reading the code much easier. In Resharper I disabled the 'redudant qualifier' rule. It makes reading back the code much easier after a year. However I usually don't work with IsXXX field in my databases. I often need more states than just true or false.

For example we have an mail system that send out personalized mail based on a query that define the template fields. Let's say the appliction database needed to execute the mail query is down (someone stripped over a network cable or the database server needs a reboot after installing updates). If I keep the state IsSend false, than I have to try every message at every run. If I give it a more specified state (ApplicationDatabaseUnreachable) I can skip the message until anoter message from the same application was send succesfully or the (error) state is older than 30 minutes.

These states are defined in enumerations which makes it also easier to read code. Readable code really brings back the number of bugs.

08 Dec 2009
12:38 PM

Rafal

Errr, just by looking at the sql query I can't see if the system has any problem or not. If the query is used only for displaying some item it is perfectly OK. You can always add some background processing if you need to run some logic exactly at the publish date, but this query doesn't have to be changed at all.

08 Dec 2009
12:51 PM

Ayende Rahien

Dennis,

Thanks for catching that, I edited the statement and now it should be clearer.

08 Dec 2009
12:52 PM

Ayende Rahien

Rafal,

The problem is the implicit state transitions, not the query.

The query works, but it results in hacks elsewhere.

08 Dec 2009
14:58 PM

Rafal

Ok, I just pointed out that you are talking about problems without showing them, the query you gave as an illustration has no problems at all. For example, if you used RSB and scheduled messages to handle time transitions explicitly, you wouldn't need IsPublished column at all - all information would be in a message.

08 Dec 2009
15:32 PM

Ayende Rahien

Rafal,

Read the paragraph after: "But why is that important?", it explains the problem.

08 Dec 2009
16:20 PM

Damien Guard

This is fine if your server and all your users are in a single timezone...

[)amien

08 Dec 2009
16:28 PM

Stephen

IsPublished is a bit more like 'HasBeenProcessedByTwitterNotificationService' right? what if you had multiple independent services that wanted to process the items?

Would you just keep adding columns to the table? (ie, 'HasBeenProcessedByFacebookNotificationService') is there standard way to do this where the notification service remembers which items have been processed? such as remembering the last processed pk, or date of last processed item?

08 Dec 2009
17:14 PM

firefly

The post make sense. It took me a couple read but I like it. I think this is going back to the design phase of where to put your application logic and how to invoke it.

Rather than depend on the view to invoke our application logic it make sense that we are doing it ourselves. It is also much easier to scale which is vital in an enterprise application.

I don't see why you would have multiple service wanting to process the item? I thought the idea is that to have one service process the item so all the other services can use it. Once IsPublished is set all the other service can check that flag. It shouldn't matter which service set the flag in the first place.

08 Dec 2009
17:57 PM

mattmc3

One other thing to note is that if you live in a place where they recognize daylight savings time, your publication could trip twice since the time between 1 AM and 2 AM occurs twice in the fall. I've seen more than one person get tripped up by DST by having a nightly process scheduled between 1 and 2 on Sunday mornings.

08 Dec 2009
22:49 PM

Frank Quednau

There is another reason for explicitness. The ability to differ between published and not is a feature of your system. Making this implicit means that any maintainer can easily miss that this is actually a feature and might e.g. inadvertedly destroy it. Which, I admit, equates to making things simpler :)

08 Dec 2009
23:10 PM

Frank

Nice post, Ayende. Essentially, using PublishedAt in queries and reports could also be seen as duplicating business logic. And that logic keeps on keeping duplicated everytime a new query or report gets introduced, thus introducing maintainability problems later on.

But I see a likewise problem for other things than time related stuff. Many times certain actions on an entity are only possible when all kinds of conditions are met on that entity and/or related entities. I see that kind of logic being replicated in queries and reports, introducing the same maintainability problem when the logic changes even so slightly.

I am more and more leaning towards exposing those computed 'states' (can't think of a better word at the moment) in the database, instead of duplicating the logic. In essence something that CQRS seems to be advocating.

09 Dec 2009
00:11 AM

Stan

Ayende,

I think you have a typo in the WHERE clause:

WHERE PublishAt > GETDATE()

should probably be:

WHERE PublishAt < GETDATE()

09 Dec 2009
00:40 AM

We run into the same problem at work all the time and we usually deal with it in a similar way.

One of the biggest problems with having just a publishAt field is that if the time on the computer changes the action can be performed twice. This will also effectively happen at the point the system changes between daylight savings time and 'normal' time. Obviously this is more of a problem when the time you care about is the time on the user's computer rather than a server time.

09 Dec 2009
10:28 AM

Ayende Rahien

Damien,

Presumably you use UTC for that. But that still doesn't help with the implicit state transitions

09 Dec 2009
10:29 AM

Ayende Rahien

Stephen,

No, that is not the same.

IsPublished means that the background service run and EVERYTHING that was interested in this run.

09 Dec 2009
10:30 AM

Ayende Rahien

Frank,

Great point, in 2 days a post about just that topic is going to show up.

09 Dec 2009
10:31 AM

Ayende Rahien

Stan,

You see how dangerous this is :-)

09 Dec 2009
16:59 PM

John Chapman

I agree with the comments about state transitions.

But as far as talking about time you could take things so much further.

First things first here we aren't using UTC (which I know was already discussed), but in a lot of of time zones a given date/time combination can occur MULTIPLE times.

You may want something published at 2:30 AM when a timezone rolls its clock back. When should it be published? Just checking the time would of course be broken and one of the reason an explicit state transition is so important.

Don't even get me started on dates versus times.

14 Dec 2009
14:01 PM

Seth Petry-Johnson

I agree. When an entity's state depends on something external like the time, whether or not a 3rd party has responded to a message, or a user action elsewhere in the system then i'd rather have a background process make an explicit state change than check for those things in the entity itself.

06 Jan 2010
15:50 PM

Tiz Zaqyah

I think the transition is good. But as John said, we could bring it further.

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB