The case of the queue in the pool

Mar 04 2016

The case of the queue in the pool

time to read 2 min | 363 words

Today we found a bug in the way our connection pool. The code in question?

public HttpClient GetClient(TimeSpan timeout, OperationCredentials credentials, Func<HttpMessageHandler> handlerFactory)
{
    var key = new HttpClientCacheKey(timeout, credentials);
    var queue = cache.GetOrAdd(key, i => new ConcurrentQueue<Tuple<long, HttpClient>>());

    Tuple<long, HttpClient> client;
    while (queue.TryDequeue(out client))
    {
        if (Stopwatch.GetTimestamp() - client.Item1 >= _maxIdleTimeInStopwatchTicks)
        {
            client.Item2.Dispose();
            continue;
        }
        client.Item2.CancelPendingRequests();
        client.Item2.DefaultRequestHeaders.Clear();
        return client.Item2;
    }

    return new HttpClient(handlerFactory())
    {
        Timeout = timeout
    };
}

public void ReleaseClient(HttpClient client, OperationCredentials credentials)
{
    var key = new HttpClientCacheKey(client.Timeout, credentials);
    var queue = cache.GetOrAdd(key, i => new ConcurrentQueue<Tuple<long, HttpClient>>());
    queue.Enqueue(Tuple.Create(Stopwatch.GetTimestamp(), client));
}

Can you see the bug?

I'll let you think about it for a while.

Done?

Okay, here we go!

The basic idea is that we'll keep a bunch of client connected, and free them if they are too old.

The issue is with the use of the concurrent queue. Consider the case of having a sudden spike in traffic, and we suddenly need 100 connections. This code will generate them, and then put them back in the pool.

We then go back to normal level, only handle 20 concurrent requests, but because we are using a queue, we'll actually cycle all the 100 requests we have through the queue all the time, keeping all of them opened.

Tweet Share Share 5 comments

Tags:

bugs
raven

Comments

04 Mar 2016
12:29 PM

mark

This is a fun bug, I didn't spot it.

_maxIdleTimeInStopwatchTicks might be a bug because Stopwatch timestamps are not ticks. They are not a constant unit.

04 Mar 2016
15:58 PM

Joseph N. Musser II

@mark That's disconcerting, since I tend to use Stopwatch for benchmarking.

07 Mar 2016
13:35 PM

Oren Eini

Mark, Yes, the name is a bit misleading, it uses:

_maxIdleTimeInStopwatchTicks = (long)((value / 1000.0) * Stopwatch.Frequency);

08 Mar 2016
20:04 PM

Brian Reischl

I'm curious why you're pooling HttpClient instances in the first place. It seems like it was designed with the intention that you would have one shared instance - the interesting methods are all threadsafe and each instance has an internal connection pool. Did you benchmark that out and find it was faster to do it this way rather than letting the HttpClient connection pool do its job?

09 Mar 2016
04:02 AM

Oren Eini

Brian, I'm tracking down the original reasoning, but I think this was done because we have some code that allows to customize the request, and this was the most straightforward way to handle that when upgrading from WebRequest.

We do have some stuff that modify the client, but we might be able to modify the HttpMessage we post, instead.

Comment preview

Comments have been closed on this topic.

Oren Eini

Oren Eini

CEO of RavenDB