Debugging memory issues with RavenDB using WinDBG

time to read 4 min | 615 words

We had gotten a database export that showed a pretty rough case for RavenDB. A small set of documents (around 7K) that fan out in multiple map/reduce indexes to be around 70K to 180K (depending of which index is used).

As you can imagine, this puts quite a load on the system, I tried to use the usual methods (dotTrace, hand picking, etc) and we did get some good results on that, we found some pretty problematic issues along the way, and it was good. But we still had the RavenDB process take way too much memory. That means that we had to pull the big guns, WinDBG.

I took a dump of the process when it was using about 1 GB of memory, then I loaded that dump into WinDBG (6.2.9200.20512 AMD64).

I loaded SOS:

.loadby sos clr

Then, the first thing to do was to try to see what is going on with the threads:

As you can see, we have a small number of threads, but nothing to write home about. The next step is to see if we have anything very big in the heap:

!dumpheap –stat

The first number is the method table address, the second is the count, and the third is the total size. The problem is that all of that combine doesn’t reach near as much memory as we take.

I guess it is possible that we hold a lot of data in the stack, especially since the problem is likely caused by indexing.

I decided to map all of the threads and see what they are doing.

Switches me to thread #1, and we can see that we are currently in a waiting. Dumping the stack reveals:

This seems to be the debugger thread. Let us look at the rest:

2 – Finalizer
3 – Seems to be an inactive thread pool thread.
6 – appears to be an esent thread:

I am not sure what this is doing, and I am going to ignore this for now.
7 – also esent thread:

And… I got tired of this, and decided that I wanted to do something more productive, which is to selectively disable things in RavenDB until I find something that drops the memory usage enough to be worth while.

Whack a mole might not be a great debugging tool, but it the essence of binary search. Or so I tell myself to make my conscience sleep more easily.

For reference, you can look at the dump file here.

Tweet Share Share 13 comments

Tags:

raven

Comments

06 Dec 2012
12:26 PM

Roman D. Boiko

http://memprofiler.com is a great memory profiler. See also http://stackoverflow.com/questions/399847/net-memory-profiling-tools and http://www.apress.com/9781430244585 for alternatives.

06 Dec 2012
13:37 PM

Rémi BOURGAREL

windbg is not really user friendly. Even helped by Tess's blog here http://blogs.msdn.com/b/tess/archive/tags/memory+issues/default.aspx?PageIndex=1 I can't figure it out.

06 Dec 2012
15:03 PM

Ryan

Keep in mind !dumpheap shows the managed heap. It won't show you the unmanaged memory used by the process, and if you're using esent that points more fingers toward the issue being in unmanaged memory.

06 Dec 2012
15:50 PM

Chris Eldredge

Loading the dmp and using !address -summary shows 1.15 Gb of committed memory, but as you note the problem does not seem to be in a managed heap since !dumpheap, !eeheap and !heapstat all show memory consumption less than 100 Mb.

This seems to indicate most memory is being allocated and used by unmanaged code.

You mentioned that you are using ESENT. Did you see another user ran into a problem with Managed ESENT that caused over 1 GB of memory to be consumed? http://managedesent.codeplex.com/discussions/276175

06 Dec 2012
17:11 PM

Justin

As your managed heap is very low, have you considered using DebugDiag's LeakTrack to try and catch native allocations that leak?

06 Dec 2012
17:36 PM

Matt

My immediate thought when I saw this was Esent too. Setting the cache parameters solved a similar problem for me. I believe there are actually some config options for this in RavenDB, but you probably know that ;)

06 Dec 2012
18:59 PM

Ayende Rahien

Roman, I am pretty sure that the problem is not in managed code, so memprofiler isn't likely to help me here.

06 Dec 2012
19:01 PM

Ayende Rahien

Chris, I am pretty sure it is an Esent issue, but I am also pretty sure that it isn't the cache. We added some diagnostics, and the cache takes ~400MB, so that isn't counting for a lot of the data, still investigating.

06 Dec 2012
19:28 PM

James Manning

A couple of tools I've found helpful fast/easy/simple to use for memory investigation:

CLRProfiler4 (despite having a name that sounds more CPU-centric) from the CLR team (IIRC) - if it has any managed code, mainly because it's so simple and quick to use that it's useful even if only as a verification that the managed heap(s) are fine. Its heap map is very useful, especially in comparison to windbg/sos :)

http://www.microsoft.com/en-us/download/details.aspx?id=16273

Process Explorer - not as detailed as other tools, but its process properties dialog includes a simple way to see all memory info ('Performance' tab) and managed memory ('.NET Performance' tab, especially with '.NET CLR Memory', the default, selected under '.NET Performance Objects')

Much like the GitHub's situation with "the bridge loop problem that wasn't there" (https://github.com/blog/1346-network-problems-last-friday), I find it important to use the coarse-granularity tools available to help narrow down the problem (even if it's just to verify a 'hunch') before bringing out the tools like windbg :)

Of course, that's exactly what Ayende did here, of course, just reiterating that it's a great idea. :)

06 Dec 2012
19:30 PM

Roman D. Boiko

Ayende, I had to mention that memprofiler partially analyses non-managed code. See http://memprofiler.com/OnlineDocs, "Native Memory Page" section. There is no guarantee that it will solve your particular issue, but it is free to try.

As for the book, it doesn't cover memprofiler in any detail, but provides several good strategies for memory profiling in general. Not sure they will be new for you, though.

06 Dec 2012
19:58 PM

Roman D. Boiko

Note that using memprofiler with unmanaged resources tracker enabled to profile your process will give you more information than using it to analyze a dump file.

06 Dec 2012
23:24 PM

Alois Kraus

You should create several dumps while the memory is growing to see a pattern in one of the threads to see where it is actually allocating something. Only by looking at the threads while they are not allocating will not lead the way. With VMMap from SysInternals you can have a look at the contents and newly allocated stuff as well. That can also give some clue what the stuff taking so much space actually is.

07 Dec 2012
03:46 AM

Sergey Shumov

Ayende, have you tried to compare memory usage of ESENT vs Munin?

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB