RavenDB Sharding: Enabling shards for existing database

architecture (611) rss
bugs (450) rss
challanges (123) rss
community (379) rss
databases (481) rss
design (895) rss
development (641) rss
hibernating-practices (71) rss
miscellaneous (592) rss
performance (397) rss
programming (1085) rss
raven (1448) rss
ravendb.net (532) rss
reviews (184) rss

2025
- June (4)
- May (10)
- April (10)
- March (10)
- February (7)
- January (12)
2024
- December (3)
- November (2)
- October (1)
- September (3)
- August (5)
- July (10)
- June (4)
- May (6)
- April (2)
- March (8)
- February (2)
- January (14)
2023
- December (4)
- October (4)
- September (6)
- August (12)
- July (5)
- June (15)
- May (3)
- April (11)
- March (5)
- February (5)
- January (8)
2022
- December (5)
- November (7)
- October (7)
- September (9)
- August (10)
- July (15)
- June (12)
- May (9)
- April (14)
- March (15)
- February (13)
- January (16)
2021
- December (23)
- November (20)
- October (16)
- September (6)
- August (16)
- July (11)
- June (16)
- May (4)
- April (10)
- March (11)
- February (15)
- January (14)
2020
- December (10)
- November (13)
- October (15)
- September (6)
- August (9)
- July (9)
- June (17)
- May (15)
- April (14)
- March (21)
- February (16)
- January (13)
2019
- December (17)
- November (14)
- October (16)
- September (10)
- August (8)
- July (16)
- June (11)
- May (13)
- April (18)
- March (12)
- February (19)
- January (23)
2018
- December (15)
- November (14)
- October (19)
- September (18)
- August (23)
- July (20)
- June (20)
- May (23)
- April (15)
- March (23)
- February (19)
- January (23)
2017
- December (21)
- November (24)
- October (22)
- September (21)
- August (23)
- July (21)
- June (24)
- May (21)
- April (21)
- March (23)
- February (20)
- January (23)
2016
- December (17)
- November (18)
- October (22)
- September (18)
- August (23)
- July (22)
- June (17)
- May (24)
- April (16)
- March (16)
- February (21)
- January (21)
2015
- December (5)
- November (10)
- October (9)
- September (17)
- August (20)
- July (17)
- June (4)
- May (12)
- April (9)
- March (8)
- February (25)
- January (17)
2014
- December (22)
- November (19)
- October (21)
- September (37)
- August (24)
- July (23)
- June (13)
- May (19)
- April (24)
- March (23)
- February (21)
- January (24)
2013
- December (23)
- November (29)
- October (27)
- September (26)
- August (24)
- July (24)
- June (23)
- May (25)
- April (26)
- March (24)
- February (24)
- January (21)
2012
- December (19)
- November (22)
- October (27)
- September (24)
- August (30)
- July (23)
- June (25)
- May (23)
- April (25)
- March (25)
- February (28)
- January (24)
2011
- December (17)
- November (14)
- October (24)
- September (28)
- August (27)
- July (30)
- June (19)
- May (16)
- April (30)
- March (23)
- February (11)
- January (26)
2010
- December (29)
- November (28)
- October (35)
- September (33)
- August (44)
- July (17)
- June (20)
- May (53)
- April (29)
- March (35)
- February (33)
- January (36)
2009
- December (37)
- November (35)
- October (53)
- September (60)
- August (66)
- July (29)
- June (24)
- May (52)
- April (63)
- March (35)
- February (53)
- January (50)
2008
- December (58)
- November (65)
- October (46)
- September (48)
- August (96)
- July (87)
- June (45)
- May (51)
- April (52)
- March (70)
- February (43)
- January (49)
2007
- December (100)
- November (52)
- October (109)
- September (68)
- August (80)
- July (56)
- June (150)
- May (115)
- April (73)
- March (124)
- February (102)
- January (68)
2006
- December (95)
- November (53)
- October (120)
- September (57)
- August (88)
- July (54)
- June (103)
- May (89)
- April (84)
- March (143)
- February (78)
- January (64)
2005
- December (70)
- November (97)
- October (91)
- September (61)
- August (74)
- July (92)
- June (100)
- May (53)
- April (42)
- March (41)
- February (84)
- January (31)
2004
- December (49)
- November (26)
- October (26)
- September (6)
- April (10)

RavenDB Workshops - Deep dive into practical use of Document Data Modeling

May 18 2015

RavenDB ShardingEnabling shards for existing database

time to read 7 min | 1265 words

A question came up in the mailing list, how do we enable sharding for an existing database. I’ll deal with data migration in this scenario at a later post.

The scenario is that we have a very successful application, and we start to feel the need to move the data to multiple shards. Currently all the data is sitting in the RVN1 server. We want to add RVN2 and RVN3 to the mix. For this post, we’ll assume that we have the notion of Customers and Invoices.

Previously, we access the database using a simple document store:

var documentStore = new DocumentStore
{
	Url = "http://RVN1:8080",
	DefaultDatabase = "Shop"
};

Now, we want to move to a sharded environment, so we want to write it like this. Existing data is going to stay where it is at, and new data will be sharded according to geographical location.

var shards = new Dictionary<string, IDocumentStore>
{
	{"Origin", new DocumentStore {Url = "http://RVN1:8080", DefaultDatabase = "Shop"}},//existing data
	{"ME", new DocumentStore {Url = "http://RVN2:8080", DefaultDatabase = "Shop_ME"}},
	{"US", new DocumentStore {Url = "http://RVN3:8080", DefaultDatabase = "Shop_US"}},
};

var shardStrategy = new ShardStrategy(shards)
	.ShardingOn<Customer>(c => c.Region)
	.ShardingOn<Invoice> (i => i.Customer);

var documentStore = new ShardedDocumentStore(shardStrategy).Initialize();

This wouldn’t actually work. We are going to have to do a bit more. To start with, what happens when we don’t have a 1:1 match between region and shard? That is when the translator become relevant:

.ShardingOn<Customer>(c => c.Region, region =>
{
    switch (region)
    {
        case "Middle East":
            return "ME";
        case "USA":
        case "United States":
        case "US":
            return "US";
        default:
            return "Origin";
    }
})

We basically say that we map several values into a single region. But that isn’t enough. Newly saved documents are going to have the shard prefix, so saving a new customer and invoice in the US shard will show up as:

But existing data doesn’t have this (created without sharding).

So we need to take some extra effort to let RavenDB know about them. We do this using the following two functions:

 Func<string, string> potentialShardToShardId = val =>
 {
     var start = val.IndexOf('/');
     if (start == -1)
         return val;
     var potentialShardId = val.Substring(0, start);
     if (shards.ContainsKey(potentialShardId))
         return potentialShardId;
     // this is probably an old id, let us use it.
     return "Origin";

 };
 Func<string, string> regionToShardId = region =>
 {
     switch (region)
     {
         case "Middle East":
             return "ME";
         case "USA":
         case "United States":
         case "US":
             return "US";
         default:
             return "Origin";
     }
 };

We can then register our sharding configuration so:

  var shardStrategy = new ShardStrategy(shards)
      .ShardingOn<Customer, string>(c => c.Region, potentialShardToShardId, regionToShardId)
      .ShardingOn<Invoice, string>(x => x.Customer, potentialShardToShardId, regionToShardId);

That takes care of handling both new and old ids, and let RavenDB understand how to query things in an optimal fashion. For example, a query on all invoices for ‘customers/1’ will only hit the RVN1 server.

However, we aren’t done yet. New customers that don’t belong to the Middle East or USA will still go to the old server, and we don’t want any modification to the id there. We can tell RavenDB how to handle it like so:

var defaultModifyDocumentId = shardStrategy.ModifyDocumentId;
shardStrategy.ModifyDocumentId = (convention, shardId, documentId) =>
{
    if(shardId == "Origin")
        return documentId;

    return defaultModifyDocumentId(convention, shardId, documentId);
};

That is almost the end. There is one final issue that we need to deal with, and that is the old documents, before we used sharding, don’t have the required sharding metadata. We can fix that using a store listener. So we have:

 var documentStore = new ShardedDocumentStore(shardStrategy);
 documentStore.RegisterListener(new AddShardIdToMetadataStoreListener());
 documentStore.Initialize();

Where the listener looks like this:

 public class AddShardIdToMetadataStoreListener : IDocumentStoreListener
 {
     public bool BeforeStore(string key, object entityInstance, RavenJObject metadata, RavenJObject original)
     {
         if (metadata.ContainsKey(Constants.RavenShardId) == false)
         {
             metadata[Constants.RavenShardId] = "Origin";// the default shard id
         }
         return false;
     }

     public void AfterStore(string key, object entityInstance, RavenJObject metadata)
     {
     }
 }

And that is it. I know that there seems to be quite a lot going on in here, but it basically can be broken down to three actions that we take:

Modify the existing metadata to add the sharding server id via the listener.
Modify the document id convention so documents on the old server won’t have a designation (optional).
Modify the sharding configuration so we’ll understand that documents without a shard prefix actually belong to the Origin shard.

And that is pretty much it.

Tweet Share Share 9 comments

Tags:

raven

Comments

17 May 2015
09:39 AM

Idsa

I'd like to note it's important to keep the old server as the first shard in the shards dictionary as DefaultShardResolutionStrategy stores Hilo documents on Shards.First() and you want it to be the same server as it was before switching to sharding
The solution you provided is nice but I think for many projects the better alternative would be to implementa migration: reading all the data from the old single-server DocumentStore and saving it to the new ShardedDocumentStore. It's simpler and you don't have to maintain all the logic related to out-of-date single-server implementation. But to make that happen you have to either make your service offline or switch it (or some parts of it one after another) to the readonly mode for time of migration.
I've been working on a little project that migrates RavenDb database from single-server to sharded environment but found a couple of bugs (already reported). As soon as they are fixed I'm considering uploading the project to GitHub (as there is no real-world sample of doing that at the moment).
Our company recently got investment for a cloud project in the sleep medicine industry. We've been developing the project with RavenDb initially but stakeholders had a reasonable question why we didn't use a more reliable storage like Sql Server. I've told them that RavenDb is the optimal data storage choice for .NET OLTP applications at the moment. But I needed to provide a proof it is reliable and that we can scale it in the future. So I created a branch and tried to convert our application to a sharded environment (see point #3). As result I've found some basic bugs (i. e. in LoadDocument). I doubt anyone really uses RavenDb sharding in production (that kind of bugs would be impossible in this case). So how would you estimate the maturity of the RavenDb sharding implementation?
What if we need to have two shard for the USA at some point? I guess it's possible to solve by adding more logic to all the extension points you've described (and do it every time you need to add a shard). But it's pain in the ass for real scalable environment (that grows and grows). It would be so much simpler (and more efficient as we could have 10x USA items than for other regions) if RavenDb implemented some kind of ShardingRebalancer. I described it more thorougly here: https://groups.google.com/forum/#!topic/ravendb/rTZ4JhXBAcc

17 May 2015
11:05 AM

Ayende Rahien

Idsa, 1) You are correct, I implicitly assumed that in the post. 2) I absolutely agree that it is much simpler to import/export everything that way. It would result in cleaner system, without legacy patches. When you can't take the system online, however, you have the option of doing it the hard way. 3) That is great. 4) The Load issue you found is a pretty obscure thing, transformer that return an id with the same value. This is fixed now, but it isn't something that you would generally notice, that is why we missed it. 5) It is possible to add new nodes to a shard, yes. And we have plans to do sharding at a deeper level on a future version.

17 May 2015
11:19 AM

Idsa

Okay, great. I've reported another problem with sharding (#3477). Not sure, may be something is wrong with my code there.

18 May 2015
19:54 PM

Mufasa

We had an established database in production. Our large documents and expensive indexes (the fault of our less than ideal architecture, not Raven's) were choking the server. (Why is another story.) We have a very burst load pattern once a week. We had so many new users last year that it couldn't keep up with the one RavenDB server.

Last fall (2014) we did what was described here, plus wrote a custom ShardResolutionStrategy (because we didn't have a good natural key that was evenly split across the shards). It took 3 developers 3 days, plus one day of ops—creating the new shard instances, manually moving documents to the new shards, etc. It worked well and was quick enough that we pulled it off before the next load spike. It worked beautifully, and the shards have been running smoothly ever since.

I was very impressed with the process, overall. It ended up being easier than I thought it would be.

That being said, there are some spots that would make the process a lot nicer:

a more robust custom ShardResolutionStrategy sample, since there are now so many different ways to write LINQ queries it is difficult to cover all the paths it takes to parse the query to pull out the relevant shard keys (it does have a fallback to query all shards, but that seems like a waste, right?)
a tool that will atomically move documents around shards based on the new ShardResolutionStrategy implementation (and update the metadata) preferably while online but it would still be nice even if it were offline only
a way for the studio to hook into your ShardResolutionStrategy implementation so that querying and finding documents/index results in the admin interface doesn't require manually running the strategy in your head (if that's even possible, depending on the strategy) or having to check each shard one by one
a lot of features suddenly stop working (often without warning in the RavenDB documentation) in sharded scenarios: database commands, streaming API, some async commands, etc. (We were working with v2.5 at the time; v3 improved this a tiny bit, but not enough.) Rewriting and working around these gaps was the most time consuming part of the transition to sharding. We realize some are a bit weird to perform on multiple shards (for example, a database command to get the RavenDB version could return different versions for each shard) but there is no mechanic to deal with this at all. You have to manually process these commands one by one on each shard, setting up new DocumentStores outside your usual IoC container since it usually only has a single reference to the one ShardedDocumentStore. (There are references to the individual shard DocumentStores contained within if you cast it, but they aren't initialized the same so you can't use them directly—which is also confusing if you don't understand the distinction.)
this lack of features also breaks many third-party tools like profilers and statistics loggers that were built to work with DocumentStores directly instead of ShardedDocumentStores (which does not inherit from DocumentStore—these two implementation classes only share the IDocumentStore interface but implement it fundamentally different for several critical methods/properties.) True, those tools were often naive in their implementations, but I don't blame them too much because of the weird thing where DocumentStore shard instances exist but aren't initialized--it makes it very awkward to access them in any meaningful way.

All that said, RavenDB is one of the better single-to-sharded data store stories I've been involved with. So kudos Hibernating Rhinos, and thanks for continuing to improve it even more.

19 May 2015
08:22 AM

Ayende Rahien

1) Can you give us some examples, so we can create something that is a bit more real world?

2) That is basically what the smuggler is doing in here. And the idea is that you can use a transform script to move the data and transform it. Having a custom tool for something like this doesn't make a lot of sense, since the logic in the resharding is usually different from the actual sharding logic.

3) That is likely not going to happen. The sharding resolution is in your code, and the studio isn't aware that this exists. We have plans for a more complete sharding impl on the server side, which may have this, but that would be for 4.0 only, and isn't much beyond some rough drafts now.

4) That is actually by design. The example about the version is a very good one. We don't have a way to report that in a sane manner. A server have a single version, but a cluster may have many. How do you report that on the same interface?

4) What tools you refer to? Usually they can be made to work by using IDocumentStore, but there are things that work directly with the internal state, which obviously cannot be made to work

19 May 2015
21:17 PM

Mufasa

RE example shard resolution strategy: The PotentialShardsFor method has to (potentially) check if it is a Query or Key based lookup. Then it may need to check if it is a lookup by a known EntityType or not. Whether or not it is a direct load versus a query with an IN operator. Formatting differences between a single value versus CSV values when loading with an array of IDs. Etc. So I mean the tree of possible (or at least common) scenarios for all the ways a document or query can be requested over the client API and put into the fairly loosely structured ShardRequestData object.
RE resharding / moving documents between shards and "the logic in the resharding is usually different from the actual sharding logic": I figured this tool, let's call it a 'rebalancing' tool, would iterate over all documents in every shard, apply the new ShardResolutionStrategy to it, check the current document's meta data for the current shard key, and if it is missing or different then the new shard key for that document, it would atomically (via DTC) move the document to the new shard. In that case, it wouldn't matter what the old logic was; it only would need to know the new logic. (To be a little safer, it might need the requirement that the shards be offline so the clients don't have to deal with the mess of supporting both shard strategies at the same time. Same with the 'duplicate document' client exception during the move.) It might not be the most ultimate perfect design to check the shard strategy for every document, but if it was done while offline (possibly at the server level? something it can't do yet, but see #3 below...) it could be a decent trade off for a very useful sharding tool that would make many scenarios a lot easier.
RE Client admin UI knowing about shards: Server-based shard knowledge could help in v4. But for now, ignoring that possible feature, in v3 there is a disconnect between the 2 clients knowledge: Code clients have to know about shards, but the UI client can't know about the shard strategy. Perhaps the shard strategy code needs to be either on the server (as you suggested might come in v4) or it needs to be written in a language that all systems can understand, like the Patch Requests do with their limited but RavenDB-defined set of JavaScript. That way the shard code can be added to the UI as an upload, server-based configuration or as a server plugin DLL--any way for the admin UI to understand the relationship of the shards just like the clients do.

The way my company has been working around this for now is to build our own admin tools directly in the app. But doesn't that defeat the point of the RavenDB admin UI then? Shouldn't it know about and be able to operate with core RavenDB features to start with? Why do client RavenDB libraries have a ShardedDocumentStore, but the admin UI does not? I know the UI is currently written with the concept that it administers a single server in isolation. If that's the final purpose of the admin UI, then I guess you can just shrug off this problem. But if you want to treat the RavenDB admin UI as an enterprise level administration and query/debug access tool for a RavenDB "system" then it will need to start thinking of servers as nodes in a larger system, instead of just being an HTML/JS web page hosted under and connected to a single isolated server.

Switching the admin UI to think about something beyond a single Raven instance is a big task and possibly outside your business plan, so I'm not sure if you want to do that or not. But it is the next step in the enterprise tool chain in my opinion.

RE enabling all features when sharding: I imagine the client API would have to be updated to support returning an array or map of results from each shard. It would require a little bit more work on the consuming client code to process those results (unlike the current way where queries across shards are automatically re-assembled from separate HTTP requests into one data list returned from the RavenDB client API--which I also have a few problems with when it comes to debugging and paging... but that's another topic). But it would still be easier to consume than dealing with the internal ShardedDocumentStore shard list and state (and lack of individual shard DocumentStore initialization) that any consumer code has to use now.
RE tooling that breaks when using ShardedDocumentStore: The specific tools that broke for us were New Relic monitoring and Glimpse.RavenDB for debugging/profiling. Both of those tools can only be initialized with ProductName.SomeMethodToInitialize(DocumentStore). I assume many others would be in the same boat, because IDocumentStore doesn't define several properties, methods, and listeners that are provided in the DocumentStore only but are needed for various inspection tools like these.

I tried to switch Glimpse.RavenDB (http://www.nuget.org/packages/Glimpse.RavenDb) to use IDocumentStore, but the lack of some of the RavenDB listener hooks by a ShardedDocumentStore was a blocker. I think JsonRequestFactory.LogRequest, SessionCreatedInternal, and a couple others were some of the ones that could be attached to but were never called by ShardedDocumentStore--only by DocumentStores but not the ones inside the shard list in a ShardedDocumentStore. (I forget exactly which listeners still worked and which didn't.) So in this particular case, adding the missing listeners to the ShardedDocumentStore would make it a lot easier. But the point remains that it is a significant barrier to working with the RavenDB client stores abstractly when IDocumentStore interface doesn't define enough to cover all the available features.

Also, internal tools broke that we wrote to issue database commands to update indexes and return status information. Of course, I would assume the status information would still need to be per server and not aggregate across the shards. But since the indexes have to exist across every shard otherwise the entire request fails for all shards (another quirk that has bit us--an issue I still haven't decided is a good safe-by-default rule or not) then aggregating index updates across shards for example would make complete sense to do with one command that is automatically applied to all applicable shards. Going through every database command and deciding which make sense across shards and which don't may get a little awkward though--so I'm sure there is a lot of room for discussion on this suggestion.

This comment is getting too long... We can move this to the discussion group if you have more questions about my thoughts; let me know if you would like me to.

19 May 2015
21:27 PM

Mufasa

Oops, regarding the "tools that broke when using a ShardedDocumentStore", I meant just Glimpse.RavenDB and our custom admin tools to run various database commands -- not New Relic.

21 May 2015
12:04 PM

Ayende Rahien

1) We will probably change that, see: http://issues.hibernatingrhinos.com/issue/RavenDB-3488

2) We are NOT going to use DTC, to start with, it is going to create locks, and cause a whole lot of issues. The problem with doing things this way is that you are going to generate a LOT of work on large systems, and you want to shard when you have a large system, after all. That means a big downtime.

3) That is a 4.0 feature that we are considering.

Note that tools like Glimpse.RavenDB needs to be initialized for _each individual document store_, not on the ShardedDocumentSTore.

21 May 2015
12:05 PM

Ayende Rahien

And yes, I agree that this should probably go to the mailing list

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB

RavenDB ShardingEnabling shards for existing database

More posts in "RavenDB Sharding" series:

Comments

Comment preview

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication

Main feed
Comments feed

Oren Eini

CEO of RavenDB

More posts in "RavenDB Sharding" series:

Comments

Comment preview

Markdown formatting

Phrase Emphasis

Links

Images

Headers

Lists

Blockquotes

Horizontal Rules

Manual Line Breaks

Fenced Code Blocks

Header IDs

Tables

Definition Lists

Footnotes

Abbreviations

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication