Raven Replication: Scenarios

time to read 2 min | 391 words

I am currently writing the tests for the replication bundle. I managed to overcome the biggest problem (my stupidity), and we now have some passing tests :-)

What I am looking for is more scenarios like this, so we would have as many tested scenarios as possible. I don’t want code, or anything like that, just give me a set scenario to try against the replication system.

Feel free to make them as complex as you wish.

Here is an example of how I think about he tests that we currently have:

	Command	Result
Replicating PUT	PUT raven1/docs/ayende	200
	GET raven2/docs/ayende	200
Replicating DELETE	PUT raven1/docs/ayende	200
	GET raven2/docs/ayende	200
	DELETE raven1/docs/ayende	201
	GET raven2/docs/ayende	404

Tweet Share Share 16 comments

Tags:

Raven

Comments

16 May 2010
14:24 PM

Andrew

I'd probably test the usual 'bank transfer' scenario, ensuring transactions are atomic across replication too?

What about concurrent transactions, does replication update correctly there?

16 May 2010
14:35 PM

Ayende Rahien

Andrew,

That is pretty meaningless, since Raven is transactional, and the replication system only operation on committed data.

16 May 2010
14:39 PM

Rafal

Are you sure you have defeated your stupidity? I'm trying for thirty years with little effect...

16 May 2010
14:47 PM

Robert The Grey

Have you tested replication of indexes as well? Are they stores as data?

16 May 2010
14:53 PM

Ayende Rahien

Robert,

No, indexes aren't replicated. This is because it is easier to just compute them than to send them on the wire

16 May 2010
14:57 PM

Ben Hall

With CouchDB, I've been looking at using replication in two ways.

1) Using HAProxy as a load balancer, then using CouchDB to continuously replicate between the two loads in a master-master approach. This in affect creates a database cluster in a very cost effective fashion - try doing that in SQL Server...

2) Similar fashion to above by using HAProxy, but in a master-child approach. Master holds all the data, then each child holds a subset done via the Replication Filters. Thinking about using this for geo-location of catalog data between our different stores... Having everything geo-located would be expensive, having just the subset would solve that problem. If the node goes down, then HAProxy restores back to the master, or more ideally another node which holds the data.

I've been meaning to write a blog post about it....

These the kind of scenarios you are looking for?

Ben

16 May 2010
15:19 PM

Ayende Rahien

Ben,

That helps, yes, although I was thinking about stuff lower down the stack.

With the master / master approach, don't you find that you get conflicts?

16 May 2010
15:22 PM

Ayende Rahien

The expectation is that you'll setup the indexes on all machines as part of your initial setup.

Raven's indexes are the closest thing to a schema that it has. It doesn't make sense to replicate the indexes, because you might not want to pay the indexing cost on a backup only copy, or might want to have different projections.

Moving the index data is too costly when we can just recalcute it

16 May 2010
20:10 PM

Jesús López

Why not to replicate just index definitions?

16 May 2010
20:31 PM

Ayende Rahien

Jesus,

Because it remove the ability to have different indexes on different machines.

It complicate things because we now have to track whatever an index was changed or not.

It means that we need to track when an index was changed, in case the user wanted to reforce an index re-fresh.

I don't see the benefit

16 May 2010
21:07 PM

Jesús López

Ayende,

I see one benefit in the high availability scenario: simpler administration. Since you need both machines identical to serve the same requests, if automatic replication of indices is not in place, you need to repeat all index creations and modifications in all servers manually.

I'm not saying that all replicated servers must have the same indexes, but I think having the option to replicate index definitions to other server would be nice.

16 May 2010
23:17 PM

Andrew

I'd have to agree there, I'd prefer (and expect) indexes to be replicated (their definitions at least, let each server calculate the data though).

Simpler administration, and its expected behavior.

Even on a backup server, I'd opt to have it replicated - it doesn't matter if it is or isn't surely.

16 May 2010
23:17 PM

Dokieboy

Minor point. Should the DELETE scenario return a 200 (OK) instead of a 201 (Created)?

17 May 2010
00:25 AM

Martin Murphy

Question about RavenDb.

Couchdb takes the approach that views are not indexed until the view is requested (in order to save cpu cycles). Personally I disagree with this approach because because I would prefer that the view be ready for me so to speak.

Also views in couch do not pass stale results by default unless you pass the stale:"ok" param. I would in the majority of cases prefer to have the stale results and then have the view index update async after the request.

I'd be really interested to hear what approach are you taking with Raven and your thought process behind it?

17 May 2010
05:57 AM

Ayende Rahien

Dokieboy ,

Raven actually returns 204 (No Content), that is a top on my part

17 May 2010
05:58 AM

Ayende Rahien

Martin,

Indexes in Raven are built on the background, and they will return stale results (with the appropriate notification)

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB