Metrics hunt in RavenDB

time to read 1 min | 103 words

You might have noticed that we are paying a lot of options for operational concerns in RavenDB 3.0. This is especially true because we moved away from performance counters to metrics.net, which means that is is much easier and light weight to add metrics to RavenDB.

As a result of that, we are adding a lot of stuff that will be very useful for ops team. From monitoring the duration of queries to the bandwidth available for replication to a host of other stuff.

What I wanted to ask is what kind of things do you want us to track?

Tweet Share Share 9 comments

Tags:

raven

Comments

29 Jul 2014
09:12 AM

Paul Stovell

One of the questions we're trying to answer at the moment is whether to make some big changes to our indexes. I'd like to answer this question: Over the last hour, how much CPU time, memory and I/O was spent on indexing (and ideally by which indexes)?

This way when I change my index and put it under some load, I can see whether my improved index is going to reduce memory/CPU/I/O or increase it.

(If there's a way to do this currently in Raven I'd love to know)

29 Jul 2014
09:49 AM

Afif

a report of which are the most frequent queries, which are the most expensive queries, and what operations are taking the most juice over a period of time would be great.

30 Jul 2014
00:29 AM

Mufasa

How about more info regarding long running background file operations, like the status when deleting a large index.

More importantly though, query/index efficiency (or inefficiencies). Like if the entire document is being loaded to satisfy a query when a projection could (or should) have been requested/used instead. Or metrics to know if/when transformers are loading related documents. Or how many CPU and IO ops it takes to satisfy an index map/reduce result (to find indexes that are doing more work than expected).

30 Jul 2014
07:43 AM

Ayende Rahien

Paul, You can see something pretty close to it in the indexing /stats. There is a performance option that list the input and duration for the indexing. Calculating memory / cpu is pretty hard, because we don't really have access to it.

30 Jul 2014
07:44 AM

Ayende Rahien

Afif, Queries in RavenDB tend to be pretty short, as far as actually querying the db. Most of the "expensive" queries we have seen are actually getting large amount of data, and thus take a lot of time to send over the network.

30 Jul 2014
07:52 AM

Ayende Rahien

Mufasa, We have added query details that will let you know how much time was spent executing the query on the index, how much loading the data from the database, etc. You can add this using ShowTimings() on the query.

30 Jul 2014
09:31 AM

Iulian Margarintescu

@Ayende Not sure if you are interested, but I've created another .NET port of the java Metrics library. (the reasons are in the readme - but the main one is that Daniel's port is not actively developed anymore).

The code is available here https://github.com/etishor/Metrics.NET The docs are in the wiki: https://github.com/etishor/Metrics.NET/wiki The NuGet package: Metrics.NET

My main focus was to provide the simplest api possible for the consumers of the library.

I would appreciate it if you could take a look and share your opinions.

30 Jul 2014
11:06 AM

Ayende Rahien

Lulian, Take a look at my original post about Metrics.NET A lot of the same issues apply to the code you have there.

05 Aug 2014
13:22 PM

Iulian Margarintescu

Ayende,

When doing the port I actually used parts of your post as inspiration :)

Code is targeting 4.5, 4.5.1 & mono, there is a separate branch with 4.0 support - so no thread sleep or thread waiting.

A metric does not depend on a type, it only has a string name. There is an overload method that can take a type parameter that is used to build the name based on the type, but that is completely optional. Since I've been using the lib I've mostly preferred explicit metric names that don't depend on the type name.

Also there is no assumption of a static, fixes set of metrics. New metrics can be added to a registry at any time ( take a look at how metrics for each request are added the first time the request is made in the NancyFx adapter ). It is true, metrics can't be removed from a registry - but since metrics are cheap I can't see why you would remove a metric.

For convenience a static class is used to configure metrics and one default metrics registry. This is because this is the most common use case.

I have not really targeted multi-registry scenarios (yet) but there should be nothing stopping you from using multiple registries that can be created or destroyed at any time. I've added a sample of how multi registry would be done here: https://github.com/etishor/Metrics.NET/blob/dev/Samples/Metrics.Samples/MultiRegistryMetrics.cs

It is on my todo list to improve the way you would manage multiple metric "sets" by providing utility classes or apis and also to improve the reporters/visualizers to account for multiple metric sets.

Don't get me wrong, i'm not saying you should use my lib, not trying to convince you of anything, but considering the scenario where you need metrics (RavenDb) and also considering your experience in building developer friendly stuff - i'm very interested in fixing or improving anything you would consider an issue.

Thanks, iulian

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB