io.grpc.StatusRuntimeException: ABORTED: PARTICIPANT_BACKPRESSURE(2,0): The participant is overloaded: participant rate limit exceeded (maximum rate: 200 commands/s)

App Development6 posts381 views2 likesLast activity Oct 2022

Daniel_PorterOP

Oct 2022

Not sure if this is a Canton specific question or not, but -

I’m doing load testing, and running into the issue that when I

have auth turned on, and
send more than 20 commands per second

I get the error The participant is overloaded: participant rate limit exceeded (maximum rate: 200 commands/s)

This happens regardless of whether or not:

I’m using only one party and one gRPC stream to send the requests
I’m using multiple parties on the same gRPC stream to send the requests
I’m using multiple parties on gRPC streams initialized with different tokens to send the requests

I’m using the scala bindings

the code is

    val start_time = System.nanoTime
    for (case ((party, token), client) <- res) {
      val iou  = Iou(party, party, "timbucks", 100, Nil)
      val result = for {
        lc <- client
        _ <- lc.commandServiceClient.submitAndWaitForTransaction(
          command_service.SubmitAndWaitRequest(
            Some(
              Commands(
                ledgerId = lc.ledgerId.unwrap,
                workflowId = UUID.randomUUID().toString,
                commandId = UUID.randomUUID().toString,
                party = party.unwrap,
                commands = Seq(iou.create.command)
              )
            )
          ),
          token
        )
        end_time   = System.nanoTime
        difference = (end_time - start_time) / 1e6
      } yield {
        clq.add(difference)
        difference
      }

My questions are twofold:

why is this happening? and
how can I get past it? I haven’t successfully located the commands per second config option, which seems like it might help.

jonas

Oct 2022

I believe maximum rate refers to this setting:

participant1.resources.set_resource_limits(
  ResourceLimits(
    // Allow for submitting at most 200 commands per second
    maxRate = Some(200),

    // Limit the number of in-flight requests to 500.
    // A "request" includes every transaction that needs to be validated by participant1:
    // - transactions originating from commands submitted to participant1
    // - transaction originating from commands submitted to different participants.
    // The chosen configuration allows for processing up to 100 requests per second
    // with an average latency of 5 seconds.
    maxDirtyRequests = Some(500),
  )
)

See Scaling and Performance — Daml SDK 2.4.0 documentation

As for a solution, have you tried batching commands? I did some small case testing where that seemed to help, but YMMV.

Daniel_Porter

Oct 2022

Well - I’m doing perf testing for daml hub, so batching commands unfortunately defeats the point. I will try bumping up the max rate and seeing if that makes a difference.

This does raise the question tho: am I actually sending 200 commands without knowing it? It’s not apparent to me from the participant logs. How can I determine that?

Daniel_Porter

Oct 2022

I have now confirmed that if I increase the maxRate to 400, I reach limits at 40 requests per second.

Either

my code has an off by 10 error I can’t see,
the client is opaquely sending 9 additional commands for my one, or
there’s some very unintuitive behavior with this maxRate command.

Not knowing the answer to this question is blocking for the task I’m working on: it leaves me unable to trust the numbers I’m producing. I’m also pretty sure it’s not #1.

Daniel_Porter

Oct 2022

I’ve determined that it’s #3 - very unintuitive behavior. In addition to the maxRate, there’s a burst rate that’s calculated based off of that with a divide by 10, and I was exceeding the burst rate. Gonna file a GH issue about this.

MatthiasSchmalz

Oct 2022

Sorry that you’re having trouble by this.

The RateLimiter processes commands in time windows of at least 100ms. If the limit is 200 commands/s, it will accept up to 20 commands within every time slice of 100ms. Thus, if you continuously keep submitting commands, it will accept 200 commands/s.

In your test, it accepts only 20 commands, because you submit all 200 commands at once and then you give up.

← Back to Discussions