Hi, I just want to sanity check my SpiceDB #spicedb

Hi, I just want to sanity check my

julienvincent

12/05/2023, 10:05 AM

Hi, I just want to sanity check my experience with the performance of the LookupResources API: - In a system with about 8k relationships; - Using

LookupResources

to query a resource type with a response size of around 1.1K resources; I am seeing response times of around 300-500ms. This seems very high to me for the order of mag I am dealing with. Is this expected or does something seem wrong? I am seeing the same order of mag when running spice on our prod environment (with spanner) and locally (with postgres)

vroldanbet

12/05/2023, 1:15 PM

It will depend on what your schema looks like - some parts would be more expensive than others. Would you be able to share it? What version of SpiceDB are you using? Are you using cursors?

julienvincent

12/05/2023, 1:48 PM

I can share portions of my schema but would prefer not to do that publically here. This is using spicedb 1.26.0 and not using cursors

vroldanbet

12/05/2023, 1:56 PM

please consider using 1.28. Cursors would allow you to get a response faster, the thing with non cursored LookupResources is that the server has to wait to get all the subproblems solved before it can proceed. By using cursors you only focus on a narrow subset of the subproblems and can start processing the batches in your client application while you are fetching the subsequent batches. If cursors is somehow not a viable option for you, then the options are: - optimizing the schema - looking into metrics to understand what needs to be tuned (e.g. connection pools in postgres, or gRPC connections in Spanner)

vroldanbet

12/05/2023, 1:56 PM

1.28 comes with a bunch of improvements for Spanner, including better observability

vroldanbet

12/05/2023, 1:57 PM

How many PUs has your Spanner allocated?

julienvincent

12/05/2023, 2:18 PM

Ok I have some questions to understand this: To clarify, by cursors you are referring to pagination cursors? I don't see anything in the 1.28.0 release notes around cursors. Was there some change or improvement in 1.28.0 around pagination? I also don't fully understand why the pagination cursors would help - if I require the entire response in order to respond upstream wouldn't the time taken be the same + longer because of the added network overhead? I don't think the cursor is particularly complicated, I will share the relevant portions below in a moment. I will upgrade to 1.28.0 for the Spanner improvements and report back. Our spanner instance is only provisioned with 200 PU's right now, but I did not observe any impact when increasing this during testing. I am noticing weird traces like in the attached image where spicedb seems to complete work but the grpc response completes much later. Not sure how best to interpret it. https://cdn.discordapp.com/attachments/1181536779006918676/1181600574303645746/image.png?ex=6581a649&is=656f3149&hm=dfa4bf8d944f688c41d9c5ce7bc1c7fe3c85ccb86378631a6ffcb46a3b81d042&

julienvincent

12/05/2023, 2:19 PM

> ... if I require the entire response in order to respond upstream ... For example because I need to include them in a DB query later.

julienvincent

12/05/2023, 2:21 PM

> please consider using 1.28 I am deploying using the spicedb-operator which currently supports up to 1.26.0 - is it safe to override it's default max and go to 1.28.0?

vroldanbet

12/05/2023, 2:38 PM

>Was there some change or improvement in 1.28.0 around pagination? there were none related to cursors >if I require the entire response in order to respond upstream wouldn't the time taken be the same + longer because of the added network overhead? that is correct, but you are also putting more strain on the server, and for a largely enough response it may run out of memory. Cursors set a boundary to how much memory is required to be held during LookupResources. If all you need is the full-response, cursors will not make things faster. I made an assumption that you application could work with pages of results, but it sounds like you need the full response. >Our spanner instance is only provisioned with 200 PU's right now, Expensive LookupResources with a 200PUs Spanner will almost certainly get throttled by Spanner - I recommend looking into SpiceDB metrics to determine if Spanner is the bottleneck. >I am noticing weird traces like in the attached image where spicedb seems to complete work but the grpc response completes much later. Not sure how best to interpret it. I think this was already reported as a potential issue: https://github.com/authzed/spicedb/issues/1662 >I am deploying using the spicedb-operator which currently supports up to 1.26.0 - is it safe to override it's default max and go to 1.28.0? Yes because there are no relevant spanner migrations AFAIK but let me doublecheck. We should do a new operator release with 1.28.0. 1.27.0 should be available with the latest version of the operator IIRC.

vroldanbet

12/05/2023, 2:38 PM

>Yes because there are no relevant spanner migrations AFAIK but let me doublecheck. We should do a new operator release with 1.28.0. 1.27.0 should be available with the latest version of the operator IIRC. Actually no, there are migrations, a new implementation of the schema cache

vroldanbet

12/05/2023, 2:38 PM

Let me see if we can quickly cut a release

vroldanbet

12/05/2023, 2:48 PM

https://github.com/authzed/spicedb-operator/pull/280

julienvincent

12/05/2023, 2:53 PM

> that is correct, but you are also putting more strain on the server, and for a largely enough response it may run out of memory. Cursors set a boundary to how much memory is required to be held during LookupResources. Gotcha, so in this case I don't think it's a factor but definitely something to keep in mind as we scale up. > Expensive LookupResources with a 200PUs Spanner will almost certainly get throttled by Spanner - I recommend looking into SpiceDB metrics to determine if Spanner is the bottleneck. I did so some testing with variations of PU configurations and I don't think we are hitting spanner limitations - we are still working with relatively low numbers here. I've done a bunch of tweaking to our deployment configuration and I've improved the performance slightly, but whats interesting is that it is super inconsistent: https://cdn.discordapp.com/attachments/1181536779006918676/1181609253761450036/image.png?ex=6581ae5f&is=656f395f&hm=a03eb208a80a0f93ead856f90a7a6d73fae73452c6ec0ddce1738101bb4781e9&

julienvincent

12/05/2023, 2:53 PM

And notice a lot of random outlier spikes of over 800ms too

julienvincent

12/05/2023, 2:54 PM

Let me get a schema reference quick

vroldanbet

12/05/2023, 2:54 PM

are you running requests at a snapshot, or with minimize_latency

vroldanbet

12/05/2023, 2:55 PM

I'm going to close the operator PR for now, since 1.28.0 cannot be released until it hits stable channel

vroldanbet

12/05/2023, 2:55 PM

instead I recommend setting

image

in your

SpiceDBCluster

object

vroldanbet

12/05/2023, 2:55 PM

the operator will do the right thing and run

migrate

for you

julienvincent

12/05/2023, 2:57 PM

Using minimize_latency but I can test at a snapshot if you think that will make a difference

vroldanbet

12/05/2023, 2:57 PM

it will, since cache will be usable for longer

vroldanbet

12/05/2023, 2:58 PM

but depending on how ok you are with stale reads, you can increase the "quantization window"

vroldanbet

12/05/2023, 2:58 PM

so that the caches are usable for longer time

julienvincent

12/05/2023, 2:58 PM

I sent you a schema privately, I am running the query

lookup-resources location read user<id>

julienvincent

12/05/2023, 2:58 PM

Ok so I guess it's reasonable to assume that the cache expiring is manifesting in the spikes?

vroldanbet

12/05/2023, 2:59 PM

most likely, but that's the normal behaviour. If you run the request with

full_consistency

you'll see the true cost of your request, as cache won't be used

julienvincent

12/05/2023, 2:59 PM

Out in the wild we are actually using zedtokens, but for my load testing I am not - so I guess this won't be needed right? That would affect the minimize_latency consistency setting

julienvincent

12/05/2023, 2:59 PM

I went ahead and did this - hard to tell for sure but I didn't notice much impact on performance

vroldanbet

12/05/2023, 3:00 PM

well at least you know that all the latest perf improvements are in place, particularly singleflight datastore and dispatchers which can save up to 33% datastore access

vroldanbet

12/05/2023, 3:01 PM

using

at_least_as_fresh

is the recommended way to call SpiceDB is you are keeping zedtokens around

julienvincent

12/05/2023, 3:01 PM

Yea I think it's just hard to tell because I have changed so many variables over the last couple hours haha

julienvincent

12/05/2023, 3:01 PM

Yup this is the current state out in the wild

julienvincent

12/05/2023, 3:01 PM

I'll run some tests using that too now

vroldanbet

12/05/2023, 3:02 PM

What requests are you setting to the SpiceDB pods?

vroldanbet

12/05/2023, 3:02 PM

is it running with burstable or best effort QoS?

vroldanbet

12/05/2023, 3:02 PM

how many cores does each replica have?

vroldanbet

12/05/2023, 3:03 PM

specially for schemas that require a lot of fan out, having core around is important. You should be able to see this with the newly exposed Go scheduler metrics, if they are stable, it means you are doing well compute wise

vroldanbet

12/05/2023, 3:04 PM

I have a bunch of stuff going on today but keep me posted, I'll get back to you eventually

julienvincent

12/05/2023, 3:07 PM

We are running on gcp c2-standard-4 instances and each spicedb pod is guaranteed at least 3.5 of the 4 cores. There are no limits on CPU, each pod is configured with 2G of memory but are only using between 200-300Mi while under load.

julienvincent

12/05/2023, 3:11 PM

Interesting to know - let me go digging for the metrics and maybe try on higher core nodes. When I originally posted this here we were running on significantly reduced compute (n2-standard-4 and only 200m guaranteed). I didn't suspect this as a main cause as I thought I was reproducing the issue locally but that was actually just misinterpreting my results.

vroldanbet

12/05/2023, 3:19 PM

4 cortes should be probably fine. Something SpiceDB does not do is setting gomaxprocs. Have you set that to the number of cores allocated?

julienvincent

12/05/2023, 3:26 PM

I haven't explicitly set it, but looing at the

go_sched_gomaxprocs_threads

metric it seems to be reported as 4. Can I trust this?

julienvincent

12/05/2023, 3:27 PM

Running this locally I get very stable performance with responses in 20-30ms. Do you think the large variance in performance on our prod is largely applicable to available compute?

vroldanbet

12/05/2023, 3:29 PM

>I haven't explicitly set it, but looing at the go_sched_gomaxprocs_threads metric it seems to be reported as 4. Can I trust this? yes, that's the right metric

vroldanbet

12/05/2023, 3:30 PM

>Running this locally I get very stable performance with responses in 20-30ms. Do you think the large variance in performance on our prod is largely applicable to available compute? well you are using postgres locally but spanner in GCP,

vroldanbet

12/05/2023, 3:31 PM

as I mentioned, I'd suggest looking into the Spanner metrics

vroldanbet

12/05/2023, 3:31 PM

also if you are running 1.28, I suggest enabling the experimental schema cache flag, it will make things faster, but set the heartbeat to something like 2 seconds

vroldanbet

12/05/2023, 3:32 PM

the less roundtrips to DB the better

julienvincent

12/05/2023, 3:34 PM

Ok, let me give this a try

vroldanbet

12/05/2023, 3:36 PM

I don't see anything in your schema that could be optimized, at first glance. It's mostly unions, and multiple levels of nesting. It's a matter of whether you have a very wide relation (e.g. something like the parent of a resource having thousands of elements, and a permission doing parent->permission_in_the_parent)

vroldanbet

12/05/2023, 3:38 PM

please note this is only available in spanner and cockroach, not on postgres nor mysql

julienvincent

12/05/2023, 3:39 PM

Ok is this the

--enable-experimental-watchable-schema-cache

flag? and then Also in conjunction with

--datastore-schema-watch-heartbeat

vroldanbet

12/05/2023, 3:39 PM

correct

julienvincent

12/05/2023, 3:40 PM

Does this affect read performance even if there is no write load?

vroldanbet

12/05/2023, 3:41 PM

the watchable schema?

vroldanbet

12/05/2023, 3:41 PM

it's precisely for read workloads

vroldanbet

12/05/2023, 3:41 PM

basically it listens to a change feed from Spanner to determine if the schema has changed, so that read requests don't have to load the schema definitions each time

vroldanbet

12/05/2023, 3:42 PM

if you do change the schema with

WriteSchema

then the cache will back off until it has caught up with the new schema revisions

julienvincent

12/06/2023, 1:09 PM

Ok I've done quite a lot of debugging and I am starting to run into a wall. Current status: + Running 10 SpiceDB (1.28.0) pods with 4 cores, 2GB ram on c2-standard-4 nodes. + Scaled spanner to 2000 PU's + Enabled the schema cache + Running lookup query for ~1000 resources (tiny volume) + Setting a

{at_least_as_fresh}

consistency token Seeing random frequent spikes of 5s+ (sometimes 8s+) for queries against spicedb. Example, see attached image: https://cdn.discordapp.com/attachments/1181536779006918676/1181945533691084800/image.png?ex=6582e78e&is=6570728e&hm=c29284b5290132333bd66c81f6076762bc1fc23e2b68b4d9d0fc1a18caf86854&

julienvincent

12/06/2023, 1:09 PM

I'm at a complete loss as to how to proceed with debugging this

julienvincent

12/06/2023, 1:11 PM

I've waaay overprovisioned my resources to completely eliminate that as a cause. Querying over very small quantities of resources. I'm sure this isn't expected behaviour, but everything points at spicedb being the cause here. Can I provide any information/metrics to help identify the issue?

vroldanbet

12/06/2023, 1:14 PM

You could do

zed backup

and send it over to us.

vroldanbet

12/06/2023, 1:14 PM

What are you using to do the lookup resources?

zed

julienvincent

12/06/2023, 1:16 PM

I doubt this is anything to do with the actual data, I'm sure this is some configuration issue or issue with the environment I am running on. I'm not even entirely sure this is spicedb's fault, but the slow trace goes all the way into the traces exposed by spice

vroldanbet

12/06/2023, 1:16 PM

Please note that LookupResources is a streaming API, so the latency of the call will be as large as the stream remains open. I assume this is not your case, but its relevant because some use caess the API is fast but the API call look "longer" because the client is doing processing as it receives the values and holds the stream open

vroldanbet

12/06/2023, 1:17 PM

it could even be a load balacer issue

vroldanbet

12/06/2023, 1:17 PM

(load balancer keeping the stream open)

julienvincent

12/06/2023, 1:17 PM

That is actually my current leading theory... but the thing that is throwing me off is that spicedb 'received' the request immediately but the client only starts receiving results many seconds later as indicated by the events on the receive span

julienvincent

12/06/2023, 1:18 PM

the connection is happening directly to a kubernetes service, so no other loadbalancer in the way other than internal k8s networking

vroldanbet

12/06/2023, 1:19 PM

It could be something with the data. The computation time is proportional to both the schema and the data shape. So a specific data shape could be surfacing some unoptimized codepath

julienvincent

12/06/2023, 1:19 PM

For example - the data is only received after 4s in this case https://cdn.discordapp.com/attachments/1181536779006918676/1181948088148033637/image.png?ex=6582e9ef&is=657074ef&hm=c0c0c824588ef75046ae2bb2a32894f2bf335f9fc252f98fba12723c88d04abe&

julienvincent

12/06/2023, 1:20 PM

But if this was the case wouldn't we expect to see this issue for all requests

vroldanbet

12/06/2023, 1:20 PM

I don't see a reason why 1000 results should take that long, unless it's a pathological case that is really forcing to traverse the whole dataset

julienvincent

12/06/2023, 1:20 PM

or at least all requests that miss the cache?

vroldanbet

12/06/2023, 1:21 PM

not necessarily, it really depends on the dataset. This is a graph traversal. For some subject/resources it may require the evaluation of a smaller dataset compared to other paths.

vroldanbet

12/06/2023, 1:21 PM

e.g. imagine an organization with 1,000,000 members, versus an organization with 1000 members

vroldanbet

12/06/2023, 1:22 PM

have you tried run

check --explain

julienvincent

12/06/2023, 1:22 PM

In our case we have a total of like 3000 relationships across all resource types

vroldanbet

12/06/2023, 1:22 PM

it will tell you visually which bits are being slow

julienvincent

12/06/2023, 1:22 PM

And we are seeing this issue for the same request in that the exact parameters of the request are the same - so the traversal path, I would imaging, would be the same

julienvincent

12/06/2023, 1:23 PM

and in some cases it resolves fine (20ms) and in others it will spike to many seconds

julienvincent

12/06/2023, 1:23 PM

I'm becoming more and more convinced this is just some networking bug

vroldanbet

12/06/2023, 1:23 PM

and if you do

fully-consistency

semantics, does it always reproduce the issue?

julienvincent

12/06/2023, 1:23 PM

julienvincent

12/06/2023, 1:24 PM

To be honest I am struggling to reproduce this issue in isolation

julienvincent

12/06/2023, 1:24 PM

it's only appearing in the wild

julienvincent

12/06/2023, 1:25 PM

Actually, let me double check this

vroldanbet

12/06/2023, 1:25 PM

well that would in theory discard the graph traversal computation on SpiceDB as the issue, but we cannot discard that some synchronization issue amon the different parts of the LookupResources code EDIT: but then I would expect more a deadlock kind of scenario, and then it would timeout rather than taking long to respond

vroldanbet

12/06/2023, 1:25 PM

have you been able to reproduce this locally?¿

vroldanbet

12/06/2023, 1:29 PM

I'm trying to see what's going on with the OTel traces locally

vroldanbet

12/06/2023, 1:36 PM

I don't seem to be able to reproduce the spans issue locally, at least with CockroachDB

julienvincent

12/06/2023, 1:36 PM

Ok I did some testing - I am able to reproduce much more reliably with the fully-consistent set to true. Example span: https://cdn.discordapp.com/attachments/1181536779006918676/1181952352996032552/image.png?ex=6582ede8&is=657078e8&hm=34828fd9237c30dc86eda0b637931eb713e6fadbd1b26a14b3effc5d9bb1f82e&

julienvincent

12/06/2023, 1:37 PM

But the issue does happen when just pulling from the cache and not hitting spanner, I think? Because it occurs even when specifying a zookie

julienvincent

12/06/2023, 1:37 PM

But definitely easier to reproduce in this configuration

julienvincent

12/06/2023, 1:37 PM

Nope, only on our cluster

julienvincent

12/06/2023, 1:38 PM

Also, this is the first example where I have seen more spans in the trace

vroldanbet

12/06/2023, 1:38 PM

right, I think you don't see spans because when there is a cache kit, you will miss any datastore access

vroldanbet

12/06/2023, 1:39 PM

but it seems to suggest that something is going on that does not involve I/O

vroldanbet

12/06/2023, 1:39 PM

or if it does it is not appearing in the trace

julienvincent

12/06/2023, 1:40 PM

I can share our spice backup with you

vroldanbet

12/06/2023, 1:40 PM

have you tried using the spanner emulator locally?

julienvincent

12/06/2023, 1:40 PM

nope didn't know that was a thing!

vroldanbet

12/06/2023, 1:41 PM

yeah, it's deliberately slow and does not support concurrency, but it exists

vroldanbet

12/06/2023, 1:41 PM

you can see it being used in SpiceDB integration tests

vroldanbet

12/06/2023, 1:41 PM

and you can spin it up locally with the gcp CLI

julienvincent

12/06/2023, 1:42 PM

How likely is this to be a spanner issue do you think

julienvincent

12/06/2023, 1:43 PM

I'll try get it setup and see

vroldanbet

12/06/2023, 1:43 PM

I don't think it's necessarily an issue with Spanner but I cannot rule out SpiceDB's implementation of Spanner

vroldanbet

12/06/2023, 1:43 PM

each datastore has its own bespoke implementation

vroldanbet

12/06/2023, 1:46 PM

did you get the chance to look into prometheus metrics on the SpiceDB-Spanner client?

vroldanbet

12/06/2023, 1:46 PM

I wonder if it's running out of connections. It's not something you'll see necessarily unless scrape at a very fast interval like 1s

vroldanbet

12/06/2023, 1:50 PM

and yeah if you can provide a backup I think that would help

julienvincent

12/06/2023, 1:50 PM

--datastore-conn-uri="projects/project-id/instances/instance-id/databases/database-id"

- what do I use here for local spanner?

vroldanbet

12/06/2023, 1:51 PM

so like all of that you have to create with the GCP CLI

julienvincent

12/06/2023, 1:51 PM

Let me get you that backup

vroldanbet

12/06/2023, 1:51 PM

it will create a project for you

vroldanbet

12/06/2023, 1:51 PM

and then inside of that project you'll create an instance

vroldanbet

12/06/2023, 1:51 PM

let me see if I have some commands handy

julienvincent

12/06/2023, 1:51 PM

I have metrics collection enabled - what metrics are you interested in viewing?

julienvincent

12/06/2023, 1:51 PM

Oh, was using the docker container. I can switch to cli though

vroldanbet

12/06/2023, 1:52 PM

this is what I have in my .zprofile

Copy code

spanner() {
    docker run -d -p 9010:9010 -p 9020:9020 gcr.io/cloud-spanner-emulator/emulator
    gcloud config configurations activate emulator
    gcloud spanner instances create test-instance --config=emulator-config --description="Test Instance" --nodes=1
    gcloud spanner databases create spicedb --instance=test-instance
    gcloud config configurations activate default
}

julienvincent

12/06/2023, 1:53 PM

sweet let me try that

julienvincent

12/06/2023, 1:54 PM

Ah you have some custom gcloud config

julienvincent

12/06/2023, 1:54 PM

ERROR: (gcloud.config.configurations.activate) Cannot activate configuration [emulator], it does not exist.

julienvincent

12/06/2023, 1:54 PM

Can you share that too?

vroldanbet

12/06/2023, 1:54 PM

huh

vroldanbet

12/06/2023, 1:55 PM

https://cloud.google.com/spanner/docs/emulator

vroldanbet

12/06/2023, 1:55 PM

check if the instructions there help

vroldanbet

12/06/2023, 1:55 PM

it has this

Copy code

gcloud config configurations create emulator
  gcloud config set auth/disable_credentials true
  gcloud config set project your-project-id
  gcloud config set api_endpoint_overrides/spanner http://localhost:9020/
`

vroldanbet

12/06/2023, 2:00 PM

definitely can't seem to reproduce it with Cockroach locally. Always within < 150ms

julienvincent

12/06/2023, 2:20 PM

I CAN REPRODUCE IT LOCALLY!

julienvincent

12/06/2023, 2:20 PM

haha with the spanner emulator

julienvincent

12/06/2023, 2:21 PM

I have never been so happy to see a bug

julienvincent

12/06/2023, 2:21 PM

ok hold on let me just tripple check I'm not misinterpreting anything

vroldanbet

12/06/2023, 2:21 PM

hah amazing

julienvincent

12/06/2023, 2:26 PM

Interestingly, my local spanner has all spans properly exporting

julienvincent

12/06/2023, 2:26 PM

https://cdn.discordapp.com/attachments/1181536779006918676/1181964877120679986/image.png?ex=6582f992&is=65708492&hm=d54fb63e13864b874b9d506279f69df921282771cac086c593cdec6d10e7dc39&

julienvincent

12/06/2023, 2:26 PM

Issue seems to present the same

vroldanbet

12/06/2023, 2:28 PM

are you running the same spicedb version?

vroldanbet

12/06/2023, 2:28 PM

spanner traces were introduced in 1.27

julienvincent

12/06/2023, 2:31 PM

yup this is spanner 1.28.0 - same as what we are running in prod

julienvincent

12/06/2023, 2:32 PM

Copy code

Summary:
  Total:    41.3156 secs
  Slowest:    1.9387 secs
  Fastest:    0.2805 secs
  Average:    0.4968 secs
  Requests/sec:    120.5356

  Total data:    174894300 bytes
  Size/request:    35190 bytes

Response time histogram:
  0.281 [1]    |
  0.446 [4410]    |■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■
  0.612 [19]    |
  0.778 [0]    |
  0.944 [0]    |
  1.110 [0]    |
  1.275 [0]    |
  1.441 [117]    |■
  1.607 [85]    |■
  1.773 [172]    |■■
  1.939 [166]    |■■


Latency distribution:
  10% in 0.3342 secs
  25% in 0.3428 secs
  50% in 0.3553 secs
  75% in 0.3738 secs
  90% in 1.3484 secs
  95% in 1.7319 secs
  99% in 1.8761 secs

Load test summary - the distribution matches what I am seeing in prod. Same rough proportion of random spikes

vroldanbet

12/06/2023, 2:33 PM

just note that the emulator does not support concurrency so it may be a red herring

julienvincent

12/06/2023, 2:33 PM

Actually, certain paths present worse in prod

julienvincent

12/06/2023, 2:33 PM

hmm

julienvincent

12/06/2023, 2:33 PM

fair

julienvincent

12/06/2023, 2:34 PM

No I think I can reproduce this with a concurrency of 1

vroldanbet

12/06/2023, 2:34 PM

Copy code

Notable limitations:


The emulator only allows one read-write transaction or schema change at a time. Any concurrent transaction will be aborted. Transactions should always be wrapped in a retry loop. This recommendation applies to the Cloud Spanner service as well.

vroldanbet

12/06/2023, 2:34 PM

it's actually only for writes

vroldanbet

12/06/2023, 2:34 PM

which I don't think is your case

vroldanbet

12/06/2023, 2:34 PM

from https://github.com/GoogleCloudPlatform/cloud-spanner-emulator

julienvincent

12/06/2023, 2:36 PM

yea this is reproduceable even with sequential requests

julienvincent

12/06/2023, 2:37 PM

I can reproduce this with

zed

cli too

julienvincent

12/06/2023, 2:37 PM

hyperfine 'parallel -j {threads} -n0 zed permission lookup-resources --skip-version-check location read user:acc_fR2nH4wdUkFmo4ZG1bdQGAjsjbDe ::: {1..{threads}}' -P threads 5 20 -D 5

julienvincent

12/06/2023, 2:38 PM

Copy code

Benchmark 4: parallel -j 20 -n0 zed permission lookup-resources --skip-version-check location read user:acc_fR2nH4wdUkFmo4ZG1bdQGAjsjbDe ::: {1..20}
  Time (mean ± σ):     502.7 ms ± 657.9 ms    [User: 431.6 ms, System: 471.7 ms]
  Range (min … max):   288.6 ms … 2375.0 ms    10 runs

See the massive min->max range

vroldanbet

12/06/2023, 2:39 PM

yeah I'm just not sure how much of this is due to the Spanner emulator design

vroldanbet

12/06/2023, 2:39 PM

you say that it's even worse in your prod env?

julienvincent

12/06/2023, 2:40 PM

yup it's much worse in prod

julienvincent

12/06/2023, 2:41 PM

It happens much more frequently and often with 10s+ spikes

vroldanbet

12/06/2023, 2:42 PM

and you are doing full-consistency or any request would do it?

julienvincent

12/06/2023, 2:42 PM

in these tests I am reproducing with any consistency setting

vroldanbet

12/06/2023, 2:44 PM

cool, then it sounds like we have a solid repro

vroldanbet

12/06/2023, 2:44 PM

and does it reproduce by just even issuing individual

zed

calls? or it has to be a load test

vroldanbet

12/06/2023, 2:44 PM

because I'm certainly not reproducing this locally with CRDB. It's always <150

julienvincent

12/06/2023, 2:45 PM

It does, but interestingly the spikes are smaller

julienvincent

12/06/2023, 2:45 PM

With a concurrency of 20 I see spikes of 1-2s, with a concurrency of 1 I see spikes of 500-700ms

vroldanbet

12/06/2023, 2:45 PM

yeah, as I mentioned, the streaming of responses is part of the trace

julienvincent

12/06/2023, 2:45 PM

and this is just with zed

vroldanbet

12/06/2023, 2:45 PM

so client is part of the equation. A "slow" client will make the request appear slower

julienvincent

12/06/2023, 2:46 PM

Snippet from spice logs: https://cdn.discordapp.com/attachments/1181536779006918676/1181969921882734683/message.txt?ex=6582fe45&is=65708945&hm=46346c7f7d7b6055aafce0fe94efd014046d8d530a09ed37f853a6281db7fba2&

julienvincent

12/06/2023, 2:46 PM

everything is fine and the a spike hits

julienvincent

12/06/2023, 2:47 PM

The command used:

Copy code

hyperfine 'parallel -j 20 -n0 zed permission lookup-resources --skip-version-check location read user:acc_fR2nH4wdUkFmo4ZG1bdQGAjsjbDe --consistency-at-least=GhUKEzE3MDE4NzI5MDAwMDAwMDAwMDA= ::: {1..20}' --runs 1000

julienvincent

12/06/2023, 2:47 PM

hyperfine just the load tester - then just executing

zed

cli with a configurable concurrency

julienvincent

12/06/2023, 2:47 PM

and this is with a consistency token - still occurs

julienvincent

12/06/2023, 2:49 PM

It also seems that my CPU spikes accross all cores when a spike occurs. That's just a non-scientific eyeball observation

vroldanbet

12/06/2023, 2:52 PM

try with full consistency

vroldanbet

12/06/2023, 2:52 PM

I'm seeing something similar lcally, but with

minimize_latency

I get a 500ms spike when the quantization window elapses and SpiceDB discards the caches

vroldanbet

12/06/2023, 2:54 PM

this may be an issue with

ReverseQueryRelationships

. I see the queries taking 60ms

vroldanbet

12/06/2023, 2:54 PM

there may be a missing index

vroldanbet

12/06/2023, 2:54 PM

have you looked into Spanner's profiler?

julienvincent

12/06/2023, 2:59 PM

ah, yea, sorry you're right the

--consistency-full

triggers it every request. I guess that means it's just the emulator being slow and not actually what I am seeing in prod?

julienvincent

12/06/2023, 2:59 PM

Nope - not super familiar with spanner in general

vroldanbet

12/06/2023, 3:00 PM

the spanner profiler would show you if certain queries are suboptimal. It would show what the query planner did and where time was spent

julienvincent

12/06/2023, 3:01 PM

https://cdn.discordapp.com/attachments/1181536779006918676/1181973632411508756/image.png?ex=658301b9&is=65708cb9&hm=c8326a7b607c84ce5ec7ed447dea811a4f2baf1b18ac4eb004bb253dae3df206&

julienvincent

12/06/2023, 3:03 PM

The spanner emulator is burning CPU while running this locally with --consistency-full

julienvincent

12/06/2023, 3:03 PM

maxed out

vroldanbet

12/06/2023, 3:53 PM

well so far I don't see anything abnormal in the trace other than queries being slow. There is room for some optimization here and there, but some specific query access patterns seem slow, and it's not clear if that's a spanner emulator thing, or a problem with how it's currently implemented

vroldanbet

12/06/2023, 3:54 PM

even the checks are relatively slow. If I pick one of the subjectIDs returned by that

LookupResources

and invoke

CheckPermission

it leads to 200ms requests, where Spanner queries are taking somewhere between 20ms up to 70ms

julienvincent

12/06/2023, 4:29 PM

I'm going to setup a test on spanner cloud and compare with pgsql

julienvincent

12/06/2023, 4:30 PM

If this reproduces on spanner but not pgsql then I'd say it's fair to assume it's an issue with the spanner data store

julienvincent

12/07/2023, 12:04 AM

Ok @vroldanbet I'm able to reproduce the behaviour I am facing quite reliably in an isolated environment with both postgres and spanner as backing datastores. My configuration is as follows: - SpiceDB running on one 8core 16G VM - Cloud Spanner with 1000 PU's or; - Postgres server with 8core 32G VM - The same dataset I shared with you (querying a result set of around 1.1k) Executing the following command:

Copy code

parallel -j 1000 -n0 'hyperfine --runs 50 "zed permission lookup-resources --skip-version-check location read user:acc_fR2nH4wdUkFmo4ZG1bdQGAjsjbDe --consistency-at-least GhUKEzE3MDE5MDM4OTAwMDAwMDAwMDA="' ::: {1..10}

And adjusting the concurrency from 1..x. Concurrency of 1

Copy code

Time (mean ± σ):      46.8 ms ±  32.9 ms    [User: 27.0 ms, System: 20.5 ms]
  Range (min … max):    34.5 ms … 280.5 ms    100 runs

Concurrency of 10

Copy code

Time (mean ± σ):     133.4 ms ±  76.0 ms    [User: 28.9 ms, System: 18.9 ms]
  Range (min … max):    68.5 ms … 511.9 ms    50 runs

Concurrency of 20

Copy code

Time (mean ± σ):     257.4 ms ± 124.4 ms    [User: 34.9 ms, System: 16.7 ms]
  Range (min … max):    70.4 ms … 760.2 ms    50 runs

Concurrency of 50

Copy code

Time (mean ± σ):     707.5 ms ± 481.3 ms    [User: 36.9 ms, System: 19.8 ms]
  Range (min … max):    81.2 ms … 2365.0 ms    50 runs

Concurrency of 100

Copy code

Time (mean ± σ):      1.307 s ±  0.987 s    [User: 0.038 s, System: 0.022 s]
  Range (min … max):    0.000 s …  4.543 s    50 runs

These results were all against spanner, but using postgres shows rather similar results. Do these results seem expected or surprising? I find the rapid increase in p90 latency to be a bit surprising. I also find that the latency when using

--consistency-full

being around 200-300ms to be surprising given the low volume of data. It seems spicedb doesn't deal with concurrency very well. Even on relatively beefy VM's. Thoughts?

vroldanbet

12/07/2023, 9:52 AM

I'm out of office, perhaps the rest of the team can help: @Joey @ecordell

julienvincent

12/07/2023, 10:08 AM

Some load testing results for both spanner and postgres. Would love if someone can take a look and confirm if they match expectations for spicedb performance. https://cdn.discordapp.com/attachments/1181536779006918676/1182262317073842236/results.tar.gz?ex=65840e95&is=65719995&hm=c1f9f61575ba3ffe3fc613276e3ed0bb526b9af94e80e254bf01377d2ae51663&

Joey

12/07/2023, 5:42 PM

@julienvincent you're overwhelming the single SpiceDB pod

Joey

12/07/2023, 5:42 PM

LR is a heavy call and if you're spawning 100+ of them, you're likely simply running into contention

julienvincent

12/08/2023, 11:16 AM

@Joey In the above the 100x concurrency was just for completeness. We are seeing multi-second latencies when running at a concurrency of ~20-30 spread across 4+ Pods all with 4 cores each - for sample sizes in the region of 1000 resources. If this is simply contention then I don't understand how to realistically proceed with SpiceDB as a solution for ACL-aware filtering.

julienvincent

12/08/2023, 11:16 AM

The performance being observed here doesn't make sense to me for the small scale we are operating at. Hence I feel something must be wrong.

julienvincent

12/08/2023, 3:20 PM

For example:

Copy code

~$ time parallel -j 2000 -n0 'curl -X POST -d "@./data.json" -H "authorization: Bearer dev" http://localhost:8443/v1/permissions/resources --silent | wc -l' ::: {1..3}
3227
3227
3227

real    0m1.463s
user    0m0.150s
sys    0m0.110s

3 concurrent requests reduces this instance to a 1.5s latency

Joey

12/08/2023, 4:16 PM

that doesn't match the times you posted above though

Joey

12/08/2023, 4:16 PM

then again above could be using cache, so I don't know what changed between your runs

julienvincent

12/08/2023, 4:19 PM

In this particular example I trippled the number of resources being returned. 1k -> 3k

Joey

12/08/2023, 4:40 PM

ah, that makes sense then

Joey

12/08/2023, 4:40 PM

especially if you have any intersections, exclusions or caveats in your schema

julienvincent

12/08/2023, 4:42 PM

So this is expected performance? What is the recommendation then for actually practically using this API in a real system? It's not really an option to accept multi-second latencies for simple list calls.

julienvincent

12/08/2023, 4:42 PM

I'll share my schema with you in a mo so you can take a look

Joey

12/08/2023, 4:58 PM

for one, use pagination

Joey

12/08/2023, 4:58 PM

loading 3000+ elements is not going to be really "useful"

Joey

12/08/2023, 5:00 PM

what, exactly, are you trying to do with the output?

julienvincent

12/08/2023, 5:01 PM

ACL-aware filtering. For which in order to do we would need the entire set of resources a user has access to

julienvincent

12/08/2023, 5:02 PM

I don't think pagination helps in this case - we still need to fetch that full set of ids. Pagination would just make it slower by adding network latency, unless I misunderstand what you are suggesting

Joey

12/08/2023, 5:02 PM

how are you using the IDs?

Joey

12/08/2023, 5:02 PM

because imagine it was 300K or 3M IDs instead of 3K

julienvincent

12/08/2023, 5:02 PM

in an intersection query against the database with our resources

Joey

12/08/2023, 5:03 PM

and is this used to display all resources the user has visibility into

Joey

12/08/2023, 5:03 PM

or another part of a search?

julienvincent

12/08/2023, 5:04 PM

we expose several API's which allow querying resources and filtering on the properties of those resources and returning the results in a paginated way.

julienvincent

12/08/2023, 5:04 PM

in order to support pagination we need to know the full set of resources the person has access to up front

julienvincent

12/08/2023, 5:05 PM

The alternative would be to load all resources from the DB and make check calls for each one against spice - which just seems worse

Joey

12/08/2023, 5:05 PM

not if the working set is smaller

Joey

12/08/2023, 5:06 PM

if the user searches for "foo", and that has 10 resources found, bulk checking will be faster

julienvincent

12/08/2023, 5:07 PM

well, if the user searches foo and there are 3000 matches but they only have access to 10, and it's the last 10 sequentially, we would need to load all 3000 resources and bulk check them

Joey

12/08/2023, 5:08 PM

correct, but that's likely still faster than filtering on 3K items if the user has access to that many outside of the match

julienvincent

12/08/2023, 5:08 PM

but we have no way to determine up front if any particular user has access to all or only a subset without asking the question

Joey

12/08/2023, 5:09 PM

sure

julienvincent

12/08/2023, 5:09 PM

so the performance of each approach completely depends on each individual users level of access

Joey

12/08/2023, 5:09 PM

correct

julienvincent

12/08/2023, 5:09 PM

So.. I don't know how to solve this without something like https://github.com/authzed/spicedb/issues/207

Joey

12/08/2023, 5:09 PM

that's why that issue exists 🙂

Joey

12/08/2023, 5:10 PM

at some point, either pre-filter or post-filter reaches a point of diminishing returns

julienvincent

12/08/2023, 5:10 PM

What do people currently do out in the wild

julienvincent

12/08/2023, 5:10 PM

to work around this issue

Joey

12/08/2023, 5:10 PM

post filter, mostly

Joey

12/08/2023, 5:10 PM

using bulk check

julienvincent

12/08/2023, 5:10 PM

damn

Joey

12/08/2023, 5:10 PM

because most of the time, when issuing a search, the number of results to be checked will be lower than loading "everything this user can see"

Joey

12/08/2023, 5:11 PM

imagine if you were at GitHub size and asked "which repos can a user view"

Joey

12/08/2023, 5:11 PM

it could be 10s of millions

julienvincent

12/08/2023, 5:11 PM

Yea no of course - but for our current scale I was assuming this would be good enough until #207 lands

Joey

12/08/2023, 5:12 PM

3K results shouldn't take as long as you're seeing, mind you

julienvincent

12/08/2023, 5:12 PM

But it seems not

Joey

12/08/2023, 5:12 PM

but I don't have insight into your cluster to know where the slowdown is

Joey

12/08/2023, 5:12 PM

your schema could just be very deep or very wide

Joey

12/08/2023, 5:12 PM

or both

julienvincent

12/08/2023, 5:12 PM

I shared the schema with you fyi in private chat

Joey

12/08/2023, 5:12 PM

sure, but that doesn't show me how wide it is or deep

Joey

12/08/2023, 5:14 PM

a cursory look suggests it could be quite on both fronts

julienvincent

12/08/2023, 5:14 PM

Would the idea here be something like - load a page out of the db, bulk check it, use the subset to construct a response page, if the response page is still too small - load more and repeat?

Joey

12/08/2023, 5:15 PM

yes

Joey

12/08/2023, 5:15 PM

the risk, of course, is that you could have to check many "pages" to find the next result

julienvincent

12/08/2023, 5:15 PM

Yea

Joey

12/08/2023, 5:15 PM

do you see the same LR lag on both spanner and Postgres?

julienvincent

12/08/2023, 5:16 PM

Yes

julienvincent

12/08/2023, 5:16 PM

It's slightly better on postgres though

Joey

12/08/2023, 5:16 PM

that implies that it is indeed deep or wide (or both)

julienvincent

12/08/2023, 5:20 PM

Do you mean by ways of number of resources in the relationship 'path'? Other than the target resource, which is around 1.1k resources, we only have around 10-20 resource for all other types

Joey

12/08/2023, 5:22 PM

sure, but how long is the path to get from the user to each of those resources?

julienvincent

12/08/2023, 5:28 PM

The problem here though is this only works for cursor pagination. Most of our API's are offset based pagination (because they are servicing frontend tables and such)

julienvincent

12/08/2023, 5:29 PM

So I'm not sure there really is a way to do post-filtering for a offset-based paginated API?

julienvincent

12/08/2023, 5:29 PM

at most 4

Joey

12/08/2023, 5:29 PM

hrmph - something seems wrong then

Joey

12/08/2023, 5:30 PM

are the SpiceDB pods running on the same node?

julienvincent

12/08/2023, 5:30 PM

nope, each have their own node with 4 cores

julienvincent

12/08/2023, 5:30 PM

But, my reproduce case was on an isolated VM and backing datastore

julienvincent

12/08/2023, 5:31 PM

single spice process, single VM (4 cores) single spanner (with any PU config, it made no difference).

julienvincent

12/08/2023, 5:31 PM

Even low concurrency would very quickly jump the latency

Joey

12/08/2023, 5:33 PM

that's not surprising - 4 cores is likely being overwhelmed

julienvincent

12/08/2023, 5:34 PM

Fair, well I'd be happy to try out the post-filtering approach - but would need a solution here ^

Joey

12/08/2023, 5:34 PM

you'd bulk check the results from the pagination from your source DB

julienvincent

12/08/2023, 5:35 PM

But that only works for cursor pagination

julienvincent

12/08/2023, 5:35 PM

Or do you mean bulk check the entire resultset?

julienvincent

12/08/2023, 5:35 PM

in which case there is no point in pagination

Joey

12/08/2023, 5:36 PM

I'm not following

Joey

12/08/2023, 5:36 PM

you have some API that is paginated that is matching the search filter(s)

Joey

12/08/2023, 5:37 PM

you get back the first "page" of results, bulk check, send forward any matching, iterate

Joey

12/08/2023, 5:37 PM

and construct a cursor to represent where you left off last time

julienvincent

12/08/2023, 5:37 PM

Yes, so that's cursor pagination. Our API's are mostly offset based pagination

julienvincent

12/08/2023, 5:37 PM

If I made a query like:

{fitler: {}, paginate: {offset: 100, limit: 10}}

then I would need to re-iterate over all previous results in order to respond

julienvincent

12/08/2023, 5:39 PM

So I don't think there is a practical way to have an offset based paginated API if we used post-filtering

Joey

12/08/2023, 5:39 PM

you'd need to wrap the API in a cursor-based API, yes

Joey

12/08/2023, 5:39 PM

but the underlying one can still be offset based, assuming it has a stable ordering

Joey

12/08/2023, 5:39 PM

if not, all bets are off

Joey

12/08/2023, 5:39 PM

but then, all bets are off anyway

julienvincent

12/08/2023, 5:40 PM

But.. how?

julienvincent

12/08/2023, 5:40 PM

Not if you had the set of ids you have access to up front 😛

Joey

12/08/2023, 5:40 PM

like I said above

Joey

12/08/2023, 5:41 PM

IF the underlying API is stable

Joey

12/08/2023, 5:41 PM

then you just bulk check until you find the

limit

items and store the offset in the returned cursor

Joey

12/08/2023, 5:41 PM

then when invoked again with the cursor, you resume calling the underlying API and continue bulk checking

julienvincent

12/08/2023, 5:42 PM

Well, it's more complicated than that. You can't support the same kinds of sorting in a cursor based paginated API. Everything you are sorting over has to be unique

julienvincent

12/08/2023, 5:42 PM

So the main reason we have an offset based API is to support sorting over arbitrary fields

julienvincent

12/08/2023, 5:42 PM

We wouldn't be able to support that if we built it on top of an underlying cursor based paginated api

Joey

12/08/2023, 5:43 PM

sort of - cursors can support arbitrary fields, they just need to have a define ordering between calls with the same cursor

julienvincent

12/08/2023, 5:44 PM

Fair.. is that true of most DB's?

julienvincent

12/08/2023, 5:45 PM

that sounds like something that would be undefined behaviour

julienvincent

12/08/2023, 5:45 PM

or maybe not

julienvincent

12/08/2023, 5:46 PM

I guess if they didn't that would break offset based pagination too

Joey

12/08/2023, 5:46 PM

yeah

Joey

12/08/2023, 5:47 PM

I would like to see why your LR calls are so slow - if there are indeed only 3 hops

Joey

12/08/2023, 5:47 PM

it should be fast

julienvincent

12/08/2023, 5:48 PM

I've got to run now - hard stop coming up, but I can be available for debugging this whenever you like

julienvincent

12/08/2023, 5:49 PM

This convo was super useful, I will be thinking about this a lot will try play with this post-filter approach

julienvincent

12/08/2023, 5:49 PM

thanks

vroldanbet

12/11/2023, 9:07 AM

This is slightly tangential, but since we are discussing pagination: unless you are using some database that handles offseted queries in a magical way, offset pagination does not scale because each query forces the database to load all previous rows of the offset. At certain query rate you'll hit a wall with it. See https://use-the-index-luke.com/no-offset SpiceDB itself does cursor based pagination with different ordering criterias.

julienvincent

12/11/2023, 2:52 PM

Yea I am aware of those performance issues - we were using offset due to the misconception around sorting with cursors.

julienvincent

12/11/2023, 2:55 PM

Having played with and thought about the post-filtering approach I don't think it's going to be a perfectly viable approach: + We have lots of cases where we have lots of data with users only having partial access + We need to know totals up front, which would always require a full table scan We need some way to pre filter data at the db query, but yea the performance of

LookupResources

just makes that completely unviable. If there was some way to filter the results from SpiceDB based on relationships or properties on the resources this would help a lot to mitigate this.

vroldanbet

12/11/2023, 6:11 PM

I'm afraid I don't have the bandwidth right now to look what's up making your LookupResources calls slow. And even if we made them faster, without setting some upper boundaries on the size of your LR responses, there will always be a LR response large enough that will make this overall slow. At some point even the

WHERE IN

clauses in your database will become very slow or cause load spikes that will affect every other workload relying on it.

281 Views

Previous Next