Google-Scale Authorization: Getting to 1... SpiceDB #spicedb

Google-Scale Authorization: Getting to 1...

make it go faster

11/30/2023, 3:17 AM

In the "1 million QPS test", is the number of spiceDB pods in the "Qty" column ? https://authzed.com/blog/google-scale-authorization https://cdn.discordapp.com/attachments/844600078948630559/1179622178954686514/Screen_Shot_2023-11-29_at_9.14.35_PM.png?ex=657a73c3&is=6567fec3&hm=f880e87f4952edfee2e722d196bd2aead5ccd0d1ef74e2cbe7fc89165ad0dcc3&

make it go faster

11/30/2023, 3:19 AM

And can you also confirm that you are measuring the grpc_server_handled_total metric in these tests?

Joey

11/30/2023, 3:26 AM

the right most, yes

Joey

11/30/2023, 3:26 AM

but note that the size of the nodes matters quite a bit too

make it go faster

11/30/2023, 3:35 AM

what is the cpu and memory req and limit that you defined for each pod

make it go faster

11/30/2023, 3:35 AM

can you please share that

make it go faster

11/30/2023, 3:36 AM

for my pods, I am doing request cpu: 500m and limit: 4. Does it matter in go runtime if I set the cpu request to 4 from the outset?

Joey

11/30/2023, 3:41 AM

I don't recall offhand but it more or less should be the full resources of the node

make it go faster

11/30/2023, 3:41 AM

do you are dedicate the full VM node to a single pod?

Joey

11/30/2023, 3:41 AM

yes

make it go faster

11/30/2023, 3:41 AM

Aha

Joey

11/30/2023, 3:42 AM

for SpiceDB ones, yes

Joey

11/30/2023, 3:42 AM

for the control plane, no

make it go faster

11/30/2023, 3:42 AM

and did you have to play around with the GOMAXPROCS setting ?

make it go faster

11/30/2023, 3:42 AM

I would believe no, since a VM node == k8 pod in your case

Joey

11/30/2023, 3:42 AM

I don't believe so

make it go faster

11/30/2023, 3:42 AM

ok this gong to be pretty tricky now

Joey

11/30/2023, 3:42 AM

yeah, you don't want SpiceDB preempted

Joey

11/30/2023, 3:43 AM

this is why we offer our Dedicated product - we do all this for you 😄

make it go faster

11/30/2023, 3:43 AM

$$$ = peace of mind 🙂

make it go faster

11/30/2023, 3:44 AM

let's see how this goes

make it go faster

11/30/2023, 3:46 AM

I did a simple test with 34 million relations randomly selected and at about 3k RPS the p95 starts to become choppy and starts going up. My test was increasing the number of requests every 10 mins. So I am trying to figure out how to work through optimizing this now https://cdn.discordapp.com/attachments/1179622178824667156/1179629593989230662/Screen_Shot_2023-11-29_at_9.45.00_PM.png?ex=657a7aab&is=656805ab&hm=863769aede9b02ff93f905ccbd89f602cae1e634d523909491b900162eeeb971&

make it go faster

11/30/2023, 3:47 AM

our current RBAC system has a P95 of 3ms so this might become a problem

make it go faster

11/30/2023, 3:49 AM

I think it might be related to crdb

make it go faster

11/30/2023, 3:50 AM

as it's latency jumps up after a certain RPS

make it go faster

11/30/2023, 3:50 AM

crdb metrics https://cdn.discordapp.com/attachments/1179622178824667156/1179630617609117766/Screen_Shot_2023-11-29_at_9.49.37_PM.png?ex=657a7b9f&is=6568069f&hm=06636126339d38de528e0d10a03394f25cf88728139297f97517f72d51476d47&

Joey

11/30/2023, 4:13 AM

you likely need to scale it too

make it go faster

11/30/2023, 4:57 AM

Looks likes the transaction latency is correlated with the "Sql memory" in cockroachdb https://cdn.discordapp.com/attachments/1179622178824667156/1179647266093535242/Screen_Shot_2023-11-29_at_10.56.01_PM.png?ex=657a8b21&is=65681621&hm=6036ed3f1088a041bd488e5166b61b91b7818a1e2b3df6ab1baad3fbcaa684df&

Previous Next