I ve just made a `schema write` where I removed 3 permissio SpiceDB #spicedb

👋 I've just made a `schema write` where I removed...

williamdclt

08/02/2022, 3:21 PM

👋 I've just made a

schema write

where I removed 3 permissions. I was expecting this to be instant, but it took ~3min, during which i observed a huge increased load on my (Postgres) database due to this query:

Copy code

SELECT namespace, object_id, relation, userset_namespace, userset_object_id, userset_relation FROM relation_tuple WHERE created_transaction <= $1 AND (deleted_transaction = $2 OR deleted_transaction > $3) AND namespace = $4 AND relation = $5 LIMIT ?

How comes removing a permission requires a DB query? How comes it is so expensive? My DB CPU jumped from ~10% to 50-60% for 3 minutes: this was my staging environment, I'm concerned about deploying this to production :/

jzelinskie

08/02/2022, 5:05 PM

When you say 3 minutes -- it was 3 minutes until the clients saw the change or 3 minutes for the

schema write

to return?

williamdclt

08/02/2022, 5:05 PM

The latter:

zed schema write

took 3 minutes to return

jzelinskie

08/02/2022, 5:07 PM

what version of spicedb? just to be sure

jzelinskie

08/02/2022, 5:16 PM

also have you configured

--datastore-revision-quantization-interval

to be something other than the default?

Joey

08/02/2022, 5:21 PM

@williamdclt three permissions or three relations?

williamdclt

08/02/2022, 5:21 PM

permissions

williamdclt

08/02/2022, 5:22 PM

> what version of spicedb? just to be sure 1.7.1 > also have you configured --datastore-revision-quantization-interval to be something other than the default? it's explicitly at 5s

Joey

08/02/2022, 5:23 PM

The reason that query is run is to ensure that removing a relation doesn’t leave dangling data on it

Joey

08/02/2022, 5:23 PM

We probably need to make it skip for permissions, since they cannot have data now

Joey

08/02/2022, 5:24 PM

As for why the query takes so long, that is a good question, especially since it is limit 1

Joey

08/02/2022, 5:24 PM

I imagine writes were occurring at the same time?

jzelinskie

08/02/2022, 5:27 PM

ah yeah it's definitely the sanity check

jzelinskie

08/02/2022, 5:28 PM

https://github.com/authzed/spicedb/blob/53f8741a90fa4738ac1900c95c688d78abc6ddb5/internal/services/shared/schema.go#L61

jzelinskie

08/02/2022, 5:31 PM

There have been some pretty big changes/optimizations to the pg datastore since 1.7.1 which makes the perf harder to reason about in retrospect

jzelinskie

08/02/2022, 5:46 PM

@Joey should I create an issue for skipping permissions in the sanity check?

Joey

08/02/2022, 5:47 PM

yep

Joey

08/02/2022, 5:47 PM

I'll address that later today, since it is a small fix

jzelinskie

08/02/2022, 5:54 PM

https://github.com/authzed/spicedb/issues/733

williamdclt

08/02/2022, 5:54 PM

> I imagine writes were occurring at the same time? Yes, although not many Thanks for the fix guys 🙂 appreciate it!

jzelinskie

08/02/2022, 5:55 PM

It should be a simple one to backport, but I'd suggest upgrading when the fix goes out

jzelinskie

08/02/2022, 5:56 PM

There are lots of performance fixes between 1.7.1 and 1.10.0/HEAD

Joey

08/02/2022, 6:19 PM

@williamdclt could you do me a favor if you get a moment and run an explain on your Postgres for that query?

Joey

08/02/2022, 6:19 PM

I'd be curious to see what its doing on your install

Joey

08/02/2022, 6:25 PM

https://github.com/authzed/spicedb/pull/734

williamdclt

08/02/2022, 6:53 PM

Copy code

sql
Limit  (cost=0.69..4677.70 rows=1 width=129) (actual time=56.761..56.762 rows=0 loops=1)
  ->  Index Scan using uq_relation_tuple_living on relation_tuple  (cost=0.69..135634.24 rows=29 width=129) (actual time=56.760..56.760 rows=0 loops=1)
        Index Cond: (((namespace)::text = 'agency'::text) AND ((relation)::text = 'caregiver_or_manager'::text))
        Filter: ((created_transaction <= 12666087) AND ((deleted_transaction = '9223372036854775807'::bigint) OR (deleted_transaction > 12666087)))
Planning Time: 0.410 ms
Execution Time: 56.790 ms

This is with fairly random params though:

Copy code

sql
EXPLAIN ANALYZE SELECT
    NAMESPACE,
    object_id,
    relation,
    userset_namespace,
    userset_object_id,
    userset_relation
FROM
    relation_tuple
WHERE
    created_transaction <= 52666087
    AND (
        deleted_transaction = 9223372036854775807
            OR deleted_transaction > 52666087
    )
    AND NAMESPACE = 'agency'
    AND relation = 'caregiver_or_manager'
LIMIT 1;

williamdclt

08/02/2022, 6:54 PM

I'm also forever suspicious of having a different plan when it's a prepared statement too

Joey

08/02/2022, 6:55 PM

hmmm

Joey

08/02/2022, 6:55 PM

seems pretty quick

Joey

08/02/2022, 6:55 PM

could indeed be the consistency level

Joey

08/02/2022, 6:58 PM

> I'm also forever suspicious of having a different plan when it's a prepared statement too yeah

jzelinskie

08/02/2022, 8:40 PM

56ms is actually pretty slow for that

jzelinskie

08/02/2022, 8:41 PM

a filter over 29 rows returned by the index takes 56ms? something seems wrong

jzelinskie

08/02/2022, 8:41 PM

that definitely should be a single digit millisecond query

Joey

08/02/2022, 8:41 PM

I meant compared to 60+s

Previous Next