It suspect there s a memory leak in SpiceDB My pods are regu SpiceDB #spicedb

It suspect there's a memory leak in SpiceDB. My po...

williamdclt

10/12/2022, 11:00 AM

It suspect there's a memory leak in SpiceDB. My pods are regularly restarting (36 times in 5d18h), with memory going very high pretty quickly. That only started happening since I upgraded to v1.13.0

williamdclt

10/12/2022, 11:00 AM

For info, my

resource

configuration:

Copy code

resources:
    requests:
      cpu: 500m
      memory: 1Gi
    limits:
      cpu: 1000m
      memory: 2Gi

williamdclt

10/12/2022, 11:01 AM

Extended graph. - First yellowish zone is v1.11.0 - Purpleish zone is v1.12.0 - Second yellowish zone is v1.13.0 (same zone as the purpleish zone of the first screenshot above, confusingly)

vroldanbet

10/12/2022, 11:11 AM

maybe potentially related to the new way to compute cache size using percentages? By default it uses a 75% of available memory to the container. A way to determine if that was the contributing factor would be to change from the default 75% to a static memory value

vroldanbet

10/12/2022, 11:12 AM

It could be that perhaps the newly introduced library is not respecting cgroup limits

vroldanbet

10/12/2022, 11:12 AM

cc @jzelinskie

vroldanbet

10/12/2022, 11:13 AM

(could be totally unrelated to the new percentage-based cache sizes, but it's worth ruling it out with the exercise outlined above)

williamdclt

10/12/2022, 11:25 AM

I'm happy to try it: could you suggest a reasonable static memory value here?

vroldanbet

10/12/2022, 11:27 AM

well I'd say set it to 75% of the resource request, that'd me 750mib?

vad8615

10/12/2022, 1:06 PM

@vroldanbet are you talking about

--dispatch-cluster-cache-max-cost

(which defaults to 70%) or

--dispatch-cache-max-cost

(default=30%) ?

vroldanbet

10/12/2022, 1:07 PM

actually both.

vad8615

10/12/2022, 1:08 PM

we're experiencing memory issues too, but I don't know if related to latest version, since we just enabled spice in production (for a small feature, we'd like to expand usage too)

vroldanbet

10/12/2022, 1:08 PM

did you experience memory issues after setting those 2 flags?

vroldanbet

10/12/2022, 1:08 PM

any chances you can get a pprof dump?

vad8615

10/12/2022, 1:09 PM

no, those flags are not set, but as I wrote earlier at every restart I get different values. Even with small values like:

vad8615

10/12/2022, 1:10 PM

it ends up OOMing

vad8615

10/12/2022, 1:10 PM

Copy code

go_memstats_mallocs_total{job="spicedb-production-metrics"} - go_memstats_frees_total{job="spicedb-production-metrics"}

vroldanbet

10/12/2022, 1:11 PM

it's expected you get different values, since the percentage of free memory in the machine would vary each time spicedb starts up

vad8615

10/12/2022, 1:11 PM

(memory request is 1GB)

vroldanbet

10/12/2022, 1:11 PM

👀

vad8615

10/12/2022, 1:12 PM

yes but should always be at least 1GB -> thus 70% ~ 700MB

vroldanbet

10/12/2022, 1:12 PM

is that running in your machine or in prod? I experienced something similar in my machine. Maybe the library we are using is not reliable

vad8615

10/12/2022, 1:13 PM

vad8615

10/12/2022, 1:13 PM

prod

vroldanbet

10/12/2022, 1:13 PM

so in that graph request and limit is 1GB, correct?

vad8615

10/12/2022, 1:14 PM

let me check the limit, I thought it was not set, it could be an error in the chart

vad8615

10/12/2022, 1:14 PM

but yes, request is 1GB

vroldanbet

10/12/2022, 1:15 PM

and the logs above, showing relatively low cache allocations, is also from prod right?

vroldanbet

10/12/2022, 1:15 PM

I certainly wouldn't expect those numbers. It's similar to what I observed, but in my local machine

vroldanbet

10/12/2022, 1:15 PM

the only alternative I can think of right now is to set those manually and make sure they fit within the request

vroldanbet

10/12/2022, 1:16 PM

manually as in with actual sizes instead of percentages, the flag should support both

vad8615

10/12/2022, 1:16 PM

yes, they are from production. I confirm limit is 1GB, my bad

vroldanbet

10/12/2022, 1:16 PM

@jzelinskie can follow up, Im actually OoO but just in front of the computer because I got paged 😅

vad8615

10/12/2022, 1:18 PM

I'll set them rn, but I don't expect it to improve, because even now we're going way above those values

jzelinskie

10/12/2022, 3:21 PM

the percentage actually checks free memory and does (free mem * 0.70) * $YOUR_PERCENTAGE

jzelinskie

10/12/2022, 3:22 PM

it's confusing, but this is so that other things can consume some amount of memory without the cache having eaten up everything

jzelinskie

10/12/2022, 3:28 PM

We're looking into to seeing if there could be another cause to the misaligned memory usage though, since that should put you safely below your target and not above it

vad8615

10/13/2022, 10:04 AM

@jzelinskie have you seen the chart about allocs? does it say something? 😄

williamdclt

10/13/2022, 10:58 AM

Just tried setting a static

--dispatch-cache-max-cost

750MiB

, doesn't seem to have changed anything really

williamdclt

10/13/2022, 10:59 AM

(I don't enable the dispatch cluster, so didn't set

--dispatch-cluster-cache-max-cost

)

williamdclt

10/13/2022, 11:07 AM

a heap dump for your eyes

conversation

williamdclt

10/13/2022, 11:08 AM

Interestingly the dump talks about "882.57MB total", not >3GB 🤔

williamdclt

10/13/2022, 12:08 PM

> (I don't enable the dispatch cluster, so didn't set --dispatch-cluster-cache-max-cost) I set it for peace of mind: didn't change anything either 🙂

vad8615

10/13/2022, 1:06 PM

i set them to 200MB/200MB, still growing (slowing, not many requests, as I mentioned before)

jzelinskie

10/13/2022, 2:36 PM

Ok I'm going to try and reproduce this today

williamdclt

10/13/2022, 3:26 PM

It's impacting response time and availability pretty negatively, I'll rollback to v1.12.0 in the meanwhile!

jzelinskie

10/14/2022, 3:36 PM

I suspect our cost estimates for item sizes going into the cache are substantially off such that the cache in reality is growing beyond what it's intended

jzelinskie

10/14/2022, 3:37 PM

I'm experimenting with how we store and estimate items in our cache so that they are no longer estimates.

vad8615

10/15/2022, 7:05 PM

@jzelinskie if you add metrics (estimates vs real, where real could be optional?) we could report what we see

jzelinskie

10/17/2022, 3:26 PM

Rather I have a PR that just gets rid of any guess-work we were doing.

jzelinskie

10/17/2022, 3:26 PM

https://github.com/authzed/spicedb/pull/900/files

jzelinskie

10/17/2022, 3:26 PM

You might be able to cherrypick it onto v1.13

vad8615

10/18/2022, 2:58 PM

@jzelinskie I'm running 1.13.0 patched with your PR. I'll wait till tomorrow to be sure but apparently it's 😍

jzelinskie

10/18/2022, 2:59 PM

nice! glad to hear that so far

vad8615

10/19/2022, 11:54 AM

vad8615

10/19/2022, 11:54 AM

annotation where PR was applied

vroldanbet

10/19/2022, 11:57 AM

do you have cache metrics enabled? Would be useful to see if evictions are happening based on the cache size selected on startup

vad8615

10/19/2022, 2:27 PM

@vroldanbet can you tell me which (prometheus) metric exactly?

vroldanbet

10/19/2022, 2:30 PM

sure! you need 2 flags enabled, because these are "ristretto" (the caching library we use) specific metrics, which are not enabled by default: -

dispatch-cache-metrics

dispatch-cluster-cache-metrics

once enabled, you shall see the following metrics:

Copy code

spicedb_dispatch_client_cost_added_bytes
spicedb_dispatch_client_cost_evicted_bytes
spicedb_dispatch_cost_added_bytes
spicedb_dispatch_cost_evicted_bytes

vad8615

10/19/2022, 2:43 PM

they're all 0, they're probably not enabled. Let me check

vad8615

10/19/2022, 2:50 PM

I've just deployed spice with these metrics enabled. Traffic is already going down for today (TZ is EU/Rome), we can have more information tomorrow

vroldanbet

10/19/2022, 2:51 PM

thanks for looking into this!

vad8615

10/19/2022, 2:57 PM

@vroldanbet thank you and @jzelinskie for the support!

vroldanbet

10/19/2022, 2:58 PM

no problem! it's jimmy who deserve all the credit here! ✨

jzelinskie

10/19/2022, 3:35 PM

@vad8615 would it be possible to get a similar graph for your cpu performance?

vad8615

10/20/2022, 7:44 AM

@jzelinskie

vroldanbet

10/20/2022, 7:46 AM

mind marking when did the cache-fix got deployed? I believe Jimmy wanted to check if extra CPU incurred was relevant, and it appears it is not. How many pods are y'all running?

vad8615

10/20/2022, 7:50 AM

sure. I can post p50 and p90 too, they look relevant

vroldanbet

10/20/2022, 7:50 AM

what is this snapshot, avg?

vroldanbet

10/20/2022, 7:52 AM

when I say relevant, I mean that it does not look like there is an appreciable increase in CPU use. And even if there was, SpiceDB is still fundamentally I/O bound and trading some CPU seemed like a reasonable choice here to better control memory management

vad8615

10/20/2022, 8:20 AM

(ATM only a couple of features with low usage have ACL managed with spice). Yes, I agree that CPU usage has not changed in any relevant way. Here CPU with annotation:

vad8615

10/20/2022, 8:22 AM

p50 and p90:

vad8615

10/20/2022, 8:22 AM

it looks like there's a modest increase in p90

vroldanbet

10/20/2022, 8:29 AM

it seems like your workload always shows that pattern around 8:00am in your TZ. At first glance it would seem so, but perhaps early to drag conclusions - workload could have changed. Maybe try to correlate with traffic?

vad8615

10/20/2022, 9:09 AM

no traffic on weekends, working days are quite similar:

vad8615

10/21/2022, 1:34 PM

vad8615

10/21/2022, 1:34 PM

evicted are still 0

vad8615

10/21/2022, 1:34 PM

vad8615

10/21/2022, 1:35 PM

max costs are 200MB/200MB

vroldanbet

10/21/2022, 3:13 PM

nice looking good, presumably there is not enough load to apply pressure to the cache

jzelinskie

10/21/2022, 4:11 PM

fantastic

vad8615

11/03/2022, 7:37 AM

vad8615

11/03/2022, 7:42 AM

even if the memory has improved a lot, I think there's still something not working properly. - In the last hours we went over the 200MB limit we set, and we still see no evitions. - Total memory usage is 675MB. It's growing faster than caches. It was ~400MB when caches were ~100MB each, thus ~200MB overhead. With caches ~200MB each it's ~675MB, thus 375MB overhead. Memory usage is basically double the amount of memory reserved for cache.

jzelinskie

11/03/2022, 3:12 PM

Can you run a heap profile on a SpiceDB instance? There could be something else outside of the cache consuming memory, so I'd like to confirm that.

vad8615

11/03/2022, 3:36 PM

(update: evictions started at 209MB 🙏 )

vad8615

11/03/2022, 3:47 PM

@jzelinskie can I send it privately somehow?

jzelinskie

11/03/2022, 3:47 PM

yeah if you DM me a https://pprof.me link that should be private

jzelinskie

11/03/2022, 4:03 PM

Looks like there could be a big LookupResources request in flight

jzelinskie

11/03/2022, 4:07 PM

going to dig a bit deeper if i can

vad8615

11/03/2022, 4:47 PM

I would't say there's a big one, of this resource type we have at most ~200 resources per user AFAIK. Could it be there's a leak of "LookupResources objects" ?

vad8615

11/03/2022, 4:47 PM

as you can see in the above chart, it takes days to reach this point

jzelinskie

11/03/2022, 5:16 PM

yeah, that's quite possible

vad8615

11/04/2022, 8:30 AM

and then suddenly it oomed

jzelinskie

11/04/2022, 3:07 PM

Can y'all try the latest v1.14 release? I just want to make sure you have all the latest changes to LookUpResources internals

vad8615

11/05/2022, 8:31 PM

I'll try next week ASAP, thank you

jzelinskie

11/05/2022, 8:32 PM

There have been lots of changes there for performance and to support caveats, so I'd like to make sure things are reproducible

jzelinskie

11/05/2022, 8:32 PM

Check out the GitHub release notes for how to upgrade with zero downtime

vad8615

11/08/2022, 4:34 PM

we're now running 1.14.1. Let's wait for memory to increase

jzelinskie

11/09/2022, 4:19 PM

How's it been going?

vad8615

11/10/2022, 8:23 AM

it's apparently the same, but let me collect data 1 more day

vad8615

11/11/2022, 7:27 AM

I need to decrease cache size even more in order to be able to make a statement

vad8615

11/15/2022, 2:40 PM

an update: with 1.14.1 memory seems to be quite stable 🎉 there could still be a small leak but way less important. I'm collecting evidence

vroldanbet

11/15/2022, 3:51 PM

did you guys reduce the memory request/limits to put some pressure on the memory?

vad8615

11/15/2022, 3:59 PM

we reduced cache size (in MB), do you think I should decrease even the pod req/limits?

vroldanbet

11/15/2022, 4:02 PM

well ideally you don't need to resource to reducing the cache size and the default establishes sensible values. I was referring to the kube requests/limits in order to accelerate the leak and make it evident waiting less time (it it still exists)

vad8615

11/16/2022, 10:46 AM

vad8615

11/16/2022, 10:48 AM

the current situation is much better than before. However, I would expect memory usage to go back to the same value every night (when there's no traffic), as soon as evictions starts kicking in. We observed an increase of ~20MB between the last two nights instead.

vroldanbet

11/16/2022, 10:50 AM

hrm, shouldn't eviction kick-in when there is memory pressure? If I understand correctly, y'all set a lower cache size compared to the actual memory available to the process, right?

vroldanbet

11/16/2022, 10:51 AM

so I guess that's why eviction is kicking in, even thought there is still memory left

vroldanbet

11/16/2022, 10:52 AM

while that works, it does not fully utilize the memory available to the process

vroldanbet

11/16/2022, 10:53 AM

If I may, I'd suggest adding a line denoting the cache size the first panel, so it's easier to map evictions to crossing that threshold

vad8615

11/16/2022, 1:32 PM

cache limit is 20MB BTW. I'm aware it's not an optimal setting, it's just for "debugging"

vroldanbet

11/16/2022, 2:02 PM

oh ok!

AlexB

12/22/2022, 3:31 PM

Hey everyone, we started to see quite a lot of out-of-memory kills of our spicedb pods recently (as the number of the relationships has grown from ~7K to ~11K, but maybe due to some other reason that we're missing) we tried setting --dispatch-cluster-cache-max-cost and --dispatch-cache-max-cost to specific values (700MiB and 300MiB respectively out of 1280Mi configured as the pods' resource request) as it was suggested here but didn't see any improvement we started to experience these problems with 1.14.1, then upgraded to 1.15.0 which made no difference is there anything else that we could try doing?

vroldanbet

12/22/2022, 5:54 PM

what version are you currently running?

AlexB

12/27/2022, 9:26 AM

we're running 1.15.0

vroldanbet

12/27/2022, 9:27 AM

are there any chances you guys can provide us with your schema to see if we can reproduce it? cc @jzelinskie who's been looking into the leak

AlexB

01/04/2023, 5:21 PM

hey @vroldanbet and @jzelinskie , we'll have to obfuscate our schema a little to be able to share but we can do that, I can send it to you in DM it's pretty complex though, so I'm not sure how feasible it would be to make any use of it I can also share some heap profiles and any other metrics if it helps

AlexB

01/04/2023, 6:18 PM

one observation - when our kubernetes metrics show that a spicedb pod uses ~1 GB of memory, the heap profile shows less than 500MB

Joey

01/04/2023, 6:27 PM

what does

ps

show for the memory usage of the process?

jzelinskie

01/04/2023, 6:32 PM

can you share a heap profile?

jzelinskie

01/04/2023, 6:53 PM

go tool pprof http://$SPICEDB_URL:$METRICS_PORT/debug/pprof/heap

should download a tarball that you can upload to https://pprof.me to share with us

AlexB

01/04/2023, 9:42 PM

https://pprof.me/45cc95d/

AlexB

01/04/2023, 9:43 PM

Copy code

/ # ps -o pid,user,vsz,rss,comm
PID   USER     VSZ  RSS  COMMAND
    1 65532    1.7g 1.1g spicedb

Joey

01/04/2023, 9:47 PM

thanks

AlexB

01/04/2023, 9:54 PM

memory usage in the heap profile (Memory In-Use Bytes, Cumulative), ps (the rss column) and in kubernetes metrics seem to match actually but the svg generated with

go tool pprof

web

displays

Showing nodes accounting for 443.62MB 92.88% of 477.64MB total

(perhaps my expectation that this would represent the total memory used is incorrect)

Joey

01/04/2023, 9:54 PM

[spicedb] github.com/authzed/spicedb/internal/namespace.(*ReachabilityGraph).collectEntrypoints

jumps out on the cumulative graph

Joey

01/04/2023, 9:54 PM

I'll investigate tomorrow to make sure we're not somehow keeping that graph around once its done being used

AlexB

01/10/2023, 10:56 PM

hey everybody, I was wondering if there are any updates regarding the suspected memory leak as far as I can see, our heap profile at the link above got overwritten by something else, but I can upload a new one if needed we keep seeing oom kills quite frequently - typically a couple of time a day

Joey

01/10/2023, 11:00 PM

we found a deadlock in the LookupResources code and just merged a fix for it: https://github.com/authzed/spicedb/pull/1086

Joey

01/10/2023, 11:00 PM

however, we aren't 100% certain that is the root cause of the memory issues

Joey

01/10/2023, 11:00 PM

if you can get us another profile too, it can't hurt

AlexB

01/19/2023, 10:17 PM

Hi everybody, we're still suffering memory leak issues here's some more observations:

Copy code

ps -o pid,user,vsz,rss,comm
PID   USER     VSZ  RSS  COMMAND
    1 65532    3.0g 1.4g spicedb

https://pprof.me/f56c390/

Copy code

ps -o pid,user,vsz,rss,comm
PID   USER     VSZ  RSS  COMMAND
    1 65532    3.0g 2.2g spicedb

https://pprof.me/567b9c8/

Joey

01/19/2023, 10:20 PM

@jzelinskie ^

Joey

01/19/2023, 10:20 PM

@AlexB on v1.16?

Joey

01/19/2023, 10:25 PM

v1.16 has a fix for what we believe was causing the reachability graph to not be GCed

Joey

01/19/2023, 10:25 PM

that fix linked above ^

jzelinskie

01/19/2023, 10:26 PM

from your graph -- the fix joey is mentioning removes errgroup and uses a different primitive for scheduling reachableresources dispatches

AlexB

01/19/2023, 10:29 PM

hmm... I checked what's included in v1.16 and I thought the fix linked above wasn't there and based on that we decided not to upgrade just yet

Joey

01/19/2023, 10:30 PM

Joey

01/19/2023, 10:30 PM

might be v1.16.1

jzelinskie

01/19/2023, 10:30 PM

https://github.com/authzed/spicedb/releases/tag/v1.16.1

Joey

01/19/2023, 10:31 PM

yeah, sorry

Joey

01/19/2023, 10:31 PM

I thought it merged earlier than it did

AlexB

01/19/2023, 10:34 PM

thanks a lot 👍, we'll give it a try

Joey

01/19/2023, 10:36 PM

thanks for the report

Joey

01/19/2023, 10:36 PM

it was incredibly helpful to narrow it down

pdow

01/25/2023, 2:28 PM

FYI @Joey - we are still seeing the same issues after upgrading. We have a relatively small DB and have been giving our pods more and more memory, but invariably spicedb eventually eats all of it after some time and use. The pattern seems to be that we do some writes to the db, and after we start doing reads on it it looks like spicedb starts populating caches but never stops growing them. In case it helps: we are not using ZedTokens and querying with the fully consistent setting (the response times on this are perfectly fine for our use case when spicedb is operating normally). Based on our observations, the latest release did not fix or mitigate the memory leak issues.

vroldanbet

01/25/2023, 2:31 PM

Is this with 1.16.1? would you mind getting another heap profile and sharing with us? 🙏🏻

pdow

01/25/2023, 2:43 PM

Yes, latest version - we'll try to get something to you soon. Since the behaviour does not seem to have changed, I expect it might look similar to the profiles from before

pdow

01/25/2023, 2:44 PM

The pod just got killed a little while ago so we might have to wait a little for the issue to re-occur (typically happens a couple of times per day)

AlexB

01/25/2023, 2:48 PM

here's a heap profile https://pprof.me/de52332/

AlexB

01/25/2023, 2:51 PM

and ps output for the same pod

Copy code

PID   USER     VSZ  RSS  COMMAND
    1 65532    6.3g 4.6g spicedb

pdow

01/25/2023, 2:54 PM

Note that the ps output shows 4.6g, while the heap memory-in-use bytes only show up as 2.8GB in the profile - we're not sure where the rest of that comes from

Joey

01/25/2023, 3:18 PM

the ristretto cache holds memory in a different way

Joey

01/25/2023, 3:18 PM

but that is expected

Joey

01/25/2023, 3:19 PM

it looks like gRPC is still holding references to the streams, for some reaswon

Joey

01/25/2023, 3:19 PM

reason*

Joey

01/25/2023, 3:19 PM

I'll investigate today

pdow

01/25/2023, 3:43 PM

Client-side we are using the asyncio python library. Roughly following the README page for that, we have a setup that just creates one client object for the app, which we keep re-using. I am quite sure we consume the streams from e.g. ReadRelationships fully, so there shouldn't be a reason for the client to keep such streams open, for instance. But there's no context management going on anywhere as of now, not sure if that's normal for this client. We're considering trying an experiment where we just destroy and re-create this client object every few hundred/thousand requests or so to see if that changes anything - do you have any reason to believe this could help?

Joey

01/25/2023, 3:47 PM

I don't believe a client change is necessary, no

Joey

01/25/2023, 3:47 PM

it may help in convincing gRPC to clear the streams

Joey

01/25/2023, 3:47 PM

but that's not something we'd want to encourage, ideally

Joey

01/25/2023, 3:47 PM

it appears others are having similar issues with gRPC streams

Joey

01/25/2023, 3:48 PM

https://github.com/grpc/grpc-go/issues/3728

Joey

01/25/2023, 6:47 PM

@pdow do you happen to have a QA or stage env that you can reproduce the problem in?

Joey

01/25/2023, 6:52 PM

if you do, and you want to try a fix: https://github.com/authzed/spicedb/pull/1119

pdow

01/26/2023, 12:40 PM

We're giving this a try, thanks @Joey

AlexB

01/26/2023, 2:56 PM

we built and deployed a version from main (i.e. without the PR linked above) and our problems are gone when we return to 1.16.1 the memory issues are back is this this PR https://github.com/authzed/spicedb/pull/1110 that solves the problem?

AlexB

01/26/2023, 3:03 PM

yep, we built from the previous commit in main and can immediately see the problem again

vroldanbet

01/26/2023, 3:04 PM

it does not necessarily mean there is no leak. What that PR is doing is eagerly evicting data from the cache that is 2x out of the quantization window and therefore unlikely to be reused

vroldanbet

01/26/2023, 3:05 PM

what could be happening is that the grow will be slower moving forward. Or it could be also that there was never any leak, but SpiceDB does not properly compute the size of the cache entries and more entries are being added when there is no capacity left

AlexB

01/26/2023, 3:13 PM

are there any cache configuration options that we can experiment with to check the latter hypothesis? the very first thing that we tried was tuning --dispatch-cluster-cache-max-cost and --dispatch-cache-max-cost but to no effect

vroldanbet

01/26/2023, 3:14 PM

you could disable the cache entirely

vroldanbet

01/26/2023, 3:19 PM

it may be worth just testing this PR for a bit but I'm not convinced this solves the problem. It's eagerly evicting data from the cache that is likely to be unused. - if the leak does not happen anymore, then we know it's related to the cache - if it happens but is happening slower than before, then it means the leak is elsewhere, and that PR didn't really fix it

Joey

01/26/2023, 3:34 PM

We know part of the issue is related to reachable resources, which the quantization PR does not address

Joey

01/26/2023, 3:48 PM

@pdow @AlexB let me know if you can try the timeouts PR itself, instead of just

main

Joey

01/26/2023, 3:48 PM

the overall memory usage should still be higher, but the memory profile should show the leak removed

AlexB

01/26/2023, 3:51 PM

@Joey , yes, we will try it

Joey

01/26/2023, 3:51 PM

thanks

Joey

01/30/2023, 7:50 PM

@AlexB any updates?

Joey

02/06/2023, 11:59 PM

@AlexB we've merged that PR into HEAD, alongside the cache improvements

AlexB

02/07/2023, 10:39 AM

hey @Joey , sorry, I was off last week, our team did some quick tests using that PR but we didn't collect enough data to come to any conclusion and then we had (due to some internal reasons) to switch back to 1.16.1

AlexB

02/07/2023, 10:39 AM

we'll try to do more tests this week

AlexB

02/09/2023, 9:47 AM

here's how things look for us in terms of memory usage after switching to a build from the main branch

AlexB

02/09/2023, 9:47 AM

and here's what we have in prod where we use 1.16.1

vroldanbet

02/09/2023, 10:05 AM

looks promising! The fix should have been released now as part of 1.16.2

5 Views

Previous Next