Hey all!
Right now we're running spicedb on version 1.15.0, we recently executed an RDS reboot with failover on a multiaz postgres RDS instance associated with spiceDB. It looks like the pods did not automatically reconnect to the failed over instance and we had to execute a manual restart.
Pods were shown as healthy and running so we were a bit confused about the issue. I'm assuming SpiceDB does not check the database status as part of some sort of continuous check?
The implications of this is that if an availability zone goes down, it would require manual intervention to restart the pods in the cluster. Is there an automated self-healing mechanism others have implemented to circumvent this? Would love to hear your suggestions and thoughts on this capability.
Thanks!