the avenue we're exploring right now is having a Network Load Balancer fronting ECS (because it seems to screw with gRPC connections less) and then having AppMesh as a service discovery mechanism for the horizontal dispatch stuff.
an NLB should work if you want to run a cluster; if you're happy with the performance you're getting right now that sounds like a sane topology, especially since getting the horizontal dispatch working in ECS is a pain.