Need Suggestions for Reducing Downtime During Deployments with DB migration

Hello everyone,

I’m facing a challenge with our deployment process and would appreciate any advice. We’re deploying to EKS, using Atlas MongoDB, and storing documents in S3. Whenever we deploy to production, we need to take the system offline to back up S3 and the database, which can take some time, especially due to a large number of files in S3.

The real issue comes when we need to perform a database migration. I’m using rolling updates with Helm, but during migrations, we can’t run the old and new versions side by side because the old version crashes after the migration.

Does anyone have suggestions on how to reduce or eliminate the downtime caused by these backups and migrations?

Thanks in advance!