One of my replica is down for a few hours, how can I get it back?

I have 3 replicas, the other 2 are fine.

Here are some of the logs, from here all the connections are rejected.
How can I get this replica back?

2020-12-14T17:37:59.453 +0000 I REPL_HB [replexec-1] Heartbeat to failed after 2 retries, response status: InterruptedAtShutdown: interrupted at shutdown

2020-12-14T17:37:59.453 +0000 I REPL [replexec-1] Member is now in state RS_DOWN - interrupted at shutdown

2020-12-14T17:38:01.455 +0000 I REPL_HB [replexec-1] Heartbeat to failed after 2 retries, response status: InterruptedAtShutdown: interrupted at shutdown

2020-12-14T17:38:03.457 +0000 I REPL_HB [replexec-2] Heartbeat to failed after 2 retries, response status: InterruptedAtShutdown: interrupted at shutdown

2020-12-14T17:38:05.459 +0000 I REPL_HB [replexec-3] Heartbeat to failed after 2 retries, response status: InterruptedAtShutdown: interrupted at shutdown

2020-12-14T17:38:07.461 +0000 I REPL_HB [replexec-3] Heartbeat to failed after 2 retries, response status: InterruptedAtShutdown: interrupted at shutdown

2020-12-14T17:38:10.572 +0000 I CONNPOOL [Replication] Ending connection to host due to bad connection status: HostUnreachable: Connection reset by peer; 0 connections to that host remain open

2020-12-14T17:38:10.572 +0000 I CONNPOOL [Replication] Connecting to

2020-12-14T17:38:10.574 +0000 I REPL_HB [replexec-3] Heartbeat to failed after 2 retries, response status: HostUnreachable: Error connecting to (192.168.248.8:27017) :: caused by :: Connection refused

Hi @CJ_Jiang,

If this is an Atlas cluster I suggest you to contact our support.

Seems that a member is down but there is no enough detail here to indicate why.

Thanks
Pavel

It is resolved. Thanks a lot.

1 Like

This topic was automatically closed 5 days after the last reply. New replies are no longer allowed.