Oplog replay -- does it truncate data from storage

Aayushi_Mangal · June 24, 2020, 11:00am

Hello,

Recently I have faced data loss in MongoDB, and that is so rare:
– checked for rollback, nothing there.
found that, only node in replicaset is alive, but unable to connect, on restart that single replica node, it starts replaying oplog, not sure if it is because member was not identified as primary or secondary.

My questions here:

does oplog replay truncate data from primary?
insights_bigd
I STORAGE [initandlisten] Sampling from the oplog between Jun 3 23:05:13:2256 and Jun 5 16:25:37:1 to determine where to place markers for truncation

why it is placing markers for truncation?

Stennie_X · July 1, 2020, 11:57pm

Hi @Aayushi_Mangal,

No. Oplog replay does not remove any data from the primary.

The oplog is a special capped collection with a configured maximum size. When the maximum size of a capped collection is reached, some of the oldest documents are removed to maintain the target oplog size.

The oplog implementation in the WiredTiger storage engine has some optimisations specifically for replication use cases. A housekeeping process adds logical markers (aka oplog stones) for more efficient removal of batches of the oldest documents.

For a more technical summary, see SERVER-19551: Keep “milestones” against WT oplog to efficiently remove old records.

Regards,
Stennie

Aayushi_Mangal · July 2, 2020, 12:45pm

Hi @Stennie_X,

Ok, when does oplog replay comes to picture. Like we have primary node, but I can see oplog replay operations in primary log as well.

Also what if we drop “local” database from primary only node. does we get data loss, as the data is already written in disk, but I can see data loss after dropping “local” database

Stennie_X · July 2, 2020, 9:19pm

Hi @Aayushi_Mangal,

Can you clarify what you mean by “oplog replay”? Are you referring to a secondary pulling updates from another member of your replica set? Can you provide an example of the operations you are looking at?

For a description of the general processes of initial sync and ongoing replication, please review Replication Set Data Synchronisation.

Dropping the local database removes the collections and configuration it contains (startup_log, oplog.rs, system.replset, …). but does not affect data in other databases.

However, replica set members rely on the oplog to determine the provenance of data. If you drop the local database (or the local.oplog.rs collection) for a replica set member, it cannot rejoin the same replica set without resyncing all data. If a replica set member has an oplog but no longer has an entry in common with the oplog of another member of the replica set, a resync will also be required.

If there are data changes you want to preserve on a replica set member that you are about to resync, you should take a backup of the relevant data before starting the resync process. The resync process replaces all data and indexes with a fresh copy (and known state) from another replica set member.

Regards,
Stennie

system · August 18, 2020, 8:00am

This topic was automatically closed 5 days after the last reply. New replies are no longer allowed.