Spike in read latencies on high throughput 15k qps

maneesh · March 11, 2021, 1:37pm

we have mongodb 4.2, 1 master and 2 slave replicaset. And application reads from secondaries.

when i reach high read qps (15k read qps while using 2 secondaries and 9k when using 1 secondary) with small number of writes (~500 qps) the read latencies shoot up.

Here are some other details

CPU utilisation 75% idle
load number 0.4 par cpu
spike in cpu usage, interrupts, free memory and created process when the failure occurs.
no major drop in concurrency ticket availability

i have uploaded rest related metric here

The reads are not random hence i don’t think it is a memory pressure issue (read iops are also not much supplement this argument).

Primarily it seems like mongo application limitation but i am not sure how it can be concluded.

can someone give pointers about how it can be debugged next?

when does mongodb create new processes?
what explains high number of interrupts and context swtiches?
why would free memory spike? when box is obviously under high memory pressure.

This is not an intermittant issue for sure, since i was consistently able to reproduce this.

Eoin_Brazil · March 11, 2021, 3:52pm

Hi @maneesh

Welcome to our forums, reading your post I think these questions might be better asked in our [Ops and Admin - MongoDB Developer Community Forums](https://Ops and Admin category) as they don’t appear to be directly related to the M201 MongoDB Performance course.

If I’m mistaken, can you clarify what chapter and lesson in M201 you are having difficulties with.

Hope this helps,
Eoin

maneesh · March 11, 2021, 3:55pm

@Eoin_Brazil apologies, I have updated the tag. Thanks for correcting.

maneesh · September 15, 2022, 7:42am

Hi Team,

Any update here? we are still facing this issue.

@kevinadi can you please help here?

maneesh · September 15, 2022, 4:30pm

what we are seeing is that there is a spike in number of forked process. when does mongodb create more processes?

maneesh · September 17, 2022, 1:40pm

I have found the issue. It was related to https://jira.mongodb.org/browse/SERVER-54805

system · September 22, 2022, 1:41pm

This topic was automatically closed 5 days after the last reply. New replies are no longer allowed.