We have two clusters 800-802 and 9000-9002with PSS architecture that receive a heavy influx of writes (~25k messages/sec) into dated schemas. Applications are facing serious latency issues with writes - 800(200mil records behind) and 9000(100mil records behind).
· 800-802 - OS: Linux sles12sp3, SATA SSD, CPU: Ivy Bridge, RAM: 250GB, Disk: 4TB (2.5TB used), mongov4.0.3 , w:0 j:true
· 9000-9002 - OS: Linux sles12sp3, SATA SSD, CPU: Ivy Bridge, RAM: 250GB, Disk: 9TB (5TB used), mongov4.0.3 , w:0 j:false (although this setting is not safe/recommended app users are willing to take the chances as opposed to having millions of records drop due to insert latency)
My recommendation is to set inserts on both clusters to w:1 j:false . While 800 may experience better performance (since it no longer hits on disk journal) 9000 is going to take a further hit with this change.
Questions:
-
Will switching to RAID10 provide a significant performance improvement even for SSDs?
-
Will transferring the journal file to a different volume help ?
-
Given that this is a write heavy application are there any cache settings that can be adjusted?