When you write
Do you mean that each machines is 16GB with 4CPU and that you have a total of 7 machines, that 2 PS per shard, 2 PS for config server and 1 machine for mongos.
or
Do you mean that you are running 6 data bearing instances and mongos on a single 16GB machine?
Indicates that your bottle neck is disk I/O.