What is most efficient way to get the data from a 10million data collections?

Please read Formatting code and log snippets in posts and update your code and sample documents so that we can easily experiment with.