Alternative query

Nuur_zakki_Zamani · December 12, 2023, 9:22am

I want to ask a question, I use aggregation to fetch data from one document to reference document. this is my code, it is correct, just this code is not suitable for large data. here the comment “the way you followed might not be scalable, which might cause problems in the future, such as overkilled, eating up the storage CPU & RAM”, can you teach me another query?

NeNaD · December 12, 2023, 9:29am

Hi @Nuur_zakki_Zamani,

I would say you should do 2 things:

Index clinics property.
Replace the order of the $match and $lookup in your aggregate query. $match can leverage the index only if it’s the first stage of the aggregation pipeline.

By doing that, your query should be performant even on large datasets.

Nuur_zakki_Zamani · December 12, 2023, 9:38am

thank you so much @NeNaD i wll try it

steevej · December 12, 2023, 3:51pm

My 2 cents.

I do not think that the following is possible.

The $match query the field critics which is the result of the $lookup. So it cannot be done before.

You use case seems overly simplistic. It looks like you want to somehow process the clinics on an ordered way using the category. I would implement a use-case like that by aggregation on the clinics and $lookup on categories.

pipeline = [
    { "$sort" : {
        "category" : 1
    } } ,
    { "$lookup" : {
        "from" : "categories" ,
        "localField" : "category" ,
        "foreignField" : "_id" ,
        "as" : "category"
    } }
] ;
db.clinics.aggregate( pipe ) ;

It is kind of a $group but without the risking the 16MB limit due to a massive array.

One caveat is that the category data is duplicated in the output. A $project could be used in the $lookup pipeline to reduce this.

Please read Formatting code and log snippets in posts before posting new code or documents. We cannot cut-n-paste code or documents from an image so it makes experimenting and replying slower than it could.

Nuur_zakki_Zamani · December 14, 2023, 7:14am

thanks a lot @steevej