I am trying to build a system which requires storing an actual document file in the database along with other details. For example, assume I need a table where I store the candidate information like name, email, phone etc, and resume file (doc, docx, pdf) who are applying for a job I posted online.
I don’t want to use amazon S3 to store the document and use the link in the database. I’m building an application which requires low latency. So having the resume file content , alongside the candidate data helps.
In my use-case I don’t actually require the resume file to be frank, rather I need the text inside the resume as a string. So, which is the most efficient way, should I extract the text from resume document file and store it in mangoDB as text (or) store the resume file as binary format. Which approach is more scalable?
The only reason I’m choosing to store text is, if I store the resume file itself I have to fetch and extract text from the document using some document parser which further increases my latency. On the other hand I’m thinking that storing such large texts from resume is scalable or not?
I’m new to mangoDB. Appreciate any help or suggestions. Note: The resumes document files are less than 1mb.