How to encode images for vector search?

michael_hoeller · January 2, 2024, 9:22pm

thanks for the update, I am currently testing a slightly different way. Unfortunately no MongoDB (yet). The use case changed, or if you want, iterated to a proof of concept to identify pairs of socks. One option I follow is to encode texture driven and run a vector search. The other currently preferred is to ask “the AI” only one question. Let me reword this: focus on simplicity. Asking for one “parmerter” only reduces complexity tremendously. I am currently working on training a model only to answer the question: are theses socks one pair? It is still quite a bit of training but surprisingly less than expected. The approach to work with multiple nodes in my net explored the number of options and so the amount of training. One result, though not the initial goal is the learning to think carefully about the question to answer and so to be able to build small but still very well working models.
Unfortunately on other project eats up some time the next two weeks. After that I’ll continue to work on the actual textile / image encoding.
Best Michael