I work on the team that built this and many of our other ranking tech. I can answer any questions people have.
Also shameless plug: using this tech we released some artificial search sessions as an exploratory dataset. https://github.com/dfcf93/MSMARCO/tree/master/ConversationlS...
Cool project! One of the challenging use cases mentioned in the description is people taking a picture and asking the search engine, 'What is this?'. Has this been solved? (it is a very hard problem if taken beyond simple object classification)
LOL - that's exactly the use case I was thinking of. But wouldn't an AI have to be trained with lots of examples of the item in question to get a high quality detection? If so, it might not be able to find that one image that antique shop in Whereverston has on their web site.
I don't work on the image side but from what I understand the entire index is vectorized so its not categorizing them like a imagenet system would as much as finding a nearest neighbor that can be categorized.
Yes, exactly! This seems like totally overblown title. It's akin to saying Google open sourced their key search tech Kubernetes, which is an open source rendition of Borg, where all the Google workloads run on top of.
no it turns out it's "A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario."