input embeddings, output similarity. Can this be done through an inference endpoint?
#38
by
click37
- opened
Hi there
I am computing the embeddings of sentences using the model. I store them so that I don't run this computation in real time. I then get a new sentence in real-time, compute the embedding and want to run a sentence similarity between this new embedding and all others I did prior.
Can this be done here?
You can use any open source vector db(chroma, weaviate) to retrieve best matches of semantic similar sentences. I hope that's what you're looking for.
I'm also looking to do something similar with the inference API. However, the API seems to only support getting sentence similarity and not the actual vectors.
Supported
- Inputting a
source_sentence
. - Inputting several
other_sentences
. - Getting the similarity of
other_sentences
withsource_sentence
.
Not Supported (Or maybe not well documented)
- Inputting
sentences
orsentence
. - Getting the 384 dimensional vectors.