CLIP encoder

**Is your feature request related to a problem? Please describe.**
Besides the typical encoder used here like `VLAD` or `Fisher Vector`, I would like to implement a one-shot encoder which uses the `CLIP` backbone. It needs no processing, just one-shot and the embedding is there :) 

**Describe the solution you'd like**
Implement a `CLIPEncoder(ImageEncoderBase)` that extracts the CLIP feature as a vector. Probably the model backbone would come from `transformers` library.

**Describe alternatives you've considered**
A clear and concise description of any alternative solutions or features you've considered.

**Additional context**
Add any other context or screenshots about the feature request here.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLIP encoder #69

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

CLIP encoder #69

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions