Skip to content

CLIP encoder #69

@MechaCritter

Description

@MechaCritter

Is your feature request related to a problem? Please describe.
Besides the typical encoder used here like VLAD or Fisher Vector, I would like to implement a one-shot encoder which uses the CLIP backbone. It needs no processing, just one-shot and the embedding is there :)

Describe the solution you'd like
Implement a CLIPEncoder(ImageEncoderBase) that extracts the CLIP feature as a vector. Probably the model backbone would come from transformers library.

Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.

Additional context
Add any other context or screenshots about the feature request here.

Metadata

Metadata

Assignees

No one assigned

    Labels

    P1Priority 1contributor neededNeeds a contributor to take over the issue/featuredependenciesPull requests that update a dependency file

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions