Length of captions

#14

by Gandharv - opened Apr 9, 2023

Apr 9, 2023

How much long of a caption can this model generate. Is there a way to increase the length of captions, making it more detailed?

ybelkada

Apr 9, 2023

Hi @Gandharv
You can probably use sampling methods when calling generate, please have a look at https://huggingface.co/docs/transformers/generation_strategies for further details
You can also control the length of the generated text by setting max_new_tokens

ppujari

Apr 25, 2023

Hi @Gandharv and @ybelkada
I tried max_new_tokens but it did not change the length.

ybelkada

Apr 25, 2023

Hi @ppujari
Can you try using sampling methods?

ppujari

Apr 26, 2023

Hello @ybelkada Thank you for your quick reply. Trying now the sampling approach. Will update you soon.

larsdjursner

Nov 29, 2023

@ppujari did you have any success with the sampling approach?

talrejanikhil

Mar 14, 2024

You can do this

captioner = pipeline("image-to-text", model="Salesforce/blip-image-captioning-base")
captioner(image, max_new_tokens=200, generate_kwargs={"min_length": 40})

matthew-at-qamcom

May 29, 2024

You can also do this:

out = model.generate(**inputs, max_new_tokens=200, min_length=40)

But I found it only made the results worse.

bumchuck

Jun 19, 2024

@talrejanikhil . It worked for me! but I want to delve deeper. where can I get more of these details/ documentation

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment