Duplicated from nlpconnect/vit-gpt2-image-captioning
How to use baseplate/vit-gpt2-image-captioning with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "image-to-text" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("image-to-text", model="baseplate/vit-gpt2-image-captioning")
# Load model directly from transformers import AutoTokenizer, AutoModelForImageTextToText tokenizer = AutoTokenizer.from_pretrained("baseplate/vit-gpt2-image-captioning") model = AutoModelForImageTextToText.from_pretrained("baseplate/vit-gpt2-image-captioning")