OpenVision
Collection
27 items • Updated • 33
How to use UCSC-VLAA/openvision-vit-base-patch8-384 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("image-feature-extraction", model="UCSC-VLAA/openvision-vit-base-patch8-384") # Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("UCSC-VLAA/openvision-vit-base-patch8-384", dtype="auto")This repository contains the OpenVision model, a fully-open and cost-effective family of advanced vision encoders for multimodal learning, as described in the paper OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning.
Project Page: https://ucsc-vlaa.github.io/OpenVision/