tshiamor/mcx-card-openvla
Viewer • Updated • 215 • 1.19k
How to use tshiamor/openvla-mcx-card with Transformers:
# Load model directly
from transformers import AutoModelForVision2Seq
model = AutoModelForVision2Seq.from_pretrained("tshiamor/openvla-mcx-card", trust_remote_code=True, dtype="auto")Fine-tuned OpenVLA (7B) for the MCX card pick-and-place task in Isaac Sim using a Franka Panda robot.
| Parameter | Value |
|---|---|
| Base model | openvla/openvla-7b |
| Learning rate | 2e-5 |
| Batch size | 8 |
| Epochs | 8 (checkpoint at epoch 6) |
| Optimizer | AdamW (weight_decay=0.01) |
| Scheduler | Cosine with warmup (5% of total steps) |
| Gradient clipping | 1.0 |
| Precision | bfloat16 |
| Gradient checkpointing | Enabled |
| Hardware | 1x NVIDIA A100 80GB |
from transformers import AutoModelForVision2Seq, AutoProcessor
import torch
model = AutoModelForVision2Seq.from_pretrained(
"tshiamor/openvla-mcx-card",
torch_dtype=torch.bfloat16,
trust_remote_code=True
)
processor = AutoProcessor.from_pretrained("tshiamor/openvla-mcx-card", trust_remote_code=True)
action = model.predict_action(
image,
instruction="Pick up the blue block and place it on the target",
processor=processor
)
Base model
openvla/openvla-7b