Image Classification
Transformers
clip_multitask_classifier
clip
fashion
computer-vision
multi-task-learning
product-attribute-extraction
autocatalogai
Instructions to use mohsin416/autocatalogai-clip-multitask with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use mohsin416/autocatalogai-clip-multitask with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-classification", model="mohsin416/autocatalogai-clip-multitask") pipe("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/hub/parrots.png")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("mohsin416/autocatalogai-clip-multitask", dtype="auto") - Notebooks
- Google Colab
- Kaggle
AutoCatalogAI CLIP Multi-Task Classifier
AutoCatalogAI is a fashion product attribute extraction model.
It predicts:
['gender', 'masterCategory', 'subCategory', 'articleType', 'baseColour', 'season', 'usage']
Dataset
Dataset: ashraq/fashion-product-images-small
Split:
- Train: 70%
- Validation: 15%
- Test: 15%
Base Model
openai/clip-vit-base-patch32
Architecture
CLIP image encoder + multiple classification heads.
Overall Test Metrics
{
"average_accuracy": 0.8335026038852995,
"average_macro_f1": 0.6568447666058456,
"average_weighted_f1": 0.8421526081109854,
"average_top3_accuracy": 0.9711087581303888,
"exact_match_accuracy": 0.27938284677053393,
"avg_inference_time_ms_per_image": 1.5960319117829298,
"test_samples": 6611
}
Important
This model should be loaded with the AutoCatalogAI project code because it contains custom multi-task classifier heads.
Expected files:
model.ptconfig.jsonlabel_maps.jsonmetrics.jsonREADME.md
- Downloads last month
- -