Instructions to use King-8/help-classifier-v2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use King-8/help-classifier-v2 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="King-8/help-classifier-v2")# Load model directly from transformers import AutoTokenizer, AutoModelForSequenceClassification tokenizer = AutoTokenizer.from_pretrained("King-8/help-classifier-v2") model = AutoModelForSequenceClassification.from_pretrained("King-8/help-classifier-v2") - Notebooks
- Google Colab
- Kaggle
# Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("King-8/help-classifier-v2")
model = AutoModelForSequenceClassification.from_pretrained("King-8/help-classifier-v2")- ๐ค Help Classifier Model (v2)
- ๐ง Overview
- ๐ Version Update (v1 โ v2)
- ๐งฉ Task Definition
- ๐ท๏ธ Labels
- ๐๏ธ Model Architecture
- โ๏ธ Training Configuration
- ๐ Training Results
- ๐ Performance Summary
- ๐งช Example Predictions
- ๐ System Integration
- ๐ฏ Use Cases
- โ ๏ธ Limitations
- ๐ฎ Future Improvements
- ๐ค Author
- ๐ง Overview
- help-classifier-v2
๐ค Help Classifier Model (v2)
๐ง Overview
The Help Classifier Model (v2) is a fine-tuned NLP model designed to classify student help requests into meaningful categories within a collaborative learning environment.
This model is part of a larger AI system built for the Coding in Color (CIC) ecosystem, supporting students working across domains such as AI development, game development, 2D/3D art, and robotics.
Its primary purpose is to:
- Interpret real student messages
- Identify intent behind help requests
- Route inputs to appropriate downstream systems (e.g., generators, agents)
๐ Version Update (v1 โ v2)
๐น v1
- Trained on ~100 examples
- Limited generalization
- Struggled with messy or informal input
๐น v2 (Current)
Trained on 1,000 examples
Balanced dataset across all categories
Strong performance on:
- informal/slang input
- mixed tone messages
- ambiguous phrasing
- real CIC-style check-ins
๐ v2 significantly improves accuracy, stability, and real-world usability
๐งฉ Task Definition
Task Type: Text Classification
Input: Student message Output: One of 5 help categories
๐ท๏ธ Labels
| Label | Description |
|---|---|
learning_help |
User is trying to understand a concept or skill |
project_help |
User needs direction or next steps in a project |
technical_issue |
Something is broken or not working |
attendance_issue |
User missed a meeting or needs to catch up |
general_guidance |
User expresses uncertainty, stress, or needs advice |
๐๏ธ Model Architecture
- Base Model: distilbert-base-uncased
- Fine-tuned for sequence classification
- Number of labels: 5
โ๏ธ Training Configuration
- Epochs: 4
- Learning Rate: 2e-5
- Batch Size: 8
- Weight Decay: 0.01
- Train/Validation Split: 80/10/10
๐ Training Results
| Epoch | Training Loss | Validation Loss |
|---|---|---|
| 1 | 0.552 | 0.512 |
| 2 | 0.111 | 0.122 |
| 3 | 0.032 | 0.077 |
| 4 | 0.025 | 0.064 |
๐ Performance Summary
Low validation loss (~0.06)
Strong generalization across unseen inputs
Stable convergence during training
Handles:
- messy/slang text
- indirect requests
- multi-layered inputs
๐งช Example Predictions
Input:
i missed the meeting and now idk what weโre doing
Output:
attendance_issue
Input:
my model works but the predictions are weird and I donโt know why
Output:
technical_issue
Input:
I feel like Iโm behind and donโt know what to focus on
Output:
general_guidance
๐ System Integration
This model is integrated into an MCP (Model Context Protocol) system where it acts as:
Entry-point classifier for routing student inputs
Pipeline example:
User Input โ Help Classifier โ (Future: Generator / Summarizer)
๐ฏ Use Cases
- Help request classification
- Slack/Discord message routing
- Educational AI assistants
- CIC ecosystem tools
- AI agent pipelines
โ ๏ธ Limitations
- Single-label classification (some messages may contain multiple intents)
- Edge cases may still overlap between categories
- Domain-specific (focused on student tech environments)
๐ฎ Future Improvements
- Multi-label classification
- Larger dataset (2,000+ examples)
- Confidence scoring
- Integration with response generation models
- Continuous retraining with real user data
๐ค Author
Created by Kingston Lewis as part of the Coding in Color program for the AI Dev team.
help-classifier-v2
This model is a fine-tuned version of distilbert-base-uncased on the King-8/help-request-messages-v2 dataset. It achieves the following results on the evaluation set:
- Loss: 0.0643
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 8
- eval_batch_size: 8
- seed: 42
- optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
- lr_scheduler_type: linear
- num_epochs: 4
- mixed_precision_training: Native AMP
Training results
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.5524 | 1.0 | 88 | 0.5124 |
| 0.1114 | 2.0 | 176 | 0.1221 |
| 0.0324 | 3.0 | 264 | 0.0771 |
| 0.0249 | 4.0 | 352 | 0.0643 |
Framework versions
- Transformers 5.0.0
- Pytorch 2.10.0+cpu
- Datasets 4.0.0
- Tokenizers 0.22.2
- Downloads last month
- 5
Model tree for King-8/help-classifier-v2
Base model
distilbert/distilbert-base-uncased
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="King-8/help-classifier-v2")