Instructions to use gijl/ai with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use gijl/ai with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="gijl/ai", trust_remote_code=True)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("gijl/ai", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use gijl/ai with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "gijl/ai" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "gijl/ai", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/gijl/ai
- SGLang
How to use gijl/ai with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "gijl/ai" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "gijl/ai", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "gijl/ai" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "gijl/ai", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use gijl/ai with Docker Model Runner:
docker model run hf.co/gijl/ai
Brain Map AI v3.0 โ Clinical Intelligence Model
Brain Map AI is a specialized medical reasoning model optimized for clinical decision support, USMLE-style reasoning, and bilingual (English/Arabic) medical tasks.
๐ Deployment Status
The model is officially live on the gijl/ai repository. It features a technical stack including PyTorch, Hugging Face Transformers, and a specialized GGUF compatibility layer.
Key Features
- Architecture: Specialized Transformer with Grouped Query Attention (GQA) and Sparse Mixture of Experts (MoE) thinking blocks.
- Memory: Adaptive clinical memory using secure JSON/Safetensors storage (migrated from pickle for maximum security).
- Safety: Built-in Sovereign Critic and Drug Dosage Validation layers.
- Compatibility: Cross-compatible with Hugging Face Transformers and GGUF (via llama.cpp/LM Studio).
Technical Specifications
- Core: PyTorch 2.x / Transformers 5.x
- Precision: F16 (4-bit/8-bit quantization recommended for consumer VRAM optimization).
- Tokenizer: Custom BPE (8k-32k vocab) supporting medical nomenclature in EN/AR.
Usage
1. Using the Universal Loader (Recommended)
We provide a loader.py script for seamless integration across formats:
from loader import load_brain_map
model = load_brain_map('gijl/ai')
2. Standard Transformers Loading
from transformers import AutoModelForCausalLM, AutoTokenizer
tokenizer = AutoTokenizer.from_pretrained('gijl/ai')
model = AutoModelForCausalLM.from_pretrained('gijl/ai', trust_remote_code=True)
Unresolved Tasks & Next Steps
- VRAM Optimization: Further quantization (GGUF Q4_K_M) for mobile deployments.
- Extended Evaluation: Continued benchmarking against MedHALT and USMLE Step 2 vignettes.
Disclaimer: This model is for clinical research and decision-support assistance only. It is not a replacement for professional medical advice.
- Downloads last month
- 96