Instructions to use Cyanex/DrDTI-Reasoner with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Cyanex/DrDTI-Reasoner with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Cyanex/DrDTI-Reasoner")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("Cyanex/DrDTI-Reasoner", dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use Cyanex/DrDTI-Reasoner with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Cyanex/DrDTI-Reasoner"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Cyanex/DrDTI-Reasoner",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/Cyanex/DrDTI-Reasoner

SGLang

How to use Cyanex/DrDTI-Reasoner with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "Cyanex/DrDTI-Reasoner" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Cyanex/DrDTI-Reasoner",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "Cyanex/DrDTI-Reasoner" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Cyanex/DrDTI-Reasoner",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Unsloth Studio new

How to use Cyanex/DrDTI-Reasoner with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for Cyanex/DrDTI-Reasoner to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for Cyanex/DrDTI-Reasoner to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for Cyanex/DrDTI-Reasoner to start chatting

Load model with FastModel

pip install unsloth
from unsloth import FastModel
model, tokenizer = FastModel.from_pretrained(
    model_name="Cyanex/DrDTI-Reasoner",
    max_seq_length=2048,
)

Docker Model Runner
How to use Cyanex/DrDTI-Reasoner with Docker Model Runner:
```
docker model run hf.co/Cyanex/DrDTI-Reasoner
```

🧬 DrDTI-Reasoner

A biomedical LLM fine-tuned for Drug–Target Interaction (DTI) prediction and reasoning.

Built on Meta LLaMA-3 8B (4-bit quantized) and adapted via LoRA (Low-Rank Adaptation) on structured bioactivity datasets, DrDTI-Reasoner is designed to support computational drug discovery, bioactivity prediction, and molecular reasoning research.

⚙️ Model Details

Property	Value
Base Model	meta-llama/Meta-Llama-3-8B (bnb-4bit quantized)
Fine-tuning Method	LoRA (Parameter-Efficient Fine-Tuning)
Framework	Unsloth + Hugging Face Transformers
Quantization	4-bit (bitsandbytes)
Domain	Drug Discovery / Computational Biology
Task	Drug–Target Interaction (DTI) reasoning
License	Apache 2.0

🔬 Capabilities

DrDTI-Reasoner is trained to handle the following tasks:

DTI Classification — Predict whether a drug interacts with a given protein target
Bioactivity Estimation — Classify compounds as Active or Inactive
Potency Approximation — Estimate pXC50 values where applicable
Molecular Interpretation — Parse and reason over SMILES molecular representations
Target Analysis — Interpret protein targets from UniProt IDs or gene names
Assay-Aware Reasoning — Incorporate assay metadata (mechanism, technology, mode) into predictions
Biological Explanation — Generate short, human-readable justifications for predictions

🧪 Input Format

The model expects structured biomedical input in the following format:

Drug (SMILES):       <SMILES string>
Target protein:      <UniProt ID or protein name>
Assay mechanism:     <optional>
Assay technology:    <optional>

Note: Input formatting quality directly affects prediction accuracy. Always validate SMILES strings and use standard UniProt identifiers when possible.

Example Input

Drug (SMILES):       CC1=CC=C(C=C1)S(=O)(=O)N
Target protein:      P00533 (EGFR)
Assay mechanism:     Inhibition
Assay technology:    Biochemical

📤 Output Format

The model returns structured predictions:

Active:  true | false
pXC50:   <numeric value> | null
Reason:  <short biological explanation of the predicted interaction>

Example Output

Active:  true
pXC50:   7.4
Reason:  The compound's sulfonamide group forms key hydrogen bonds with the EGFR
         ATP-binding pocket, suggesting moderate inhibitory activity consistent
         with the predicted pXC50.

📊 Training Data

DrDTI-Reasoner was trained on curated drug–target interaction datasets containing:

Molecular structures in SMILES format
Protein target identifiers (UniProt IDs and gene names)
Bioassay metadata (mechanism, technology, and mode)
Binary activity labels (active / inactive)
Potency values (pXC50) where available

Key sources include ChEMBL and BindingDB bioactivity databases.

📈 Evaluation Results

Evaluated on a held-out test split from the ChEMBL bioactivity dataset:

Metric	Value
Accuracy	0.84
F1 Score	0.82
ROC-AUC	0.89

⚠️ These are preliminary research metrics. Results may vary across target classes, assay types, and chemical scaffolds not well-represented in the training data. Scaffold-split benchmarking is planned for a future release.

⚠️ Limitations

Please read carefully before use:

Research only — This is not a clinical or regulatory-grade system
Probabilistic outputs — Predictions are model-generated estimates, not experimentally verified results
Statistical SMILES understanding — Molecular interpretation is learned from data, not from physical simulation or quantum chemistry
Format-sensitive — Prediction quality degrades with poorly formatted or non-standard inputs
Not medically validated — This model has not been assessed for safety or efficacy in any clinical context

🚀 Intended Use

DrDTI-Reasoner is designed for:

✅ Computational drug discovery research
✅ Bioactivity prediction experiments
✅ Machine learning benchmarking in cheminformatics
✅ Educational use in bioinformatics and AI
✅ Early-stage hit identification and prioritization workflows

Not intended for:

❌ Clinical diagnosis or treatment decisions
❌ Regulatory or pharmaceutical approval processes
❌ Any application directly impacting human health

🗺️ Roadmap

Improved molecular encoding — SMILES-aware tokenization and embedding strategies
Protein sequence integration — Embedding protein context via ESM models
Multi-task learning — Joint classification (active/inactive) and regression (pXC50) heads
Scaffold-based evaluation — Benchmarking on scaffold-split datasets to assess generalization
Calibrated confidence scores — Better-calibrated uncertainty estimates for pXC50 predictions

📦 Base Model & Framework

Component	Details
Base Model	meta-llama/Meta-Llama-3-8B (4-bit quantized)
PEFT Method	LoRA via Unsloth
Inference Framework	Hugging Face Transformers

Disclaimer: DrDTI-Reasoner is a research tool. All predictions should be interpreted in the context of existing literature and validated experimentally before any downstream use.

Downloads last month: -; Downloads are not tracked for this model. How to track

Model tree for Cyanex/DrDTI-Reasoner

Base model

meta-llama/Meta-Llama-3-8B

Adapter

(726)

this model

Dataset used to train Cyanex/DrDTI-Reasoner

Paper for Cyanex/DrDTI-Reasoner

LoRA: Low-Rank Adaptation of Large Language Models

Paper • 2106.09685 • Published Jun 17, 2021 • 61

Evaluation results

Accuracy on ChEMBL Bioactivity (held-out test set)
self-reported

0.840
F1 Score on ChEMBL Bioactivity (held-out test set)
self-reported

0.820
ROC-AUC on ChEMBL Bioactivity (held-out test set)
self-reported

0.890