28 15 76

Nikita Kezins

entfane

AI & ML interests

LLM post-training, adversarial training, safety, knowledge transfer

Recent Activity

updated a model 17 minutes ago

entfane/Toxic_Llama8B

published a model 20 minutes ago

entfane/Toxic_Llama8B

published a model about 14 hours ago

entfane/Toxic-Llama8B

View all activity

Organizations

Collections 2

spaces 3

Gpt2 Harmful Classifier

🚀

Gpt2 Harmful Classifier

🚀

Visualize token scores from a GPT-2 classifier

Math Virtuoso

🧮

Ask math questions and get detailed answers

models 24

datasets 12

entfane/violent_eval

Viewer • Updated 10 days ago • 22.4k • 76

entfane/harmful_subsets

Viewer • Updated 12 days ago • 571k • 33

entfane/preprocessed_toxigen

Viewer • Updated 16 days ago • 10.1k • 114

entfane/toxic_classification

Viewer • Updated 16 days ago • 38.9k • 25

entfane/toxic_chat

Viewer • Updated Mar 1 • 1.25M • 17

entfane/EmotionAtlas-chat

Viewer • Updated Jun 1, 2025 • 3.3k • 9

entfane/EmotionAtlas

Viewer • Updated Jun 1, 2025 • 3.3k • 11

entfane/professor-mathematics

Viewer • Updated Apr 17, 2025 • 64.2k • 9 • 1

entfane/psychotherapy-dpo

Viewer • Updated Mar 30, 2025 • 168 • 12 • 4

entfane/psychotherapy_prompts

Viewer • Updated Mar 30, 2025 • 168 • 5

View 12 datasets

Nikita Kezins

AI & ML interests

Recent Activity

Organizations

Collections 2

spaces 3 Sort: Recently updated

Gpt2 Harmful Classifier

Gpt2 Harmful Classifier

Math Virtuoso

models 24 Sort: Recently updated

datasets 12 Sort: Recently updated

spaces 3

models 24

datasets 12