3 3

Peter PRO

pworth1971

pworth1971

AI & ML interests

Language Models

Recent Activity

updated a Space 1 day ago

asg-ai/README

published a Space 4 days ago

asg-ai/README

updated a dataset 8 days ago

pworth1971/athena-ift

View all activity

Organizations

updated a Space 1 day ago

README

🚀

published a Space 4 days ago

README

🚀

updated a dataset 8 days ago

pworth1971/athena-ift

Viewer • Updated 8 days ago • 106k • 24

published a dataset 8 days ago

pworth1971/athena-ift

Viewer • Updated 8 days ago • 106k • 24

upvoted a paper 6 months ago

AthenaBench: A Dynamic Benchmark for Evaluating LLMs in Cyber Threat Intelligence

Paper • 2511.01144 • Published Nov 3, 2025 • 4

upvoted an article 6 months ago

Article

CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models

May 24, 2024

•

liked 2 models 9 months ago

trend-cybertron/Llama-Primus-Nemotron-70B-Instruct

Text Generation • 71B • Updated Aug 9, 2025 • 652 • 14

trendmicro-ailab/Llama-Primus-Merged

Text Generation • 8B • Updated Mar 4, 2025 • 282 • 14

upvoted a collection 10 months ago

REAL-MM-RAG-Bench

Collection

REAL-MM-RAG-Bench is a benchmark designed to evaluate multi-modal retrieval models under realistic and challenging conditions. • 4 items • Updated Mar 13, 2025 • 11

liked a model 12 months ago

fdtn-ai/Foundation-Sec-8B

Text Generation • 8B • Updated Aug 26, 2025 • 14.7k • • 302

Peter PRO

AI & ML interests

Recent Activity

Organizations

pworth1971's activity

README

README

CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models