An Empirical Study of DPO Configuration Choices for LLM Alignment
Jan Majkutewicz
jmajkutewicz
AI & ML interests
None yet
Recent Activity
updated a model 4 days ago
jmajkutewicz/Qwen3.5-9B-medadapt updated a model 11 days ago
jmajkutewicz/Bielik-11B-v3.0-medadapt updated a model 11 days ago
jmajkutewicz/gemma-4-E2B-medadaptOrganizations
None yet