KuKu
dragonkue
AI & ML interests
anything.
Recent Activity
upvoted an article 9 days ago
Beyond LoRA: Can you beat the most popular fine-tuning technique? upvoted a paper 11 days ago
Training Sparse Mixture Of Experts Text Embedding Models liked a model 12 days ago
BCCard/MoAI-Embedding-4BOrganizations
Reranker Models
A collection of high-performance Korean reranker models, including those I have trained myself as well as other strong baselines
-
dragonkue/bge-reranker-v2-m3-ko
Text Ranking • 0.6B • Updated • 23.2k • 23 -
telepix/PIXIE-Spell-Reranker-Preview-0.6B
Text Ranking • 0.6B • Updated • 293 • 5 -
BAAI/bge-reranker-v2-m3
Text Classification • 0.6B • Updated • 16.2M • • 1.06k -
Qwen/Qwen3-Reranker-0.6B
Text Ranking • 0.6B • Updated • 2.15M • 368
Multi-modal Retrieval Models
Korean Sparse Retriever
Korean Embedding Models
A collection of high-performance Korean embedding models, including both models I trained myself and other publicly available strong baselines.
-
dragonkue/snowflake-arctic-embed-l-v2.0-ko
Sentence Similarity • 0.6B • Updated • 74.4k • • 48 -
dragonkue/BGE-m3-ko
Sentence Similarity • 0.6B • Updated • 800k • • 76 -
dragonkue/multilingual-e5-small-ko
Sentence Similarity • 0.1B • Updated • 3.88k • • 11 -
dragonkue/multilingual-e5-small-ko-v2
Sentence Similarity • 0.1B • Updated • 15.9k • • 4
Multilingual Embedding Models
A collection of multilingual embedding models suitable for use as training backbones
-
BAAI/bge-m3
Sentence Similarity • Updated • 32M • • 3.18k -
Snowflake/snowflake-arctic-embed-l-v2.0
Sentence Similarity • 0.6B • Updated • 915k • • 248 -
google/embeddinggemma-300m
Sentence Similarity • 0.3B • Updated • 1.53M • • 1.76k -
intfloat/multilingual-e5-large-instruct
Feature Extraction • 0.6B • Updated • 1.54M • • 629
Colbert (multi-vec)
-
dragonkue/colbert-ko-0.1b
Sentence Similarity • 0.1B • Updated • 249 • 4 -
LiquidAI/LFM2-ColBERT-350M
Sentence Similarity • 0.4B • Updated • 63.5k • 144 -
yjoonjang/colbert-ko-v1
Sentence Similarity • 0.1B • Updated • 17 • 16 -
mixedbread-ai/mxbai-edge-colbert-v0-32m
Sentence Similarity • 31.9M • Updated • 66.7k • • 45
Korean BERT
A collection of backbone models suitable for building Korean embedding or reranker models.
papers
Korean Embedding Models
A collection of high-performance Korean embedding models, including both models I trained myself and other publicly available strong baselines.
-
dragonkue/snowflake-arctic-embed-l-v2.0-ko
Sentence Similarity • 0.6B • Updated • 74.4k • • 48 -
dragonkue/BGE-m3-ko
Sentence Similarity • 0.6B • Updated • 800k • • 76 -
dragonkue/multilingual-e5-small-ko
Sentence Similarity • 0.1B • Updated • 3.88k • • 11 -
dragonkue/multilingual-e5-small-ko-v2
Sentence Similarity • 0.1B • Updated • 15.9k • • 4
Reranker Models
A collection of high-performance Korean reranker models, including those I have trained myself as well as other strong baselines
-
dragonkue/bge-reranker-v2-m3-ko
Text Ranking • 0.6B • Updated • 23.2k • 23 -
telepix/PIXIE-Spell-Reranker-Preview-0.6B
Text Ranking • 0.6B • Updated • 293 • 5 -
BAAI/bge-reranker-v2-m3
Text Classification • 0.6B • Updated • 16.2M • • 1.06k -
Qwen/Qwen3-Reranker-0.6B
Text Ranking • 0.6B • Updated • 2.15M • 368
Multilingual Embedding Models
A collection of multilingual embedding models suitable for use as training backbones
-
BAAI/bge-m3
Sentence Similarity • Updated • 32M • • 3.18k -
Snowflake/snowflake-arctic-embed-l-v2.0
Sentence Similarity • 0.6B • Updated • 915k • • 248 -
google/embeddinggemma-300m
Sentence Similarity • 0.3B • Updated • 1.53M • • 1.76k -
intfloat/multilingual-e5-large-instruct
Feature Extraction • 0.6B • Updated • 1.54M • • 629
Multi-modal Retrieval Models
Colbert (multi-vec)
-
dragonkue/colbert-ko-0.1b
Sentence Similarity • 0.1B • Updated • 249 • 4 -
LiquidAI/LFM2-ColBERT-350M
Sentence Similarity • 0.4B • Updated • 63.5k • 144 -
yjoonjang/colbert-ko-v1
Sentence Similarity • 0.1B • Updated • 17 • 16 -
mixedbread-ai/mxbai-edge-colbert-v0-32m
Sentence Similarity • 31.9M • Updated • 66.7k • • 45
Korean Sparse Retriever
Korean BERT
A collection of backbone models suitable for building Korean embedding or reranker models.