KuKu

dragonkue

12 94 186

Ash-Hun's profile picture

regisss's profile picture

dongbin777's profile picture

AI & ML interests

anything.

Recent Activity

upvoted an article 9 days ago

Beyond LoRA: Can you beat the most popular fine-tuning technique?

upvoted a paper 11 days ago

Training Sparse Mixture Of Experts Text Embedding Models

liked a model 12 days ago

BCCard/MoAI-Embedding-4B

View all activity

Organizations

dragonkue 's collections 8

papers

Is Position Bias in Dense Retrievers Built In-or Learned from Data?

Paper • 2605.26578 • Published May 26 • 20

Reranker Models

A collection of high-performance Korean reranker models, including those I have trained myself as well as other strong baselines

dragonkue/bge-reranker-v2-m3-ko

Text Ranking • 0.6B • Updated Apr 3, 2025 • 23.2k • 23
telepix/PIXIE-Spell-Reranker-Preview-0.6B

Text Ranking • 0.6B • Updated Apr 2 • 293 • 5
BAAI/bge-reranker-v2-m3

Text Classification • 0.6B • Updated Jun 24, 2024 • 16.2M • • 1.06k
Qwen/Qwen3-Reranker-0.6B

Text Ranking • 0.6B • Updated Apr 16 • 2.15M • 368

Multi-modal Retrieval Models

Qwen/Qwen3-VL-Embedding-8B

Sentence Similarity • 8B • Updated Apr 16 • 1.13M • 451
Qwen/Qwen3-VL-Embedding-2B

Sentence Similarity • 2B • Updated Apr 16 • 1M • • 425
Qwen/Qwen3-VL-Reranker-8B

Text Ranking • 9B • Updated Apr 16 • 466k • 154
Qwen/Qwen3-VL-Reranker-2B

Text Ranking • 2B • Updated Apr 16 • 296k • 201

Korean Sparse Retriever

telepix/PIXIE-Splade-Preview

Feature Extraction • 0.1B • Updated Sep 19, 2025 • 33 • 13
yjoonjang/splade-ko-v1

Feature Extraction • 0.1B • Updated Jan 17 • 2.07k • 16

Korean Embedding Models

A collection of high-performance Korean embedding models, including both models I trained myself and other publicly available strong baselines.

dragonkue/snowflake-arctic-embed-l-v2.0-ko

Sentence Similarity • 0.6B • Updated Oct 16, 2025 • 74.4k • • 48
dragonkue/BGE-m3-ko

Sentence Similarity • 0.6B • Updated Oct 16, 2025 • 800k • • 76
dragonkue/multilingual-e5-small-ko

Sentence Similarity • 0.1B • Updated Oct 16, 2025 • 3.88k • • 11
dragonkue/multilingual-e5-small-ko-v2

Sentence Similarity • 0.1B • Updated Oct 16, 2025 • 15.9k • • 4

Multilingual Embedding Models

A collection of multilingual embedding models suitable for use as training backbones

BAAI/bge-m3

Sentence Similarity • Updated Jul 3, 2024 • 32M • • 3.18k
Snowflake/snowflake-arctic-embed-l-v2.0

Sentence Similarity • 0.6B • Updated Jul 28, 2025 • 915k • • 248
google/embeddinggemma-300m

Sentence Similarity • 0.3B • Updated Sep 25, 2025 • 1.53M • • 1.76k
intfloat/multilingual-e5-large-instruct

Feature Extraction • 0.6B • Updated Jul 10, 2025 • 1.54M • • 629

Colbert (multi-vec)

dragonkue/colbert-ko-0.1b

Sentence Similarity • 0.1B • Updated May 26 • 249 • 4
LiquidAI/LFM2-ColBERT-350M

Sentence Similarity • 0.4B • Updated 3 days ago • 63.5k • 144
yjoonjang/colbert-ko-v1

Sentence Similarity • 0.1B • Updated Nov 28, 2025 • 17 • 16
mixedbread-ai/mxbai-edge-colbert-v0-32m

Sentence Similarity • 31.9M • Updated Apr 15 • 66.7k • • 45

Korean BERT

A collection of backbone models suitable for building Korean embedding or reranker models.

skt/A.X-Encoder-base

Text Classification • 0.1B • Updated Jan 20 • 605 • • 29

papers

Is Position Bias in Dense Retrievers Built In-or Learned from Data?

Paper • 2605.26578 • Published May 26 • 20

Korean Embedding Models

A collection of high-performance Korean embedding models, including both models I trained myself and other publicly available strong baselines.

dragonkue/snowflake-arctic-embed-l-v2.0-ko

Sentence Similarity • 0.6B • Updated Oct 16, 2025 • 74.4k • • 48
dragonkue/BGE-m3-ko

Sentence Similarity • 0.6B • Updated Oct 16, 2025 • 800k • • 76
dragonkue/multilingual-e5-small-ko

Sentence Similarity • 0.1B • Updated Oct 16, 2025 • 3.88k • • 11
dragonkue/multilingual-e5-small-ko-v2

Sentence Similarity • 0.1B • Updated Oct 16, 2025 • 15.9k • • 4

Reranker Models

A collection of high-performance Korean reranker models, including those I have trained myself as well as other strong baselines

dragonkue/bge-reranker-v2-m3-ko

Text Ranking • 0.6B • Updated Apr 3, 2025 • 23.2k • 23
telepix/PIXIE-Spell-Reranker-Preview-0.6B

Text Ranking • 0.6B • Updated Apr 2 • 293 • 5
BAAI/bge-reranker-v2-m3

Text Classification • 0.6B • Updated Jun 24, 2024 • 16.2M • • 1.06k
Qwen/Qwen3-Reranker-0.6B

Text Ranking • 0.6B • Updated Apr 16 • 2.15M • 368

Multilingual Embedding Models

A collection of multilingual embedding models suitable for use as training backbones

BAAI/bge-m3

Sentence Similarity • Updated Jul 3, 2024 • 32M • • 3.18k
Snowflake/snowflake-arctic-embed-l-v2.0

Sentence Similarity • 0.6B • Updated Jul 28, 2025 • 915k • • 248
google/embeddinggemma-300m

Sentence Similarity • 0.3B • Updated Sep 25, 2025 • 1.53M • • 1.76k
intfloat/multilingual-e5-large-instruct

Feature Extraction • 0.6B • Updated Jul 10, 2025 • 1.54M • • 629

Multi-modal Retrieval Models

Qwen/Qwen3-VL-Embedding-8B

Sentence Similarity • 8B • Updated Apr 16 • 1.13M • 451
Qwen/Qwen3-VL-Embedding-2B

Sentence Similarity • 2B • Updated Apr 16 • 1M • • 425
Qwen/Qwen3-VL-Reranker-8B

Text Ranking • 9B • Updated Apr 16 • 466k • 154
Qwen/Qwen3-VL-Reranker-2B

Text Ranking • 2B • Updated Apr 16 • 296k • 201

Colbert (multi-vec)

dragonkue/colbert-ko-0.1b

Sentence Similarity • 0.1B • Updated May 26 • 249 • 4
LiquidAI/LFM2-ColBERT-350M

Sentence Similarity • 0.4B • Updated 3 days ago • 63.5k • 144
yjoonjang/colbert-ko-v1

Sentence Similarity • 0.1B • Updated Nov 28, 2025 • 17 • 16
mixedbread-ai/mxbai-edge-colbert-v0-32m

Sentence Similarity • 31.9M • Updated Apr 15 • 66.7k • • 45

Korean Sparse Retriever

telepix/PIXIE-Splade-Preview

Feature Extraction • 0.1B • Updated Sep 19, 2025 • 33 • 13
yjoonjang/splade-ko-v1

Feature Extraction • 0.1B • Updated Jan 17 • 2.07k • 16

Korean BERT

A collection of backbone models suitable for building Korean embedding or reranker models.

skt/A.X-Encoder-base

Text Classification • 0.1B • Updated Jan 20 • 605 • • 29