佐藤魁星's picture

佐藤魁星

itounagi0116

·

https://t.co/9b6MU5U71K

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

kizuna-intelligence/Irodori-TTS-500M-v2-duration-control

liked a model 8 days ago

sbintuitions/sarashina2.2-tts

liked a model 27 days ago

tencent/HY-World-2.0

View all activity

Organizations

None yet

upvoted an article 5 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

burtenshaw, evalstate

•

Dec 4, 2025

• 624

upvoted 2 collections 8 months ago

EmbeddingGemma

3 items • Updated Mar 12 • 118

japanese-reranker

日本語rerankerシリーズ • 10 items • Updated 3 days ago • 3

upvoted a collection about 1 year ago

Perception Encoder

16 items • Updated Mar 2 • 81

upvoted an article about 1 year ago

Article

Mixture of Experts Explained

+4

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.13k

upvoted 4 collections about 1 year ago

Phi-4 (All Versions)

Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes • 20 items • Updated 22 days ago • 82

Canary-TTS

10 items • Updated Mar 2 • 3

Reasoning Vector

Reasoningモデルとベースモデルの重み差分 • 4 items • Updated Feb 18, 2025 • 3

Dart v2 (Danbooru Tags Transformer v2)

LLMs for generating danbooru tags. • 9 items • Updated Mar 2 • 2

upvoted 3 collections over 1 year ago

DeepSeek-R1

10 items • Updated Nov 27, 2025 • 845

TinySwallow

Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models" • 5 items • Updated Jan 30, 2025 • 18

qwen2.5-bakeneko

The bakeneko model series are based on the qwen2.5 series and have been continually pre-trained on Japanese-specific corpora. • 21 items • Updated Aug 26, 2025 • 11

upvoted 2 articles over 1 year ago

Article

Open R1: Update #2

open-r1

•

Feb 10, 2025

• 218

Article

Open-source DeepResearch – Freeing our search agents

+3

m-ric, albertvillanova, merve, thomwolf, clefourrier

•

Feb 4, 2025

• 1.32k

upvoted a collection over 1 year ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 159

upvoted an article over 1 year ago

Article

Use Models from the Hugging Face Hub in LM Studio

yagilb

•

Nov 28, 2024

• 144

upvoted a collection almost 2 years ago

LLM Compiler

Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated Jun 27, 2024 • 157

upvoted a collection about 2 years ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 25 items • Updated Mar 2 • 580