Djuunaa's picture

Djuunaa

djuna

·

AI & ML interests

None yet

Recent Activity

new activity 1 day ago

prithivMLmods/PiD-Image-Upscaler:Download button for png output

upvoted a collection 1 day ago

reacted to vincentg64's post with 🔥 3 days ago

96% Correct Next Token Prediction, with No DNN, no Training, auto-distilled model - https://mltblog.com/4urfvTB Over the last 12 months, I’ve built a model to predict the next token and to suggest synonyms or related queries to a user prompt, with 100% correct predictions on the training set in one shot, without training or deep neural networks (DNNs). The same model is now integrated in some of the most recent LLM architectures, albeit with costly training via DNNs. My version does not need DNNs or training. The purpose of this article is to provide validation to my deep neural network alternative in the context of LLMs. The new model is as a substitute to standard DNNs, with increased explainability and higher accuracy. It is designed for corporate corpuses. The end goal is to provide better accuracy at a much lower cost, while providing full control over all the components. An interesting feature is auto-distillation, whereas the model self-identifies weights that do not contribute over time in 99.9% of user-generated prompts, and drop them, based on prompts from a large, specialized user base. The gain is most spectacular in open-weight LLMs applied to specialized contexts, whether based on DNNs or not. Read article and download the free technical paper with NVIDIA case study, at https://mltblog.com/4urfvTB

View all activity

Organizations

djuna 's models 100

djuna/Q3-IIJAN-3B-Q8_0-GGUF

4B • Updated Aug 12, 2025 • 4

djuna/DeepSeek-R1-0528-Qwen3-8B-remap

Text Generation • 8B • Updated May 30, 2025 • 1

djuna/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2-remap

Text Generation • 15B • Updated Mar 11, 2025 • 3 • 2

djuna/DeepSeek-R1-Distill-Qwen-14B-abliterated-remap

Text Generation • 15B • Updated Mar 11, 2025 • 1

djuna/MN-Chinofun-12B-4-4bit

Text Generation • 2B • Updated Mar 11, 2025 • 7

djuna/TEST3-Q2.5-Lenned-14B-Q5_K_M-GGUF

15B • Updated Feb 17, 2025 • 5

djuna/TEST3-Q2.5-Lenned-14B

Text Generation • 15B • Updated Feb 17, 2025 • 1 • 1

djuna/TEST2-Q2.5-Lenned-14B-Q5_K_M-GGUF

15B • Updated Feb 17, 2025 • 6 • 1

djuna/TEST2-Q2.5-Lenned-14B

Text Generation • 15B • Updated Feb 17, 2025 • 8 • 4

djuna/TEST-Q2.5-Lenned-14B

Text Generation • 15B • Updated Jan 28, 2025 • 1 • 1

djuna/MN-Chinofun-12B-4

Text Generation • 12B • Updated Jan 27, 2025 • 5 • 4

djuna/MN-Chinofun-12B-4.1-Q6_K-GGUF

12B • Updated Jan 26, 2025 • 84 • 1

djuna/MN-Chinofun-12B-4.1

Text Generation • 12B • Updated Jan 26, 2025 • 9 • • 7

djuna/MN-Chinofun-12B-4-Q6_K-GGUF

12B • Updated Jan 26, 2025 • 1 • 1

djuna/Q2.5-Veltha-14B-0.5-AWQ-4bit

15B • Updated Jan 19, 2025 • 1

djuna/TEST-Q2.5-AA-Q8_0-GGUF

9B • Updated Jan 8, 2025 • 3

djuna/Q2.5-Veltha-14B

Text Generation • 15B • Updated Dec 23, 2024 • 47 • • 11

djuna/Q2.5-Veltha-14B-0.5

Text Generation • 15B • Updated Dec 22, 2024 • 11 • • 11

djuna/Q2.5-Veltha-14B-0.5-Q5_K_M-GGUF

15B • Updated Dec 22, 2024 • 29 • 1

djuna/Q2.5-Veltha-14B-Q5_K_M-GGUF

15B • Updated Dec 22, 2024 • 1 • 1

djuna/MT-Gen3-gemma-2-9B-Flip-Q5_K_M-GGUF

9B • Updated Dec 19, 2024 • 7

djuna/G2-Nowing-9B-32K-YS

Text Generation • 10B • Updated Dec 12, 2024 • 4 • 1

djuna/G2-Nowing-9B

Text Generation • 10B • Updated Dec 12, 2024 • 4 • 1

djuna/MS-Nudion-22B

Text Generation • 22B • Updated Dec 9, 2024 • 2 • 1

djuna/G2-GSHT-32K-Q6_K-GGUF

9B • Updated Dec 6, 2024

djuna/MN-Chinofun-12B-3

Text Generation • 12B • Updated Dec 6, 2024 • 4 • 2

djuna/G2-Noranum-27B-Q3_K_S-GGUF

27B • Updated Dec 2, 2024 • 1

djuna/G2-Noranum-27B

Text Generation • 28B • Updated Dec 2, 2024 • 3

djuna/TEST-Ocerus-7B-Q5_K_M-GGUF

7B • Updated Dec 1, 2024 • 4 • 1

djuna/TEST-OcerusBeam-7B-Q5_K_M-GGUF

7B • Updated Dec 1, 2024 • 1 • 1