·
AI & ML interests
None yet
Recent Activity
reacted to vincentg64's post with 🔥 3 days ago 96% Correct Next Token Prediction, with No DNN, no Training, auto-distilled model - https://mltblog.com/4urfvTB
Over the last 12 months, I’ve built a model to predict the next token and to suggest synonyms or related queries to a user prompt, with 100% correct predictions on the training set in one shot, without training or deep neural networks (DNNs). The same model is now integrated in some of the most recent LLM architectures, albeit with costly training via DNNs. My version does not need DNNs or training.
The purpose of this article is to provide validation to my deep neural network alternative in the context of LLMs. The new model is as a substitute to standard DNNs, with increased explainability and higher accuracy. It is designed for corporate corpuses. The end goal is to provide better accuracy at a much lower cost, while providing full control over all the components.
An interesting feature is auto-distillation, whereas the model self-identifies weights that do not contribute over time in 99.9% of user-generated prompts, and drop them, based on prompts from a large, specialized user base. The gain is most spectacular in open-weight LLMs applied to specialized contexts, whether based on DNNs or not.
Read article and download the free technical paper with NVIDIA case study, at https://mltblog.com/4urfvTB
View all activity Organizations
djuna/Q3-IIJAN-3B-Q8_0-GGUF
4B • Updated • 4
djuna/DeepSeek-R1-0528-Qwen3-8B-remap
Text Generation
• 8B • Updated • 1
djuna/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2-remap
Text Generation
• 15B • Updated • 3
• 2
djuna/DeepSeek-R1-Distill-Qwen-14B-abliterated-remap
Text Generation
• 15B • Updated • 1
djuna/MN-Chinofun-12B-4-4bit
Text Generation
• 2B • Updated • 7
djuna/TEST3-Q2.5-Lenned-14B-Q5_K_M-GGUF
15B • Updated • 5
djuna/TEST3-Q2.5-Lenned-14B
Text Generation
• 15B • Updated • 1
• 1
djuna/TEST2-Q2.5-Lenned-14B-Q5_K_M-GGUF
15B • Updated • 6
• 1
djuna/TEST2-Q2.5-Lenned-14B
Text Generation
• 15B • Updated • 8
• 4
djuna/TEST-Q2.5-Lenned-14B
Text Generation
• 15B • Updated • 1
• 1
Text Generation
• 12B • Updated • 5
• 4
djuna/MN-Chinofun-12B-4.1-Q6_K-GGUF
12B • Updated • 84
• 1
djuna/MN-Chinofun-12B-4.1
Text Generation
• 12B • Updated • 9
• • 7
djuna/MN-Chinofun-12B-4-Q6_K-GGUF
12B • Updated • 1
• 1
djuna/Q2.5-Veltha-14B-0.5-AWQ-4bit
15B • Updated • 1
djuna/TEST-Q2.5-AA-Q8_0-GGUF
9B • Updated • 3
Text Generation
• 15B • Updated • 47
• • 11
djuna/Q2.5-Veltha-14B-0.5
Text Generation
• 15B • Updated • 11
• • 11
djuna/Q2.5-Veltha-14B-0.5-Q5_K_M-GGUF
15B • Updated • 29
• 1
djuna/Q2.5-Veltha-14B-Q5_K_M-GGUF
15B • Updated • 1
• 1
djuna/MT-Gen3-gemma-2-9B-Flip-Q5_K_M-GGUF
9B • Updated • 7
djuna/G2-Nowing-9B-32K-YS
Text Generation
• 10B • Updated • 4
• 1
Text Generation
• 10B • Updated • 4
• 1
Text Generation
• 22B • Updated • 2
• 1
djuna/G2-GSHT-32K-Q6_K-GGUF
9B • Updated Text Generation
• 12B • Updated • 4
• 2
djuna/G2-Noranum-27B-Q3_K_S-GGUF
27B • Updated • 1
Text Generation
• 28B • Updated • 3
djuna/TEST-Ocerus-7B-Q5_K_M-GGUF
7B • Updated • 4
• 1
djuna/TEST-OcerusBeam-7B-Q5_K_M-GGUF
7B • Updated • 1
• 1