Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
Banaxi-Tech 
posted an update 2 days ago
Post
10490
A new model is coming!
Its going to take a long time on my 5070 Ti so expect a release in ~1 month.
We think this model is going to be SOTA For its size.
Our Mini Version will be 25M Parameters and Pro with 140M.
The Pro version has a 3072 Context Window (Extensible to up to 6K with RoPE) And the Mini version has a context window of 4096 (Up to 8K with RoPE)
Meanwhile we are currently working on a Instruct Version of our BananaMind 1.5 Base.

The training will start this weekend

We are very exited to release it when its done!

Impressive! Although, I am curious how powerful the model will be! I have followed so I will see when you release it!

This will be very interesting

oh! i saw this for first time. what's it? a model like NanoBanana? what's it goal?

·

oh! i saw this for first time. what's it? a model like NanoBanana? what's it goal?

LLM

Interesting. Will it be a classic autoregressive architecture, or something new?

·

classic autoregressive architecture