Activity Feed

AI & ML interests

None defined yet.

Recent Activity

FlameF0Xย  updated a model about 5 hours ago
IndexLM/Index-pt
FlameF0Xย  updated a dataset 1 day ago
IndexLM/Index-SFT
FlameF0Xย  updated a Space 1 day ago
IndexLM/README
View all activity

FlameF0Xย 
updated a Space 1 day ago
FlameF0Xย 
published a Space 1 day ago
FlameF0Xย 
posted an update 9 months ago
view post
Post
4333
I am very sad to say that the budget in creating of SnowflakeCore-G1 1b and 7b MoE models ran out and I can't pre-train them anymore.
  • 7 replies
ยท
FlameF0Xย 
posted an update 9 months ago
view post
Post
786
the training for SnowflakeCore-G1-1B and 7B would be retaken because now I implemented DeepSpeed and management to use two gpus.
FlameF0Xย 
posted an update 9 months ago
view post
Post
277
The development of SnowflakeCore-G1-7B-MoE it getting delay. In the mean time I am working on SnowflakeCore-G1-1B-MoE witch would be a pre-train chatbot.
  • 1 reply
ยท
FlameF0Xย 
posted an update 9 months ago
view post
Post
2958
The development of SnowflakeCore-G1-7B-MoE. I can't say when it would be publish yet because it's big and it requires a lot of computational power.
  • 1 reply
ยท
FlameF0Xย 
posted an update 10 months ago
FlameF0Xย 
posted an update 10 months ago
view post
Post
315
Hello! Important announcement, I will rename SnowflakeCore-G1-Medium to SnowflakeCore-G1-Tiny2 because it's going to have the same parameters as the Tiny version, but this one is trained on more data.
  • 1 reply
ยท
FlameF0Xย 
posted an update 10 months ago
view post
Post
747
Currently working on SnowflakeCore-G1-Medium. [Updated loss cruve]
  • 3 replies
ยท
FlameF0Xย 
posted an update 10 months ago
FlameF0Xย 
posted an update 10 months ago
FlameF0Xย 
posted an update 10 months ago
FlameF0Xย 
posted an update 10 months ago
view post
Post
258
SnowflakeCore-G1 Update:
Got it running and training! Context window is currently set to 2048 tokens.
Training is active and stable. Will share results once I have some metrics to report.
  • 2 replies
ยท
FlameF0Xย 
posted an update 10 months ago
view post
Post
1940
SnowflakeCore-G1 development update: We're building a 24-layer transformer with 32K context and 1024 embedding dimensions - pretty ambitious! Even running at batch_size=1 with heavy gradient accumulation, we're hitting memory walls at 300GB RAM. Scaling up to ~1TB will take some time, but the architecture is looking promising. Thanks for following along with the journey! ๐Ÿ˜…
  • 1 reply
ยท
FlameF0Xย 
posted an update 11 months ago
view post
Post
1154
Hello there!
I just find out that all the SnowflakeCore-G0 series are Mask Language Models instead of LLM's.
The development of SnowflakeCore-G0-Releas-3 would be delayed even more.

Edit: I officially end the development of SnowflakeCore-G0 and start the development of SnowflakeCore-G1 what SHOULD be the text generator.

Edit-2: After some evaluation of the code, the models are actual Text Generator. So the development of G0 will continue.