tuxsentience
Collection
ACCURACY IS PRIORITY
•
2 items
•
Updated
Our second open-weight model, in progress. For now this documents progress and details.
It has been decided that this will be based off Qwen3 8B.
It will like the last one most likely be 4-bit, but due to our new training methods (detailed below) we may release larger sizes.
We are attempting to train this model via distributed computing, this is how our current setup looks so far:
Amounting to around 98.47 TFLOPS.

In the future we are trying to aquire better hardware and a RX 9070 XT is planned for future models. Currently we are attempting unsloth + ray for distributed computing.
Coming soon to an accuracy near you