SamuZai
Nguyen Le Minh
Nguyen Le Minh

patreon


Sugoi LLM 14B Ultra - Download links

The magic of the original FP16 14B model is now fully unleashed in Sugoi 14B Ultra! I have pushed the limits to retain almost every ounce of quality from the source model – and the result? Translation performance that’s ALMOST TWICE as good as the previous quantized Sugoi 14B version! (BLEU score of 21.38 vs 13.67)

But that’s not all – Sugoi 14B Ultra isn’t just accurate, it’s smart. With prompt-following skills rivaling the Qwen 2.5 base model, it’s ready to translate even text with lots of brackets (commonly seen in RPGM games) with unmatched precision.

Instructions:

https://blog.sugoitoolkit.com/sugoi-llm-14b-ultra/

Download links:

https://sugoi-file.sfo3.cdn.digitaloceanspaces.com/Sugoi-14B-Ultra-Q4_K_M.gguf

Comments

if there are more demands from users, I'll consider it because requests like you are right now less than 3 :)

Nguyen Le Minh

Are we able to get the non-quantized version? GGUF does not play nicely with certain hosts (particularly vLLM, which is blazing fast)

Pilaxiv724

Can you post this on Sugoi Toolkit tech support channel along with some cmd screenshots, I'll have a look

Nguyen Le Minh

Great model, got this working in LM Studio but having trouble linking it with Sugoi Toolkit v12.5, any chance for an updated/dedicated guide?

Arthur Kord

you can quantize from the FP16 versions yourself with llama.cpp tool. I made a Q6 14B version like a month ago.

Amazing Flapples

Is there any chance we can get a less qauntized model q4 kinda hurts.

Gerald Gantos

Hype

Amazing Flapples


More Creators