This is the KVLink5 model of the paper "KVLink: Accelerating LLMs via Efficient KV Cache Reuse."

Downloads last month
13
Safetensors
Model size
1B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Shiyu-Lab/Llama1B-KVLink5

Quantizations
1 model