Zhi Zheng
neoz
AI & ML interests
LLM, NLP
Organizations
Update config.json
#27 opened about 1 year ago
by
kansalsayamvw
vLLM Error: Model architectures ['MiniCPM3ForCausalLM'] are not supported for now.
👀
1
5
#7 opened about 1 year ago
by
HaoyuHuang
TypeError: PreTrainedTokenizerFast._batch_encode_plus() got an unexpected keyword argument 'tools'
5
#22 opened about 1 year ago
by
sankexin
How to use infinity long context with LLMxMapReduce?
2
#14 opened about 1 year ago
by
lixiangtian
安装vllm出现bug
1
#15 opened about 1 year ago
by
sanwuge
Adding `safetensors` variant of this model
#11 opened about 1 year ago
by
SFconvertbot
用llama.cpp部署时无法启用flashattn
1
#5 opened about 1 year ago
by
gimling
function calling dataset
1
#6 opened about 1 year ago
by
lucyknada
Base model release
1
#10 opened about 1 year ago
by
siberiamark
Call for GGUF , If possible~~
3
#12 opened about 1 year ago
by
Jason233
不支持sglang
2
#18 opened about 1 year ago
by
zhangdahaodaddy
如何进行batch inference?
1
#19 opened about 1 year ago
by
Rebelliousgang
add library tag
#8 opened about 1 year ago
by
davanstrien
generating extremely slow, compared to 4k length model
2
#4 opened over 1 year ago
by
CHNtentes
Slow inference speed and high VRAM using huggingface transformers
2
#2 opened over 1 year ago
by
Starlento