Resources

tool calls sometimes fail on vllm

#19 opened about 1 month ago by

viktara

Not support flash_attn_2？

#17 opened about 1 month ago by

bdsqlsz

chatllm.cpp adds support of this model

🚀 7

#16 opened about 1 month ago by

J22

possible bug in processing_step3.py?

#15 opened about 1 month ago by

J22

Quantisation support for this model

❤️ 4

#14 opened about 1 month ago by

dineshananthi

Update processing_step3.py

#13 opened about 1 month ago by

jakubstrawa

Installation Video and Testing - Step by Step

👍 6

#12 opened about 1 month ago by

fahdmirzac

is it possible to provide FP8 version in additional to BF16

🔥 8

#10 opened about 1 month ago by

water258

Incorrect eos_token_id in config causing infinite generation

#9 opened about 1 month ago by

WenyaLi

perfect

🤗 15

#8 opened about 1 month ago by

ares2324

Doesn't work in OpenAI Streaming Interface

#7 opened about 1 month ago by

pytokusu

Add pipeline tag and library name to metadata

#6 opened about 1 month ago by

nielsr

vLLM is not working

#4 opened about 1 month ago by

wei01

Typo in paper

#3 opened about 1 month ago by

hgsg

Llama.cpp support

👀 🔥 13

#2 opened about 1 month ago by

tcpmux

请问有推荐的复现超参吗？

#1 opened about 1 month ago by

JjjjjZzz