Are there any current methods to speed up inference?
1
#5 opened about 2 months ago
by
zhouchongqin
Could you please provide a example of the full retrieval pipeline?
1
#4 opened 2 months ago
by
AaronWho
Provide examples of other modalities to embedding
3
#3 opened 2 months ago
by
uukoala
support vllm?
1
#2 opened 3 months ago
by
shuowang
Errors when using transformers 4.57.1, Load model by AutoModel
2
#1 opened 3 months ago
by
heyanzhuo