Having issues with repetition.
#2
by
rdsm - opened
Any one else having issues with the model repeating it self?
After some time deployed the model started repeating itself "The !!!!!!!!!!!!!!!!!(... continues indefinitely...)"
$ curl -s -X POST "http://[my internal url]/v1/chat/completions" \
-H "Content-Type: application/json" \
-d '{
"model": "nvidia/Kimi-K2.5-NVFP4",
"messages": [{"role": "user", "content": "Hello! What can you do?"}],
"max_tokens": 100,
"temperature": 0.7
}' | python -m json.tool
{
"id": "chatcmpl-960b0d2b9bd89f72",
"object": "chat.completion",
"created": 1771856932,
"model": "nvidia/Kimi-K2.5-NVFP4",
"choices": [
{
"index": 0,
"message": {
"role": "assistant",
"content": " !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!",
"refusal": null,
"annotations": null,
"audio": null,
"function_call": null,
"tool_calls": [],
"reasoning": " !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!"
},
"logprobs": null,
"finish_reason": "length",
"stop_reason": null,
"token_ids": null
}
],
"service_tier": null,
"system_fingerprint": null,
"usage": {
"prompt_tokens": 33,
"total_tokens": 133,
"completion_tokens": 100,
"prompt_tokens_details": null
},
"prompt_logprobs": null,
"prompt_token_ids": null,
"kv_transfer_params": null
}
Hardware: B200s
The same issue~~~
Any update?
No luck, reverted the deploy back to the moonshot version.