Having issues with repetition.

#2
by rdsm - opened

Any one else having issues with the model repeating it self?
After some time deployed the model started repeating itself "The !!!!!!!!!!!!!!!!!(... continues indefinitely...)"

$ curl -s -X POST "http://[my internal url]/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nvidia/Kimi-K2.5-NVFP4",
    "messages": [{"role": "user", "content": "Hello! What can you do?"}],
    "max_tokens": 100,
    "temperature": 0.7
  }' | python -m json.tool
{
    "id": "chatcmpl-960b0d2b9bd89f72",
    "object": "chat.completion",
    "created": 1771856932,
    "model": "nvidia/Kimi-K2.5-NVFP4",
    "choices": [
        {
            "index": 0,
            "message": {
                "role": "assistant",
                "content": " !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!",
                "refusal": null,
                "annotations": null,
                "audio": null,
                "function_call": null,
                "tool_calls": [],
                "reasoning": " !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!"
            },
            "logprobs": null,
            "finish_reason": "length",
            "stop_reason": null,
            "token_ids": null
        }
    ],
    "service_tier": null,
    "system_fingerprint": null,
    "usage": {
        "prompt_tokens": 33,
        "total_tokens": 133,
        "completion_tokens": 100,
        "prompt_tokens_details": null
    },
    "prompt_logprobs": null,
    "prompt_token_ids": null,
    "kv_transfer_params": null
}

Hardware: B200s

The same issue~~~
Any update?

No luck, reverted the deploy back to the moonshot version.

Sign up or log in to comment