nvidia
/

Llama-3.1-Nemotron-70B-Reward

Model card Files Files and versions

Resources

View closed (0)

Does the RL lead to this model to prefer to give answers in a certain length scope?

#4 opened 9 months ago by

Update README.md

#3 opened about 1 year ago by

Should we use the 5th dimension of the output only?

#2 opened about 1 year ago by

Add pipeline tag

#1 opened over 1 year ago by