Top P controls the randomness of the generated text using nucleus sampling.
The model selects words randomly from the smallest possible set whose cumulative probability equals P%.
For example, with Top P = 0.9, the model only considers the words whose cumulative probability reaches 90%.
A higher Top P (closer to 1) results in more diverse and creative outputs.