Issues
Search results
- Status: Open.#5975 In huggingface/trl;
- Status: Open.#5974 In huggingface/trl;
- Status: Open.#5973 In huggingface/trl;
- Status: Open.#5912 In huggingface/trl;
- Status: Open.#5903 In huggingface/trl;
- Status: Open.#5889 In huggingface/trl;
- Status: Open.#5886 In huggingface/trl;
- Status: Open.#5865 In huggingface/trl;
- Status: Open.#5863 In huggingface/trl;
- Status: Open.#5847 In huggingface/trl;
- Status: Open.#5831 In huggingface/trl;
GRPOTrainer silently uses near-greedy decoding when temperature=1.0 (transformers >= 4.50 + Qwen2.5)
Status: Open.#5783 In huggingface/trl;