Dear Light-R1 Team,
I am studying your Light-R1 model and I’m particularly interested in the two-stage SFT data filtering process described in your paper and repository.
In the paper, the selection of SFT data in Stage-1 and Stage-2 is described using a pass-rate cutoff parameter α, but I could not locate the exact numeric values of α in the paper, the GitHub code.
Could you please share the numeric α thresholds used for Stage-1 and Stage-2 SFT filtering? This would be very helpful for understanding and reproducing your results.
Thank you very much for your time and for releasing this excellent work.
Dear Light-R1 Team,
I am studying your Light-R1 model and I’m particularly interested in the two-stage SFT data filtering process described in your paper and repository.
In the paper, the selection of SFT data in Stage-1 and Stage-2 is described using a pass-rate cutoff parameter α, but I could not locate the exact numeric values of α in the paper, the GitHub code.
Could you please share the numeric α thresholds used for Stage-1 and Stage-2 SFT filtering? This would be very helpful for understanding and reproducing your results.
Thank you very much for your time and for releasing this excellent work.