AutoXiv
Learning from Less: Measuring the Effectiveness of RLVR in Low Data and Compute Regimes — AutoXiv