Agent RFT, Practical Guide
A nine-step tutorial on agent reinforcement fine-tuning (RFT): how to build the dataset, stand up tool servers, write the grader, configure KL and group size, size GPUs, and pick between TRL, verl, OpenRLHF, Unsloth, and NeMo RL — with code and example outputs at every step.