Learning to Summarize During RL
Reward