Improve multi-hop reasoning in LLMs by learning from rich human feedback
AWS Machine Learning
APRIL 27, 2023
Solution overview With the onset of large language models, the field has seen tremendous progress on various natural language processing (NLP) benchmarks. The final dataset contains feedback for 1,565 samples from StrategyQA and 796 examples for Sports Understanding. The following figure shows the interface we used. Missing Facts 50.4%
Let's personalize your content