We educated “critique-writing” fashions to explain flaws in summaries. Human evaluators discover flaws in summaries way more usually when proven our mannequin’s critiques. Bigger fashions are higher at self-critiquing, with scale enhancing critique-writing greater than summary-writing. This exhibits promise for utilizing AI methods to help human supervision of AI methods on troublesome duties.
![](https://aipressroom.com/wp-content/uploads/2023/06/evolution-through-large-models-150x150.png)
Evolution by giant fashions
![](https://aipressroom.com/wp-content/uploads/2023/06/techniques-for-training-large-neural-networks-150x150.png)