Archives | Qubixity.net

Improving mathematical reasoning with process supervision

by jsendak | May 31, 2023 | Cosmology & Computing

We’ve trained a model to achieve a new state-of-the-art in mathematical problem solving by rewarding each correct step of reasoning (“process supervision”) instead of simply rewarding the correct final answer (“outcome supervision”). In addition to boosting performance relative to outcome supervision, process supervision also has an important alignment benefit: it directly trains the model to produce a chain-of-thought that is endorsed by humans.

Language models can explain neurons in language models

by jsendak | May 9, 2023 | Cosmology & Computing

We use GPT-4 to automatically write explanations for the behavior of neurons in large language models and to score those explanations. We release a dataset of these (imperfect) explanations and scores for every neuron in GPT-2.

Improving mathematical reasoning with process supervision

Language models can explain neurons in language models

Recent Posts

Recent Comments