techniques_to_improve_reliability.md

rw-book-cover

Metadata

Highlights

  • • Give clearer instructions • Split complex tasks into simpler subtasks • Structure the instruction to keep the model on task • Prompt the model to explain before answering • Ask for justifications of many possible answers, and then synthesize • Generate many outputs, and then use the model to pick the best one • Fine-tune custom models to maximize performance (View Highlight)
  • Another powerful technique for improving the reliability of answers is to prompt the model to gradually reason out the answer rather than jumping immediately to the final answer. By ‘thinking aloud’ the model can be far more likely to arrive at the correct answer. (View Highlight)
  • ric Zelikman and Yuhuai Wu et al. published a clever procedure for using a few-shot prompt to generate a dataset of explanations that could be used to fine-tune a model. The idea is to use a few-shot prompt to generate candidate explanations, and only keep the explanations that produce the correct answer. (View Highlight)
  • , one extension of the chain-of-thought technique is to split the single prompt for generating explanations and answers into smaller parts. First, a prompt selects a relevant subset of facts from the text (‘selection prompt’). Then, a second prompt infers a conclusion from the selected facts (‘inference prompt’). These prompts are then alternated in a loop to generate multiple steps of reasoning and eventually land on a final answer. (View Highlight)
  • The results highlight a couple of general lessons for working with large language models. One, splitting up complex tasks into smaller tasks is a great way to improve reliability and performance; the more atomic the task, the less room there is for the model to err. Two, getting maximum performance often means combining fine-tuning with whatever approach you’ve chosen. (View Highlight)