Constrain LLM to specific output

Stuffs to figure out
 
 
—microsoft one, very useful
 
Langchain is meant to be able to do this using their application of ReAct or chain of thought.
 
 
Regex + CFGs
 
 

How to fine-tune a model to be in instruct?

  • Provide the LLM with some examples of instructions and their corresponding outputs.
  • Consider the quality and quantity of the data, the learning rate and regularization, the freezing and unfreezing of layers, and the evaluation and validation