Evaluating and Debugging Generative Al

Disclaimer

This note contains only the slides, no personal notes.

Outline

|800

Comparing model outputs

Model Registry

It is a central system of records for the models.

Training & Finetuning LLMs

Training from scratch

  1. Long & expensive training runs
  2. Expensive & difficult evaluations
  3. Monitoring is critical
  4. Ability to restore training from a checkpoint

Fine-tuning

  1. Efficient methods being developed
  2. Expensive & difficult evaluations

Source

Also Read

Thoughts 🤔 by Soumendra Kumar Sahoo is licensed under CC BY 4.0