This page is used to run subjective metrics on a list of documents using a list of prompts.
Bulk analysis is useful for many things, including comparing the performance of different prompts
or creating useful evaluation metrics for data.