Generate Test Cases for Evaluation
Building off of the Generate seed indices for evaluation data and Expand evaluation dataset from seed indices prompts, this prompt considers the generated indices, then generates a set of test questions and expected results. The result can then be used to evaluate a tool that tries to pick the best index to query based on the indices it has available to choose from.