Talk on a dynamic, automated approach to search evaluation using user-modeling inspired by the 'LLM as a judge' paradigm to generate realistic query-result pairs and evaluate multi-modality search.
Topic
1
tagged
Talk on a dynamic, automated approach to search evaluation using user-modeling inspired by the 'LLM as a judge' paradigm to generate realistic query-result pairs and evaluate multi-modality search.