Our work on dialogue evalution with large language models was presented at IWSDS ‘23.