Knowledge Quiz
Test your understanding of this article
1.What is the primary limitation of existing legal benchmarks, according to the article?
2.What is the main purpose of CALRK-Bench?
3.From what sources is the CALRK-Bench dataset constructed?
4.What do experimental results with CALRK-Bench indicate about recent large language models?
