If you're looking to broaden your search, I've found a few more resources to share. Hope this helps!
LLM Leaderboards
LLM Search Tools
LLM Eval & Benchmark Resources
- Holistic Evaluation of Language Models (HELM)
- TextSynth
- The Curious Case of LLM Evaluations
- Mosaic Benchmarks
I'm going to add a few of these to the sidebar for quick access. Let me know if I've missed one!