Skip to content

Pull requests: open-compass/opencompass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add BGPT REFUTE benchmark
#2471 opened Jun 5, 2026 by connerlambden Loading…
[Dataset] Add ZebraLogic benchmark
#2464 opened May 31, 2026 by amanyara Loading…
3 tasks done
[Dataset] Add ArxivRollBench
#2458 opened May 24, 2026 by liangzid Loading…
[Feature] Add BuySideFinBench dataset for buy-side financial analysis…
#2446 opened May 13, 2026 by cindy90 Loading…
5 tasks done
feat: add LiteLLM as AI gateway model backend
#2441 opened Apr 22, 2026 by RheagalFire Loading…
4 of 6 tasks
[BugFix] Fix gsm8k postprocess
#2426 opened Mar 30, 2026 by Hibbert133 Loading…
6 tasks done
feat: upgrade MiniMax default model to M3
#2418 opened Mar 20, 2026 by octo-patch Loading…
3 tasks done
[Fix] CEval ModelScope load and HF generate for causal LMs
#2416 opened Mar 19, 2026 by DeliWang Loading…
6 tasks
Add support for Azure OpenAI models and managed identity auth
#2415 opened Mar 18, 2026 by jgbradley1 Loading…
2 of 6 tasks
[Fix] Fix eval stage is extremely slow
#2409 opened Mar 5, 2026 by xming521 Loading…
6 tasks
fix: correct typo 'seperated' to 'separated'
#2397 opened Feb 9, 2026 by thecaptain789 Loading…
Update README.md for mmlu
#2374 opened Dec 29, 2025 by freemedom Loading…
[Dataset] Add dataset arena-hard-v2
#2362 opened Dec 16, 2025 by Myhs-phz Collaborator Loading…
ProTip! Follow long discussions with comments:>50.