LiveCodeBench: Holistic and Contamination Free Evaluation of Large Lan

- livecodebench.github.io

LiveCodeBench: Holistic and Contamination-Free Evaluation of Large Language Models for Code

Not Applicable $ 8.94


R2E: Turning any GitHub Repository into a Programming Agent Environmen

- r2e.dev

R2E: Turning any GitHub Repository into a Programming Agent Environment

Not Applicable $ 8.94