GitHub Daily TrendGitHub - laude-institute/terminal-bench: A benchmark for LLMs on complicated tasks in the terminal
·00:04:33
https://github.com/laude-institute/terminal-bench A benchmark for LLMs on complicated tasks in the terminal - laude-institute/terminal-bench