![]() BIG-bench Lite leaderboardīIG-bench Lite (BBL) is a small subset of 24 diverse JSON tasks from BIG-bench. The benchmark organizers can be contacted at of contentsįor more details about the benchmark, see our detailed instructions. ![]() However, they will be included in future BIG-bench releases, and the task authors will be included in the author list of future publications. New tasks are no longer eligible for inclusion in the initial BIG-bench release and paper. Tasks will be reviewed and merged into the BIG-bench repository on a rolling basis. A paper introducing the benchmark, including evaluation results on large language models, is currently under review, and is available as a preprint. ![]() The more than 200 tasks included in BIG-bench are summarized by keyword here, and by task name here. ![]() The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborativeīenchmark intended to probe large language models and extrapolate their future ![]()
0 Comments
Leave a Reply. |