read_file: Read file contents with flexible line range control edit_file: Make precise edits to files with clear instructions Supports complete file replacement ...
Abstract: Large Language Models (LLMs) are increasingly used by software engineers for code generation. However, limitations of LLMs such as irrelevant or incorrect code have highlighted the need for ...
We introduce the Berkeley Function Leaderboard (BFCL), the first comprehensive and executable function call evaluation dedicated to assessing Large Language Models' (LLMs) ability to invoke functions.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results