Abstract: Tools based on the use of Large Language Models (LLMs) have improved the computer programming teaching process, automated feedback processes, facilitated program repair, and enabled ...
We introduce OfficeBench, one of the first office automation benchmarks for evaluating current LLM agents' capability to address office tasks in realistic office workflows. OfficeBench requires LLM ...
Abstract: The appearance of large language models (LLMs) and related products has generated widespread attention and lively discussions in both industry and academia. Given their extensive ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results