We introduce OfficeBench, one of the first office automation benchmarks for evaluating current LLM agents' capability to address office tasks in realistic office workflows. OfficeBench requires LLM ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results