The example of how to get retrieval metrics along with answer inference based on the context. "ctx" refers to 'context' "ans" refers to 'answer' "gt" refers to 'ground truth answer' "ctx_ans_inference ...
This repository is a part of our ongoing effort to build large scale execution based evaluation benchmark published as xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results