Combine LLMs and RL: The LLM reasons about the agent's behavior to solve subtasks and generates higher-level actions, improving RL's sample efficiency. Create python ...
The dataset used for fine-tuning the model. Code for generating the dataset. Scripts for fine-tuning the model on high-performance GPUs. Inference scripts for real-time task execution. SG_VLM utilizes ...