extract_sft_dataset

extract_sft_dataset(root: autogen.agentchat.contrib.reasoning_agent.ThinkNode) -> list[dict]

Extract the best trajectory or multiple equally good trajectories for SFT training.

Parameters:
NameDescription
rootThe root node of the tree.

Type: autogen.agentchat.contrib.reasoning_agent.ThinkNode
Returns:
TypeDescription
list[dict]List[Dict]: List of best trajectories, each one is a pair of instruction and response.