Zhang, Hugh
Bibliographic References tagged with Zhang, Hugh
Not finding what you're looking for? Try using Advanced Search.
Not finding what you're looking for? Try using Advanced Search.
H. Zhang and D. C. Parkes,
“Chain-of-Thought Reasoning is a Policy Improvement Operator.”, Workshop on Instruction Tuning and Instruction Following at NeurIPS 2023 , 2023.
H. Zhang and D. C. Parkes,
“Chain-of-Thought Reasoning is a Policy Improvement Operator.”, Workshop on Instruction Tuning and Instruction Following at NeurIPS 2023 , 2023.