Skip to main content

Zhang, Hugh

Bibliographic References tagged with Zhang, Hugh

Not finding what you're looking for? Try using Advanced Search.
Not finding what you're looking for? Try using Advanced Search.
H. Zhang and D. C. Parkes,
Chain-of-Thought Reasoning is a Policy Improvement Operator.”, Workshop on Instruction Tuning and Instruction Following at NeurIPS 2023 , 2023.
H. Zhang and D. C. Parkes,
Chain-of-Thought Reasoning is a Policy Improvement Operator.”, Workshop on Instruction Tuning and Instruction Following at NeurIPS 2023 , 2023.