News | Yanjun Chen

Jun 24, 20262026年6月24日2026年6月24日	Passed the confirmation of candidature for my PhD at The Hong Kong Polytechnic University, with the thesis Towards Efficient Reinforcement Learning via Environment Measurement and Shaping.通过了香港理工大学的博士候选人资格确认，论文题目为 Towards Efficient Reinforcement Learning via Environment Measurement and Shaping。香港理工大学の博士候補者資格審査に合格しました。論文題目は Towards Efficient Reinforcement Learning via Environment Measurement and Shaping です。
May 22, 20262026年5月22日2026年5月22日	Shortlisted for PolyU Micro Fund 2025/26 Cohort 2 (HK$20,000 cash prize), with a conditional offer to the HKSTP Ideation Programme.入围 PolyU Micro Fund 2025/26 第二轮（HK$20,000 现金奖励），并获得 HKSTP Ideation Programme 的有条件录取。PolyU Micro Fund 2025/26 Cohort 2（賞金 HK$20,000）にショートリスト入り、HKSTP Ideation Programme に条件付きで内定。
May 08, 20262026年5月8日2026年5月8日	Released v2 of Exact Is Easier: Credit Assignment for Cooperative LLM Agents on arXiv:2603.06859.Exact Is Easier: Credit Assignment for Cooperative LLM Agents v2 已发布至 arXiv:2603.06859。Exact Is Easier: Credit Assignment for Cooperative LLM Agents v2 を arXiv:2603.06859 で公開。
Mar 06, 20262026年3月6日2026年3月6日	First arXiv release of Exact Is Easier: Credit Assignment for Cooperative LLM Agents (in submission).Exact Is Easier: Credit Assignment for Cooperative LLM Agents 在 arXiv 首次发布（投稿中）。Exact Is Easier: Credit Assignment for Cooperative LLM Agents を arXiv に初回公開（投稿中）。
May 22, 20252025年5月22日2025年5月22日	Co-authored a comprehensive survey on latent chain-of-thought reasoning (arXiv:2505.16782).与人合作完成了一篇关于隐式思维链推理（latent chain-of-thought reasoning）的综述论文（arXiv:2505.16782）。latent chain-of-thought reasoning に関する包括的なサーベイ論文を共著として発表（arXiv:2505.16782）。
May 15, 20252025年5月15日2025年5月15日	Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning accepted at ACL 2025 Findings (co-author).Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning 被 ACL 2025 Findings 接收（共同作者）。Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning が ACL 2025 Findings に採択（共著）。
Jan 15, 20252025年1月15日2025年1月15日	Fine-Grained and Multi-Dimensional Metrics for Document-Level MT accepted at NAACL 2025 (co-author).Fine-Grained and Multi-Dimensional Metrics for Document-Level MT 被 NAACL 2025 接收（共同作者）。Fine-Grained and Multi-Dimensional Metrics for Document-Level MT が NAACL 2025 に採択（共著）。
Oct 09, 20242024年10月9日2024年10月9日	The Accuracy Paradox in RLHF: When Better Reward Models Don’t Yield Better Language Models accepted at EMNLP 2024.The Accuracy Paradox in RLHF: When Better Reward Models Don’t Yield Better Language Models 被 EMNLP 2024 接收。The Accuracy Paradox in RLHF: When Better Reward Models Don’t Yield Better Language Models が EMNLP 2024 に採択。