Publications
publications by categories in reversed chronological order.
2025
- Memory
MEM1: Learning to Synergize Memory and Reasoning for Efficient Long-Horizon AgentsArxiv preprint - InteractionToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term InteractionsEMNLP 2025 findings
- InteractionLLM Meets Scene Graph: Can Large Language Models Understand and Generate Scene Graphs? A Benchmark and Empirical Study
- Memory
Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized AssistanceArxiv preprint - Reward Model
- Reward Model
Rethinking Reward Model Evaluation Through the Lens of Reward OveroptimizationACL 2025 (Oral)