From Deep Reinforcement Learning to LLM-based Agents: Perspectives on Current Research

2024-06-19 15:17

演讲人Speaker：Dr. Stefano V. Albrecht

题目Title: From Deep Reinforcement Learning to LLM-based Agents: Perspectives on Current Research

时间Date：2024年6月 20日 Time：上午 9：00 ~ 10：30

地点Venue：信息楼1417会议室

内容摘要Abstract：

Since the recent successes of large language models (LLMs), we are beginning to see a shift of attention from deep reinforcement learning to LLM-based agents. While deep RL policies are typically learned from scratch to maximise some defined return objective, LLM-agents use an existing LLM at their core and focus on clever prompt engineering and downstream specialisation of the LLM via supervised and reinforcement learning techniques. In this talk, I will first provide a broad overview of my group’s research in deep RL, which focuses among other topics on developing sample-efficient and robust RL algorithms for both single- and multi-agent control tasks, including industry applications in autonomous driving and multi-robot warehouses. I will then present our recent research into LLM-agents, where we propose an approach for household robotics that takes into account user preferences to achieve more robust and effective planning.

I will conclude with some personal observations about the state of LLM-agent research: (a) many papers in this field follow essentially the same recipe by focussing on prompt engineering and downstream specialisation; (b) this recipe makes their scientific claims brittle as they depend crucially on the specific LMM engine, and (c) LLMs are not natively designed to maximise objectives for optimal control and decision making. Based on these observations, I believe some fruitful research avenues can be identified.

个人简介(About the speaker):

Dr. Stefano V. Albrecht is Associate Professor in Artificial Intelligence in the School of Informatics, University of Edinburgh. He leads the Autonomous Agents Research Group (https://agents.inf.ed.ac.uk) which specialises in developing machine learning algorithms for autonomous systems control and decision making, with a particular focus on reinforcement learning and multi-agent interaction. Dr. Albrecht is affiliated with the Alan Turing Institute where he leads the Multi-Agent Systems theme. In 2022, he was nominated for the IJCAI Computers and Thought Award based on his research which introduced Stochastic Bayesian Games and optimal solution algorithms, which have since been applied in a range of domains.