Name: The Unsolved Challenges of LLMs in Open-Ended Web Tasks: A Case Study | GTC 24 2024 | NVIDIA On-Demand
Uploaded: 2024-03-19T16:00:00Z
Duration: 2843 s
Description: We investigate the challenges associated with developing goal-driven AI agents capable of performing open-ended tasks in a web environment using zero-shot

Video Player is loading.

Current Time 0:00

Duration 0:00

Loaded: 0%

Stream Type LIVE

Remaining Time 0:00

详情

字幕

We investigate the challenges associated with developing goal-driven AI agents capable of performing open-ended tasks in a web environment using zero-shot learning. Our primary focus is on harnessing the capabilities of large language models (LLMs) in the context of web navigation through HTML-based user interfaces. We evaluate the MiniWoB benchmark and show that it's a suitable yet challenging platform for assessing an agent's ability to comprehend and solve tasks without prior human demonstrations. Our main contribution encompasses a set of extensive experiments where we compare and contrast various agent design considerations, such as action space, observation space, and the choice of LLM, with the aim of shedding light on the bottlenecks and limitations of LLM-based zero-shot learning in this domain, in order to foster research in this area.

活动: GTC 24

日期: March 2024

行业: All Industries

NVIDIA technology: Cloud / Data Center GPU

级别: General

话题: Generative AI Platform

语言: English

所在地: