Video Player is loading.
Current Time 0:00
Duration 0:00
Loaded: 0%
Stream Type LIVE
Remaining Time 0:00
 
1x
    • Chapters
    • descriptions off, selected
    • subtitles off, selected
      • Quality

      Optimizing AI-powered NPCs Cost-Efficiency Using TRT-LLM Without Sacrificing Quality

      , Vice President of AI, Inworld AI
      , Data Scientist, NVIDIA
      The future of gaming will be powered by AI. Large language models allowed to enable a new era of immersive gaming experiences and to make sure that thousands of players can enjoy it one needs to guarantee that LLMs can be served as efficiently as possible. This session will shed more light on how to approach this problem on a practical surface and how the TRT-LLM framework allows to achieve SOTA performance.
      活动: GTC 24
      日期: March 2024
      行业: Gaming
      话题: Generative AI Platform
      级别: Intermediate Technical
      NVIDIA technology: Triton
      语言: English
      所在地: