Video Player is loading.
Current Time 0:00
Duration 0:00
Loaded: 0%
Stream Type LIVE
Remaining Time 0:00
 
1x
    • Chapters
    • descriptions off, selected
    • subtitles off, selected

      Scaling Generative AI Features to Millions of Users Thanks to Inference Pipeline Optimizations

      , Chief Technology Officer, PhotoRoom
      , Machine Learning Scientist, PhotoRoom
      We'll cover how PhotoRoom uses generative AI models for real-time inference. We'll start by highlighting the techniques used for architecture optimization, followed by a deep dive into TensorRT. We'll also cover how we serve models for millions of users, leveraging Triton Inference Server.
      活动: GTC 24
      日期: March 2024
      话题: AI Inference
      行业: All Industries
      NVIDIA technology: Cloud / Data Center GPU,Grace CPU
      级别: Intermediate Technical
      语言: English
      所在地: