Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
详情
字幕
Scaling Generative AI Features to Millions of Users Thanks to Inference Pipeline Optimizations
, Chief Technology Officer, PhotoRoom
, Machine Learning Scientist, PhotoRoom
We'll cover how PhotoRoom uses generative AI models for real-time inference. We'll start by highlighting the techniques used for architecture optimization, followed by a deep dive into TensorRT. We'll also cover how we serve models for millions of users, leveraging Triton Inference Server.
活动: GTC 24
日期: March 2024
话题: AI Inference
行业: All Industries
NVIDIA technology: Cloud / Data Center GPU,Grace CPU