Define the Next Consumer Internet Experience

Stream. Search. Shop. Ship.

Today’s consumers have the world at their fingertips. The boom in online users has made the consumer internet industry one of the largest users of machine learning, deep learning, and data science. From recommenders to smart chatbots to AI-enhanced video conferencing, these capabilities rely on fast, intelligent tools and infrastructure. That’s why leading companies are using NVIDIA solutions to harness their wealth of data to pioneer products that shape the online experience.

The Developer Conference for the Era of AI and the Metaverse

Conference & Training September 19 - 22 | Keynote September 20

The Developer Conference
for the Era of AI and the Metaverse

Conference & Training September 19 - 22 | Keynote September 20

AI is powering change in every industry across the globe. The boom in online users has made the consumer internet industry one of the largest users of machine learning, deep learning, and data science. From recommenders to smart chatbots to AI-enhanced video conferencing, these capabilities rely on fast, intelligent tools and infrastructure.  Get the latest innovations across consumer internet at GTC22.

  • Maximizing GPU utilization in Large Scale Machine Learning Infrastructure

    • Yobo Zhu, Director of Machine Learning Systems, ByteDance

    The products of ByteDance heavily rely on machine learning (ML) and deep learning (DL). Large-scale clusters are built to support these workloads including for both model training and real time online inferencing. In this talk, I will share how we make best use of our complex infrastructure to simultaneously run ML training and inference workloads in our extensive GPU clusters. Our objectives are to maximize GPU resource utilization while providing our users service level guarantees.

    View Details >

  • Serving 100x Bigger Recommender Models

    • Zhiyuan Zhang, Engineering Manager, ML Serving Platforms, Pinterest, Inc.
    • Pong Eksombatchai, Staff Software Engineer, Advanced Technology Group, Pinterest, Inc.

    The central piece of Pinterest’s technical stack is our recommendation system, which brings responsive and personalized content from a corpus of over 300 billion pins to more than 400 million users. Join us for a deep dive into this system and learn how we are able to serve billions of recommendations per day at millisecond latencies.

    View Details >

  • From Ingestion to Deployment for Large Language Models

    • Vartika Singh, Dev Rel DL Frameworks and Compilers, NVIDIA
    • Julien Demouth, NVIDIA
    • Sudip Roy, Systems and Infra Lead, Cohere
    • Niki Parmar, Product Lead, Adept.AI

    The products of ByteDance heavily rely on machine learning (ML) and deep learning (DL). Large-scale clusters are built to support these workloads including for both model training and real time online inferencing. In this talk, I will share how we make best use of our complex infrastructure to simultaneously run ML training and inference workloads in our extensive GPU clusters. Our objectives are to maximize GPU resource utilization while providing our users service level guarantees. 

    View Details >

Three headshots with varying dark gray to dark purple backgrounds. The left headshot features a man in a gray shirt with a gold banner that reads Stephen Jones, NVIDIA. The middle headshot features a woman in a red shirt with a gold banner that reads Anima Anandkumar, NVIDIA. The right headshot features a man in a black shirt and gray collar with a gold banner that reads Ian Buck, NVIDIA.

Streamline AI Models

In response to the explosion of different AI models and the resulting network complexity, NVIDIA’s GPU-based SDKs centralize deep learning and machine learning modeling, training, and inference.

Tackle Massive Datasets

Datasets aren’t just increasing at a massive pace. They’re also evolving into different formats with varying quality, making GPU-accelerated data science platforms critical to tackling modern processing workloads.

Faster Deployments

Scaling infrastructure monitoring, management, and software deployments can be challenging. With NVIDIA GPUs, engineers can build technical infrastructure and pipelines to support data.

Power Smarter User Experiences for
Your Business

  • Media Streaming
  • Social Media
  • eCommerce
  • Delivery Services
  • Video Conferencing
Media Streaming

Media Streaming

From customized user interfaces to expertly recommended content and automated video production, NVIDIA GPU platforms help elevate the audience experience. Data scientists and engineers can better understand viewer behavior and predict success through GPU-powered, real-time data analysis.

Social Media

Social Media

Social media platforms allow for personalization and sharing at a large scale. Advanced machine learning serves targeted advertising, job, and product recommendations, suggests people you might like to connect with, and curates specific posts in your feed.

eCommerce

eCommerce

From AI chatbots to smart assistants, NVIDIA GPU platform power natural language processing and conversational AI for a seamless and tailor-made shopping experience. And cybersecurity analysts can use GPU technology to predict and prevent fraudulent online transactions.

Delivery Services

Delivery Services

With the help of GPU-powered AI and robotics, delivery services can leverage algorithms and collect and analyze mapping and geolocation data to provide the most efficient and automated transport.

Video Conferencing

Video conference providers can vastly improve streaming quality and offer enhanced AI  features, such as super resolution, gaze correction, and live captions, using a suite of GPU AI-powered tools.

Video Conferencing

NVIDIA Solutions for Tailored Consumer Applications

Conversational AI with NVIDIA Riva

NVIDIA Riva is a GPU-accelerated application framework for building multimodal conversational AI services that deliver real-time performance. Riva includes pre-trained conversational AI models, tools in the NVIDIA AI Toolkit, and optimized end-to-end services like messaging apps, speech-based assistants, and chatbots for automating communication and creating personalized customer experiences at scale.

It combines vision, audio, and other sensory capabilities to power call center assistants and other virtual assistants.

Conversational AI with NVIDIA Riva
Recommender Systems with NVIDIA Merlin

Recommender Systems with NVIDIA Merlin

NVIDIA Merlin is a framework for building high-performance, deep learning-based recommender systems. From personalized media thumbnails to tailored movie and TV show recommendations optimized end-to-end services like messaging apps,, Merlin includes tools that provide better predictions of user preference and behavior than traditional methods to increase engagement  rates.

Accelerated Data Science with RAPIDS

Data science can deliver faster time to business insight, including in areas like  consumer behavior and predictive analytics. NVIDIA-accelerated data science, built on NVIDIA® CUDA-X AI™ and featuring NVIDIA RAPIDS data processing and machine learning libraries, provides GPU-accelerated software for data science workflows that maximize productivity, performance, and ROI.

Accelerated Data Science with RAPIDS
AI-Powered Video Conferencing with NVIDIA Maxine

AI-Powered Video Conferencing with NVIDIA Maxine

The NVIDIA Maxine™ AI platform SDK enables video-conference providers to vastly improve streaming quality in the cloud with super resolution, gaze correction, live captions, and more. In addition to reducing video bandwidth, Maxine’s fully accelerated platform includes innovative capabilities such as face alignment, noise removal,  and virtual assistants.

NGC Catalog: GPU-Optimized Software

The NGC catalog is a registry of GPU-optimized software for AI and HPC applications, pre-trained models, AI application frameworks, and helm charts. The enterprise-ready software from the NGC catalog helps data engineers, data scientists, researchers, developers, DevOps, and system admins shorten time to solution and bring solutions faster to market.

NGC Catalog: GPU-Optimized Software
Universal AI Workloads with NVIDIA DGX

Universal AI Workloads with NVIDIA DGX

The growth of AI-assisted services within the consumer internet industry is impeded by an explosion of complex AI models, increased datasets, and cumbersome deployment and management workflows. These challenges result in slow computing architectures and high costs. NVIDIA DGX™ A100 provides IT directors, data scientists, and data engineers a platform that can unify all AI workloads, simplify infrastructure, and accelerate ROI.

GPU Cloud Computing

Consumer internet services are leveraged across the world and on multiple platforms. NVIDIA’s GPU-accelerated solutions are available through all top cloud platforms, empowering companies to scale and access massive computing power on demand and with ease. The NVIDIA T4 Tensor Core GPU speeds up cloud workloads, including high-performance computing, deep learning training and inference, machine learning, data analytics, and graphics. It enables businesses to create new customer experiences that help make services more accessible and scalable.

GPU Cloud Computing

Sign up for the latest consumer internet industry news from NVIDIA.