ViewTube

ViewTube
Sign inSign upSubscriptions
Filters

Upload date

Type

Duration

Sort by

Features

Reset

7,814,788 results

Related queries

fsdp

deepspeed

cuda programming

Lightning AI
Unit 9.3 | Deep Dive into Data Parallelism | Part 1 | Understanding Data Parallelism

Follow along with Unit 9 in a Lightning AI Studio, an online reproducible environment created by Sebastian Raschka, that ...

2:24
Unit 9.3 | Deep Dive into Data Parallelism | Part 1 | Understanding Data Parallelism

2,409 views

2 years ago

Developers Hutt
How DDP works || Distributed Data Parallel || Quick explained

Discover how DDP harnesses multiple GPUs across machines to handle larger models and datasets, accelerating the training ...

3:21
How DDP works || Distributed Data Parallel || Quick explained

4,371 views

1 year ago

Stanford Online
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 7: Parallelism 1

For more information about Stanford's online Artificial Intelligence programs visit: https://stanford.io/ai To learn more about ...

1:24:42
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 7: Parallelism 1

30,490 views

7 months ago

ByteByteGo
Concurrency Vs Parallelism!

Get a Free System Design PDF with 158 pages by subscribing to our weekly newsletter: https://bit.ly/bytebytegoytTopic Animation ...

4:13
Concurrency Vs Parallelism!

172,421 views

1 year ago

Mark Saroufim
Model vs Data Parallelism in Machine Learning

... deal with this is called model parallelism and with lots of data the way we deal with this is called data parallelism so what do you ...

9:32
Model vs Data Parallelism in Machine Learning

7,940 views

5 years ago

iTech
Task vs. Data Parallelism
4:48
Task vs. Data Parallelism

75 views

1 month ago

Next LVL Programming
What Is Data Parallelism? - Next LVL Programming

What Is Data Parallelism? In this informative video, we'll clarify the concept of data parallelism and its importance in the realm of ...

3:16
What Is Data Parallelism? - Next LVL Programming

130 views

5 months ago

Big Data Analysis with Scala and Spark
Data-Parallel to Distributed Data-Parallel

In this session we're going to try and bridge the gap between data parallelism in the shared memory case which is what we ...

10:18
Data-Parallel to Distributed Data-Parallel

5,806 views

8 years ago

Lightning AI
Unit 9.3 | Deep Dive into Data Parallelism | Part 2 | Distributed Data Parallelism

Follow along with Unit 9 in a Lightning AI Studio, an online reproducible environment created by Sebastian Raschka, that ...

5:43
Unit 9.3 | Deep Dive into Data Parallelism | Part 2 | Distributed Data Parallelism

1,173 views

2 years ago

Programming Massively Parallel Processors
Lecture 02 - Data Parallel Programming

GPU Computing, Spring 2021, Izzat El Hajj Department of Computer Science American University of Beirut.

1:19:18
Lecture 02 - Data Parallel Programming

5,124 views

3 years ago

Guy Steele
Data Parallel Algorithms

A talk about Data Parallel Algorithms given at MIT in 1990.

53:15
Data Parallel Algorithms

1,624 views

9 years ago

Rust Belt Rust Conference
Rayon: Data Parallelism for Fun and Profit — Nicholas Matsakis

Materials for this talk are available at https://speakerdeck.com/nikomatsakis/rayon-rust-belt-rust Rayon is a convenient library for ...

24:43
Rayon: Data Parallelism for Fun and Profit — Nicholas Matsakis

32,660 views

8 years ago

PyTorch
Part 2: What is Distributed Data Parallel (DDP)

In the second video of this series, Suraj Subramanian gently introduces you to what is happening under the hood when you train a ...

3:16
Part 2: What is Distributed Data Parallel (DDP)

49,377 views

3 years ago

NVIDIA Developer
Data Parallelism Using PyTorch DDP | NVAITC Webinar

Learn how to do Distributed Data Parallelism using PyTorch DDP Distributed Data Parallel (DDP) is a technique that enables data ...

27:11
Data Parallelism Using PyTorch DDP | NVAITC Webinar

7,117 views

2 years ago

Faradawn Yang
LLM Inference Optimization #2: Tensor, Data & Expert Parallelism (TP, DP, EP, MoE)

Part 2 of 5 in the “5 Essential LLM Optimization Techiniques” series. Link to the 5 techiniques roadmap: ...

20:18
LLM Inference Optimization #2: Tensor, Data & Expert Parallelism (TP, DP, EP, MoE)

1,464 views

2 months ago

Emerging Tech Insider
What Is Data Parallelism? - Emerging Tech Insider

What Is Data Parallelism? In this informative video, we will clarify the concept of data parallelism and its significance in modern ...

3:14
What Is Data Parallelism? - Emerging Tech Insider

18 views

3 months ago

Umar Jamil
Distributed Training with PyTorch: complete tutorial with cloud infrastructure and code

00:04:44 - Data Parallelism vs Model Parallelism 00:06:25 - Gradient accumulation 00:19:38 - Distributed Data Parallel 00:26:24 ...

1:12:53
Distributed Training with PyTorch: complete tutorial with cloud infrastructure and code

34,741 views

2 years ago

DLVU
Lecture 12.4 Scaling up (Mixed precision, Data-parallelism, FSDP)

How to train big models. slides: https://dlvu.github.io/sa course website: https://dlvu.github.io lecturer: Peter Bloem.

34:27
Lecture 12.4 Scaling up (Mixed precision, Data-parallelism, FSDP)

3,000 views

2 years ago

Lazy Analyst
Model Parallelism vs Data Parallelism vs Tensor Parallelism | #deeplearning #llms

Model Parallelism vs Data Parallelism vs Tensor Parallelism #deeplearning #llms #gpus #gpu In this video, we will learn about ...

6:59
Model Parallelism vs Data Parallelism vs Tensor Parallelism | #deeplearning #llms

3,146 views

1 year ago

Ahmed Taha
How Fully Sharded Data Parallel (FSDP) works?

This video explains how Distributed Data Parallel (DDP) and Fully Sharded Data Parallel (FSDP) works. The slides are available ...

32:31
How Fully Sharded Data Parallel (FSDP) works?

30,698 views

2 years ago

MIT OpenCourseWare
21.2.2 Data-level Parallelism

MIT 6.004 Computation Structures, Spring 2017 Instructor: Chris Terman View the complete course: https://ocw.mit.edu/6-004S17 ...

6:45
21.2.2 Data-level Parallelism

8,993 views

6 years ago

Database Systems Research Group at U Tübingen
DB2 — Chapter #07 — Video #28 — Data parallelism in MonetDB, SIMD-based vector processing

Video lecture, part of the "DB2" course, U Tübingen, summer semester 2020. Read by Torsten Grust.

21:52
DB2 — Chapter #07 — Video #28 — Data parallelism in MonetDB, SIMD-based vector processing

375 views

5 years ago

Little ML book club
Ultra-scale playbook, ch.2.1 - "Data Parallelism [:ZERO]"

"Little ML book club" is reading "Ultra-scale playbook". Together! Oh, and it is free. Details: ...

27:15
Ultra-scale playbook, ch.2.1 - "Data Parallelism [:ZERO]"

68 views

1 month ago

CppCon
std::simd: How to Express Inherent Parallelism Efficiently Via Data-parallel Types - Matthias Kretz

https://cppcon.org/ --- std::simd: How to Express Inherent Parallelism Efficiently Via Data-parallel Types - Matthias Kretz ...

1:04:57
std::simd: How to Express Inherent Parallelism Efficiently Via Data-parallel Types - Matthias Kretz

20,473 views

2 years ago