[pytorch] DistributedDataParallel vs DataParallel 차이

728x90

The difference between DistributedDataParallel and DataParallel is: DistributedDataParallel uses multiprocessing where a process is created for each GPU, while DataParallel uses multithreading. By using multiprocessing, each GPU has its dedicated process, this avoids the performance overhead caused by GIL of Python interpreter.

If you use DistributedDataParallel, you could use torch.distributed.launch utility to launch your program, see Third-party backends.

DistributedDataParallel : multi processing 이용

DataParallel : multi thread 이용

python은 GLI 때문에 multi thread로 성능향상을 보기 어려운 걸로 알고 있다. 그러니 DistributedDataParallel를 쓰자.

pytorch.org/docs/stable/notes/cuda.html#cuda-nn-ddp-instead

CUDA semantics — PyTorch 1.7.0 documentation

CUDA semantics torch.cuda is used to set up and run CUDA operations. It keeps track of the currently selected GPU, and all CUDA tensors you allocate will by default be created on that device. The selected device can be changed with a torch.cuda.device cont

pytorch.org

'Data-science > deep learning' 카테고리의 다른 글

[pytorch] inference시 memory leak, 메모리 과부하 문제 (6)	2021.01.11
[petcon] stylegan2 distillation 찾다가 다른 좋은 거 찾음 encoding + distillation 한 번에 (0)	2021.01.09
2020.12.31 회의록 (petcon stylegan2 학습 진행 과정2, 할 일, 관련 논문) (0)	2021.01.01
[pytorch] stylegan2 여러 지표들 용어 정리. real score? fake score? (0)	2020.12.31
[pytorch] stylegan2 학습, freezeD + freezeG 동시 적용 (6)	2020.12.30

성장하는 나날들

[pytorch] DistributedDataParallel vs DataParallel 차이

'Data-science > deep learning' 카테고리의 다른 글

티스토리툴바

[pytorch] DistributedDataParallel vs DataParallel 차이

'Data-science > deep learning' 카테고리의 다른 글

'Data-science/deep learning' Related Articles

티스토리툴바