Yosuke Oyama

Our paper has been accepted for IEEE Transactions on Parallel & Distributed Systems

Our paper “The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs with Hybrid Parallelism” has been accepted for the Special Section on Parallel and Distributed Computing Techniques for AI, ML, and DL in IEEE Transactions on Parallel and Distributed Systems (TPDS).

Our 3D CNNs + hybrid-parallelism paper is released on arXiv

Our paper “The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs with Hybrid Parallelism” is released on arXiv. This paper presents scalable hybrid-parallel algorithms for training two large-scale 3D convolutional neural networks, the CosmoFlow network, and the 3D U-Net. For the ComsoFlow network, we successfully scale the training to 2k V100 GPUs with 64x larger spatial input size, by partitioning each data sample across multiple GPUs.

Yosuke Oyama

Our paper has been accepted for IEEE Transactions on Parallel & Distributed Systems

Our 3D CNNs + hybrid-parallelism paper is released on arXiv

SWoPP2019に参加します

Our poster has been accepted for ICPP 2019

深層学習の高速化に関する研究成果が日経Robotics 2019年7月号に掲載されました

Our poster has been accepted for GTC 2019

onnx2chainer

公開シンポジウム「Co-Designによる深層学習基盤」でポスター発表を行います

情報処理学会 2018年度山下記念研究賞を受賞しました

Our paper has been accepted for IEEE Cluster2018

μ-cuDNN v1.1.0

SWoPP2018に参加します

第10回JHPCNシンポジウムでポスター発表を行います

Tokyo Tech-flavored Metropolis Themeを公開しました

2018年以前にHPC研究会で発表したスライド

Initial commit