Distributed ML/DL on KSL systems#
- Accelerating Machine Learning with Scikit Learn
- PyTorch Distributed Data Parallel (DDP)
- Microsoft DeepSpeed
- Accelerate API by Hugginface
- Cray Machine Learning Development Environment
- Pytorch Lightning
- Horovod for Distributed Data Parallel training
- Distributed Deep Learning with Tensorflow 2.x
- MATLAB Deep Learning Toolbox
- Ray Tune for Hyperparameter Optimization experiments