Skip to content

sandeepkumar-skb/pytorch_multistreaming

Repository files navigation

PyTorch Multistreaming Experiments

PyTorch CUDA multistreaming in Python and CPP(TorchScript)

  1. Pytorch_py_multistream - This has examples for launching models on multiple CUDA streams using pytorch context managers using the Python API.
  2. PyTorch_cpp_multistream - This has examples for launching models on multiple CUDA streams using C++ TorchScript and converting it to a CPP extension which can be imported into Python model. This has the following examples:
    • Launching models on CUDA streams using TorchScript.
    • Converting them into CPP extensions which can be imported into the python model.
    • Experiments for launching on multiple CUDA Streams using
      • Threads
      • async_launch
      • pthreads with priority
      • OpenMP thread pools
      • TBB thread pools
  3. seq_block_custom_op- This demonstrates how to launch models on different CUDA streams using CPP custom op and compiling it into a library which can be loaded into Python or CPP modules.

About

PyTorch CUDA multistreaming in Python and CPP(TorchScript)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published