- 微信
- 微博
  
  分享文章到微博
- 复制链接
  
  复制链接到剪贴板

PyCUDA+Threading = Invalid Handles on kernel invocations

风吹稻花香发表于 2021/06/04 23:36:46 2021/06/04

【摘要】 PyCUDA+Threading = Invalid Handles on kernel invocations The reason is context affinity. Every CUDA function instance is tied to a context, and they are not portable (the same applies t...

PyCUDA+Threading = Invalid Handles on kernel invocations

The reason is context affinity. Every CUDA function instance is tied to a context, and they are not portable (the same applies to memory allocations and texture references). So each context must load the function instance separately, and then use the function handle returned by that load operation.

If you are not using metaprogramming at all, you might find it simpler to compile your CUDA code to a cubin file, and then load the functions you need from the cubin to each context with driver.module_from_file. Cutting and pasting directly from some production code of mine:


  
   
    
     
    
    
     
      # Context establishment
     
    
   
    
     
    
    
     
      try:
     
    
   
    
     
    
    
      if (autoinit):
     
    
   
    
     
    
    
     
       import pycuda.autoinit
     
    
   
    
     
    
    
      self.context = None
     
    
   
    
     
    
    
      self.device = pycuda.autoinit.device
     
    
   
    
     
    
    
      self.computecc = self.device.compute_capability()
     
    
   
    
     
    
    
      else:
     
    
   
    
     
    
    
     
       driver.init()
     
    
   
    
     
    
    
      self.context = tools.make_default_context()
     
    
   
    
     
    
    
      self.device = self.context.get_device()
     
    
   
    
     
    
    
      self.computecc = self.device.compute_capability()
     
    
   
    
     
    
    
      
     
    
   
    
     
    
    
      # GPU code initialization
     
    
   
    
     
    
    
      # load pre compiled CUDA code from cubin file
     
    
   
    
     
    
    
      # Select the cubin based on the supplied dtype
     
    
   
    
     
    
    
      # cubin names contain C++ mangling because of
     
    
   
    
     
    
    
      # templating. Ugly but no easy way around it
     
    
   
    
     
    
    
      if self.computecc == (1,3):
     
    
   
    
     
    
    
      self.fimcubin = "fim_sm13.cubin"
     
    
   
    
     
    
    
     
       elif self.computecc[0] == 2:
     
    
   
    
     
    
    
      self.fimcubin = "fim_sm20.cubin"
     
    
   
    
     
    
    
      else:
     
    
   
    
     
    
    
     
       raise NotImplementedError("GPU architecture not supported")
     
    
   
    
     
    
    
      
     
    
   
    
     
    
    
     
       fimmod = driver.module_from_file(self.fimcubin)
     
    
   
    
     
    
    
      
     
    
   
    
     
    
    
     
       IterateName32 = "_Z10fimIterateIfLj8EEvPKT_PKiPS0_PiS0_S0_S0_jjji"
     
    
   
    
     
    
    
     
       IterateName64 = "_Z10fimIterateIdLj8EEvPKT_PKiPS0_PiS0_S0_S0_jjji"
     
    
   
    
     
    
    
      
     
    
   
    
     
    
    
      if (self.dtype == np.float32):
     
    
   
    
     
    
    
     
       IterateName = IterateName32
     
    
   
    
     
    
    
     
       elif (self.dtype == np.float64):
     
    
   
    
     
    
    
     
       IterateName = IterateName64
     
    
   
    
     
    
    
      else:
     
    
   
    
     
    
    
     
       raise TypeError
     
    
   
    
     
    
    
      
     
    
   
    
     
    
    
      self.fimIterate = fimmod.get_function(IterateName)
     
    
   
    
     
    
    
      
     
    
   
    
     
    
    
     
      except ImportError:
     
    
   
    
     
    
    
     
       warn("Could not initialise CUDA context")

文章来源: blog.csdn.net，作者：网奇，版权归原作者所有，如需转载，请联系作者。

原文链接：blog.csdn.net/jacke121/article/details/79705500

点赞
收藏
关注作者

0/1000

抱歉，系统识别当前为高风险访问，暂不支持该操作

全部回复

上滑加载中

设置昵称

在此一键设置昵称，即可参与社区互动！

*长度不超过10个汉字或20个英文字符，设置后3个月内不可修改。

确认取消

加入云驻计划，成为创作者

华为云周边好礼
免费体验产品
特殊身份标识
线下官方门票
内部专家零距离
与10000+优质创作者共同成长

立即加入

PyCUDA+Threading = Invalid Handles on kernel invocations

PyCUDA+Threading = Invalid Handles on kernel invocations

全部回复

设置昵称

关于作者

目录

加入云驻计划，成为创作者

PyCUDA+Threading = Invalid Handles on kernel invocations

PyCUDA+Threading = Invalid Handles on kernel invocations

全部回复

设置昵称

关于作者

目录

热门推荐查看更多

相关文章

加入云驻计划，成为创作者

相关产品