Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Login
Remember
Register
Ask
Q&A
All Activity
Hot!
Unanswered
Tags
Users
Ask a Question
Ask a Question
Categories
All categories
Topic[话题] (13)
Life[生活] (4)
Technique[技术] (2.1m)
Idea[创意] (3)
Jobs[工作] (2)
Others[杂七杂八] (18)
Code Example[编程示例] (0)
Recent questions tagged CUDA
0
votes
896
views
1
answer
cuda - Reading from an unaligned uint8_t recast as a uint32_t array - not getting all values
I am trying to cast a uint8_t array to uint32_t array. However, when i try to do this, I cant seem to be able ... any way that I can do this? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
1.2k
views
1
answer
cuda - nvcc.exe linking error Microsoft Visual Studio configuration file 'vcvars64.bat' could not found
I want to use nvcc -ptx from windows command line, but I always get this error message: nvcc : fatal error ... . What can be the solution? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
691
views
1
answer
cuda - CUDA_ERROR_INVALID_IMAGE during cuModuleLoad
I've created a very simple kernel (can be found here) which I successfully compile using "C:Program ... valid and compiles without issues. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
687
views
1
answer
cuda - How to start debug version of project in nsight with optirun command?
I'we been writing some simple cuda program (I'm student so I need to practice), and the thing is I can ... for helping in advance folks. :) See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
660
views
1
answer
cuda - Do I have to use the MPS (MULTI-PROCESS SERVICE) when using CUDA6.5 + MPI?
By the link is written: https://docs.nvidia.com/deploy/pdf/CUDA_Multi_Process_Service_Overview.pdf 1.1.?AT A GLANCE ... will stay the same? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
787
views
1
answer
cuda - How is the 2D thread blocks padded for warp scheduling?
I understand that for a 1D thread block with 31 threads, it will be padded to 32 threads for warp execution. What ... (31*31=961; 961%32=1)? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
745
views
1
answer
cuda - thrust::sequence - how to increase the step after each N elements
I am using thrust::sequence(myvector.begin(), myvector.end(), 0, 1) and achieve good ordered list like: 0, 1, ... or am I missing a simple way.. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
492
views
1
answer
cuda kernels not executing concurrently
I'm trying to explore the concurrent kernels execution property of my Nvidia Quadro 4000, which has 2.0 ... CHK_ERR(cudaDeviceReset()); } See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
675
views
1
answer
cuda - Should I check the number of threads in kernel code?
I am a beginner with CUDA, and my coworkers always design kernels with the following wrapping: __global__ ... specified block/grid dimensions? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
577
views
1
answer
cuda - JIT in JCuda, loading multiple ptx modules
I said in this question that I had some problem loading ptx modules in JCuda and after @talonmies's idea, I ... variable by reference in JCuda? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
659
views
1
answer
cuda - Caffe compilation fails due to unsupported gcc compiler version
I struggle with Caffe compilation. Unfortunately I failed to compile it. Steps I followed: git clone https://github.com/ ... .9 - what to do?. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
916
views
1
answer
cuda - CURAND Library - Compiling Error - Undefined reference to functions
I have the following code which I am trying to compile using nvcc. Code: #include <stdio.h> #include <stdlib.h ... to solve my problem. Thanks! See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
757
views
1
answer
cuda - Performance of atomic operations on shared memory
How atomic operations perform when the address they are provided with resides in block shared memory? During ... atomic operation is done? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
909
views
1
answer
cuda - thrust reduction result on device memory
Is it possible to leave the return value of a thrust::reduce operation in device-allocated memory? In case it is ... I use a thrust::device_ptr? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
675
views
1
answer
cuda - What are "Other" Issue Stall Reasons displayed by the Nsight profiler?
I have a kernel that is performing poorly on CC 3.0 (Kepler) as opposed to CC 2.0 (Fermi). In the Nsight profiler, ... Nsight 3.0. RC / CC 3.0. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
569
views
1
answer
cuda - Does 'code=sm_X' embed only binary (cubin) code, or also PTX code, or both?
I am little bit confused about the 'code=sm_X' option within the '-gencode' statement. An example: What does ... is conflicting in my opinion. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
687
views
1
answer
cuda - Amdahl's law and GPU
I have a couple of doubts regarding the application of Amdahl's law with respect to GPUs. For instance, I ... for the parallel code? Thanks See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
941
views
1
answer
cuda - How to use GPUDirect RDMA with Infiniband
I have two machines. There are multiple Tesla cards on each machine. There is also an InfiniBand card on each ... dealing with this in OpenMPI. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
434
views
1
answer
cuda - __activemask() vs __ballot_sync()
After read this post on CUDA Developer Blog I am struggling to understand when is safecorrect use __activemask ... the function interface. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
920
views
1
answer
cuda - How do we use cuPrintf()?
What do we have to do to use cuPrintf()? (device compute capability 1.2, Ubuntu 12) I couldn't find " ... "hello_kernel") is not allowed Thanks! See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
976
views
1
answer
cuda - Set default host compiler for nvcc
I have just installed Debian Stretch (9) and Cuda 8 on a new GPU server. Stretch does not come with ... cuda config or an environment variable? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
891
views
1
answer
cuda - What is the difference between cudaMemcpy() and cudaMemcpyPeer() for P2P-copy?
I want to copy data from GPU0-DDR to GPU1-DDR directly without CPU-RAM. As said here on the page-15: http: ... any advantage, why it is needed? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
920
views
1
answer
cuda - Equivalent of cudaGetErrorString for cuBLAS?
CUDA runtime has a convenience function cudaGetErrorString(cudaError_t error) that translates an error enum into a ... function like this? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
1.1k
views
1
answer
cuda - Branch and predicated instructions
Section 5.4.2 of the CUDA C Programming Guide states that branch divergence is handled either by "branch ... set the predicate". Why? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
506
views
1
answer
cuda - How to compile PTX code
I need to modify the PTX code and compile it directly. The reason is that I want to have some specific instructions ... cubin) to "X.o" file. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
543
views
1
answer
cuda - How to compile PTX code
I need to modify the PTX code and compile it directly. The reason is that I want to have some specific instructions ... cubin) to "X.o" file. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
646
views
1
answer
cuda - How to compile PTX code
I need to modify the PTX code and compile it directly. The reason is that I want to have some specific instructions ... cubin) to "X.o" file. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
587
views
1
answer
cuda - How to compile PTX code
I need to modify the PTX code and compile it directly. The reason is that I want to have some specific instructions ... cubin) to "X.o" file. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
Page:
1
2
3
4
next »
Ask a question:
Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question
Just Browsing Browsing
[1] nodejs的 js文件 和 一般开发前端项目的js 有啥区别
[2] astronomy - Algorithms about Perigee and Apogee of Moon
[3] Locating partial link text with python selenium in chrome headless mode
[4] VScode怎么去自动修复已改过的代码,而不修复老的代码
[5] 自动服务任务用工作流还是用规则引擎?
[6] api - Information about tracking system
[7] Linked Server for MySql RDS (Alternatives)
[8] php7.4已经抛弃了get_magic_quotes_gpc这个方法,那么请问这个值在php7.4返回的应该是0还是1呢?
[9] knockout.js - Checkbox doesn't enable fields Knockout
[10] Nuxt.js TTFB (首字节时间) 长达1 分钟 每次都这样,请问是什么原因造成的
2.1m
questions
2.1m
answers
60
comments
57.0k
users
Most popular tags
javascript
python
c#
java
How
android
c++
php
ios
html
sql
r
c
node.js
.net
iphone
asp.net
css
reactjs
jquery
ruby
What
Android
objective
mysql
linux
Is
git
Python
windows
Why
regex
angular
swift
amazon
excel
algorithm
macos
Java
visual
how
bash
Can
multithreading
PHP
Using
scala
angularjs
typescript
apache
spring
performance
postgresql
database
flutter
json
rust
arrays
C#
dart
vba
django
wpf
xml
vue.js
In
go
Get
google
jQuery
xcode
jsf
http
Google
mongodb
string
shell
oop
powershell
SQL
C++
security
assembly
docker
Javascript
Android:
Does
haskell
Convert
azure
debugging
delphi
vb.net
Spring
datetime
pandas
oracle
math
Django
联盟问答网站-Union QA website
Xstack问答社区
生活宝问答社区
OverStack问答社区
Ostack问答社区
在这了问答社区
在哪了问答社区
Xstack问答社区
无极谷问答社区
TouSu问答社区
SQlite问答社区
Qi-U问答社区
MLink问答社区
Jonic问答社区
Jike问答社区
16892问答社区
Vigges问答社区
55276问答社区
OGeek问答社区
深圳家问答社区
深圳家问答社区
深圳家问答社区
Vigges问答社区
Vigges问答社区
在这了问答社区
DevDocs API Documentations
Xstack问答社区
生活宝问答社区
OverStack问答社区
Ostack问答社区
在这了问答社区
在哪了问答社区
Xstack问答社区
无极谷问答社区
TouSu问答社区
SQlite问答社区
Qi-U问答社区
MLink问答社区
Jonic问答社区
Jike问答社区
16892问答社区
Vigges问答社区
55276问答社区
OGeek问答社区
深圳家问答社区
深圳家问答社区
深圳家问答社区
Vigges问答社区
Vigges问答社区
在这了问答社区
在这了问答社区
DevDocs API Documentations
Xstack问答社区
生活宝问答社区
OverStack问答社区
Ostack问答社区
在这了问答社区
在哪了问答社区
Xstack问答社区
无极谷问答社区
TouSu问答社区
SQlite问答社区
Qi-U问答社区
MLink问答社区
Jonic问答社区
Jike问答社区
16892问答社区
Vigges问答社区
55276问答社区
OGeek问答社区
深圳家问答社区
深圳家问答社区
深圳家问答社区
Vigges问答社区
Vigges问答社区
在这了问答社区
DevDocs API Documentations
广告位招租
...