StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

talonmies

Rating
1649.65 (569th)
Reputation
60,369 (1,526th)
Page: 1 2 3 4 ... 33
Title Δ
How to run numba.jit decorated function on GPU? 0.00
CUDA can't use all available constant memory 0.00
Efficient CUDA FFT with a spike 0.00
Does clang support CUDA __global__ function recursion? 0.00
Is it appropriate to store an array of size 64 in register for Nvid... 0.00
Compiling CUDA with clang on math functions 0.00
Is it possible to manually set the SMs used for one CUDA stream? 0.00
Is it possible to call a cuda.jit function which indirectly calls a... 0.00
How to program NVIDIA's tensor cores in RTX GPU with python and... 0.00
How to do the modulus of complex number more efficiently in CUDA? +0.30
Dynamic allocation in device makes the memory copy fails 0.00
How to compute large matrix-vector multiplication by splitting data... 0.00
How to check which CUDA error arises in which asynchronous CUDA call? 0.00
Thread index as an memory location index in CUDA 0.00
Using atomic arithmetic operations in CUDA Unified Memory multi-GPU... 0.00
Is there a way to multiply all the elements of an array using thrust? 0.00
Is there a way to update cuda driver using conda on windows 10 0.00
CUDA Unified Memory and use of std::vector in device code 0.00
PyCUDA Kernel returns inconsistent division result for specific cal... 0.00
CUDA Driver API vs Runtime API in 2020 0.00
Why is there a warp-level synchronization primitive in CUDA? 0.00
Struct of multidimensional arrays in CUDA 0.00
Why does cuda_free have such a high overhead? (is it due to zeroing?) 0.00
CUDA array addition and block size 0.00
How to increase FLOPS achieved using CUDA 0.00
CUDA GPU processing: TypeError: compile_kernel() got an unexpected... 0.00
cublas address out of bounds for particular matrix size 0.00
How to execute this same function on CPU and GPU with JCuda 0.00
Cuda cuLaunchHostFunc equivalent in python libs 0.00
Nvcc fatal : The version (‘40001’) of the host compiler (‘clang’) i... 0.00
Are conversions from float* to float3* in CUDA safe? 0.00
How to effectively copy data from 2D host array (with padding) to 1... 0.00
Declaration of cuda variables in header which is included in cpp pr... 0.00
CUDA coalescing and global memory 0.00
Is it still necessary to install CUDA before using the conda tensor... 0.00
When is a primary CUDA context destroyed by the Runtime API? 0.00
Use thrust reduce_by_key on a struct 0.00
Wrong values of thrust::transform output 0.00
CUDA Device Vector 0.00
NVCC: is it possible to target an earlier driver while compiling wi... 0.00
convert hexdecimal array to char array in CUDA 0.00
CUDA: Differences in precision when summing doubles on host and dev... +1.99
Why is this NVIDIA CUDA PTX not working as intended? 0.00
Does order of memory allocation matter in PyCUDA's curandom? 0.00
Why dummy cudaMalloc speeds up interval_gather in moderngpu? 0.00
__host__ __device__ functions calling overloaded functions 0.00
Pass statically declared __constant__ variable as kernel parameter... 0.00
What is the proper way to use stride in cuda to do multiblock reduc... 0.00
Cuda memory allocation 0.00
Created Shared Memory Code with Python Cuda +0.30