StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

Robert Crovella

Rating
1642.70 (671st)
Reputation
101,824 (716th)
Page: 1 2 3 4 5 6 7 ... 55
Title Δ
What's the alternative for __match_any_sync on compute capabili... 0.00
finding the first non-zero element in CUDA +0.30
Parallel Dynamic Programming with CUDA +0.31
CUDA Cores and Streaming Multiprocessors Count for Inference Speed 0.00
Thrust -- sort two vectors by key 0.00
CUDA Dynamically allocated constant or texture memory for array of... 0.00
Bank Conflicts From Non-Sequential Access in Shared Memory on CUDA 0.00
Different timing indicated from two kind of timers 0.00
SASS code and its corresponding asm code in kernel 0.00
How do I compile a CUDA shared library that depends on c++ object f... 0.00
cuda.jit matrices multiplication crashes 0.00
CUDA operations that work differently than on CPU 0.00
cuFFT in column direction 0.00
kernels accessing host memory 0.00
the performance of CUDA depending on declaring variable 0.00
can the gpu access memory allocated by malloc? 0.00
nvidia cuda access gpu shared memory 0.00
Passing structure to raw kernel in cupy -0.20
PyCUDA fills np.array too slow 0.00
CUDA - dynamically reallocate more global memory in Kernel 0.00
CUDA not executing code after nested loop in kernel function 0.00
How to copy char pointer from device to host 0.00
Deadlocks with cuda cooperative groups 0.00
Memory allocation and indexing tied to SM/core in CUDA 0.00
Understanding in details the algorithm for inversion of a high numb... 0.00
graphic card driver fails after installing cuda 0.00
identifier "__shfl_down" is undefined for cuda-7.5 0.00
CUDA Ray-Sphere intersection random walk spooky values 0.00
Efficient zero padding using cudaMemcpy3D 0.00
Why do I have 'insufficient buffer space' when I put alloca... 0.00
Is mask adaptive in __shfl_up_sync call? 0.00
Binary Matrix Reduction in CUDA 0.00
Insight into the first argument mask in __shfl__sync() 0.00
In place real to complex FFT with cufft 0.00
LU factorization receives different results between LAPACK and cuBL... 0.00
Arguments mismatch for instruction 'ld' and 'add' 0.00
Don't understand why column addition faster than row in CUDA 0.00
Static __device__ variable and kernels in separate file 0.00
cudaMallocManaged and cudaDeviceSynchronize() 0.00
Python Numba Cuda slower than JIT 0.00
cuda kernel seems not to be called 0.00
Cuda C threads synchronization with printf or other functions 0.00
cublas matrix matrix multiplication gives INTERNAL ERROR when apply... 0.00
Search Minimum/Maximum from n Arrays parallel in CUDA (Reduction Pr... 0.00
What's the differences between the kernel fusion and persistent... 0.00
cudaMemcpy to host for device-allocated memory still not possible? 0.00
How to copy the pointer variables of array of structures from host... 0.00
How to copy the pointer variable of a structure from host to device... 0.00
Is the memory allocated using malloc inside kernel accessible by th... 0.00
Cuda global memory load and store 0.00