StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

JackOLantern

Rating
1516.79 (39,374th)
Reputation
14,627 (9,695th)
Page: 1 2 3 4 5 6
Title Δ
CUDA Dynamic memory allocation in kernel 0.00
CUDA dim3 grid bypasses initilization 0.00
Matrix-matrix multiplication between A and B, with A a random matrix 0.00
Purpose and usage of counting_iterators in CUDA Thrust library 0.00
Simulatenous computation and data load to shared memory: the case o... 0.00
Double precision CUDA code being faster than single precision count... -2.95
CUDA code optimization; Number of registers +3.78
Code running on two GPUs does not reach concurrent execution and ha... -3.48
Concurrently running two for loops with same number of loop cycles... 0.00
Do complex thread index calculations have an impact on the performa... +0.40
high performance prefix sum / scan function in CUDA, looking for th... -4.47
Calculating (A - B(D^-1)B^T )^-1 with CUDA 0.00
Latency of shuffle instructions in CUDA 0.00
MATLAB MEX causes memory loss on GPU -4.23
Same code, mex is much slower and pure C, why? 0.00
Best approach for convolution of multiple small matrices using CUDA +3.78
Choosing the initial simplex in the Nelder-Mead optimization algori... 0.00
How does the GK110's Hyper-Q enable concurrency of multiple streams? 0.00
CUDA code for sum of rows of a matrix too slow -1.87
CUDA Reduction - atomic vs single thread summation +4.51
Performance of CUDA __threadfence() synchronization versus synchron... 0.00
Build failed with error MSB3721 when building a CUDA SDK example un... 0.00
Using libraries like boost in cuda device code -0.09
Correct way to use __constant__ memory on CUDA? 0.00
Speeding up Matlab optimizations by CUDA +4.09
Automatic loop unrolling by the NVIDIA CUDA compiler 0.00
Parallel implementation for multiple SVDs using CUDA 0.00
Porting a C code to CUDA without modifying the original code line b... +3.46
CUDA streams and concurrent kernel execution 0.00
CUDA reduction: atomic operations not impacting performance? 0.00
CUDA external class linkage and unresolved extern function in ptxas... 0.00
Maximum number of resident threads per multiprocessor VS. Maximum n... +0.03
Benefit of splitting a big CUDA kernel and using dynamic parallelism 0.00
GPU coalesced global memory access vs using shared memory 0.00
Cuda Project Not Compiling 0.00
Creating mex files from CUDA code +2.03
cuFFT and streams 0.00
2D median filtering in CUDA: how to efficiently copy global memory... 0.00
Finding the local minima of a sampled function by CUDA Thrust 0.00
Emulating std::bitset in CUDA 0.00
Implementing an Exponential Moving Average Filter described by a di... 0.00
plotting from a loop in matlab 0.00
Numerical integration by the Romberg method using CUDA 0.00
CUDA Thread Addressing ((threadIdx.x, threadIdx.y, threadIdx.z) and... -1.71
Solving a non-polynomial equation numerically 0.00
how to measure gpu vs cpu performance , with which time measuring f... 0.00
Matlab Mex file compile 0.00
Simpson's method to integrate real valued functions with CUDA 0.00
Copying from cuda 3D memory to linear memory: copied data is not wh... 0.00
FIR filter in CUDA 0.00