StackRating

An Elo-based rating system for Stack Overflow

Home | About | Stats and Analysis | Get a Badge

Rating Stats for

JackOLantern

Rating
1516.79 (39,374th)

Reputation
14,627 (9,695th)

Page: 1 2 3 4 5 6

Title	Δ
CUDA Dynamic memory allocation in kernel	0.00
CUDA dim3 grid bypasses initilization	0.00
Matrix-matrix multiplication between A and B, with A a random matrix	0.00
Purpose and usage of counting_iterators in CUDA Thrust library	0.00
Simulatenous computation and data load to shared memory: the case o...	0.00
Double precision CUDA code being faster than single precision count...	-2.95
CUDA code optimization; Number of registers	+3.78
Code running on two GPUs does not reach concurrent execution and ha...	-3.48
Concurrently running two for loops with same number of loop cycles...	0.00
Do complex thread index calculations have an impact on the performa...	+0.40
high performance prefix sum / scan function in CUDA, looking for th...	-4.47
Calculating (A - B(D^-1)B^T )^-1 with CUDA	0.00
Latency of shuffle instructions in CUDA	0.00
MATLAB MEX causes memory loss on GPU	-4.23
Same code, mex is much slower and pure C, why?	0.00
Best approach for convolution of multiple small matrices using CUDA	+3.78
Choosing the initial simplex in the Nelder-Mead optimization algori...	0.00
How does the GK110's Hyper-Q enable concurrency of multiple streams?	0.00
CUDA code for sum of rows of a matrix too slow	-1.87
CUDA Reduction - atomic vs single thread summation	+4.51
Performance of CUDA __threadfence() synchronization versus synchron...	0.00
Build failed with error MSB3721 when building a CUDA SDK example un...	0.00
Using libraries like boost in cuda device code	-0.09
Correct way to use __constant__ memory on CUDA?	0.00
Speeding up Matlab optimizations by CUDA	+4.09
Automatic loop unrolling by the NVIDIA CUDA compiler	0.00
Parallel implementation for multiple SVDs using CUDA	0.00
Porting a C code to CUDA without modifying the original code line b...	+3.46
CUDA streams and concurrent kernel execution	0.00
CUDA reduction: atomic operations not impacting performance?	0.00
CUDA external class linkage and unresolved extern function in ptxas...	0.00
Maximum number of resident threads per multiprocessor VS. Maximum n...	+0.03
Benefit of splitting a big CUDA kernel and using dynamic parallelism	0.00
GPU coalesced global memory access vs using shared memory	0.00
Cuda Project Not Compiling	0.00
Creating mex files from CUDA code	+2.03
cuFFT and streams	0.00
2D median filtering in CUDA: how to efficiently copy global memory...	0.00
Finding the local minima of a sampled function by CUDA Thrust	0.00
Emulating std::bitset in CUDA	0.00
Implementing an Exponential Moving Average Filter described by a di...	0.00
plotting from a loop in matlab	0.00
Numerical integration by the Romberg method using CUDA	0.00
CUDA Thread Addressing ((threadIdx.x, threadIdx.y, threadIdx.z) and...	-1.71
Solving a non-polynomial equation numerically	0.00
how to measure gpu vs cpu performance , with which time measuring f...	0.00
Matlab Mex file compile	0.00
Simpson's method to integrate real valued functions with CUDA	0.00
Copying from cuda 3D memory to linear memory: copied data is not wh...	0.00
FIR filter in CUDA	0.00