StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

kangshiyin

Rating
1527.12 (21,449th)
Reputation
8,671 (18,020th)
Page: 1 2 3 4 5 6 7 ... 8
Title Δ
Efficient image 2d sliding window max algorithm with hop > 1 0.00
Lower triangular solve using dtrsv() BLAS level 2 0.00
#pragma omp parallel for schedule crashes my program 0.00
OpenMP not starting threads in one machine but works OK in another... 0.00
Sub-Matrix computations -0.02
openmp with nested loops and function call 0.00
why does this openmp give SIGSEGV? -1.69
Parsing a complicated function using MKL/VML -0.04
CUBLAS universal matrix dot product 0.00
Why does calculation with OpenMP take 100x more time than with a si... -0.28
Matlab Convolution using gpu 0.00
OpenMP false sharing and cache hits exploitation +0.31
icc is not performing loop invariant code motion 0.00
C++ OpenMP and gcc 4.8.1 - performance issue when parallelising loops +0.31
CUDA: Why accessing the same device array is not coalesced? 0.00
How to process subarrays in each routine OpenMP -0.03
Why can't I use reduction with default(shared)? -1.71
How to chose value of Block and thread in Cuda? 0.00
CUDA - convert RGB image to Grayscale -0.50
calling templated CUDA kernels from a .cpp file 0.00
Issues with number of threads with openMP in C -0.54
Induction with OpenMP: getting range values for a parallized for lo... -1.78
C++ Element in array first is read correctly, then gives a NaN with... +0.47
divergence statement for redundant threads 0.00
LU decomposition using openmp -0.06
How to use atomicCAS for multiple variables with conditionals in CUDA +0.23
Trying to get CUDA working, sample can't find helper_cuda.h 0.00
CUDA vs Intel AVX / SSE vector sum performance questions 0.00
How would you avoid False Sharing in a scenario like this? 0.00
cuda addvectors memory intuitive explanation 0.00
how can I install the intel composer xe (icc) under open suse 12.3? 0.00
Understanding basic concepts of CUDA through vector addition 0.00
openmp: barrier synchronization not working within loop with if con... 0.00
CUDA kernel reduction to calculate the Euclidean distance between c... 0.00
Multiple scans by key 0.00
use DGEMM BLAS in windows eclipse 0.00
Compiling code containing dynamic parallelism fails +0.70
Random no generation Vs Hashing inside a kernel 0.00
CUDA kernel launch parameters explained right? +0.60
How does GPU help in improving iterative questions? -1.78
How to compile a CUDA program with an specific toolkit version? 0.00
moving elements between arrays in a CUDA kernel 0.00
CUB (CUDA UnBound) equivalent of thrust::gather 0.00
Cuda thrust - xutility: name followed by "::" must be a c... 0.00
Intel MKL cblas_dgemm documention error? 0.00
Still the link errors about Intel-MKL 0.00
Are there any eigenvalue decomposition methods in C++ faster than M... 0.00
basic operand >> not recognized in c++ code after having chan... -0.00
GPU-based inclusive scan on an unbalanced tree +0.32
Discovering my GPU capabilities 0.00