StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

kangshiyin

Rating
1527.12 (21,449th)
Reputation
8,671 (18,020th)
Page: 1 2 3 4 5 ... 8
Title Δ
Parallel multiplication vector-matrix 0.00
sparse BLAS solver used in a shared library doesn't work (retur... 0.00
keras predict is very slow 0.00
Thrust: How to intentionally avoid passing a parameter into algorit... 0.00
openmp parallel for - How does it handle previously private index? -2.06
How can I solve the sparse linear system with the coefficients in Z... 0.00
Loop sequence in OpenMP Collapse performance advise 0.00
thrust::min_element Access violation while reading from location 0.00
How can I skip the fourth element in a float4 when using cublas sge... 0.00
How to optimize matlab code for gpu 0.00
How to work with struct inside struct in Cuda -0.02
How many simultaneous read instructions per thread on modern GPU? 0.00
How to fit a bounding ellipse around a set of 2D points 0.00
How to override python's distutils gcc linker with icc? -0.53
Comparing the time requirements of addition and division operation... 0.00
Thrust: reduce_by_key passing zip_iterator(tuple) into custom funct... 0.00
SciPy compatibility issue with MKL libraries 0.00
Is there analogy of boost compute function in Thrust? 0.00
Does the implementation of pow() function in C/C++ vary with platfo... +0.48
Eigen use of diagonal matrix -1.36
problems getting OpenMP 4.0 to run in eclipse (Linux Mint) 0.00
how to parallelize this for-loop using reduction? +0.45
Darknet framework fails to start with GPU acceleration using CUDA -0.03
OpenMP outer loop private or shared +0.08
Solving ill-conditioned system of linear equations with Lapack&co 0.00
CUDA Thrust functor GMEM access: ctor data copy vs ctor dev ptr arg 0.00
Use more than one thread when calling intel's mkl directly from... 0.00
CUDA Thrust Functor with Flexibility to Run in CPU or GPU -1.54
How to perform relational join on two data containers on GPU (prefe... 0.00
Matrix multiplication when one dimension is much larger than the ot... 0.00
cublasSgetrsBatched error in kernel 0.00
Sparse BLAS on OSX -0.02
Error while getting Identity matrix after performing matrix multipl... 0.00
How to process a task of arbitrary size using CUDA? 0.00
Search value in OpenCV vector 0.00
Parallelizing Boruvka with openMP 0.00
Why is this CUDA kernel slow? +0.47
Torch linear model forward pass 4 times slower on GPU then CPU 0.00
SIGSEGV in CUDA allocation 0.00
How to write "target data map" for std::vector in OpenMP 4? 0.00
Is there a way to determine the size of BLAS IDAMAX function? 0.00
What is the advantage of doing a Multi-GPU training in TensorFlow? 0.00
Sampling from a boolean matrix in Eigen 0.00
Can a program designed to use mpi be run with a gpu? 0.00
illegal memory access in CUDA 0.00
cuBLAS matrix inverse much slower than MATLAB 0.00
cublasStbsv in cuda kernel 0.00
GPU Memory bandwidth theoretical vs practical 0.00
cuBLAS dsyrk slower than dgemm 0.00
Invalid device pointer error +0.46