StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

tera

Rating
1541.28 (11,178th)
Reputation
5,627 (29,172nd)
Page: 1 2 3 4
Title Δ
Can I check whether an address is in shared memory? +1.49
Why repeating a kernel inside a for-loop makes CUDA code so slower? +0.45
What is the most efficient way of partitioning my buffer which favo... 0.00
CUDA - GB/s for PCI-E vs Gbps for memory clock speed for GPUs 0.00
NVCC - host compiler targets unsupported OS 0.00
Cuda register compiler optimization 0.00
Cuda, unified memory, data transfers 0.00
How to remove all PTX from compiled CUDA to prevent Intellectual Pr... 0.00
Initialize big images using cuda +0.20
Heat equation matrix in CUDA - illegal address error +0.46
CUDA C using single precision flop on doubles 0.00
how exactly does CUDA handle a memory access? 0.00
Strange under-performance of Titan X in memory-bound kernels (e.g.... +0.44
Unrecognized token error when declaring a cudaError_t variable -1.60
Use shared memory for neighboring array elements? +0.45
How to store bool result of a CUDA kernel function 0.00
CUDA Can't allocate more than 6.8Gb memory on a 8Gb card 0.00
Is the warmup code necessary when measuring CUDA kernel running time? 0.00
CUDA Kernel is crashing without any reason with 20k+ threads 0.00
Is it possible to make a gpu cluster (accessible via ssh) accessibl... 0.00
why increasing the number of blocks in cuda increase the time? 0.00
In my cuda program of runtime ,the cpu and gpu can compute Asynchro... -0.00
several cuda_stream for one cuda kernel 0.00
Define atomicAdd function doesn't work in CUDA 0.00
CUDA streams performance 0.00
Can i run Cuda or opencl on intel iris? +1.44
cast uint8_t array to uint32_t array, alignment is off +1.90
Managing Occupancy in CUDA 0.00
LLVM with CUDA inline assembly 0.00
CUDA kernel only launches and runs at some grid sizes 0.00
How to pack bits (efficiently) in CUDA? +4.00
How to pack bits (efficiently) in CUDA? -4.00
Smart bit encoding of floating point values (float, double) +4.58
How can I make sure the compiler parallelizes my loads from global... 0.00
expected speedup Numba/CUDA versus Numpy 0.00
What are (empirically) sufficient conditions for NVCC to use ldg in... +3.17
In cuda do threads of a block act on consecutive array elements or... -0.38
How to get N greatest elements out of M elements using CUDA, where... 0.00
Is there any way to process the huge bunch of float data as keeping... -3.80
Preventing Out-Of-Bounds in kepler: branches, textures or bigger bu... 0.00
How to avoid TLB miss (and high Global Memory Replay Overhead) in C... 0.00
Arithmetic Intensity in Nvidia Architectures 0.00
Cuda AtomicAdd not increment 0.00
64 bit number support in CUDA 0.00
CUDA profiling inside kernel 0.00
CUDA __syncthreads(); not working; inverse in breakpoint hit order 0.00
Special Case of Matrix multiplication Using CUDA 0.00
CUDA Independent Instruction optimization 0.00
nvcc runs from command line but not from shell 0.00
Improving asynchronous execution in CUDA 0.00