StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

Robert Crovella

Rating
1642.70 (671st)
Reputation
101,824 (716th)
Page: 1 2 3 4 5 6 ... 55
Title Δ
How to run pytorch with NVIDIA "cuda toolkit" version ins... 0.00
Thrust: Stream compaction copying only first N valid elements 0.00
OpenACC - Complex loop carried dependence of a->,c->,b-> p... 0.00
How to link a shared library (.so) to a CUDA source file (.cu) usin... 0.00
NUMBA CUDA slower than parallel CPU even for giant matrices 0.00
nvcc under linux complains: Contains a vector, which is not support... +1.36
How to enable C++17 code generation in VS2019 CUDA project 0.00
How to improve computational time for sorting with thrust? 0.00
When I `make` darknet with CUDA=1, `/usr/bin/ld: cannot find -lcuda... 0.00
Cuda Error (209): cudaLaunchKernel returned cudaErrorNoKernelImageF... 0.00
What does cudaSetDevice() do to a CUDA device's context stack? 0.00
Why is linking to cuda runtime still necessary for library user eve... 0.00
Why it is so slow to transfer data from GPU when use numba.cuda and... 0.00
Debugging CUDA MMU Fault 0.00
When running Numba code on a CUDA GPU, I notice one of my CPU cores... 0.00
Running texture objects with streams 0.00
How to use WMMA functions in Cupy kernels? 0.00
CUDA error identifier "__stcs" is undefined 0.00
Static linking with cublas 0.00
Pass 2D thrust::device_vector Complex Matrix to CUDA kernel function 0.00
How to use CUDA IPC? 0.00
Thrust::sort slow for array of structs of size 300k in GTX960M 0.00
Sorting multiple arrays using CUDA/Thrust 0.00
How many CUDA cores is used to process a CUDA warp? 0.00
Copying a 2D malloc pitched device memory to a 3D array in the device 0.00
How to do tuple reduction in cuda (also return a tuple)? 0.00
CUDA Speed Slower than expected - Image Processing 0.00
How to accelerate list of saxpy's in CUDA 0.00
cupy indexing is slow 0.00
Problem of converting bgr to yuv420p with cuda 0.00
CUDA: Differences in precision when summing doubles on host and dev... -1.99
question about modifing flag array in cuda 0.00
How to compile C++ with CUB library? 0.00
Pointer to member of device object 0.00
Cuda C++: Malloc class on Device and fill it with data from the host 0.00
How to initialize an object at pointer location with cudaMallocMana... 0.00
Why cannot a kernel be launched with the reason of too many registe... 0.00
CUDA: Unified Memory and change of pointer address? 0.00
Is `__shfl_sync` broken for 64-bit? 0.00
CUDA unified memory and Windows 10 0.00
How to use something like Try-Catch inside CUDA device code 0.00
Desired Compute-To-Memory-Ratio (OP/B) on GPU 0.00
How to know how many elements are in the result of thrust::partitio... 0.00
Does PTX actually have a 64-bit warp shuffle instruction? 0.00
Nvidia GPU simultaneous access to a single location in global memory 0.00
Pass a __device__ lambda as argument to a __global__ function 0.00
Access an matrix as its tranpose in tiled matrix mutliplication in... 0.00
memset cuArray for surface memory 0.00
Cuda: XOR single bitset with array of bitsets 0.00
Smart design for large kernel with different inputs that only chang... 0.00