StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

ArchaeaSoftware

Rating
1524.20 (24,931st)
Reputation
3,746 (44,822nd)
Page: 1 2 3
Title Δ
Are there advantages to using the CUDA vector types? -2.40
OpenCL for GPU vs. FPGA -3.67
parallel removal of elements from an array 0.00
Multiple texture references in CUDA - use in kernel-time 0.00
How can I read from the pinned (lock-page) RAM, and not from the CP... 0.00
cudaMemcpy() calls to streams 0.00
cuMemHostAlloc out of memory errors 0.00
How to reduce CUDA synchronize latency / delay 0.00
Is it possible to run cuMemset on a CUarray? 0.00
Using a GPU card with more Streaming Mulitprocessors +0.09
Can thrust deal with half precision floats (float16 / IEEE 754-2008)? 0.00
do concurrent kernel execution require use of pinned memory 0.00
Optimizing a Bit-Wise Manipulation Kernel 0.00
Which is faster in CUDA: Constant Memory or Texture Memory? +1.12
Is Thrust synchronous or asynchronous? +3.97
Sobel filter in x direction in CUDA +0.14
Accessing cuda device memory when the cuda kernel is running +4.22
Use CUDA driver API in host code which contains CUDA runtime API 0.00
Should CUDA events and streams always be destroyed? -3.82
Is cudaHostRegister equivalent to mlock() system call? -4.09
The memory consistency model CUDA 4.0 and global memory? +0.64
Copy 1D Array to CUDA Memory - pointer to memory 0.00
Debugging inline PTX in Parallel Nsight 0.00
How can I deal with very large image processing in CUDA? Images >... 0.00
Why are overlapping data transfers in CUDA slower than expected? 0.00
Is zero a valid value for a CUDA device pointer? +3.30
does cuda event time cudaMalloc and cudaMemcpy 0.00
How to properly coalesce reads from global memory into shared memor... 0.00
What is the maximum size of the texture memory on a modern GPU? +3.78
When using 3D cuda Memory is it better to pass the associated cudaP... +0.04
Efficient implementation of quick select algorithm on GPU? 0.00
Kernels that run fast on Multicores but relatively slow on GPU 0.00
Build a heap using scan in parallel 0.00
Can we really get performance gain with CUDA streams? +3.92
Calls to GPU kernel from a multithreaded C++ application? +0.28
How would you implement this function in CUDA? (offsets in sorted i... -1.13
Cuda Bayer/CFA demosaicing example +3.73
Any tips to avoid laggy display during long kernels? 0.00
Segmentation fault with cudaMemcpy2D +1.99
multiple textures with same reference type 0.00
Is there a way to initialize block_size in CUDA? 0.00
cuda context creation and resource association in runtime API appli... 0.00
Why CUDA block size of 256 or 512 gives better performance as compa... 0.00
CUDA 4.0 - cudaHostUnregister is slow 0.00
cudaMemcpy too slow -3.79
Does replacing int with short help the performance in CUDA -4.27
Issues with CUDA streams +3.48
Why does replacing if-else by bit-operation turn out to be slower i... +0.13
Passing cuda context to worker pthreads 0.00
Dividing up CUDA cudaMemcpy into chunks +3.99