StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

Greg Smith

Rating
1585.55 (2,598th)
Reputation
8,623 (18,136th)
Page: 1 2 3 ... 4
Title Δ
What are the "long" and "short" scoreboards w.r... 0.00
Cache behaviour in Compute Capability 7.5 0.00
Terminology used in Nsight Compute 0.00
dram_write_bytes result on P100 0.00
Achieved Occupancy column is not shown is Nsight Profiling result 0.00
Interpreting compute workload analysis in Nsight Compute 0.00
Instruction execution order by cuda driver 0.00
Can the name of a running CUDA kernel be obtained by its threads? 0.00
Issued load/store instructions for replay +1.98
How to get malloc to show up in nvprof's statistical profiler? 0.00
Why my GPU program can execute, although the number of blocks excee... -0.72
CUDA coalesced memory access speed depending on word size 0.00
L1 cache in GPU 0.00
local cache hit metric in cuda profiler 0.00
Control flow efficiency 0.00
FLOP efficiency in CUDA +0.40
Which is faster for CUDA shared-mem atomics - warp locality or anti... +0.95
How do SM(streaming multiprocessors), active blocks and active warp... 0.00
Understanding the IPC metric from Nvprof and GPGPUsim 0.00
How to a warp cause another warp be in the Idle state? 0.00
task scheduling of NVIDIA GPU +0.40
CUDA: Will the same thread accessing the same bank twice cause bank... 0.00
Thread ID rotation in NVIDIA GPU assembly code (SASS) 0.00
Instruction replay in CUDA 0.00
Understanding Warp Parallelism (Fermi) 0.00
CUDA __constant__ deference to global memory. Which cache? 0.00
Understanding Dynamic Parallelism in CUDA +0.40
CUDA Nsight Debug Focus Block not active 0.00
Reading event counters with concurrent exection 0.00
*Modified* Nvidia Maxwell, increased global memory instruction count 0.00
CUDA Profiler: Calculate memory and compute utilization 0.00
What does <overflow> mean during CUDA profiling? 0.00
What info we get from CUDA Information Tool Window 0.00
cuda nsight: generating debug symbolics: -G0 0.00
Global Memory Load/Store Efficiency and Global Memory Coalescence 0.00
Miscellaneous and Inter-Thread Communication Instructions in CUDA 0.00
gpgpu: Why dont we need branch prediction in fine grain multi-threa... -0.37
GPGPU: Consequence of having a common PC in a warp +0.75
New issue stall reasons in NVIDIA Nsight Visual Studio Edition 4.1... 0.00
Throughput drops after saturation with more threads 0.00
What is the "warp allocation granularity", and what purpo... 0.00
NSight (NVIDIA) does not work correctly using 'Pause and Captur... -0.12
"while"/"for" loop in kernel causing CUDA out o... 0.00
How are warps partitioned across cores in NVIDIA GPUs? +1.60
Running CUDA and OpenGL in parallel without using interoperability 0.00
CUDA Kernel running repeatedly for each launch 0.00
Is there a limit size of malloc? -0.13
NVIDIA cuda memory trace generator 0.00
How to explain the super-linear speedup observed in GPU device with... 0.00
How to code the profilling of multiple metrics from Nvidia CUDA gpu... 0.00