Help
-
Search
-
Members
-
Calendar
Full Version:
General CUDA GPU Computing Discussion
NVIDIA Forums
>
CUDA GPU Computing
>
General CUDA GPU Computing Discussion
Pages:
1
,
2
,
3
,
4
,
5
,
6
,
7
,
8
,
9
,
10
,
11
,
12
,
13
,
14
,
15
,
16
,
17
,
18
,
19
,
20
,
21
,
22
,
23
Sticky:
Vertical applications and links based on CUDA
(9 replies)
Sticky:
August GPU Computing Webinars now open for registration
(6 replies)
Sticky:
GPU Technology Conference
(9 replies)
Sticky:
Getting started with parallel programming
(5 replies)
Sticky:
Welcome!
(1 reply)
Sticky:
CUDA 2.1 FAQ
(0 replies)
CUDA on LINUX
(1 reply)
ATI isn't interested in GPGPU
(7 replies)
GPU vs CPU theoretical single-precision peak performance
(2 replies)
CUDA Toolkit 3.0 beta released
(49 replies)
Server 2008 R2 and S1070
(1 reply)
Pointer to CudaMem
(9 replies)
Anyone have a Tesla S/D/C870 that can help with...
(5 replies)
submatrix computation
(3 replies)
Active aerospace research?
(2 replies)
multi dimension array
(14 replies)
Texture References and Asynchronous Kernel Calls
(0 replies)
Taking apart global atomics performance
(12 replies)
AtomicAdd faster than coalesced add. What is going on?
(2 replies)
Article on bit manipulation speed comparisons - CUDA/SSE2/GPU
(0 replies)
G210, GT220 deviceQuery?
(30 replies)
CUDA - MathLink incompatibilty?
(1 reply)
Used C1060s?
(10 replies)
Mandelbulbs
(12 replies)
GPGPU with a hard real time OS ?
(2 replies)
More details on new Tesla w/ Fermi GPU posted
(21 replies)
CUDA C programming,output correct in eulation mode but not when run on GPU
(0 replies)
Where can I get NVIDIA NPP?
(2 replies)
How is the number of required registers per thread counded?
(2 replies)
How to do Parallel Reduction of many unequally sized arrays in CUDA?
(0 replies)
benchmark vs a cluster
(0 replies)
texture interpolation
(2 replies)
__constant__ qualifier questions, please help
(8 replies)
Quick Question on cudaSetDevice()? It does not work in my case.
(5 replies)
Fermi and zero-copy
(0 replies)
GMem coalescing bandwidth for double data
(0 replies)
ERROR: too many resources requested for launch.
(3 replies)
does Tesla support SLI?
(6 replies)
So how much shared mem do we really have ?
(0 replies)
Regarding Fermi SM organization change
(9 replies)
new cufftPlanMany() in CUFFT
(2 replies)
cuda stream
(2 replies)
Complete cuBLAS anytime soon?
(9 replies)
How can I use Cuda Visual Profiler
(1 reply)
struct pointer element
(0 replies)
TLB_miss, TLB_hit and time in profiler
(2 replies)
Licensing question
(0 replies)
Water sounds
(8 replies)
Sorting bottleneck in rendering fractals
(19 replies)
Moved:
Engineering Sample Quadro 3700M
(-- replies)
Is real time ray tracing feasible?
(11 replies)
DVI Port in Tesla C2050 & C2070
(0 replies)
Number of variables limit?
(6 replies)
cudaMallocPitch() and cudaMemcpy2D()
(0 replies)
cudaMemcpy(..., cudaMemcpyDeviceToHost) not working?
(1 reply)
Strange behavior when overlapping transfer and kernel execution!
(2 replies)
How to use the type "short" for coalesced reading and writing?
(6 replies)
Can't get copyDeviceToHost to work with cudaMemcpy2D
(0 replies)
"Genetic" image compression using transparent polygons
(55 replies)
Coalesced Memory Access ?
(2 replies)
questions about coalescing access
(8 replies)
Strange bandwidthTest results with new hardware
(11 replies)
Phenomenal Speed-up!
(13 replies)
S1070 and Windows compatibility
(0 replies)
Strange cuda compilation error
(2 replies)
Sending the results by email
(1 reply)
Mini PCI form factor
(10 replies)
CUDA with Microcontroller
(7 replies)
Can __global__ function included in a c++ class?
(0 replies)
Asymmetric pinned memory bandwidth on Dell precision 7400 with GTX 285 card
(5 replies)
Question of NVIDIA CUDA Visual Profiler Version 2.2
(1 reply)
passing pointers by reference
(0 replies)
Server Motherboards for mulit-GPU systems (&Fermi)
(26 replies)
branch predication
(0 replies)
CUDA Support on GTS 8800 with TMPGENC
(4 replies)
Manipulate a single element in cublas matrix
(8 replies)
CUDA SDK Examples <badptr> double precision
(2 replies)
cudaArray invalid argument
(1 reply)
future-proof binaries -- nvcc -code and -arch options
(7 replies)
Ability to run PTX directly
(2 replies)
GeForce 8400 GS, 9400 GT, 9500GT and Programming CUDA
(2 replies)
Bug in the POW function?
(4 replies)
pairwise parallel comparison
(2 replies)
Serious hardware bug?
(2 replies)
What can't you do in CUDA that you'd like?
(300 replies)
compiler bug?
(0 replies)
Small compiler bug.
(0 replies)
Debugging CUDA from Matlab
(13 replies)
Questions about Performance
(3 replies)
8 streams is the magic number?
(0 replies)
Take Garbage Value
(6 replies)
FFT GFLOPS results with nice graph!
(5 replies)
cublasScal
(2 replies)
CUDA 2.2 pinned memory white paper
(5 replies)
cudaMalloc and Structs and Pointers problem
(3 replies)
GTX280 versus 8800GTX running CUFFT
(3 replies)
Kernel Execution issues related to Shared Memory
(5 replies)
Transforming Tridiagonal Eigenvectors to Original Symmetric Eigenvectors
(2 replies)
Memory-intensive benchmarks ?
(5 replies)
Profiler speeding up my kernels? Nvidia employees please read
(6 replies)
Window has trigged a breakpoint in,corruption of heap
(2 replies)
Retrieving data from device to host memory while computer is rendering OpenGL graphics through the same video card
(2 replies)
having problem with simpe CUDA code
(4 replies)
Newb
(3 replies)
Possible Issue with Cuda 2.3 and __syncthreads() Emulation
(1 reply)
Fractron 9000
(6 replies)
compiling nvopencc on win32?
(1 reply)
GPU vs. CPU Comparison over the last years
(1 reply)
cudaMalloc & cudaMemcpy from different host threads
(0 replies)
Pinned Memory Usage
(0 replies)
Memory Copy Issue
(4 replies)
how to make multiple gpu work?
(4 replies)
NVIDIA SDK reduction
(1 reply)
ceil(1) == -1
(0 replies)
measuring temperature
(3 replies)
Windows 7 and CUDA
(4 replies)
Help please!
(5 replies)
kernel launch overhead for GTX 280
(17 replies)
Using Cuda1.1 for GTX280, anything bad will happen ?
(2 replies)
CUDA and SLI
(4 replies)
how to reduce registers in each kernel
(2 replies)
Logical OR pattern
(9 replies)
Help, Cuda Debug Question
(3 replies)
how to select the device manually
(2 replies)
New CUDA developer, buying GTX 2xx, which maker to buy?
(5 replies)
memory size
(6 replies)
Is there any findCuda.cmake step by step tutorial?
(8 replies)
Is it possible to reset GPU w/o rebooting?
(2 replies)
profiler instruction count
(0 replies)
Stream serialization with CUDA Visual Profiler v2.3.11
(4 replies)
CUDA Lapack
(2 replies)
dual cards - x16 cuda GTS 250 with PCI geforce 9400
(1 reply)
The best algorithm of Gaussian fliter in Guda
(11 replies)
texture memory cache size
(3 replies)
dynamic array issue?
(5 replies)
singly linked list
(4 replies)
Garbage Value in Square
(2 replies)
Memory Read and Write to device gives different timing
(3 replies)
Which instructions get executed by SFU units ?
(1 reply)
CUDA Toolkit and SDK 2.3 released
(127 replies)
64-bit versus 32-bit CUDA code
(5 replies)
cudaSafeCall
(0 replies)
Canny edge detection in NPP
(1 reply)
Weird Kernel behaveiur
(0 replies)
Indexing in 3D data
(14 replies)
Texture and cudaMalloc3D?
(0 replies)
what exactly does gld_128b mean in cuda profiler
(0 replies)
Confused about 2D and 3D Memory
(0 replies)
Complete Novice Question
(6 replies)
__syncthreads
(7 replies)
This is a "lo-fi" version of our main content. To view the full version with more information, formatting and images, please
click here
.