| Guest_dhoff_* |
Jan 18 2008, 11:56 PM
Post
#1
|
|
Guests |
Here's an initial download of the CUDA Visual Profiler
See the full README. For linux: tar xvfz CudaVisualProfiler_0.1_beta_linux.tar.gz Windows: extract the zip contents from CudaVisualProfiler_0.1_beta_windows.zip ------------------------------------------------------------------------------ NVIDIA CUDA Visual Profiler Version 0.1 Beta ------------------------------------------------------------------------------ Published by NVIDIA Corporation 2701 San Tomas Expressway Santa Clara, CA 95050 Notice BY DOWNLOADING THIS FILE, USER AGREES TO THE FOLLOWING: ALL NVIDIA SOFTWARE, DESIGN SPECIFICATIONS, REFERENCE BOARDS, FILES, DRAWINGS, DIAGNOSTICS, LISTS, AND OTHER DOCUMENTS (TOGETHER AND SEPARATELY, "MATERIALS") ARE BEING PROVIDED "AS IS". NVIDIA MAKES NO WARRANTIES, EXPRESSED, IMPLIED, STATUTORY, OR OTHERWISE WITH RESPECT TO THE MATERIALS, AND EXPRESSLY DISCLAIMS ALL IMPLIED WARRANTIES OF NONINFRINGEMENT, MERCHANTABILITY, AND FITNESS FOR A PARTICULAR PURPOSE. Information furnished is believed to be accurate and reliable. However, NVIDIA Corporation assumes no responsibility for the consequences of use of such information or for any infringement of patents or other rights of third parties that may result from its use. No license is granted by implication or otherwise under any patent or patent rights of NVIDIA Corporation. Specifications mentioned in this publication are subject to change without notice. These materials supersedes and replaces all information previously supplied. NVIDIA Corporation products are not authorized for use as critical components in life support devices or systems without express written approval of NVIDIA Corporation. Trademarks NVIDIA, CUDA, and the NVIDIA logo are trademarks or registered trademarks of NVIDIA Corporation in the United States and other countries. Other company and product names may be trademarks of the respective companies with which they are associated. Copyright © 2007-2008 by NVIDIA Corporation. All rights reserved. LIST OF SUPPORTED FEATURES: -------------------------- - Execute a CUDA program with profiling enabled and view the profiler output as a table. The table has the following columns for each GPU method: timestamp: Start time stamp method: GPU method name. This is either "memcopy" for memory copies or the name of a GPU kernel. GPU Time CPU Time Occupancy Profiler counters: gld_incoherent : Number of non-coalesced global memory loads gld_coherent : Number of coalesced global memory loads gst_incoherent : Number of non-coalesced global memory stores gst_coherent : Number of coalesced global memory stores local_load : Number of local memory loads local_store : Number of local memory stores branch : Number of branch events (instruction and/or sync stack) divergent_branch : Number of divergent branches within a warp instructions : Number of dynamic instructions (in fetch) warp_serialize : Number of threads in a warp serialize based on address (GRF or constant) cta_launched : Number of CTAs launched on the PM TPC Please refer the "Interpreting Profiler Counters" section below for more information on profiler counters. Note that profiler counters are also referred to as profiler signals. - Display the summary profiler table. It has the following columns for each GPU method: method name number of calls total GPU time % age GPU time Total counts for each profiler counter. - Display various kinds of plots: - Summary profiling data bar plot - GPU Time Height plot - GPU Time Width plot - Profiler counter bar plot - Profiler output table column bar plot - Analysis of profiler output - lists out method with high number of: . incoherent stores . incoherent loads . warp serializations - Compare profiler output for multiple program runs of the same program or for different programs. Each program run is referred to as a session. - Save profiling data for multiple sessions. A group of sessions is referred to as a project. - Import/Export CUDA Profiler CSV format data DESCRIPTION OF DIFFERENT PLOTS: ------------------------------ - Summary profiling data bar plot . One bar for each method . Bars sorted in decreasing gpu time, . Bar length is proportional to cumulative gputime for a method - GPU Time Height Plot: It is a bar diagram in which the height of each bar is proportional to the GPU time for a method and a different bar color is assigned for each method. A legend is displayed which shows the color assignment for different methods. The width of each bar is fixed and the bars are displayed in the order in which the methods are executed. When the "fit in window" option is enabled the display is adjusted so as to fit all the bars in the displayed window width. In this case bars for multiple methods can overlap. The overlapped bars are displayed in decreasing order of height so that all the different bars are visible. When the "Show CPU Time" option is enabled the CPU time is shown as a bar in a different color on top of the GPU time bar. The height of this bar is proportional to the difference of CPU time and GPU time for the method. - GPU Time Width Plot: It is a bar diagram in which the width of each bar is proportional to the GPU time for a method and a different bar color is assigned for each method. A legend is displayed which shows the color assignment for different methods. The bars are displayed in the order in which the methods are executed. When time stamps are enabled the bars are positioned based on the time stamp. The height of each bar is based on the option chosen: a) fixed height : height is fixed. b) height proportional to instruction issue rate: the instruction issue rate for a method is equal to profiler "instructions" counter value divided by the gpu time for the method. c) height proportional to incoherent load + store rate: the incoherent load + store rate for a method is equal to the sum of profiler "gld_incoherent" and "gst_incoherent" counter values divided by the gpu time for the method. - Profiler counter bar plot It is a bar plot for profiler counter values for a method from the profiler output table or the summary table. . One bar for each profiler counter . Bars sorted in decreasing profiler counter value . Bar length is proportional to profiler counter value - Profiler output table column bar plot It is a bar plot for any column of values from the profiler output table or summary table . One bar for each row in the table . Bars sorted in decreasing column value . Bar length is proportional to column value
Attached File(s)
CudaVisualProfiler_0.1_beta_linux.tar.gz ( 4.53MB )
Number of downloads: 1724
CudaVisualProfiler_0.1_beta_windows.zip ( 3.99MB )
Number of downloads: 4205
CudaVisualProfiler_README_0.1_beta.txt ( 20.74K )
Number of downloads: 1577
CudaVisualProfiler_linux_releases_notes_0.1_beta.txt ( 3.15K )
Number of downloads: 714
CudaVisualProfiler_windows_releases_notes_0.1_beta.txt ( 2.07K )
Number of downloads: 696 |
|
|
|
dhoff preview of NVIDIA Visual Profiler Jan 18 2008, 11:56 PM
DenisR Is it possible to profile CUDA code that is being ... Jan 19 2008, 10:52 PM
Mark Harris Unfortunately not right now, unless you can genera... Jan 22 2008, 01:16 PM
DenisR QUOTE(Mark Harris @ Jan 22 2008, 03:16 PM)Unf... Jan 23 2008, 06:48 PM
sandeepan1986 QUOTE(Mark Harris @ Jan 22 2008, 05:16 AM)Unf... Jul 3 2008, 09:51 AM
Eri Rubin Just my guess but it seems that would be hard to d... Jul 3 2008, 10:09 AM
sandeepan1986 QUOTE(Eri Rubin @ Jul 3 2008, 02:09 AM)Just m... Jul 3 2008, 10:40 AM
MisterAnderson42 Thanks for the preview version guys. It's alre... Jan 22 2008, 03:46 PM
Mark Harris QUOTE(MisterAnderson42 @ Jan 22 2008, 04:46 P... Jan 23 2008, 01:56 PM
mfatica You can profile a mex file with the CUDA profiler... Jan 23 2008, 07:40 PM
DenisR QUOTE(mfatica @ Jan 23 2008, 09:40 PM)You can... Jan 23 2008, 09:00 PM
hannes_4 QUOTE(mfatica @ Jan 23 2008, 08:40 PM)You can... May 14 2008, 11:36 PM
shsr QUOTE(hannes_4 @ May 14 2008, 04:36 PM)this d... Jun 13 2008, 09:01 PM
mfatica You can have plots, the session will run from comm... Jan 23 2008, 09:06 PM
Morph208 Thanks for this release guys. It's pretty cool... Jan 24 2008, 01:19 AM
Mark Harris QUOTE(Morph208 @ Jan 24 2008, 02:19 AM)Thanks... Jan 24 2008, 04:13 PM
Morph208 QUOTE(Mark Harris @ Jan 25 2008, 02:13 AM)RE:... Jan 25 2008, 12:00 AM
Nielske Very nice tool.
Is there any webpage or pdf where... Jan 30 2008, 02:48 PM
hufo First thanks for this tool, it is very useful ... Jan 30 2008, 01:51 PM
ssatoor QUOTE(hufo @ Jan 30 2008, 05:51 AM)First than... Feb 4 2008, 04:52 AM

hufo QUOTE(ssatoor @ Feb 4 2008, 12:52 AM)Regardin... Feb 4 2008, 10:09 AM
Mark Harris QUOTE(hufo @ Jan 30 2008, 02:51 PM)2- H2D, D2... Feb 4 2008, 10:37 AM
DenisR Look for the supercomputing '07 slides. Coales... Jan 30 2008, 07:04 PM
bog Very nice tool indeed.
Yet, I got stuck at some p... Jan 31 2008, 01:15 PM
ssatoor QUOTE(bog @ Jan 31 2008, 05:15 AM)Very nice t... Jan 31 2008, 01:35 PM
bog QUOTE(ssatoor @ Jan 31 2008, 03:35 PM)a) Do y... Jan 31 2008, 02:32 PM
DenisR Do your programs stop in time when running from co... Jan 31 2008, 03:11 PM
bog QUOTE(DenisR @ Jan 31 2008, 05:11 PM)Do your ... Jan 31 2008, 03:30 PM
dhoff Similarly, if you run any of the SDK examples sinc... Jan 31 2008, 07:26 PM
jordyvaneijk I need to say thank you for this tool...
Am I righ... Feb 1 2008, 01:24 PM
eelsen Nice work - this is a very promising tool.
A feat... Feb 5 2008, 06:11 PM
kristleifur Thank you for releasing this.
Here's one comm... Feb 12 2008, 02:04 PM
jordyvaneijk QUOTE(kristleifur @ Feb 12 2008, 04:04 PM)Tha... Feb 12 2008, 03:01 PM

seibert QUOTE(jordyvaneijk @ Feb 12 2008, 10:01 AM)I ... Feb 12 2008, 03:07 PM

jordyvaneijk QUOTE(seibert @ Feb 12 2008, 05:07 PM)What do... Feb 12 2008, 04:09 PM
seibert QUOTE(kristleifur @ Feb 12 2008, 09:04 AM)Tha... Feb 12 2008, 03:06 PM
JHHPC Hi everybody,
thanks for providing this tool.
I h... Mar 6 2008, 12:20 PM
jordyvaneijk QUOTE(JHHPC @ Mar 6 2008, 02:20 PM)Hi everybo... Mar 6 2008, 01:46 PM
houzet dominique Hi everybody,
thanks for providing this tool.
Whe... Mar 14 2008, 05:41 PM
wumpus Is there any chance of a 64-bit Linux build? I can... Apr 7 2008, 07:41 AM
JHHPC QUOTE(wumpus @ Apr 7 2008, 09:41 AM)Is there ... Apr 7 2008, 08:01 AM
wumpus Well yes I don't have a full set of 32 bit lib... Apr 7 2008, 08:20 AM
DenisR I remember I also has some trouble getting it to w... Apr 7 2008, 09:34 AM
Morph208 For those who don't know, Visual profiler v0.2... Apr 28 2008, 05:11 AM
Eri Rubin Hi i dont know if this is a silly question ... i k... May 5 2008, 08:19 AM
hufo Hello,
I had the same problem if the program chang... May 5 2008, 12:01 PM
ssatoor QUOTE(hufo @ May 5 2008, 04:01 AM)Hello,
I ha... May 5 2008, 01:04 PM
Eri Rubin QUOTE(ssatoor @ May 5 2008, 06:04 AM)Yes this... May 5 2008, 01:12 PM
riclas QUOTE(Eri Rubin @ May 5 2008, 01:12 PM)Thanks... May 20 2008, 01:32 PM
Eri Rubin QUOTE(riclas @ May 20 2008, 06:32 AM)how have... May 20 2008, 05:03 PM
gpugpu QUOTE(Eri Rubin @ May 20 2008, 09:03 AM)First... May 22 2008, 04:47 AM
patrick16 QUOTE(gpugpu @ May 21 2008, 09:47 PM)under wh... May 22 2008, 11:55 PM
Eri Rubin QUOTE(patrick16 @ May 22 2008, 04:55 PM)... May 23 2008, 06:30 AM
kyzhao so cool tool May 6 2008, 02:27 PM
Nielske Perhaps stupid question, but what can we understan... May 6 2008, 07:25 PM
DenisR occupancy=1 :
all of the maximum 768 threads are r... May 6 2008, 08:22 PM
Nielske Hey,
question again or bug i don't know.
No... May 11 2008, 05:48 PM
seibert QUOTE(Nielske @ May 11 2008, 12:48 PM)Normall... May 11 2008, 06:49 PM
Geka I have seen a couple of issues with the CUDA profi... May 16 2008, 09:27 PM
riclas there is a .conf file that is created and deleted ... May 20 2008, 06:28 PM
bdietrich Strange behaviour and data miss interpretation in ... May 21 2008, 01:46 PM
gpugpu Hi all,
I get the following error when I try to u... May 22 2008, 01:06 AM
inducer Does profiling work on programs using the driver A... May 23 2008, 07:50 PM
ryta1203 Is there a non-visual version of this that offers ... Jun 12 2008, 04:31 PM
Simon Green The visual profiler uses the same performance coun... Jun 12 2008, 04:55 PM
ryta1203 QUOTE(Simon Green @ Jun 12 2008, 12:55 PM)
T... Jun 12 2008, 05:48 PM
E.D. Riedijk I think that is ignored by the compiler. Functions... Jul 3 2008, 11:29 AM
gpugpu Is there any command-line profiler available for c... Jul 14 2008, 01:41 AM
espe_ Is there any tutorial on how to use this tool? Jul 31 2008, 10:13 AM
defy Hi!
I'm using CUDA 2.0 Beta2 on Vista 32.
... Aug 14 2008, 11:47 AM
ssatoor QUOTE(defy @ Aug 14 2008, 03:47 AM)Hi!
I... Aug 15 2008, 06:37 AM
santyhyammer QUOTE(ssatoor @ Aug 15 2008, 07:37 AM)Profile... Aug 16 2008, 05:23 AM
chrismc I'm having trouble with the profiler.
See htt... Sep 9 2008, 01:15 PM
chrismc QUOTE(chrismc @ Sep 9 2008, 01:15 PM)I'm ... Sep 9 2008, 01:24 PM
not8 QUOTE(santyhyammer @ Aug 15 2008, 09:23 PM)We... Oct 8 2008, 04:47 AM
davem_de Vista support would be great!
Need timestamps... Nov 14 2008, 01:07 PM![]() ![]() |
| Copyright 2008 NVIDIA Corporation. Terms of Use | Legal Info | Privacy Policy | Time is now: 9th February 2010 - 11:55 PM |