IPB

Welcome Guest ( Log In | Register )

> preview of NVIDIA Visual Profiler
Guest_dhoff_*
post Jan 18 2008, 11:56 PM
Post #1





Guests






Here's an initial download of the CUDA Visual Profiler
See the full README.
For linux: tar xvfz CudaVisualProfiler_0.1_beta_linux.tar.gz
Windows: extract the zip contents from CudaVisualProfiler_0.1_beta_windows.zip

------------------------------------------------------------------------------
NVIDIA CUDA Visual Profiler
Version 0.1 Beta
------------------------------------------------------------------------------

Published by
NVIDIA Corporation
2701 San Tomas Expressway
Santa Clara, CA 95050

Notice

BY DOWNLOADING THIS FILE, USER AGREES TO THE FOLLOWING:

ALL NVIDIA SOFTWARE, DESIGN SPECIFICATIONS, REFERENCE BOARDS, FILES,
DRAWINGS, DIAGNOSTICS, LISTS, AND OTHER DOCUMENTS (TOGETHER AND SEPARATELY,
"MATERIALS") ARE BEING PROVIDED "AS IS". NVIDIA MAKES NO WARRANTIES,
EXPRESSED, IMPLIED, STATUTORY, OR OTHERWISE WITH RESPECT TO THE MATERIALS,
AND EXPRESSLY DISCLAIMS ALL IMPLIED WARRANTIES OF NONINFRINGEMENT,
MERCHANTABILITY, AND FITNESS FOR A PARTICULAR PURPOSE.

Information furnished is believed to be accurate and reliable. However,
NVIDIA Corporation assumes no responsibility for the consequences of use
of such information or for any infringement of patents or other rights
of third parties that may result from its use. No license is granted by
implication or otherwise under any patent or patent rights of NVIDIA
Corporation. Specifications mentioned in this publication are subject
to change without notice. These materials supersedes and replaces all
information previously supplied. NVIDIA Corporation products are not
authorized for use as critical components in life support devices or
systems without express written approval of NVIDIA Corporation.

Trademarks

NVIDIA, CUDA, and the NVIDIA logo are trademarks or registered trademarks
of NVIDIA Corporation in the United States and other countries. Other
company and product names may be trademarks of the respective companies
with which they are associated.

Copyright

© 2007-2008 by NVIDIA Corporation. All rights reserved.



LIST OF SUPPORTED FEATURES:
--------------------------
- Execute a CUDA program with profiling enabled and view the profiler output
as a table. The table has the following columns for each GPU method:
timestamp: Start time stamp
method: GPU method name. This is either "memcopy" for memory copies
or the name of a GPU kernel.
GPU Time
CPU Time
Occupancy
Profiler counters:
gld_incoherent : Number of non-coalesced global memory loads
gld_coherent : Number of coalesced global memory loads
gst_incoherent : Number of non-coalesced global memory stores
gst_coherent : Number of coalesced global memory stores
local_load : Number of local memory loads
local_store : Number of local memory stores
branch : Number of branch events (instruction and/or sync stack)
divergent_branch : Number of divergent branches within a warp
instructions : Number of dynamic instructions (in fetch)
warp_serialize : Number of threads in a warp serialize based
on address (GRF or constant)
cta_launched : Number of CTAs launched on the PM TPC

Please refer the "Interpreting Profiler Counters" section below for
more information on profiler counters.
Note that profiler counters are also referred to as profiler signals.


- Display the summary profiler table. It has the following columns for each
GPU method:
method name
number of calls
total GPU time
% age GPU time
Total counts for each profiler counter.

- Display various kinds of plots:
- Summary profiling data bar plot
- GPU Time Height plot
- GPU Time Width plot
- Profiler counter bar plot
- Profiler output table column bar plot

- Analysis of profiler output - lists out method with high number of:
. incoherent stores
. incoherent loads
. warp serializations

- Compare profiler output for multiple program runs of the same program
or for different programs. Each program run is referred to as a session.

- Save profiling data for multiple sessions. A group of sessions is referred
to as a project.

- Import/Export CUDA Profiler CSV format data


DESCRIPTION OF DIFFERENT PLOTS:
------------------------------

- Summary profiling data bar plot
. One bar for each method
. Bars sorted in decreasing gpu time,
. Bar length is proportional to cumulative gputime for a method

- GPU Time Height Plot:
It is a bar diagram in which the height of each bar is proportional
to the GPU time for a method and a different bar color is assigned for
each method. A legend is displayed which shows the color assignment for
different methods. The width of each bar is fixed and the bars are displayed
in the order in which the methods are executed.

When the "fit in window" option is enabled the display is adjusted so as
to fit all the bars in the displayed window width. In this case bars for
multiple methods can overlap. The overlapped bars are displayed in decreasing
order of height so that all the different bars are visible.

When the "Show CPU Time" option is enabled the CPU time is shown as a
bar in a different color on top of the GPU time bar. The height of this
bar is proportional to the difference of CPU time and GPU time for the method.

- GPU Time Width Plot:
It is a bar diagram in which the width of each bar is proportional
to the GPU time for a method and a different bar color is assigned for
each method. A legend is displayed which shows the color assignment for
different methods. The bars are displayed in the order in which the
methods are executed. When time stamps are enabled the bars are positioned
based on the time stamp.
The height of each bar is based on the option chosen:
a) fixed height : height is fixed.
b) height proportional to instruction issue rate: the instruction issue
rate for a method is equal to profiler "instructions" counter
value divided by the gpu time for the method.
c) height proportional to incoherent load + store rate: the incoherent load
+ store rate for a method is equal to the sum of profiler
"gld_incoherent" and "gst_incoherent" counter values divided by the
gpu time for the method.

- Profiler counter bar plot
It is a bar plot for profiler counter values for a method from the profiler
output table or the summary table.
. One bar for each profiler counter
. Bars sorted in decreasing profiler counter value
. Bar length is proportional to profiler counter value

- Profiler output table column bar plot
It is a bar plot for any column of values from the profiler output
table or summary table
. One bar for each row in the table
. Bars sorted in decreasing column value
. Bar length is proportional to column value


Attached File(s)
Attached File  CudaVisualProfiler_0.1_beta_linux.tar.gz ( 4.53MB ) Number of downloads: 1724
Attached File  CudaVisualProfiler_0.1_beta_windows.zip ( 3.99MB ) Number of downloads: 4205
Attached File  CudaVisualProfiler_README_0.1_beta.txt ( 20.74K ) Number of downloads: 1577
Attached File  CudaVisualProfiler_linux_releases_notes_0.1_beta.txt ( 3.15K ) Number of downloads: 714
Attached File  CudaVisualProfiler_windows_releases_notes_0.1_beta.txt ( 2.07K ) Number of downloads: 696
 
Go to the top of the page
 
+Quote Post

Posts in this topic
- dhoff   preview of NVIDIA Visual Profiler   Jan 18 2008, 11:56 PM
- - DenisR   Is it possible to profile CUDA code that is being ...   Jan 19 2008, 10:52 PM
|- - Mark Harris   Unfortunately not right now, unless you can genera...   Jan 22 2008, 01:16 PM
|- - DenisR   QUOTE(Mark Harris @ Jan 22 2008, 03:16 PM)Unf...   Jan 23 2008, 06:48 PM
|- - sandeepan1986   QUOTE(Mark Harris @ Jan 22 2008, 05:16 AM)Unf...   Jul 3 2008, 09:51 AM
|- - Eri Rubin   Just my guess but it seems that would be hard to d...   Jul 3 2008, 10:09 AM
|- - sandeepan1986   QUOTE(Eri Rubin @ Jul 3 2008, 02:09 AM)Just m...   Jul 3 2008, 10:40 AM
- - MisterAnderson42   Thanks for the preview version guys. It's alre...   Jan 22 2008, 03:46 PM
|- - Mark Harris   QUOTE(MisterAnderson42 @ Jan 22 2008, 04:46 P...   Jan 23 2008, 01:56 PM
- - mfatica   You can profile a mex file with the CUDA profiler...   Jan 23 2008, 07:40 PM
|- - DenisR   QUOTE(mfatica @ Jan 23 2008, 09:40 PM)You can...   Jan 23 2008, 09:00 PM
|- - hannes_4   QUOTE(mfatica @ Jan 23 2008, 08:40 PM)You can...   May 14 2008, 11:36 PM
|- - shsr   QUOTE(hannes_4 @ May 14 2008, 04:36 PM)this d...   Jun 13 2008, 09:01 PM
- - mfatica   You can have plots, the session will run from comm...   Jan 23 2008, 09:06 PM
|- - Morph208   Thanks for this release guys. It's pretty cool...   Jan 24 2008, 01:19 AM
|- - Mark Harris   QUOTE(Morph208 @ Jan 24 2008, 02:19 AM)Thanks...   Jan 24 2008, 04:13 PM
|- - Morph208   QUOTE(Mark Harris @ Jan 25 2008, 02:13 AM)RE:...   Jan 25 2008, 12:00 AM
|- - Nielske   Very nice tool. Is there any webpage or pdf where...   Jan 30 2008, 02:48 PM
- - hufo   First thanks for this tool, it is very useful ...   Jan 30 2008, 01:51 PM
|- - ssatoor   QUOTE(hufo @ Jan 30 2008, 05:51 AM)First than...   Feb 4 2008, 04:52 AM
||- - hufo   QUOTE(ssatoor @ Feb 4 2008, 12:52 AM)Regardin...   Feb 4 2008, 10:09 AM
|- - Mark Harris   QUOTE(hufo @ Jan 30 2008, 02:51 PM)2- H2D, D2...   Feb 4 2008, 10:37 AM
- - DenisR   Look for the supercomputing '07 slides. Coales...   Jan 30 2008, 07:04 PM
- - bog   Very nice tool indeed. Yet, I got stuck at some p...   Jan 31 2008, 01:15 PM
|- - ssatoor   QUOTE(bog @ Jan 31 2008, 05:15 AM)Very nice t...   Jan 31 2008, 01:35 PM
|- - bog   QUOTE(ssatoor @ Jan 31 2008, 03:35 PM)a) Do y...   Jan 31 2008, 02:32 PM
- - DenisR   Do your programs stop in time when running from co...   Jan 31 2008, 03:11 PM
|- - bog   QUOTE(DenisR @ Jan 31 2008, 05:11 PM)Do your ...   Jan 31 2008, 03:30 PM
|- - dhoff   Similarly, if you run any of the SDK examples sinc...   Jan 31 2008, 07:26 PM
- - jordyvaneijk   I need to say thank you for this tool... Am I righ...   Feb 1 2008, 01:24 PM
- - eelsen   Nice work - this is a very promising tool. A feat...   Feb 5 2008, 06:11 PM
|- - kristleifur   Thank you for releasing this. Here's one comm...   Feb 12 2008, 02:04 PM
|- - jordyvaneijk   QUOTE(kristleifur @ Feb 12 2008, 04:04 PM)Tha...   Feb 12 2008, 03:01 PM
||- - seibert   QUOTE(jordyvaneijk @ Feb 12 2008, 10:01 AM)I ...   Feb 12 2008, 03:07 PM
||- - jordyvaneijk   QUOTE(seibert @ Feb 12 2008, 05:07 PM)What do...   Feb 12 2008, 04:09 PM
|- - seibert   QUOTE(kristleifur @ Feb 12 2008, 09:04 AM)Tha...   Feb 12 2008, 03:06 PM
- - JHHPC   Hi everybody, thanks for providing this tool. I h...   Mar 6 2008, 12:20 PM
|- - jordyvaneijk   QUOTE(JHHPC @ Mar 6 2008, 02:20 PM)Hi everybo...   Mar 6 2008, 01:46 PM
|- - houzet dominique   Hi everybody, thanks for providing this tool. Whe...   Mar 14 2008, 05:41 PM
- - wumpus   Is there any chance of a 64-bit Linux build? I can...   Apr 7 2008, 07:41 AM
|- - JHHPC   QUOTE(wumpus @ Apr 7 2008, 09:41 AM)Is there ...   Apr 7 2008, 08:01 AM
- - wumpus   Well yes I don't have a full set of 32 bit lib...   Apr 7 2008, 08:20 AM
- - DenisR   I remember I also has some trouble getting it to w...   Apr 7 2008, 09:34 AM
|- - Morph208   For those who don't know, Visual profiler v0.2...   Apr 28 2008, 05:11 AM
- - Eri Rubin   Hi i dont know if this is a silly question ... i k...   May 5 2008, 08:19 AM
|- - hufo   Hello, I had the same problem if the program chang...   May 5 2008, 12:01 PM
|- - ssatoor   QUOTE(hufo @ May 5 2008, 04:01 AM)Hello, I ha...   May 5 2008, 01:04 PM
|- - Eri Rubin   QUOTE(ssatoor @ May 5 2008, 06:04 AM)Yes this...   May 5 2008, 01:12 PM
|- - riclas   QUOTE(Eri Rubin @ May 5 2008, 01:12 PM)Thanks...   May 20 2008, 01:32 PM
|- - Eri Rubin   QUOTE(riclas @ May 20 2008, 06:32 AM)how have...   May 20 2008, 05:03 PM
|- - gpugpu   QUOTE(Eri Rubin @ May 20 2008, 09:03 AM)First...   May 22 2008, 04:47 AM
|- - patrick16   QUOTE(gpugpu @ May 21 2008, 09:47 PM)under wh...   May 22 2008, 11:55 PM
|- - Eri Rubin   QUOTE(patrick16 @ May 22 2008, 04:55 PM)...   May 23 2008, 06:30 AM
- - kyzhao   so cool tool   May 6 2008, 02:27 PM
- - Nielske   Perhaps stupid question, but what can we understan...   May 6 2008, 07:25 PM
- - DenisR   occupancy=1 : all of the maximum 768 threads are r...   May 6 2008, 08:22 PM
|- - Nielske   Hey, question again or bug i don't know. No...   May 11 2008, 05:48 PM
|- - seibert   QUOTE(Nielske @ May 11 2008, 12:48 PM)Normall...   May 11 2008, 06:49 PM
- - Geka   I have seen a couple of issues with the CUDA profi...   May 16 2008, 09:27 PM
- - riclas   there is a .conf file that is created and deleted ...   May 20 2008, 06:28 PM
- - bdietrich   Strange behaviour and data miss interpretation in ...   May 21 2008, 01:46 PM
- - gpugpu   Hi all, I get the following error when I try to u...   May 22 2008, 01:06 AM
- - inducer   Does profiling work on programs using the driver A...   May 23 2008, 07:50 PM
- - ryta1203   Is there a non-visual version of this that offers ...   Jun 12 2008, 04:31 PM
- - Simon Green   The visual profiler uses the same performance coun...   Jun 12 2008, 04:55 PM
|- - ryta1203   QUOTE(Simon Green @ Jun 12 2008, 12:55 PM) T...   Jun 12 2008, 05:48 PM
- - E.D. Riedijk   I think that is ignored by the compiler. Functions...   Jul 3 2008, 11:29 AM
- - gpugpu   Is there any command-line profiler available for c...   Jul 14 2008, 01:41 AM
- - espe_   Is there any tutorial on how to use this tool?   Jul 31 2008, 10:13 AM
- - defy   Hi! I'm using CUDA 2.0 Beta2 on Vista 32. ...   Aug 14 2008, 11:47 AM
|- - ssatoor   QUOTE(defy @ Aug 14 2008, 03:47 AM)Hi! I...   Aug 15 2008, 06:37 AM
|- - santyhyammer   QUOTE(ssatoor @ Aug 15 2008, 07:37 AM)Profile...   Aug 16 2008, 05:23 AM
- - chrismc   I'm having trouble with the profiler. See htt...   Sep 9 2008, 01:15 PM
|- - chrismc   QUOTE(chrismc @ Sep 9 2008, 01:15 PM)I'm ...   Sep 9 2008, 01:24 PM
- - not8   QUOTE(santyhyammer @ Aug 15 2008, 09:23 PM)We...   Oct 8 2008, 04:47 AM
- - davem_de   Vista support would be great! Need timestamps...   Nov 14 2008, 01:07 PM


Reply to this topicStart new topic

 



Copyright 2008 NVIDIA Corporation.  Terms of Use | Legal Info | Privacy Policy Time is now: 9th February 2010 - 11:55 PM
Unites States Argentina Brazil Chile China Colombia France Germany India Italy Japan Korea Mexico Poland Russia Spain Taiwan United Kingdom Venezuela