Sophie

Sophie

distrib > Mandriva > 2010.1 > x86_64 > media > non-free-release > by-pkgid > cc383a08dffc6e4b44f979ed22783c8f > files > 3

nvidia-opencl-profiler-3.0-1mdv2010.1.x86_64.rpm

Key changes in version openclprof v1.1 with respect to v1.0:
1) Following new counters are supported:
  a) tex_cache_hit, tex_cache_miss : Number of texture cache sector hits, misses
  b) "NOP Triggers" : Can be used to get execution counts for certain code paths in the kernel

2) New memory copy option "host mem transfer type" is added for memory transfers.
   This specifies whether a memory transfers uses "Pageable" or "Page-locked"
   
3) Device level summary plot :
   One bar for each method is there. Bars are sorted in decreasing GPU time. Bar length 
   is proportional to cumulative GPU time for a method across all contexts for a device.

4) Session level summary plot :
   One bar for each device is there. Bar length is proportional to GPU Utilization. 
   GPU Utilization is the proportion of time when the GPU was actually executing some method 
   to total time interval from GPU start to end. The values are presented in percentage.
   
5) User interface changes:
   "Session Settings" Dialog : 
   a) Added a new device selection option on "Session" tab.
      Based on this option the available counters can be selected on "Profiler Counter" tab. 
      In case of "multi-device" only counters supported by all devices can be selected.
   b) All the counters on "Profiler Counter" tab and options on "Other Options" tab are shown 
      in tree view under different groups.

6) Support for the global memory load & store request counters (gld_request and gst_request) in dropped. 
   Global memory load and store efficiency which uses these counters is also dropped.

7) Added support for devices of compute capability 2.0.