Quickly introducing the App Profiler and Kernel Analyzer, couldn’t catch all the details but here you go what I could write down.
AMD app profiler
What is AMD app profiler?
- A performance analysis too that gathers data from the OpenCL run=time and AMD APUs and GPUs during the execution of an OpenCL app
- Integration into MS Visual Studio 2008 and 2010
- Command line utility program for windows and Linux platforms
- Support OpenCL and direct compute
- No code pro project modification to target application necessary.
- API trace View: view API input arguments and output results
- Summary pages: find API hotspots, top ten data transfer and kernel execution operations
- API trace analyzer: identify failed APU calls, resource leaks and best practices
What can app profiler do for you?
Timeline visualization: visualize open cl execution in a timeline chart
- View number of OpenCL contexts and command queues created and the relationships between these items
- View host and device execution
- Determine proper synch
Session profile view: analyze OpenCL kernel execution for AMD Radeon GPUs
Collect GPU performance counters
- The number of ALU, global and local memory instructions executed
- GPU utilization and memory access characteristics
- Shader Compiler VLIW packing efficiency
- Show the kernel resource usages
AMD App Kernel Analyzer
- Compiler, analyze and disassemble an OpenCL kernel for multiple Catalyst driver versions
Best Performance Optimization Practices
- Collect API trace first
- Remove all error and warnings reported by API trace analyzer if possible
- Use timeline visualization to determine possible synchronization issues
- Find the top bottleneck of the application using the summary pages
- Drill down the most expensive kernel using session profile view