SlotMaximizer extension to LLVM can help you to quickly optimize OpenCL™ applications
I would like to introduce you to SlotMaximizer. SlotMaximizer is a transformation tool that automatically tunes OpenCL™ kernels, helping to increase developer productivity. It aids developers to obtain increased performance, higher throughput, and better hardware utilization from their kernels with minimal effort while maintaining a small, readable and maintainable code base. SlotMaximizer enables developers to focus on their original problems and algorithm strategies and leave the details of optimizing the code to the compiler.
In Auto-Tuning mode, SlotMaximizer attempts many potential optimizations on kernels directed by the developer. Each of the many optimizations of a given kernel are executed using the actual data being processed by the application, and performance statistics of the optimized kernels processing this data are provided to the developer to select the appropriate version to leverage.
In production code the original kernel is used, and the selected configuration for the preferred optimizations is indicated to the compiler using an attribute that is provided to the developer as part of the Auto-Tuning output data.
SlotMaximizer is already incorporated into the AMD Catalyst™ drivers as a preview and can be used by anyone developing applications using the APP SDK. It will be turned on by default to support application execution on end-user systems later in the year.
Improvements continue to be made to SlotMaximizer, download the latest beta version from here. You will need to register on the Multicoreware web-site, but this will also give you access to report issues, ask questions, etc. using their forums. The download will install over that included with your AMD Catalyst™ drivers, and includes documentation on use of SlotMaximizer.
Mark Ireton is the Product Manager for Compute Solutions at AMD. His postings are his own opinions and may not represent AMD’s positions, strategies or opinions. Links to third party sites, and references to third party trademarks, are provided for convenience and illustrative purposes only. Unless explicitly stated, AMD is not responsible for the contents of such links, and no third party endorsement of AMD or any of its products is implied.





Can we use that for MacOs X with AMD GPUs?
Also I think OpenCL on Amd GPUs on Apple MacOS is not a first class citizen as even BLAS and FFT AMD OpenCL libs aren’t available for this platform.. Hope AMD exposes some of this extra features on Apple platforms..
I had a good read. Thanks for sharing this as it answers my questions. More posts like this and I will be your loyal visitor. table saw