|
| 1 | +# ######################################################################## |
| 2 | +# Copyright 2013 Advanced Micro Devices, Inc. |
| 3 | +# |
| 4 | +# Licensed under the Apache License, Version 2.0 (the "License"); |
| 5 | +# you may not use this file except in compliance with the License. |
| 6 | +# You may obtain a copy of the License at |
| 7 | +# |
| 8 | +# http://www.apache.org/licenses/LICENSE-2.0 |
| 9 | +# |
| 10 | +# Unless required by applicable law or agreed to in writing, software |
| 11 | +# distributed under the License is distributed on an "AS IS" BASIS, |
| 12 | +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. |
| 13 | +# See the License for the specific language governing permissions and |
| 14 | +# limitations under the License. |
| 15 | +# ######################################################################## |
| 16 | + |
| 17 | +clFFT Readme |
| 18 | + |
| 19 | +Version: 1.10 |
| 20 | +Release Date: April 2013 |
| 21 | + |
| 22 | +ChangeLog: |
| 23 | + |
| 24 | +____________ |
| 25 | +Current Version: |
| 26 | + * This release tested using the 9.012 runtime driver and the 2.8 APPSDK |
| 27 | + |
| 28 | +____________ |
| 29 | +Version 1.8.291: |
| 30 | +Fixed: |
| 31 | + * Memory leaks affecting use cases where 'clfftEnqueueTransform' is used in a loop |
| 32 | + |
| 33 | +____________ |
| 34 | +Version 1.8.269 (beta): |
| 35 | +New: |
| 36 | + * clFFT now supports real-to-complex and complex-to-real transforms; |
| 37 | + refer to documentation for details |
| 38 | + * This release tested using the 12.4 Catalyst software suite |
| 39 | + |
| 40 | +Known Issues: |
| 41 | + * Some degradation in performance of real transforms due to known |
| 42 | + runtime/driver issues |
| 43 | + * Failures in real transforms have been seen on 7xxx series GPUs with certain |
| 44 | + problem sizes involving powers of 3 and 5 |
| 45 | + |
| 46 | +____________ |
| 47 | +Version 1.6.244: |
| 48 | +Fixed: |
| 49 | + * Failures observed in v1.6.236 in backward transforms of certain power of 2 |
| 50 | + (involving radix 4 and radix 8) problem sizes. |
| 51 | + |
| 52 | +____________ |
| 53 | +Version 1.6.236: |
| 54 | +New: |
| 55 | + * Performance of the FFT library has been improved for Radix-2 1D and 2D transforms |
| 56 | + * Support for R4XXX GPUs is deprecated and no longer tested |
| 57 | + * Preview: Support for AMD Radeon� HD7000 series GPUs |
| 58 | + * This release tested using the 8.92 runtime driver and the 2.6 APP SDK |
| 59 | +____________ |
| 60 | +Version 1.4: |
| 61 | +New: |
| 62 | + * clFFT now supports transform lengths whose factors consist exclusively |
| 63 | + of powers of 2, 3, and 5 |
| 64 | + * clFFT supports double precision data types |
| 65 | + * clFFT executes on OpenCL 1.0 compliant devices |
| 66 | + * This release tested using the 8.872 runtime driver and the 2.5 APP SDK |
| 67 | + * A helper bash script appmlEnv.sh has been added to the root installation |
| 68 | + directory to assist in properly setting up a terminal environment to |
| 69 | + execute clFFT samples |
| 70 | + |
| 71 | +Fixed: |
| 72 | + * If the library is required to allocate a temporary buffer, and the user does |
| 73 | + not specify a temporary buffer on the Enqueue call, the library will |
| 74 | + allocate a temporary buffer internally and the lifetime of that temporary |
| 75 | + buffer is managed by the lifetime of the FFT plan; deleting the plan will |
| 76 | + release the buffer. |
| 77 | + * Test failures on CPU device for 32-bit systems (Windows/Linux) |
| 78 | + |
| 79 | +Known Issues: |
| 80 | + * Failures have been seen on graphics cards using R4550 (RV710) GPUs. |
| 81 | + |
| 82 | +____________ |
| 83 | +Version 1.2: |
| 84 | +New: |
| 85 | + * Reduced the number of internal LDS bank conflicts for our 1D FFT transforms, |
| 86 | + increasing performance. |
| 87 | + * Padded reads/writes to global memory, decreasing bank conflicts and |
| 88 | + increasing performance on 2D transforms. |
| 89 | + * This release tested using the 8.841 runtime driver and the 2.4 APP SDK |
| 90 | + |
| 91 | +Fixed: |
| 92 | + * Failures have been seen attempting to queue work on the second GPU device on |
| 93 | + a multi GPU 5970 card on Linux. |
| 94 | + |
| 95 | +Known Issues: |
| 96 | + * It is recommended that users query for and explicitely create an |
| 97 | + intermediate buffer if clFFT requires one. If the library creates the |
| 98 | + intermediate buffer internally, a race condition may occur on freeing the |
| 99 | + buffer on lower end hardware. |
| 100 | + * Failures have been seen on graphics cards using R4550 (RV710) GPUs. |
| 101 | + * Test failures on CPU device for 32-bit systems (Windows/Linux) |
| 102 | + * It is recommended that windows users uninstall previous version of clFFT |
| 103 | + before installing newer versions. Otherwise, Add/Remove programs only |
| 104 | + removes the latest version. Linux users can delete the install directory. |
| 105 | + |
| 106 | +____________ |
| 107 | +Version 1.0: |
| 108 | + * Initial release, available on all platforms |
| 109 | + |
| 110 | +Known Issues: |
| 111 | + * Failures have been seen attempting to queue work on the second GPU device on |
| 112 | + a multi GPU 5970 card on Linux. |
| 113 | +_____________________ |
| 114 | +Building the Samples: |
| 115 | + |
| 116 | +To install the Linux versions of clFFT, uncompress the initial download and |
| 117 | + then execute the install script. |
| 118 | + |
| 119 | +For example: |
| 120 | + tar -xf clFFT-${version}.tar.gz |
| 121 | + - This installs three files into the local directory, one being an |
| 122 | + executable bash script. |
| 123 | + |
| 124 | + sudo mkdir /opt/clFFT-${version} |
| 125 | + - This pre-creates the install directory with proper permissions in /opt |
| 126 | + if it is to be installed there (This is the default). |
| 127 | + |
| 128 | + ./install-clFFT-${version}.sh |
| 129 | + - This prints an EULA and uncompresses files into the chosen install |
| 130 | + directory. |
| 131 | + |
| 132 | + cd ${installDir}/bin64 |
| 133 | + export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:${OpenCLLibDir}:${clfftLibDir} |
| 134 | + - Export library dependencies to resolve all external linkages to the |
| 135 | + client program. The user can create a bash script to help automate this |
| 136 | + procedure. |
| 137 | + |
| 138 | + ./Client -h |
| 139 | + - Understand the command line options that are available to the user |
| 140 | + through the sample client. |
| 141 | + |
| 142 | + ./Client -iv |
| 143 | + - Watch for the version strings to print out; watch for |
| 144 | + 'Client Test *****PASS*****' to print out. |
| 145 | + |
| 146 | +The sample program does not ship with native build files. Instead, a CMake |
| 147 | +file is shipped, and users generate a native build file for their system. |
| 148 | + |
| 149 | +For example: |
| 150 | + cd ${installDir} |
| 151 | + mkdir samplesBin/ |
| 152 | + - This creates a sister directory to the samples directory that will house |
| 153 | + the native makefiles and the generated files from the build. |
| 154 | + |
| 155 | + cd samplesBin/ |
| 156 | + ccmake ../samples/ |
| 157 | + - ccmake is a curses-based cmake program. It takes a parameter that |
| 158 | + specifies the location of the source code to compile. |
| 159 | + - Hit 'c' to configure for the platform; ensure that the dependencies to |
| 160 | + external libraries are satisfied, including paths to 'ATI Stream SDK' |
| 161 | + and 'Boost'. |
| 162 | + - After dependencies are satisfied, hit 'c' again to finalize configure |
| 163 | + step, then hit 'g' to generate makefile and exit ccmake. |
| 164 | + |
| 165 | + make help |
| 166 | + - Look at the available options for make. |
| 167 | + |
| 168 | + make |
| 169 | + - Build the sample client program. |
| 170 | + |
| 171 | + ./clfft.Sample -iv |
| 172 | + - Watch for the version strings to print out; watch for |
| 173 | + 'Client Test *****PASS*****' to print out. |
| 174 | +_______________________________________________________________________________ |
| 175 | +(C) 2010-2013 Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD |
| 176 | +Arrow logo, ATI, the ATI logo, Radeon, FireStream, FireGL, Catalyst, and |
| 177 | +combinations thereof are trademarks of Advanced Micro Devices, Inc. Microsoft |
| 178 | +(R), Windows, and Windows Vista (R) are registered trademarks of Microsoft |
| 179 | +Corporation in the U.S. and/or other jurisdictions. OpenCL and the OpenCL logo |
| 180 | +are trademarks of Apple Inc. used by permission by Khronos. Other names are for |
| 181 | +informational purposes only and may be trademarks of their respective owners. |
| 182 | + |
| 183 | +The contents of this document are provided in connection with Advanced Micro |
| 184 | +Devices, Inc. ("AMD") products. AMD makes no representations or warranties with |
| 185 | +respect to the accuracy or completeness of the contents of this publication and |
| 186 | +reserves the right to make changes to specifications and product descriptions |
| 187 | +at any time without notice. The information contained herein may be of a |
| 188 | +preliminary or advance nature and is subject to change without notice. No |
| 189 | +license, whether express, implied, arising by estoppel or otherwise, to any |
| 190 | +intellectual property rights is granted by this publication. Except as set forth |
| 191 | +in AMD's Standard Terms and Conditions of Sale, AMD assumes no liability |
| 192 | +whatsoever, and disclaims any express or implied warranty, relating to its |
| 193 | +products including, but not limited to, the implied warranty of |
| 194 | +merchantability, fitness for a particular purpose, or infringement of any |
| 195 | +intellectual property right. |
| 196 | + |
| 197 | +AMD's products are not designed, intended, authorized or warranted for use as |
| 198 | +components in systems intended for surgical implant into the body, or in other |
| 199 | +applications intended to support or sustain life, or in any other application |
| 200 | +in which the failure of AMD's product could create a situation where personal |
| 201 | +injury, death, or severe property or environmental damage may occur. AMD |
| 202 | +reserves the right to discontinue or make changes to its products at any time |
| 203 | +without notice. |
| 204 | +_______________________________________________________________________________ |
0 commit comments