Add new comment

The kernels in the video are WEC2013. WEC7 obviously runs a bit slower (due to the 32 bit ARM code instead of the 16/32 Thumb2 code). Hardly noticeable differences in GPU performance though!