}
+\frame{
+
+\frametitle{Running on CX1}
+
+\begin{itemize}
+
+\item We benchmarked on CX1 cluster at Imperial using the Intel Math
+ Kernel Libraries instead of FFTW.
+
+\item Performance was terrible, significantly worse than vanilla ONETEP
+ ($\approx$ 20\%
+ slowdowns).
+\item It looks like MKL really dislikes working with split layout data.
+
+\item If so, this means that ONETEP's data marshalling that we worked to remove
+actually helps with IMKL.
+
+\item I updated the interpolate library to support staging data so that the FFT
+ routines could work on entirely \emph{contiguous}, \emph{interleaved} data.
+
+\item The other implementation would gather and scatter directly from and to the input
+ and output arrays using the FFT routines.
+
+\item When planning the interpolate, we benchmark and select the best technique.
+ When using IMKL, we almost always do the scatter/gather ourselves.
+
+\end{itemize}
+
+}
+
+\frame{
+
+\frametitle{Running on CX1}
+
+\large
+\centering
+
+We don't have performance results for this new implementation\ldots yet.
+
+}
\end{document}