Intel VTune Amplifier XE allows users to tune applications for scalable multicore performance. Engineers can quickly locate code using an immense amount of CPU time with hotspot analysis. Because the software tunes threading with locks and waits analysis, Intel VTune Amplifier XE detects the common cause of slow performance in parallel programs. It determines which programs are waiting too long on a lock while the cores are underutilized during the wait. Intel VTune Amplifier XE also allows users to see the cause of OpenMP inefficiencies sorted by potential impact, and optimize multi-rank hybrid MPI/ OpenMP by profiling MPI ranks selected in the Intel Trace Analyzer and Collector.