Abstract
For Run 2 of the LHC, LHCb is replacing a significant part of its event filter farm with new compute nodes. For the evaluation of the best performing solution, we have developed a method to convert our high level trigger application into a stand-alone, bootable benchmark image. With additional instrumentation we turned it into a self-optimising benchmark which explores techniques such as late forking, NUMA balancing and optimal number of threads, i.e. it automatically optimises box-level performance. We have run this procedure on a wide range of Haswell-E CPUs and numerous other architectures from both Intel and AMD, including also the latest Intel micro-blade servers. We present results in terms of performance, power consumption, overheads and relative cost.
Original language | English |
---|---|
Article number | 092022 |
Number of pages | 8 |
Journal | Journal of Physics: Conference Series |
Volume | 664 |
Issue number | 9 |
DOIs | |
Publication status | Published - 1 Dec 2015 |
Externally published | Yes |