OPTIMIZATION OF THE GENERAL FORM MATRIX OPERATIONS FOR PENTIUM PRO PROCESSOR
Bruce Greer C., S.V. Kazakov, S.V. Sivolgin VANT. Ser.: Mat. Mod. Fiz. Proc 1997. Вып.4. С. 3-10.
Multilevel optimization of matrix float operations for Pentium Pro processor is described using one BLAS library program as an example. The upper levels of optimization increase the efficiency of using cache memory. The lower optimization level takes into account specific features of a microprocessor architecture. Performance plots for different algorithms are given.
|