nmpp

part of nmblas library, sgemm function implementation Подробнее...

См. исходные тексты.

Подробное описание

part of nmblas library, sgemm function implementation

single precision matrix multiplication implementation

C = alpha*A*B + beta*C

see blas\cblas docs for detailed description

limitations: M, N, K, ld* must be all even, all matrix rows must be even aligned

limitations: row major (C order, not Fortran) only

limitations: TransA is not supported yet

Автор
leshabirukov
Версия
1.0
Дата
2018-07-26
Аргументы
[in]TransAtranspose first multiplicand (A) flag
[in]TransBtranspose second multiplicand (B) flag
[in]Mnumber of rows of the matrix A and C
[in]Nnumber of columns of the matrix B and C
[in]Knumber of rows of the matrix B and of columns of the matrix C
[in]alphaspecifies the scalar alpha
[in]Apointer to the first multiplicand
[in]ldaA stride, first dimension of A holder
[in]Bpointer to the second multiplicand
[in]ldbB stride, first dimension of B holder
[in]betaspecifies the scalar beta
[in,out]Cpointer to accumulator matrix
[in]ldcC stride, first dimension of C holder
Предупреждения