Fork me on GitHub

src/arraymancer/laser/primitives/matrix_multiplication/gemm

Theme:

Index

Search:

Group by:

Imports
Procs

Source Edit

Imports

compiler_optim_hints, openmp, gemm_tiling, gemm_utils, gemm_packing, gemm_ukernel_dispatch, datatypes, cpuinfo_x86

Procs

proc gebp_mkernel[T; ukernel: static MicroKernel](mc, nc, kc: int; alpha: T; packA, packB: ptr UncheckedArray[T]; beta: T; mcncC: MatrixView[T]): Macro kernel, multiply:
a block Amc, kc * panel Bkc, N

Source Edit
proc gemm_strided[T: SomeNumber and not (uint32 | uint64 | uint | int)]( M, N, K: int; alpha: T; A: ptr T; rowStrideA, colStrideA: int; B: ptr T; rowStrideB, colStrideB: int; beta: T; C: ptr T; rowStrideC, colStrideC: int): Source Edit
proc gemm_strided[T: uint32 | uint64 | uint | int](M, N, K: int; alpha: T; A: ptr T; rowStrideA, colStrideA: int; B: ptr T; rowStrideB, colStrideB: int; beta: T; C: ptr T; rowStrideC, colStrideC: int): Overload to avoid bloating the code size with generics monomorphization Source Edit

Arraymancer Technical reference

Core tensor API

Neural network API

Linear algebra, stats, ML

IO & Datasets

Autograd

Neuralnet primitives

Other docs

Tutorial

Spellbook (How-To's)

Under the hood