Optimized ops in Kelvin TFLM
The following table is a list of currently optimized ops in Kelvin TFLM. The relevant source code can be found located here.
Non-Convolutional Ops
Op | Supported Data Type | Comments |
---|
Elementwise Add | s8, s16, s32 | Rescaling with offset and shift, clamping |
Leaky ReLU | s8, s16 | |
Max Pooling | s8 | |
Convolutional Ops
Op | Weights | Activation | Bias | Comments |
---|
Depthwise Conv2d | s8 | s16 | s64 | filter size 3x1 |
Depthwise Conv2d | s8 | s8 | s64 | output depth % 32 == 0 |
Conv2d | s8 | s16 | s32 | |
Conv2d | s8 | s16 | s64 | filter size 1x1, filter depth % 32 == 0 |
Conv2d | s8 | s16 | s64 | filter size 1xn, grouped or ungroups |
Conv2d | s8 | s8 | s32 | filter size 1x1, output depth % 8 == 0 |
Conv2d | s8 | s8 | s32 | filter depth % 32 == 0 |
Conv2d | s8 | s8 | s32 | filter shape == (48x3x1x48) |