AE_MULAAFD16SS.33_22_S2 — 16x16-bit signed fraction dual MAC with Addition/Addition and 32-bit (1.31) result, with saturation after every step.: slot 2 version.

Instruction Word

Slot
ae_slot2_0
6
3
6
2
6
1
6
0
5
9
5
8
5
7
5
6
5
5
5
4
5
3
5
2
5
1
5
0
4
9
4
8
4
7
4
6
4
5
4
4
4
3
4
2
4
1
4
0
3
9
3
8
3
7
3
6
3
5
3
4
3
3
3
2
3
1
3
0
2
9
2
8
2
7
2
6
2
5
2
4
2
3
2
2
2
1
2
0
1
9
1
8
1
7
1
6
1
5
1
4
1
3
1
2
1
1
1
0
9876543210
Format ae_format - 64 bit(s) 1111
AE_MULAAFD16SS.33_22_S201000000
ae_fld_mul_x2_S2_q0 3210
ae_fld_mul_x2_S2_d0 3210
ae_fld_mul_x2_S2_d1 3210

Assembler Syntax

AE_MULAAFD16SS.33_22_S2 aed0..15(ae_mul_S2_q0), aed0..15(ae_mul_S2_d0), aed0..15(ae_mul_S2_d1)

C Syntax

#include <xtensa/tie/xt_hifi2.h>

extern void AE_MULAAFD16SS_33_22_S2(ae_f32x2 d /*inout*/, ae_f16x4 d0, ae_f16x4 d1);

Description

AE_MULAAFD16SS.33_22 is a 16x16-bit signed fraction dual MAC with 32-bit (1.31) result, with saturation after each multiply and each accumulation. The extension 33_22 indicates that the two multiplication results are (1) the 3 element of the first AE_DR operand times the 3 element of the second AE_DR operand; and (2) the 2 element of the first AE_DR operand times the 2 element of the second AE_DR operand. AA indicates that both multiply results are added into the accumulator.

This operation is bit exact with two successive instantiations of the ITU-T L_mac basic operation. This is equivalent to AE_MULAAFD16SS.33_22 and will be automatically used by the compiler as needed.

Implementation Pipeline

In Out
AE_OVERFLOW Wstage, ae_mul_S2_q0 Wstage, ae_mul_S2_d0 Mstage, ae_mul_S2_d1 Mstage AE_OVERFLOW Wstage, ae_mul_S2_q0 Wstage

Protos that use AE_MULAAFD16SS.33_22_S2

proto AE_MULAAFD16SS.33_22_S2 { inout ae_f32x2 d, in ae_f16x4 d0, in ae_f16x4 d1 }{}{
AE_MULAAFD16SS.33_22_S2 d, d0, d1;
}