AE_MULA16X4 — Four-way SIMD 16x16-bit signed integer MAC with 32-bit result.

Instruction Word

Slot
ae2_slot1
6
3
6
2
6
1
6
0
5
9
5
8
5
7
5
6
5
5
5
4
5
3
5
2
5
1
5
0
4
9
4
8
4
7
4
6
4
5
4
4
4
3
4
2
4
1
4
0
3
9
3
8
3
7
3
6
3
5
3
4
3
3
3
2
3
1
3
0
2
9
2
8
2
7
2
6
2
5
2
4
2
3
2
2
2
1
2
0
1
9
1
8
1
7
1
6
1
5
1
4
1
3
1
2
1
1
1
0
9876543210
Format ae_format2 - 64 bit(s)0000 1110
AE_MULA16X4 1100 1000
ae_fld_mul_x4_q1 3210
ae_fld_mul_q0 3210
ae_fld_mul_d1 3210
ae_fld_mul_d0 3210

Assembler Syntax

AE_MULA16X4 aed0..15(ae_mul_q1), aed0..15(ae_mul_q0), aed0..15(ae_mul_d1), aed0..15(ae_mul_d0)

C Syntax

#include <xtensa/tie/xt_hifi2.h>

extern void AE_MULA16X4(ae_int32x2 d0 /*inout*/, ae_int32x2 d1 /*inout*/, ae_int16x4 d2, ae_int16x4 d3);

Description

AE_MULA16X4 is a four-way SIMD, 16x16-bit integer MAC with 32-bit result and no saturation.

Implementation Pipeline

In Out
ae_mul_q1 Wstage, ae_mul_q0 Wstage, ae_mul_d1 Mstage, ae_mul_d0 Mstage ae_mul_q1 Wstage, ae_mul_q0 Wstage

Protos that use AE_MULA16X4

proto AE_MULA16X4 { inout ae_int32x2 d0, inout ae_int32x2 d1, in ae_int16x4 d2, in ae_int16x4 d3 }{}{
AE_MULA16X4 d0, d1, d2, d3;
}
proto AE_MULA16X4_vector { inout ae_int32x4 d, in ae_int16x4 d0, in ae_int16x4 d1 }{}{
AE_MULA16X4 d->d1, d->d0, d0, d1;
}
proto AE_MULAAR16P16X4S_vector { inout ae_int16x4 d, in ae_int16x4 d0, in ae_int16x4 d1 }{ae_int32x4 t}{
AE_SEXT32X2D16.32 t->d0, d;
AE_SEXT32X2D16.10 t->d1, d;
AE_MULA16X4 t->d0, t->d1, d0, d1;
AE_SEL16I d, t->d0, t->d1, 8;
}