0% found this document useful (0 votes)

256 views

An4841 Digital Signal Processing For Stm32 Microcontrollers Using Cmsis Stmicroelectronics

Uploaded by

Pablo Salazar

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

256 views

An4841 Digital Signal Processing For Stm32 Microcontrollers Using Cmsis Stmicroelectronics

Uploaded by

Pablo Salazar

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

AN4841

Application note
Digital signal processing for STM32 microcontrollers using CMSIS

Introduction
This application note describes the development of digital filters for analog signals, and the
transformations between time and frequency domains. The examples discussed in this
document include a low-pass and a high-pass FIR filter, as well as Fourier fast transforms
with floating and fixed point at different frequencies.
The associated firmware (X-CUBE-DSPDEMO), applicable to STM32F429xx and
STM32F746xx MCUs, can be adapted to any STM32 microcontroller.
Digital Signal Processing (DSP) is the mathematical manipulation and processing of
signals. Signals to be processed come in various physical formats that include audio, video
or any analog signal that carries information, such as the output signal of a microphone.
Both Cortex®-M4-based STM32F4 Series and Cortex®-M7-based STM32F7 Series provide
instructions for signal processing, and support advanced SIMD (Single Instruction Multi
Data) and Single cycle MAC (Multiply and Accumulate) instructions.
The use of STM32 MCUs in a real-time DSP application not only reduces cost, but also
reduces the overall power consumption.
The following documents are considered as references:
• PM0214, “STM32F3 and STM32F4 Series Cortex®-M4 programming manual”, available
on www.st.com
• PM0253, “STM32F7 Series Cortex®-M7 programming manual”, available on www.st.com
• CMSIS - Cortex® Microcontroller Software Interface Standard, available on
www.arm.com
• Arm® compiler toolchain Compiler reference, available on http://infocenter.arm.com
• “Developing Optimized Signal Processing Software on the Cortex®-M4 Processor”,
technical paper by Shyam Sadasivan, available on www.techonline.com.

February 2018 AN4841 Rev 2 1/25

www.st.com 1
Contents AN4841

Contents

1 Basic DSP notions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

1.1 Data types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.1.1 Floating point . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.1.2 Fixed point . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.1.3 Fixed-point vs. floating-point . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

2 Cortex® DSP instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

2.1 Saturation instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.2 MAC instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.3 SIMD instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

3 Algorithms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
3.1 Filters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
3.2 Transforms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

4 DSP application development . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

4.1 CMSIS library . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .11
4.2 DSP demonstration overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .11
4.2.1 FFT demonstration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
4.2.2 FFT performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
4.2.3 FIR filter demonstration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
4.2.4 FIR filter design specification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
4.2.5 FIR performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
4.2.6 FIR example software overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
4.3 Overview of STM32 product lines performance . . . . . . . . . . . . . . . . . . . . 22

5 Revision history . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

2/25 AN4841 Rev 2

AN4841 List of tables

List of tables

Table 1. Pros and cons of number formats in DSP applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

Table 2. Saturating instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
Table 3. SIMD instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
Table 4. FIR filter specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
Table 5. FFT performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
Table 6. Revision history . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

AN4841 Rev 2 3/25

3
List of figures AN4841

List of figures

Figure 1. Single precision number format . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

Figure 2. Double precision number format. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Figure 3. 32 bits fixed point number format . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Figure 4. FFT size calculation performance on STM32F429 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
Figure 5. FFT size calculation performance on STM32F746 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
Figure 6. Running FFT 1024 points with input data in Float-32 on STM32F429I-DISCO . . . . . . . . . 14
Figure 7. Running FFT 1024 points with input data in Float-32 on STM32F746-DISCO. . . . . . . . . . 15
Figure 8. Block diagram of the FIR example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Figure 9. Generated input (sum of two sine waves) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
Figure 10. Magnitude spectrum of the input signal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
Figure 11. FIR filter verification using MATLAB® FVT tool . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
Figure 12. FIR filter computation performance for STM32F429. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
Figure 13. FIR filter computation performance for STM32F746. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
Figure 14. FIR demonstration results on STM32F429I-DISCO . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
Figure 15. FIR demonstration results on STM32F746-DISCO . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

4/25 AN4841 Rev 2

AN4841 Basic DSP notions

1 Basic DSP notions

1.1 Data types

DSP operations can use either floating-point or fixed-point formats.

1.1.1 Floating point

Floating point is a method to represent real numbers.
The floating point unit in the Cortex®-M4 is only single precision, as it includes an 8-bit
exponent field and a 23-bit fraction, for a total of 32 bits (see Figure 1). The floating point
unit in the Cortex®-M7 supports both single and double precision, as indicated in Figure 2.
The representation of single/double precision floating-point number is, respectively
Value = (-1)s x M x 2(E-127), or Value = (-1)s x M x 2(E-1023)
where S is the value of the sign bit, M is the value of the mantissa, and E is the value of the
exponent.

Figure 1. Single precision number format

ELWV

6LJQ
([SRQHQWELWV 0DQWLVVDELWV
ELW
069

Figure 2. Double precision number format

ELWV

6LJQ
([SRQHQWELWV 0DQWLVVDELWV
ELW
069

AN4841 Rev 2 5/25

24
Basic DSP notions AN4841

1.1.2 Fixed point

Fixed point representation expresses numbers with an integer part and a fractional part, in a
2-complement format. As an example, a 32-bit fixed point representation, shown in Figure 3,
allocates 24 bits for the integer part and 8 bits for the fractional part.

Figure 3. 32 bits fixed point number format

ELWV

,QWHJHUSDUWELWV )UDFWLRQELWV

069

Available fixed-point data sizes in Cortex®-Mx cores are 8-, 16- and 32-bit.
The most common format used for DSP operations are Q7, Q15 and Q31, with only
fractional bits to represent numbers between -1.0 and + 1.0.
The representation of a Q15 number is:
bs –1 –2 – 14 – 15
Value = ( – 1 ) × ( b 14 × 2 + b 13 × 2 + …+ b 1 × 2 + b0 × 2 )

where bs is the sign bit (the 15th bit), and bn is the digit for bit n.
The range of numbers supported in a Q15 number is comprised between -1.0 and 1.0,
corresponding to the smallest and largest integers that can be represented, respectively
-32768 and 32767.
For example, the number 0.25 will be encoded in Q15 as 0x2000(8192).
When performing operations on fixed-point the equation is as follows:
c = a <operand> b
where a, b and c are all fixed-point numbers, and <operand> refers to addition, subtraction,
multiplication, or division. This equation remains true for floating-point numbers as well.
Note: Care must be taken when doing operations on fixed-point numbers.
For example, if c = a x b with a and b in Q31 format, this will lead to a wrong result since the
compiler will treat it as an integer operation, consequently it will generate “muls a, b” and will
keep only the least significant 32 bits of the result.

6/25 AN4841 Rev 2

AN4841 Basic DSP notions

1.1.3 Fixed-point vs. floating-point

Table 1 highlights the main advantages and disadvantages of fixed-point vs. floating-point in
DSP applications.

Table 1. Pros and cons of number formats in DSP applications

Number format Fixed point Floating point

Advantages Fast implementation Supports a much wider range of values

Limited number range
Disadvantages Needs more memory space
Can easily go in overflow

AN4841 Rev 2 7/25

24
Cortex® DSP instructions AN4841

2 Cortex® DSP instructions

The Cortex®-Mx cores feature several instructions that result in efficient implementation of
DSP algorithms.

2.1 Saturation instructions

Saturating, addition and subtraction instructions are available for 8-, 16- and 32-bit values,
some of these instructions are listed in Table 2.

Table 2. Saturating instructions

Code Function

QADD8 Saturating four 8-bit integer additions

QSUB8 Saturating four 8-bit integer subtraction
QADD16 Saturating two 16-bit integer additions
QSUB16 Saturating two 16-bit integer subtraction
QADD Saturating 32-bit add
QSUB Saturating 32-bit subtraction

The SSAT (Signed SATurate) instruction is used to scale and saturate a signed value to any
bit position, with optional shift before saturating.

2.2 MAC instructions

Multiply ACcumulate (MAC) instructions are widely used in DSP algorithms, as in the case
of the Finite Impulse Response (FIR) and Infinite Impulse Response (IIR).
Executing multiplication and accumulation in single cycle instruction is a key requirement for
achieving high performance.
The following example explains how the SMMLA (Signed Most significant word MuLtiply
Accumulate) instruction works.

2.3 SIMD instructions

In addition to MAC instructions that execute a multiplication and an accumulation in a single
cycle, there are the SIMD (Single Instruction Multiple Data) instructions, performing multiple
identical operations in a single cycle instruction.

8/25 AN4841 Rev 2

AN4841 Cortex® DSP instructions

Table 3 lists some SIMD instructions.

Table 3. SIMD instructions

Code Function

Performs two 16-bit integer arithmetic additions in parallel, saturating the results to the
__qadd16
16-bit signed integer range -215 ≤ x ≤ 215 - 1.
__uhadd16 Performs two unsigned 16-bit integer additions, halving the results.
__shadd18 Performs four signed 8-bit integer additions, halving the results.
Performs two 16-bit signed multiplications, takes the difference of the products,
__smlsd subtracting the high half-word product from the low half-word product, and adds the
difference to a 32-bit accumulate operand.

The following example explains how the __shadd8 instruction works.

The __shadd8 intrinsic returns:

• The halved addition of the first bytes from each operand, in the first byte of the return
value
• The halved addition of the second bytes from each operand, in the second byte of the
return value
• The halved addition of the third bytes from each operand, in the third byte of the return
value
• The halved addition of the fourth bytes from each operand, in the fourth byte of the
return value

AN4841 Rev 2 9/25

24
Algorithms AN4841

3 Algorithms

3.1 Filters
The most common digital filters are:
• FIR (Finite Impulse Response): used, among others, in motor control and audio
equalization
• IIR (Infinite Impulse Response): used in smoothing data
The IIR filter can be used to implement filters such as Butterworth, Chebyshev, and Bessel.

3.2 Transforms
A transform is a function that converts data from a domain into another.
The FFT (Fast Fourier Transform) is a typical example: it is an efficient algorithm used to
convert a discrete time-domain signal into an equivalent frequency-domain signal based on
the Discrete Fourier Transform (DFT).

10/25 AN4841 Rev 2

AN4841 DSP application development

4 DSP application development

4.1 CMSIS library

The Arm® Cortex® Microcontroller Software Interface Standard (CMSIS) is a
vendor-independent hardware abstraction layer for all Cortex® processor based devices.
CMSIS has been developed by Arm® in conjunction with silicon, tools and middleware
partners.
The idea behind CMSIS is to provide a consistent and simple software interface to the
processor for interface peripherals, real-time operating systems, and middleware,
simplifying software re -use, reducing the learning curve for new microcontroller
developments and reducing the time to market for new devices.
CMSIS library comes with ST firmware under \Drivers\CMSIS\.
The CMSIS-DSP library includes:
• Basic mathematical functions with vector operations
• Fast mathematical functions, like sine and cosine
• Complex mathematical functions like calculating magnitude
• Filtering functions like FIR or IIR
• Matrix computing functions
• Transform functions like FFT
• Controller functions like PID controller
• Statistical functions like calculating minimum or maximum
• Support functions like converting from one format to another
• Interpolation functions
Most algorithms uses floating-point and fixed-point in various formats. For example, in FIR
case, the available Arm® functions are:
• arm_fir_init_f32
• arm_fir_f32
• arm_fir_init_q31
• arm_fir_q31
• arm_fir_fast_q31
• arm_fir_init_q15
• arm_fir_q15
• arm_fir_fast_q15
• arm_fir_init_q7
• arm_fir_q7

4.2 DSP demonstration overview

The goal of this demonstration is to show a full integration with STM32F429 using ADC,
DAC, DMA and timers, and also calling CMSIS routines, all with the use of graphics, taking
advantage of the 2.4" QVGA TFT LCD included in the discovery board.

AN4841 Rev 2 11/25

24
DSP application development AN4841

This demonstration also shows how easy it is to migrate an application from an STM32F4
microcontroller to one of the STM32F7 Series.
A graphical user interface is designed using STemWin, to simplify access to different
features of the demonstration.

4.2.1 FFT demonstration

The main features of this FFT example are
• For the STM32F429
– Generate data signal and transfer it through DMA1 Stream6 Channel7 to DAC
output Channel2
– Acquire data signal with ADC Channel0 and transfer it for elaboration through
DMA2 Stream0 Channel0
– Vary the frequency of the input signal using Timer 2
– Initialize FFT processing with various data: Float-32, Q15 and Q31
– Perform FFT processing and calculate the magnitude values
– Draw input and output data on LCD screen
• For the STM32F746
– Generate data signal and transfer it through DMA1 Stream5 Channel7 to DAC
output Channel1
– Acquire data signal with ADC Channel4 and transfer it for elaboration through
DMA2 Stream0 Channel0
– Vary the frequency of the input signal using Timer 2
– Initialize FFT processing with various data: Float-32, Q15 and Q31
– Perform FFT processing and calculate the magnitude values
– Draw input and output data on LCD screen
The code below shows how to initialize the CFFT function to compute a 1024, 256 or 64
points FFT and transform the input signals (aFFT_Input_f32) from the time domain to the
frequency domain, then calculate the magnitude at each bin, and finally calculate and return
the maximum magnitude value.

FFT_Length depends on the user choice, it can be 1024, 256 or 64. The user can find FFT
initialization and processing for other formats in the fft_processing.c source file.

12/25 AN4841 Rev 2

AN4841 DSP application development

4.2.2 FFT performance

Figure 4 shows the absolute execution time and the number of cycles taken to perform an
FFT on STM32F429 device running at 180 MHz, while Figure 5 refers to the same
parameters measured on an STM32F746 device running at 216 MHz, in both cases using
MDK-Arm™ (5.14.0.0) toolchain supporting C Compiler V5.05 with Level 3 (-O3) for time
optimization.

Figure 4. FFT size calculation performance on STM32F429

Figure 5. FFT size calculation performance on STM32F746

AN4841 Rev 2 13/25

24
DSP application development AN4841

Results on STM32F429I-DISCO
To run one of the FFT examples select FFT, then connect PA5 to PA0.
Signal shape and spectrum are displayed on the LCD.
By varying the slider position the user can see the new input signal shape and the FFT
spectrum of the input signal updated in real time, as illustrated in Figure 6.

Figure 6. Running FFT 1024 points with input data in Float-32 on STM32F429I-DISCO

14/25 AN4841 Rev 2

AN4841 DSP application development

Results on STM32F746-DISCO
In this case it is possible to take advantage of the existing connection between PA4 and
DCMI_HSYNC. No other connections are needed since PA4 is configured as an output for
DAC1 and an input for ADC1.
Signal shape and spectrum are displayed on the LCD.
By varying the slider position the user can see the new input signal shape and the FFT
spectrum of the input signal updated in real time, as illustrated in Figure 7.

Figure 7. Running FFT 1024 points with input data in Float-32 on STM32F746-DISCO

4.2.3 FIR filter demonstration

The goal of this demonstration is to remove the spurious signal (a sine wave at 15 kHz) from
the desired signal (a sine wave at 1 kHz), applying a low-pass FIR filter in different format.
When choosing the Q15 format, it is possible to isolate the spurious signal applying a
high-pass FIR filter.
The block diagram of the FIR example is shown in Figure 8.

Figure 8. Block diagram of the FIR example

AN4841 Rev 2 15/25

24
DSP application development AN4841

The code below shows the initialization and the processing function for the floating-point
FIR filter.

The user can find FIR initialization and processing for other formats in the fir_processing.c
source file.
Input data to the FIR filter is the sum of the 1 kHz and 15 kHz sine waves (see Figure 9),
generated by MATLAB® in floating point format using the following script:

Figure 9. Generated input (sum of two sine waves)

16/25 AN4841 Rev 2

AN4841 DSP application development

The magnitude spectrum of the input signal (Figure 10) shows that there are two
frequencies, 1 kHz and 15 kHz.

Figure 10. Magnitude spectrum of the input signal

As the noise is positioned around 15 kHz, the cutoff point must be set at a lower frequency,
namely at 6 kHz.

4.2.4 FIR filter design specification

The main features are listed in Table 4.

Table 4. FIR filter specifications

Feature / Parameter Value

Type Low-pass
Order 28
Sampling frequency 48 kHz
Cut-off frequency 6 kHz

AN4841 Rev 2 17/25

24
DSP application development AN4841

The low-pass filter is designed with MATLAB®, using the commands shown below

Note: FIR filter order is equal to the number of coefficients -1.

In order to verify the designed filter, it’s possible to use the Filter Visualization Tool in
MATLAB® using the following command:

The Filter Visualization Tool (FVT) is a practical tool allowing the user to verify the details
and the parameters of the built filter.
In Figure 11 are reported (left to right, top to bottom):
• magnitude response
• filter gain (in dB) vs. frequency (in Hz)
• impulse response
• step response

18/25 AN4841 Rev 2

Figure 11. FIR filter verification using MATLAB® FVT tool

AN4841
AN4841 Rev 2

DSP application development

19/25
DSP application development AN4841

4.2.5 FIR performance

Figure 12 shows the absolute execution time and the number of cycles taken to run the
previously designed FIR filter on STM32F429I device running at 180 MHz, while Figure 13
refers to the STM32F746 device running at 216 MHz, in both cases using MDK-Arm™
(5.14.0.0) toolchain supporting C Compiler V5.05 with Level 3 (-O3) for time optimization.

Figure 12. FIR filter computation performance for STM32F429

Figure 13. FIR filter computation performance for STM32F746

4.2.6 FIR example software overview

The main features of this FIR example are
• Generate the input data signal and stock in the RAM
• Initialize FFT processing with various data: F32, Q15 and Q31
• Apply the low-pass FIR filter for Float-32, Q15 and Q31
• Apply the high-pass FIR filter for Q15
• Draw input and output data on LCD screen

20/25 AN4841 Rev 2

AN4841 DSP application development

Results on STM32F429I-DISCO
This example considers two scenarios:
1. a FIR low-pass filter that includes Float-32, Q31 and Q15 data format
2. a FIR high-pass filter that includes only Q15 data format.
The oscilloscope screen captures for three different configurations are reported in
Figure 14. Left to right are shown
1. a low-pass FIR filter when the input data is floating point
2. a low-pass FIR filter with Q15 input data
3. a high-pass FIR filter with Q15 input data

Figure 14. FIR demonstration results on STM32F429I-DISCO

Results on STM32F746-DISCO
The same example has been run on the STM32F746, the waveforms are visible in
Figure 15. Left to right are shown:
1. a low-pass FIR filter when the input data is floating point.
2. a low-pass FIR filter with Q15 input data.
3. a high-pass FIR filter with Q15 input data.

Figure 15. FIR demonstration results on STM32F746-DISCO

AN4841 Rev 2 21/25

24
DSP application development AN4841

4.3 Overview of STM32 product lines performance

One of the purposes of this application note is to provide benchmarking results for different
STM32 Series. In the case in discussion, the DSP algorithm to use are:
• complex FFT using 64 and 1024 points (radix-4)
• use of fixed point format (Q15 and Q31)
The comparison is based on execution time (i.e. the time required for the FFT processing).
The input vector is generated with MATLAB®, using the commands below:

22/25 AN4841 Rev 2

AN4841 DSP application development

Table 5 summarizes the results, achieved using MDK-Arm™ (5.14.0.0) toolchain supporting
C Compiler V5.05 with Level 3 (-O3) for time optimization.

Table 5. FFT performance

System Cortex® Fixed point No. of
MCU Cycles Duration (µs)
frequency core format points

1024 783106 16314

Q31
64 26576 553
STM32F091 48 MHz M0
1024 938278 19547
Q15
64 37522 781
1024 214098 2973
Q31
64 7983 110
STM32F103 72 MHz M3
1024 248936 3457
Q15
64 9696 134
1024 193189 1609
Q31
64 6992 58
STM32F217 120 MHz M3
1024 200608 1671
Q15
64 7828 65
1024 178005 2472
Q31
64 7129 99
STM32F303 72 MHz M4
1024 101316 1407
Q15
64 4304 59
1024 153307 855
Q31
64 6025 33
STM32F429 180 MHz M4
1024 82299 457
Q15
64 3655 20
1024 93725 468
Q31
64 4537 22
STM32F746 216 MHz M7
1024 56989 284
Q15
64 2994 14
Q31 64 33493 1046
STM32L073 32 MHz M0+
Q15 64 44506 1390
1024 144214 1802
Q31
64 6007 75
STM32L476 80 MHz M4
1024 77371 967
Q15
64 3509 43

AN4841 Rev 2 23/25

24
Revision history AN4841

5 Revision history

Table 6. Revision history

Date Revision Description of changes

23-Mar-2016 1 Initial release

Updated Table 5: FFT performance.
23-Feb-2018 2
Minor text edits across the whole document.

24/25 AN4841 Rev 2

AN4841

IMPORTANT NOTICE – PLEASE READ CAREFULLY

STMicroelectronics NV and its subsidiaries (“ST”) reserve the right to make changes, corrections, enhancements, modifications, and
improvements to ST products and/or to this document at any time without notice. Purchasers should obtain the latest relevant information on
ST products before placing orders. ST products are sold pursuant to ST’s terms and conditions of sale in place at the time of order
acknowledgement.

Purchasers are solely responsible for the choice, selection, and use of ST products and ST assumes no liability for application assistance or
the design of Purchasers’ products.

No license, express or implied, to any intellectual property right is granted by ST herein.

Resale of ST products with provisions different from the information set forth herein shall void any warranty granted by ST for such product.

ST and the ST logo are trademarks of ST. All other product or service names are the property of their respective owners.

Information in this document supersedes and replaces information previously supplied in any prior versions of this document.

AN4841 Rev 2 25/25

Pfizer Brand Standards
No ratings yet
Pfizer Brand Standards
25 pages
THE LTSPICE XVII SIMULATOR: Commands and Applications
From Everand
THE LTSPICE XVII SIMULATOR: Commands and Applications
Gilles Brocard
5/5 (1)
DSP by Avatar Singh PDF
80% (15)
DSP by Avatar Singh PDF
355 pages
CAN and FPGA Communication Engineering: Implementation of a CAN Bus based Measurement System on an FPGA Development Kit
From Everand
CAN and FPGA Communication Engineering: Implementation of a CAN Bus based Measurement System on an FPGA Development Kit
Yu Zhu
No ratings yet
Home Designer Pro 2021 Users Guide PDF
No ratings yet
Home Designer Pro 2021 Users Guide PDF
156 pages
Practical Considerations in Fixed-Point FIR Filter Implem
No ratings yet
Practical Considerations in Fixed-Point FIR Filter Implem
15 pages
(Att) 4647
No ratings yet
(Att) 4647
25 pages
Smart Card Applications: Design models for using and programming smart cards
From Everand
Smart Card Applications: Design models for using and programming smart cards
Wolfgang Rankl
No ratings yet
AN4044 Application Note: Floating Point Unit Demonstration On STM32 Microcontrollers
No ratings yet
AN4044 Application Note: Floating Point Unit Demonstration On STM32 Microcontrollers
31 pages
Signal Processing Examples Using The TMS320C67x Digital Signal Processing Library (DSPLIB)
No ratings yet
Signal Processing Examples Using The TMS320C67x Digital Signal Processing Library (DSPLIB)
18 pages
chap15
No ratings yet
chap15
61 pages
DSP
No ratings yet
DSP
190 pages
Root File
No ratings yet
Root File
84 pages
Fixed point Signal Processors 1st Edition Wayne T. Padgett download
100% (2)
Fixed point Signal Processors 1st Edition Wayne T. Padgett download
49 pages
Practical Considerations in Fixed-Point FIR Filter Implementations
No ratings yet
Practical Considerations in Fixed-Point FIR Filter Implementations
12 pages
Fixed Point Signal Processing by W Paddget
100% (1)
Fixed Point Signal Processing by W Paddget
133 pages
Fixed-Point Signal Processing
No ratings yet
Fixed-Point Signal Processing
133 pages
Labview Digital Filter Design Toolkit Api Reference 2024-04-19-01-39-07
No ratings yet
Labview Digital Filter Design Toolkit Api Reference 2024-04-19-01-39-07
149 pages
Typical Digital Signal Processing Operations!: Prof. Shankar Prakriya! Indian Institute of Technology, Delhi!
No ratings yet
Typical Digital Signal Processing Operations!: Prof. Shankar Prakriya! Indian Institute of Technology, Delhi!
36 pages
BITS Pilani: Digital Signal Processing
No ratings yet
BITS Pilani: Digital Signal Processing
73 pages
DSP Lab With Ti c6x DSP and c6713 DSK 6.3
No ratings yet
DSP Lab With Ti c6x DSP and c6713 DSK 6.3
116 pages
DSP Floating Point Formats
No ratings yet
DSP Floating Point Formats
29 pages
FIR FILTER Implementation in ARM Instruction
No ratings yet
FIR FILTER Implementation in ARM Instruction
19 pages
Spra 948
No ratings yet
Spra 948
13 pages
DSP Cours V2 PDF
No ratings yet
DSP Cours V2 PDF
90 pages
Notes (2)
No ratings yet
Notes (2)
11 pages
Sprugh 7
No ratings yet
Sprugh 7
1,013 pages
DSPAA Notes
No ratings yet
DSPAA Notes
42 pages
DSP Processors
No ratings yet
DSP Processors
114 pages
DSP Notes
0% (1)
DSP Notes
26 pages
Summary STM32F4 DSP - English
No ratings yet
Summary STM32F4 DSP - English
5 pages
Real-Time DSP: ECE 5655/4655 Lecture Notes
No ratings yet
Real-Time DSP: ECE 5655/4655 Lecture Notes
34 pages
Real-Time DSP: ECE 5655/4655 Lecture Notes
No ratings yet
Real-Time DSP: ECE 5655/4655 Lecture Notes
34 pages
DSP Processors Theory
No ratings yet
DSP Processors Theory
9 pages
DSP Arithmetic
No ratings yet
DSP Arithmetic
33 pages
Xtremedsp Dsp48A For Spartan-3A DSP Fpgas: User Guide
No ratings yet
Xtremedsp Dsp48A For Spartan-3A DSP Fpgas: User Guide
52 pages
Advanced C Programming: Real Time Programming Like The Pros
No ratings yet
Advanced C Programming: Real Time Programming Like The Pros
41 pages
DSP Module 5
No ratings yet
DSP Module 5
24 pages
Real-Time Signal Processing Implementation For 100 Gb/s Fibre Communication
No ratings yet
Real-Time Signal Processing Implementation For 100 Gb/s Fibre Communication
37 pages
Tutorial 09 DSP IO Transceivers
No ratings yet
Tutorial 09 DSP IO Transceivers
161 pages
ShiWal95A
No ratings yet
ShiWal95A
8 pages
DSP_presentation_Sumit 3
No ratings yet
DSP_presentation_Sumit 3
63 pages
dsp
No ratings yet
dsp
117 pages
FULLTEXT01
No ratings yet
FULLTEXT01
134 pages
Family Manual: 16-Bit Digital Signal Controllers
No ratings yet
Family Manual: 16-Bit Digital Signal Controllers
448 pages
DSP Architecture
100% (1)
DSP Architecture
71 pages
Digital Signal Processors: Inderdeep Kaur Aulakh Asst. Prof. (IT), UIET Pu, CHD
No ratings yet
Digital Signal Processors: Inderdeep Kaur Aulakh Asst. Prof. (IT), UIET Pu, CHD
19 pages
DSP Module 5 2018 Scheme
No ratings yet
DSP Module 5 2018 Scheme
104 pages
DSP56800ERM - Reference Manual
No ratings yet
DSP56800ERM - Reference Manual
728 pages
Embedded Systems Notes
No ratings yet
Embedded Systems Notes
13 pages
01 Introduction
No ratings yet
01 Introduction
29 pages
Basic Research and Technologies for Two-Stage-to-Orbit Vehicles: Final Report of the Collaborative Research Centres 253, 255 and 259
From Everand
Basic Research and Technologies for Two-Stage-to-Orbit Vehicles: Final Report of the Collaborative Research Centres 253, 255 and 259
Dieter Jacob
No ratings yet
Sanjay - High Performance DSP Architectures
No ratings yet
Sanjay - High Performance DSP Architectures
38 pages
VHDL Fir
No ratings yet
VHDL Fir
21 pages
UG - EC303 DSP Part-9 FIR in C55x PDF
No ratings yet
UG - EC303 DSP Part-9 FIR in C55x PDF
27 pages
Dspa Course File
No ratings yet
Dspa Course File
10 pages
Digital Signal Processing Laboratory
No ratings yet
Digital Signal Processing Laboratory
177 pages
DSP Architecture
100% (1)
DSP Architecture
31 pages
Open Data Structures: An Introduction
From Everand
Open Data Structures: An Introduction
Pat Morin
4/5 (4)
Injection Molding Processing Data
From Everand
Injection Molding Processing Data
Alberto Naranjo C.
No ratings yet
DC/DC Converter Handbook: SMPS topologies from an EMC point of view
From Everand
DC/DC Converter Handbook: SMPS topologies from an EMC point of view
Andreas Nadler
No ratings yet
Control Systems
From Everand
Control Systems
Francisco Luis Pagola y de las Heras
No ratings yet
Paper Characterizing The Behavior of Greenhouse Climate
No ratings yet
Paper Characterizing The Behavior of Greenhouse Climate
6 pages
Stm32Cubef3: Stm32Cube Mcu Package For Stm32F3 Series With Hal, Low-Layer Drivers and Dedicated Middleware
No ratings yet
Stm32Cubef3: Stm32Cube Mcu Package For Stm32F3 Series With Hal, Low-Layer Drivers and Dedicated Middleware
4 pages
An2820 Driving Bipolar Stepper Motors Using A Mediumdensity Stm32f103xx Microcontroller Stmicroelectronics
No ratings yet
An2820 Driving Bipolar Stepper Motors Using A Mediumdensity Stm32f103xx Microcontroller Stmicroelectronics
23 pages
GALÁPAGOS
No ratings yet
GALÁPAGOS
4 pages
SAC - Course Overview
No ratings yet
SAC - Course Overview
3 pages
BUL-77 Videoregistratori DN - Manuale Installazione
No ratings yet
BUL-77 Videoregistratori DN - Manuale Installazione
53 pages
1 OneDrive For Business User Guide
No ratings yet
1 OneDrive For Business User Guide
3 pages
Ict Practical 3
No ratings yet
Ict Practical 3
5 pages
A500 Diagnostic Tool SOP - v1.0
No ratings yet
A500 Diagnostic Tool SOP - v1.0
23 pages
Disease Detection in Plants - Report
No ratings yet
Disease Detection in Plants - Report
78 pages
Glosario Aramis
No ratings yet
Glosario Aramis
13 pages
JD_Internship - 5G_6G Wireless.docx
No ratings yet
JD_Internship - 5G_6G Wireless.docx
2 pages
Chapter 1 - Basic Concepts and Computer Evolution
No ratings yet
Chapter 1 - Basic Concepts and Computer Evolution
23 pages
DS-MP5604N Series Mobile Video Recorder: Key Feature Packing List
No ratings yet
DS-MP5604N Series Mobile Video Recorder: Key Feature Packing List
4 pages
AutoCAD Commands 1
No ratings yet
AutoCAD Commands 1
6 pages
SURFCAM 6 Whats New
No ratings yet
SURFCAM 6 Whats New
41 pages
Fortimanager Informations
No ratings yet
Fortimanager Informations
6 pages
Ui and Ux Design B2 - T2
No ratings yet
Ui and Ux Design B2 - T2
69 pages
Unit-1 Hardik Sir SPU
No ratings yet
Unit-1 Hardik Sir SPU
57 pages
CadWorx How To
No ratings yet
CadWorx How To
7 pages
Pro Builder Documentation
No ratings yet
Pro Builder Documentation
20 pages
Bim and Smart Contracts For Smart Cities
No ratings yet
Bim and Smart Contracts For Smart Cities
37 pages
A350 EFB Operational Procedure
No ratings yet
A350 EFB Operational Procedure
7 pages
Automated Optimized Classification Techniques For Magnetic Resonance Brain Images
No ratings yet
Automated Optimized Classification Techniques For Magnetic Resonance Brain Images
26 pages
Tri Force
No ratings yet
Tri Force
6 pages
Project File Of: IT: Writer Styles and Image Editing
No ratings yet
Project File Of: IT: Writer Styles and Image Editing
8 pages
BASIC COMPUTER Operations
No ratings yet
BASIC COMPUTER Operations
23 pages
Manual KV Studio V9 PDF
No ratings yet
Manual KV Studio V9 PDF
542 pages
GIS Practical No 4
No ratings yet
GIS Practical No 4
21 pages
Implementation of FPGA-based Accelerator For CNN
No ratings yet
Implementation of FPGA-based Accelerator For CNN
7 pages
AJP Question Bank
No ratings yet
AJP Question Bank
4 pages
UPSC BPSC All State PCS Pervious Year Questions Test-2
No ratings yet
UPSC BPSC All State PCS Pervious Year Questions Test-2
5 pages

An4841 Digital Signal Processing For Stm32 Microcontrollers Using Cmsis Stmicroelectronics

Uploaded by

An4841 Digital Signal Processing For Stm32 Microcontrollers Using Cmsis Stmicroelectronics

Uploaded by

AN4841

February 2018 AN4841 Rev 2 1/25

1 Basic DSP notions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

2 Cortex® DSP instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

4 DSP application development . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

2/25 AN4841 Rev 2

Table 1. Pros and cons of number formats in DSP applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

AN4841 Rev 2 3/25

Figure 1. Single precision number format . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

4/25 AN4841 Rev 2

1 Basic DSP notions

1.1 Data types

1.1.1 Floating point

Figure 1. Single precision number format

Figure 2. Double precision number format

AN4841 Rev 2 5/25

1.1.2 Fixed point

Figure 3. 32 bits fixed point number format

,QWHJHUSDUW ELWV )UDFWLRQ ELWV

6/25 AN4841 Rev 2

1.1.3 Fixed-point vs. floating-point

Table 1. Pros and cons of number formats in DSP applications

Advantages Fast implementation Supports a much wider range of values

AN4841 Rev 2 7/25

2 Cortex® DSP instructions

2.1 Saturation instructions

Table 2. Saturating instructions

QADD8 Saturating four 8-bit integer additions

2.2 MAC instructions

2.3 SIMD instructions

8/25 AN4841 Rev 2

Table 3 lists some SIMD instructions.

Table 3. SIMD instructions

The following example explains how the __shadd8 instruction works.

The __shadd8 intrinsic returns:

AN4841 Rev 2 9/25

10/25 AN4841 Rev 2

4 DSP application development

4.1 CMSIS library

4.2 DSP demonstration overview

AN4841 Rev 2 11/25

4.2.1 FFT demonstration

12/25 AN4841 Rev 2

4.2.2 FFT performance

Figure 4. FFT size calculation performance on STM32F429

Figure 5. FFT size calculation performance on STM32F746

AN4841 Rev 2 13/25

14/25 AN4841 Rev 2

4.2.3 FIR filter demonstration

Figure 8. Block diagram of the FIR example

AN4841 Rev 2 15/25

Figure 9. Generated input (sum of two sine waves)

16/25 AN4841 Rev 2

Figure 10. Magnitude spectrum of the input signal

4.2.4 FIR filter design specification

Table 4. FIR filter specifications

AN4841 Rev 2 17/25

Note: FIR filter order is equal to the number of coefficients -1.

18/25 AN4841 Rev 2

DSP application development

4.2.5 FIR performance

Figure 12. FIR filter computation performance for STM32F429

Figure 13. FIR filter computation performance for STM32F746

4.2.6 FIR example software overview

20/25 AN4841 Rev 2

Figure 14. FIR demonstration results on STM32F429I-DISCO

Figure 15. FIR demonstration results on STM32F746-DISCO

AN4841 Rev 2 21/25

4.3 Overview of STM32 product lines performance

22/25 AN4841 Rev 2

Table 5. FFT performance

1024 783106 16314

AN4841 Rev 2 23/25

Table 6. Revision history

23-Mar-2016 1 Initial release

24/25 AN4841 Rev 2

,QWHJHUSDUWELWV )UDFWLRQELWV