BUA1921d Introduction to Electronic Stethoscopes Acoustics and Signal Processing
BUA1921d Introduction to Electronic Stethoscopes Acoustics and Signal Processing
Author
Struer: Gimsinglundvej 20 Copenhagen: Arne Jacobsens Allé 7 Phone +45 70 301 600 www.medicomnordic.com
DK-7600 Struer, Denmark DK-2860 Søborg, Denmark Fax +45 70 301 700
Andersen BK CONFIDENTIAL
Introduction to electronic stethoscope acoustics and signal processing
Table of Contents
1 Introduction ......................................................................................................... 3
2 Hardware key specifications ................................................................................... 3
2.1 General introduction to electronic auscultation .......................................................... 3
2.2 Auscultation sensor pickup of signal from the human thorax ....................................... 4
2.3 Introduction to typical auscultation signals and their perception................................... 6
2.4 Implications of A/D-conversion ............................................................................... 7
2.5 Electronic stethoscope key system features and specifications ..................................... 9
2.5.1 Frequency range and filtering ....................................................................... 9
2.5.2 Dynamic range, amplification and electro acoustic signal-to-noise ratio .............. 9
2.5.3 Amplification rate, volume control and maximum output limitation .................. 10
2.5.4 Handling noise and ergonomics .................................................................. 11
2.5.5 Ambient noise.......................................................................................... 13
2.5.6 Sampling frequency, resolution (bit depth) and compression .......................... 14
2.5.7 Hand-ear signal perception effects .............................................................. 14
2.6 Summary table ................................................................................................... 16
3 References ......................................................................................................... 16
File name: BUA1921d Introduction to electronic stethoscopes acoustics and signal processing Page 2 of 16
Andersen BK CONFIDENTIAL
Introduction to electronic stethoscope acoustics and signal processing
1 Introduction
B&OM have worked within electronic auscultation for almost two decades and among other
achievements has developed and manufactured three generations 3M Littmann digital
stethoscopes. This has included e.g. inventing and implementing state-of-the-art auscultation
sensors as well as inventing and implementing real-time embedded software algorithms to
analyze for e.g. stethoscope use status and heart rate and to digitally record and wirelessly
transfer to third party devices e.g. using Bluetooth. Apart from this B&OM has been conducting
independent research and development within automated assisted auscultation diagnostics
both in the areas of cardiovascular and respiratory disease areas.
The present report provides a summary of the most ‘critical-to-design’ aspects influencing the
sensing of sound from the human (or animal) body as well as the signal processing applied to
help preserve signal integrity while amplifying, digitizing and reproducing such signals.
File name: BUA1921d Introduction to electronic stethoscopes acoustics and signal processing Page 3 of 16
Andersen BK CONFIDENTIAL
Introduction to electronic stethoscope acoustics and signal processing
According to [3] and [4] the thorax impedance elements may be estimated based on sensor
application surface area and application force (pressure). In example, for a Ø30 mm sensor
applied with 0.6 N force the parameters Rt=4.8 Ns/m, Mt=5.4 g and Ct=0.62 mm/N may be
used for calculating a representative thorax impedance. In order to apply two distinct thorax
impedances to either couple to housing or to the diaphragm it could be assumed that these
individually will yield the same value (accurate enough for enclosed examples):
1
Zt = Rt + jMt +
jCt
The influence from hand/arm holding the transducer housing may be relevant in some
situations, and the loading contribution from Zha may then be implemented by applying it in
series with Mht and Mh. A ‘typical’ holding force would best be represented by a resistive
element of ~40 Ns/m. Furthermore an inclusion of an enclosed air-cavity directly in connection
with the sensor diaphragm is implemented by adding this loading contribution in series with
Cd.
The force acting on the sensor diaphragm which controls its deflection and hence the
generation of electrical signal to convey to further processing may be calculated using the
following expression:
File name: BUA1921d Introduction to electronic stethoscopes acoustics and signal processing Page 4 of 16
Andersen BK CONFIDENTIAL
Introduction to electronic stethoscope acoustics and signal processing
resonance between the tissue and the diaphragm impedance which affects the high-end
frequency sensitivity.
The following figure demonstrates these effects (using arbitrary but realistic component
values) either where the sensor is positioned freely on the thorax or where the physician is
holding the sensor in place (applying more back-support to drive amplitude up at low
frequencies). Furthermore, a graph showing the influence from basic to twice as stiff sensor
diaphragm (half the resulting compliance) which increases sensitivity at high frequencies.
As shown through these examples, the effect of hand-holding the sensor onto the chest while
performing auscultation) has the positive effect (which is quite normal) of not only extending
the frequency range towards very low frequencies as the housing backing restricts the housing
from moving with the chest vibration below the resonance point but at the same time also
effectively dampen the noticeable low frequency resonance thereby typically improves overall
sound ‘clarity’.
Furthermore, the more ‘stiff’ diaphragm, which very well could be a result of optimizing an air-
cavity between the diaphragm and the microphone in the above illustration to just allow the
diaphragm deflection (smallest possible air-cavity), has the positive effect of increasing
sensitivity towards higher frequencies.
In short summary, when further down in this report the frequency range of a stethoscope or
sensor system is addressed it is of significance to understand how such performance is
achieved. E.g. as long as acoustic behaviour during auscultation is stable then filtering may be
applied whereby ‘any outer’ frequency response function can be achieved, however only if the
core sensor (and its application) is diligently optimized then this will also provide best possible
dynamic range and signal-to-noise performance. Furthermore, with respect to future sourcing,
collaborating or development of a novel auscultation sensor system for picking up delicate
respiratory signals it is therefore essential to not rely only on provided specification data but
combine such information carefully with an understanding of the core sensor principle to
secure high-quality data acquisition.
The simple simulation/modelling provided above is not intended to necessarily be highly
accurate but will be used in the following to point out the influence of specific design elements
affecting the performance of the stethoscope or auscultation sensor system.
File name: BUA1921d Introduction to electronic stethoscopes acoustics and signal processing Page 5 of 16
Andersen BK CONFIDENTIAL
Introduction to electronic stethoscope acoustics and signal processing
File name: BUA1921d Introduction to electronic stethoscopes acoustics and signal processing Page 6 of 16
Andersen BK CONFIDENTIAL
Introduction to electronic stethoscope acoustics and signal processing
Masking phenomena is however merely a problem of the human auditory perception, for digital
signal processing algorithms the collective assessment of the entire frequency range and
dynamic range is not an issue. However before being able to make available the full signal to
e.g. a digital signal processing algorithm, is has to transform from the analogue domain and
into the digital (A/D-conversion) and here the ‘extreme’ dynamics of the auscultation signals
also complicates registration that does not critically deteriorate the signal qualities.
File name: BUA1921d Introduction to electronic stethoscopes acoustics and signal processing Page 7 of 16
Andersen BK CONFIDENTIAL
Introduction to electronic stethoscope acoustics and signal processing
The below figure demonstrates the situation where a normal respiratory auscultation recording
is registered either without (left side) or with 1st order pre-emphasis of the higher frequency
components.
When comparing to the digitization upper and lower boundaries it is obvious that with under
normal (neutral) conditions then the high-frequency signal components lies very close to lower
boundary and hence are represented in the digital domain with fairly poor SNR, assuming a
firm 100 dB dynamic range ranging from -80 to +20 dB (which even is slightly higher than
typical 16 bit precision of 96 dB), and even some details around 700 Hz appears to fall entirely
below the lower limit. If however applying pre-emphasis to the analogue signal and then
digitizing, then these same higher frequency signal components are ‘lifted up higher’ into the
dynamic range whereby the SNR is preserved (e.g. the 700 Hz details now appear to be
represented with ~42 dB distance to the lower limit).
For later playback of the pre-emphasized and recorded signal, obviously the weighting has to
be reversed for a true sound comparable to the original analogue signal and this may be done
by applying analogue filtering after the final D/A-conversion.
The concept of pre-emphasizing the picked-up physiological signal prior to digitization is
particularly useful when developing an auscultation system for general purposes, e.g. both
low-frequency cardiac sounds and higher-pitched respiratory components and may be of
secondary relevance if narrowing the frequency range significantly. In other words, the
concept may not be critical for initial indications related e.g. ‘only’ with IPF (if its signal
components exist only in a narrow frequency range). However later expansion of therapeutic
scope might stress performance such that optimal digital representation will only be
accomplished by applying some sort of frequency weighting/shaping to preserve sufficient
upper/lower calculation margin in the digital domain.
File name: BUA1921d Introduction to electronic stethoscopes acoustics and signal processing Page 8 of 16
Andersen BK CONFIDENTIAL
Introduction to electronic stethoscope acoustics and signal processing
File name: BUA1921d Introduction to electronic stethoscopes acoustics and signal processing Page 9 of 16
Andersen BK CONFIDENTIAL
Introduction to electronic stethoscope acoustics and signal processing
To reflect the relevance of taking into account the frequency dependent threshold of hearing,
usually signal-to-noise ratio will often be expressed in terms of maximum undistorted (e.g.
below 2% total harmonic distortion for a pure tone played back) output level’s elevation over
the system A-weighted noise level.
The higher SNR the better performance but generally a number higher than 75-80 dB will
provide acceptable user listening comfort and details audibility. Again, if not listening to the
signal but rather sending to external analysis, the SNR performance is not directly relevant as
long as signal processing is able to extract relevant, e.g. IPF, signal components. Instead, the
‘fundamental’ pick-up and digitization of this signal (as discussed above) are the key features
to pursue.
File name: BUA1921d Introduction to electronic stethoscopes acoustics and signal processing Page 10 of 16
Andersen BK CONFIDENTIAL
Introduction to electronic stethoscope acoustics and signal processing
While applying amplification and then changing input signal may result in sudden increase in
signal amplitude which may be associated with signal distortion due to amplitude restrictions
and/or uncomfortable or even hazardous signal levels (which also may be further pronounced
due to presence of distortion). For that reason it may be desirable to incorporate signal
compression so that e.g. towards higher level (absolute level when compared to the electronic
stethoscope capabilities) the amplification rate is automatically lowered. Also it may be
desirable to incorporate a hard upper limit for sound reproduction, such that under no
circumstances will the acoustical signal reproduced by the stethoscope system exceed e.g. 140
dB SPL (impulse measurement). As much as such an upper limit might sound very loud indeed
it is worth noting that conventional stethoscopes are perfectly capable of producing peak levels
well exceeding 150 dB SPL (e.g. tapping on the diaphragm or rapidly placing it on the chest of
a patient).
The above figure shows the native features of such limiting either by a ‘soft clipping’ to lower
amplification rate for signals exceeding a predefined threshold or entirely capping the signal
amplitude for such higher levels.
For the ‘soft clipping’ approach many different implementations would be possible, e.g.
optimizing for lowest resulting distortion or for least risk of exceeding a ‘hard’ threshold. In
general terms, the sooner (lower levels) such limiting begins the least audible the result will
be, however non-linear amplification may possibly influence the character of certain
auscultation sounds which then could be critical to clinical utilization (i.e. compared to
conventional mechanical stethoscope).
Furthermore such a clipping feature may be common to the entire signal, e.g. based on the
highest amplitude present, or subject to individual clipping strategies for signals in different
frequency ranges, e.g. a soft compression and/or hard limiting feature for frequencies in the
20-200 Hz range but no compression or limitation for signals beyond 200 Hz. There are no
rules or standards for such details, and all comes down to the specific requirements rising from
perceived operator listening sound quality and no such output limitation would influence any
remote signal analysis.
File name: BUA1921d Introduction to electronic stethoscopes acoustics and signal processing Page 11 of 16
Andersen BK CONFIDENTIAL
Introduction to electronic stethoscope acoustics and signal processing
however it is very difficult to specify and quantify how exactly a device should be designed to
ensure minimal disturbance from handling noises as this will depend much on the specific use
situation being faced.
The sources to handling noise come from a long list of interactions between device, operator
and patient and the following table will highlight some of these while attaching brief comments
on means to optimize design.
a) Obviously one of the most central sources to handling noise is scratching directly at the
interface between the stethoscope diaphragm and the patient skin where the main body
of auscultation sound is assessed. Not only friction of larger surfaces here but also the
presence of any edges etc. will produce noise. In the presence of such movement over
the patient to stethoscope interface one of the most direct ways to limit generation
diaphragm scratching noise is to minimize the friction e.g. by selecting optimal
materials and/or surface treatments, coatings etc.
b) Another type of input to handling noise is scratching in the operator to stethoscope
interface while holding and operating the stethoscope. Part of this is associated with
e.g. finger scratching on the surface of the stethoscope structure thereby due to friction
generating noise which can travel through the material (or even through air) to be
picked-up by the sensor. Again, in the presence of such operator handling stethoscope
surface one of the most direct ways to limit generation diaphragm scratching noise is to
minimize the friction e.g. by selecting optimal materials and/or surface treatments,
coatings etc. Furthermore however, the pick-up of such noise in the sensor can be
minimized through designing the device mechanical structure for minimal transduction,
e.g. using very stiff or dampened materials and possibly integrating vibration ‘traps’
(e.g. a physical node over which a structural wave cannot transmit its energy).
c) Another significant part of operator handling noise is associated vibration which pushes
the stethoscope towards the patient skin whereby relative vibratory signals are
generated directly on the diaphragm for pick-up by the stethoscope sensor. There are
no ‘easy fixes’ to eliminate such noise generation. However, as typically these noises
are associated with very low-frequency components originating from operator muscle
tremor (predominantly below 100 Hz and most energy below 20 Hz), such noise may be
handled through amplification and filtering, e.g. during subsequent processing in the
stethoscope (before presenting to the operator) or not at all if ‘only’ sending for remote
signal analysis where any noise will not disturb the listening experience). If it can be
determined that such filtering will be performed because no relevant physiological
information will ever be present here, analog filtering may be integrated directly in the
sensor thereby possibly relaxing dynamic range requirements for digital representation
of the signal (which again may result in a more efficient and lower-cost system).
On top of specifically suggested mitigation approaches the device styling and ergonomics plays
an important role in limiting the handling noise contributions, the more easy and natural it is
File name: BUA1921d Introduction to electronic stethoscopes acoustics and signal processing Page 12 of 16
Andersen BK CONFIDENTIAL
Introduction to electronic stethoscope acoustics and signal processing
for the operator to hold and control the stethoscope, the better he will be able to do so in a
controlled and calm manner. Moreover, if further silence is required it may be solved e.g. by
strapping the sensor head to the patient thereby completely eliminating the operator influence
during auscultation.
Testing and quantification and comparison of performance is highly challenging for handling
noises as styling and ergonomics plays a big role, meaning that comparison is ‘only’ possible if
narrowing down the isolated effect of single design elements, e.g. different types of coating
applied in various positions on the stethoscope. In addition, also the rate of amplification and
filtering approach plays a significant part as in some systems frequency areas of significant
noise contribution might not be amplified similarly.
Figure 9 Entry paths for ambient noise into a stethoscope sensor system
Unless dedicated technology is available, as e.g. demonstrated in [4] for a passive approach,
the options to reduce ambient noise through design are limited as a major contributor is
associated with the in-coupling of noise to the stethoscope sensor housing structure. Even so
supplementary parameters such as stethoscope mass (the higher the better), closeness of
interface to patient skin (the tighter the better) may prove significant in the practical setting.
For a radical approach one could consider acoustic insulation of the sensor head (applying a
solid shell around the entire auscultation site) but such solutions may not be commercially
viable (e.g. due to size and handling) and no marketed devices are identified demonstrating
such principles.
On top of isolated mechanical and acoustical approaches however, amplification and filtering
may help to attenuate unwanted ambient noises and then of course there exist active noise
cancelling approaches to cancel such ambient noise, e.g. based on the auscultation sensor
input combined with an alternative stethoscope microphone not subject to body sounds.
File name: BUA1921d Introduction to electronic stethoscopes acoustics and signal processing Page 13 of 16
Andersen BK CONFIDENTIAL
Introduction to electronic stethoscope acoustics and signal processing
The reason for such delay lies in the fact, that for robust and efficient software execution the
digital data will typically not be processed sample by sample but in fact processed in package
structure whereby (at minimum) the delay of ‘filling’ one package will be induced onto the
audio stream. Moreover, if the signal processing requires any minimum number of samples in
File name: BUA1921d Introduction to electronic stethoscopes acoustics and signal processing Page 14 of 16
Andersen BK CONFIDENTIAL
Introduction to electronic stethoscope acoustics and signal processing
order to analyze specific frequency contents etc. this may in itself tie up a much higher
number of samples before reproducing over the stethoscope speaker.
Figure 11 Basic configuration for a buffered digital audio system reading multiple
samples for processing
While under normal (e.g. music audio) any added delay may not represent any problem it
might when performing auscultation as the audio pathway is not the only transmission of
signal from the patient body to the physician’s ears. When resting his and on the patient chest
the physician may feel a heartbeat which then is mentally registered, the associated sound
from that heartbeat will however only arrive following the processing delay as illustrated
below.
In rough terms the human brain will not be able to isolate two such signals, thereby
disconnecting the tactile event from the parallel audio signal, for delays below ~30-50 ms and
it should always be ensured that delays are below 100 ms as longer periods not only will be
noticeable but also disturbing to the physician. Such limited delays should normally be easy to
maintain in confined and integrated digital stethoscope systems.
However, if instead of relying on wired connection between system peripherals e.g. the
connection between sensor and DSP, the DSP and headset or both are to be implemented by
wireless interface (e.g. Bluetooth) then yet additional delay builds onto the resulting overall
delay from body to ear. In that case it should preferably be investigated clinically if the
resulting usability (validated safe and efficacious use) is still maintained and comparable to
e.g. a normal conventional stethoscope.
In supplement to these aspects, if based on the signal present inside a digital confined
stethoscope system enabling a wireless link to an outside device, e.g. a mobile App, for visual
graphical signal representation then effects similar to the tactile-audio issue may again
become relevant. In this situation however, the delay over wireless will likely be closer to the
direct sound reproduction through the stethoscope and hence disturbing delay might not be
excessive or even disturbing to the effective use.
File name: BUA1921d Introduction to electronic stethoscopes acoustics and signal processing Page 15 of 16
Andersen BK CONFIDENTIAL
Introduction to electronic stethoscope acoustics and signal processing
3 References
[1] Kaniusas E.: ‘Fundamentals of Biosignals’ - Biomedical signals and sensors I, Biomedical
and Medical Physics, Biomedical Engineering, Springer 2012
[2] Bohadana A. et al: ‘Fundamentals of Lung auscultation’ - The new England Journal of
Medicine 370;8, February 20, 2014
[3] Vermarien H. and Vollenhoven E.: ‘The recording of heart vibrations: a problem of
vibration measurement on soft tissue’. Medical & Biological Engineering & Computing,
1984.
[4] Andersen BK.: ‘Transducer for Bioacoustic signals’ – Patent US7593534B2
File name: BUA1921d Introduction to electronic stethoscopes acoustics and signal processing Page 16 of 16