Olink Data Normalization White Paper v2.0
Olink Data Normalization White Paper v2.0
Immuno/incubation controls
Extension controls
Detection control
Fig 1. Main steps and controls in a PEA assay with qPCR readout.
Sample controls Plate QC
There are six required and two recommended external controls The internal controls are also used in plate QC. This assesses the
that are added to separate wells on the plate. variation over the plate for each of Incubation Control 1 and 2
and the Detection Control. If the variation for one of the controls
Inter-plate controls
Inter-plate Control (IPC) is included in triplicate on each plate and these is too large, the entire plate is considered unreliable.
are run as normal samples. The IPC is a pool of 92 antibodies, each with
one pair of unique DNA-tags positioned in fixed proximity and can be
seen as a synthetic sample, expected to give a high signal for all assays. Normalization using inter-plate controls
The median of the IPC triplicates is used to normalize each assay, to
compensate for potential variation between runs and plates. Calculation of NPX
Negative controls
Olink uses an arbitrary, relative quantification unit called
Negative Control is also included in triplicate on each plate and consists of
buffer run as a normal sample. These are used to monitor any background Normalized Protein Expression (NPX). In qPCR, the x-axis value
noise generated when DNA-tags come in close proximity without prior of the point where the reaction curve intersects the threshold line
binding to the appropriate protein. The negative controls set the background is called the Ct, or “threshold cycle.” This indicates the number
levels for each protein assay and are used to calculate the limit of detection.
of cycles needed for the signal to surpass the fluorescent signal
Sample controls threshold line. NPX is derived from the Ct values obtained from
When running more than 90 samples, it is recommended to run a pooled
plasma sample in duplicate. These are used to assess potential variation the qPCR using the following equations:
between runs and plates, for example to calculate inter-assay and intra-
Extension Control:
assay CV.
CtAnalyte – CtExtension Control = dCtAnalyte
Inter-plate Control:
dCtAnalyte – dCtInter-plate Control = ddCtAnalyte
TERMINOLOGY
The correction factor is calculated by Olink during the validation
of the panels. The value is pre-determined, using Negative Control,
and used to invert the scale so that a higher value corresponds to
Fig 2. Recommended control setup for a PEA run.
a higher signal. It is also used to ensure that background levels are
approximately zero.
Sample QC
Each of the internal controls is spiked into the samples at the same
concentration. The signals for these are therefore expected to be the NPX is a relative quantification unit logarithmically related to
same over the plate. Sample QC is performed using the Detection protein concentration. Even if two different proteins have the
Control and Incubation Control 2, on the initially calculated NPX same NPX values, their absolute concentrations may differ.
values. Within each plate, the levels of these controls are monitored NPX should be compared for each assay separately between
for each sample and compared against the median of all samples. samples within a run. NPX should not be compared between
If either of the controls differ by more than the acceptance criteria runs without proper inter-plate normalization due to the risk of
±0.3 NPX, the sample is considered deviant and fails QC. Deviating falsely interpreting shifts in median between runs as a biological
values for the internal controls can be caused by factors such as difference. However, relative differences in NPX can be compared
errors in pipetting or pre-analytical variations in the samples that more easily, often without additional normalization steps.
affect the performance, for example matrix effects.
TERMINOLOGY
Matrix effects are defined as changes in the analytical readout that
can be caused by all other sample components except the specific
analyte to be quantified.
2
Example curves Normalized against IPCs
To improve inter-assay repeatability, the data is then normalized
Raw data
against the IPCs. This is done per assay and plate and may
Below is an example of raw data for three runs with a calibrator
be followed by intensity or reference sample normalization
curve. There are deviating samples in the middle, and one curve
depending on the study characteristics.
has a parallel shift.
dCtAnalyte – dCtInter-plate Control = ddCt
Experiment 1 Experiment 2 Experiment 3
22 14
20 12
18 10
16 8
14 6
Ct
ddCt
12 4
10 2
8 0
6 -2
4 -4
0.15 0.6 2.4 9.2 37 147 588 2.35 9.4 37.7 151 600 ... IPC
-6
(pg/mL) (ng/mL) 0.15 0.6 2.4 9.2 37 147 588 2.35 9.4 37.7 151 600 ... IPC
Samples
(pg/mL) (ng/mL)
Samples
Fig 3. Raw data for three runs.
Fig 5. Adjusted against IPCs.
Adjusted against Extension Control
The raw data is then adjusted against the Extension Control per Adjusted against correction factor
sample to improve intra-assay repeatability by reducing technical The curve is then inverted so that a high NPX value means high
variation introduced in the extension step. protein concentration which allows more intuitive interpretation.
A difference of 1 NPX approximates to a doubling of the protein
CtAnalyte – CtExtension Control = dCtAnalyte
concentration regardless of protein.
Experiment 1 Experiment 2 Experiment 3 Correction factor – ddCt = NPX
14
Experiment 1 Experiment 2 Experiment 3
12
10 16
8 14
6 12
dCt
4 10
2 8
NPX
0
6
-2
4
-4
2
-6
0.15 0.6 2.4 9.2 37 147 588 2.35 9.4 37.7 151 600 ... IPC 0
(pg/mL) (ng/mL) -2
Samples
0.15 0.6 2.4 9.2 37 147 588 2.35 9.4 37.7 151 600 ... IPC
3
Inter-plate normalization methods Intensity normalization method
The assumption behind intensity normalization is that on
Olink recommends one of two normalization methods depending
average, there is no expected difference between the median
on the study design. For randomized studies, IPC normalization
signal for an assay on one plate compared to another. If any such
should be replaced with intensity normalization, which will
difference is seen between plates, it can be construed as technical
increase statistical power and reduce technical variation between
bias and be safely removed, resulting in more comparable values.
plates and projects. Intensity normalization can also be used
when combining studies from different sources as long as the Olink uses intensity normalization, with the median as the
sample distribution can be considered comparable between the normalization factor, when samples are randomized across plates
data sets. and projects. The intensity normalization adjusts the data so that
the median value for an assay on each plate is equal to the overall
For non-randomized studies, Olink recommends reference sample
median across all plates. This method assumes that the actual
normalization. Running reference samples on all plates is a good
median of each plate is the same. The way to ensure this is to
strategy to minimize technical variation. When applied correctly,
randomize the samples beforehand. If complete randomization
both intensity normalization and reference sample normalization
can be assumed, this is a robust and high-performing
can increase the power in a given study by reducing technical
normalization method. Intensity normalization is performed in
variation, since they are based on real samples in contrast to the
the following way:
IPC samples.
Step Description
1 For each assay, calculate the overall median value for all samples
TERMINOLOGY
and plates.
Randomization in this context applies to the sample placement
across the plates. A sample set is randomized if the relevant 2 For each plate and assay, calculate the plate specific median value.
experimental variables can be considered evenly distributed across 3 For each assay, subtract the plate specific median from every value
plates. The variables to consider must be decided for each study. for the plates (equals centralizing to median 0).
They can for example include study groups, treatment, time points
4 For each assay, add the overall median value (equals centralizing to
or demographics. If the randomization is not appropriate with
the overall median).
regards to experimental variables, the normalization might remove
true biological variation that otherwise could have been identified.
4
Evaluation
To evaluate how effective a normalization method is, Olink investigates the NPX distributions and
compares the average %CV for different normalization methods.
Plate 1
Plate 2
Plate 3
4
NPX
Plate 1
Plate 2
Plate 3
4
NPX
Fig 8. NPX across plates for each protein after intensity normalization.
In properly randomized studies, intensity normalization typically improves average %CV by a few percent.
5
Reference sample normalization evaluation
In the following example, the samples are poorly randomized, and the plate in the middle contains mostly
samples from one group. This means that technical variation between plates is mixed with the biological
variation that can be investigated by comparing the sample groups. In this case, intensity normalization of
the plates would remove some of the relevant biological variation when correcting for technical variation
between plates. The illustration shows the NPX values of the proteins in poorly randomized samples and
the NPX values of the reference samples. The reference samples are shown in purple.
Plate 1
Plate 2
Plate 3
Reference
4
NPX
Fig 9. NPX across plates for each protein including reference samples.
The reference samples are then used for normalization. After reference sample normalization, the median
NPX can still differ between plates, reflecting different compositions of sample groups.
Plate 1
Plate 2
Plate 3
4 Reference
NPX
Fig 10. NPX across plates for each protein after reference sample normalization.
6
Misinterpreted data
To further illustrate issues with improper normalization, the image below gives an example of the bias
introduced into a dataset when normalization factors are driven by variance introduced by biological
factors, and not simply by inappropriate sample preparation or instrument-based variability. The
illustration shows a poorly randomized set of samples where the difference between the two groups is
statistically significant (p<0.05). In this case, intensity normalization would remove that difference.
4
NPX
The second plate contains samples from mainly one group. Here the technical variable (plate) is mixed
with the biological variable (sample group), and intensity normalization cannot separate between the
two, as shown in the following image.
4
NPX
4
NPX
7
Olink can help
Normalization is important to remove systematic variation, but needs to be applied carefully to minimize
the risk of removing true biological variation. As a service, Olink can discuss normalization approaches
and study design with you before analysis starts.
If you send your samples to Olink's Analysis Service team, the data analysis steps will be taken care of.
They will provide you with a comprehensive NPX data and QC report after your study is completed.
If more extensive data analysis services are required, our biostatisticians in the Olink Data Science team
can help you with customized statistical analysis.
www.olink.com
For Research Use Only. Not for Use in Diagnostic Procedures.
This product includes a license for non-commercial use of Olink products. Commercial users may require additional licenses. Please contact Olink Proteomics AB
for details. There are no warranties, expressed or implied, which extend beyond this description. Olink Proteomics AB is not liable for property damage, personal
injury, or economic loss caused by this product.
The following trademark is owned by Olink Proteomics AB: Olink®.
This product is covered by several patents and patent applications available at https://www.olink.com/patents/.
Components in the Olink® Target 96 Probe Kits utilize Thunder-Link technology and are provided under license from Expedeon Ltd.
© Copyright 2018-2021 Olink Proteomics AB. All third party trademarks are the property of their respective owners.
Olink Proteomics, Dag Hammarskjölds väg 52B , SE-752 37 Uppsala, Sweden
1096, v2.0, 2021-04-08