0% found this document useful (0 votes)

25 views26 pages

LearnTheArchitecture-MemoryManagement-101811 0100 00 en

This document provides an overview of memory management in the Armv8-A architecture, detailing how virtual addresses are translated to physical addresses and the role of the Memory Management Unit (MMU). It explains the necessity of memory management for operating systems and applications, the structure of translation tables, and the concept of address spaces. The document serves as a guide for developers working with low-level code, emphasizing the importance of understanding address translation and TLB maintenance.

Uploaded by

Abderrahmane Mehenni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views26 pages

LearnTheArchitecture-MemoryManagement-101811 0100 00 en

Uploaded by

Abderrahmane Mehenni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Memory management

Version 1.0
Memory management 101811_0100_00
Version 1.0

Memory management
Learn the architecture guide
Copyright © 2019 Arm Limited (or its affiliates). All rights reserved.
Release Information
Document History

Version Date Confidentiality Change

1.0 20 June 2019 Non-Confidential First release

Non-Confidential Proprietary Notice

This document is protected by copyright and other related rights and the practice or implementation of the information
contained in this document may be protected by one or more patents or pending patent applications. No part of this document
may be reproduced in any form by any means without the express prior written permission of Arm. No license, express or
implied, by estoppel or otherwise to any intellectual property rights is granted by this document unless specifically stated.

Your access to the information in this document is conditional upon your acceptance that you will not use or permit others to use
the information for the purposes of determining whether implementations infringe any third party patents.

THIS DOCUMENT IS PROVIDED “AS IS”. ARM PROVIDES NO REPRESENTATIONS AND NO WARRANTIES, EXPRESS,
IMPLIED OR STATUTORY, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES OF MERCHANTABILITY,
SATISFACTORY QUALITY, NON-INFRINGEMENT OR FITNESS FOR A PARTICULAR PURPOSE WITH RESPECT TO THE
DOCUMENT. For the avoidance of doubt, Arm makes no representation with respect to, and has undertaken no analysis to
identify or understand the scope and content of, patents, copyrights, trade secrets, or other rights.

This document may include technical inaccuracies or typographical errors.

TO THE EXTENT NOT PROHIBITED BY LAW, IN NO EVENT WILL ARM BE LIABLE FOR ANY DAMAGES, INCLUDING
WITHOUT LIMITATION ANY DIRECT, INDIRECT, SPECIAL, INCIDENTAL, PUNITIVE, OR CONSEQUENTIAL DAMAGES,
HOWEVER CAUSED AND REGARDLESS OF THE THEORY OF LIABILITY, ARISING OUT OF ANY USE OF THIS
DOCUMENT, EVEN IF ARM HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES.

This document consists solely of commercial items. You shall be responsible for ensuring that any use, duplication or disclosure
of this document complies fully with any relevant export laws and regulations to assure that this document or any portion
thereof is not exported, directly or indirectly, in violation of such export laws. Use of the word “partner” in reference to Arm’s
customers is not intended to create or refer to any partnership relationship with any other company. Arm may make changes to
this document at any time and without notice.

If any of the provisions contained in these terms conflict with any of the provisions of any click through or signed written
agreement covering this document with Arm, then the click through or signed written agreement prevails over and supersedes
the conflicting provisions of these terms. This document may be translated into other languages for convenience, and you agree
that if there is any conflict between the English version of this document and any translation, the terms of the English version of
the Agreement shall prevail.

The Arm corporate logo and words marked with ® or ™ are registered trademarks or trademarks of Arm Limited (or its
subsidiaries) in the US and/or elsewhere. All rights reserved. Other brands and names mentioned in this document may be the
trademarks of their respective owners. Please follow Arm’s trademark usage guidelines at
http://www.arm.com/company/policies/trademarks.
3T 3T

Copyright © 2019 Arm Limited (or its affiliates). All rights reserved.
Non-Confidential
Page 2 of 26
Memory management 101811_0100_00
Version 1.0

Arm Limited. Company 02557590 registered in England.

110 Fulbourn Road, Cambridge, England CB1 9NJ.

LES-PRE-20349

Confidentiality Status
This document is Non-Confidential. The right to use, copy and disclose this document may be subject to license restrictions in
accordance with the terms of the agreement entered into by Arm and the party that Arm delivered this document to.

Unrestricted Access is an Arm internal classification.

Product Status
The information in this document is Final, that is for a developed product.

Web Address
https://developer.arm.com 3T

Copyright © 2019 Arm Limited (or its affiliates). All rights reserved.
Non-Confidential
Page 3 of 26
Memory management 101811_0100_00
Version 1.0

Contents
1 Overview .............................................................................................................................................................................................................................................. 5

2 What is memory management?..................................................................................................................................................................................................... 6

2.1. Why is memory management needed? ................................................................................................................................................................................................ 6

3 Virtual and physical addresses ...................................................................................................................................................................................................... 7

4 The Memory Management Unit (MMU) ..................................................................................................................................................................................... 8

4.1. Table entry ...................................................................................................................................................................................................................................................... 9
4.2. Table lookup ................................................................................................................................................................................................................................................ 10
4.3. Multilevel translation .............................................................................................................................................................................................................................. 11

5 Address spaces in Armv8-A..........................................................................................................................................................................................................12

5.1. Address sizes ............................................................................................................................................................................................................................................... 13
5.1.1 Size of virtual addresses....................................................................................................................................................................................................................... 13
5.1.2 Size of physical addresses ................................................................................................................................................................................................................... 15
5.1.3 Size of intermediate physical addresses ........................................................................................................................................................................................ 15
5.2. Address Space Identifiers - Tagging translations with the owning process ....................................................................................................................... 15
5.3. Virtual Machine Identifiers - Tagging translations with the owning VM............................................................................................................................. 16
5.4. Common not Private ................................................................................................................................................................................................................................ 16

6 Controlling address translation ..................................................................................................................................................................................................17

6.1. Translation table format ......................................................................................................................................................................................................................... 17

7 Translation granule .........................................................................................................................................................................................................................18

7.1. The starting level of address translation .......................................................................................................................................................................................... 19
7.2. Registers that control address translation ...................................................................................................................................................................................... 20
7.3. MMU disabled ............................................................................................................................................................................................................................................ 20

8 Translation Lookaside Buffer maintenance ............................................................................................................................................................................21

8.1. Format of a TLB operation ..................................................................................................................................................................................................................... 22

9 Address translation instructions ................................................................................................................................................................................................23

10 Check your knowledge ................................................................................................................................................................................................................24

11 Related information .....................................................................................................................................................................................................................25

12 Next steps ........................................................................................................................................................................................................................................26

Copyright © 2019 Arm Limited (or its affiliates). All rights reserved.
Non-Confidential
Page 4 of 26
Memory management 101811_0100_00
Version 1.0

1 Overview
This guide introduces memory translation in Armv8-A, which is key to memory management. It explains how virtual addresses
are translated to physical addresses, the translation table format, and how software manages the Translation Lookaside Buffers
(TLBs).

This information is useful for anyone who is developing low-level code, such as boot code or drivers. It is particularly relevant to
anyone who is writing code to set up or manage the Memory Management Unit (MMU).

At the end of this guide, you can check your knowledge. You will have learned how a virtual address is translated to a physical
address. You will be able to name the different address spaces, and describe how the address spaces map onto the stages of
translation. You will also have learned when software must perform TLB maintenance, and the syntax of TLB maintenance
commands.
Memory management 101811_0100_00
Version 1.0

2 What is memory management?

Memory management describes how access to memory in a system is controlled. The hardware performs memory management
every time that memory is accessed by either the OS or applications. Memory management is a way of dynamically allocating
regions of memory to applications.

2.1. Why is memory management needed?

Application processors are designed to run a rich OS, such as Linux, and to support virtual memory systems. Software that
executes on the processor only sees virtual addresses, which the processor translates into physical addresses. These physical
addresses are presented to the memory system and point to the actual physical locations in memory.
Memory management 101811_0100_00
Version 1.0

3 Virtual and physical addresses

The benefit of using virtual addresses is that it allows management software, such as an Operating System (OS), to control the
view of memory that is presented to software. The OS can control what memory is visible, the virtual address at which that
memory is visible, and what accesses are permitted to that memory. This allows the OS to sandbox applications (hiding the
resources of one application from another application) and to provide abstraction from the underlying hardware.

One benefit of using virtual addresses is that an OS can present multiple fragmented physical regions of memory as a single,
contiguous virtual address space to an application.

Virtual addresses also benefit software developers, who will not know a system's exact memory addresses when writing their
application. With virtual addresses, software developers do not need to concern themselves with the physical memory. The
application knows that it is up to the OS and the hardware to work together to perform the address translation.

In practice, each application can use its own set of virtual addresses that will be mapped to different locations in the physical
system. As the operating system switches between different applications it re-programs the map. This means that the virtual
addresses for the current application will map to the correct physical location in memory.

Virtual addresses are translated to physical addresses through mappings. The mappings between virtual addresses and physical
addresses are stored in translation tables (sometimes referred to as page tables) as this diagram shows:

Translation tables are in memory and are managed by software, typically an OS or hypervisor. The translations tables are not
static, and the tables can be updated as the needs of software change. This changes the mapping between virtual and physical
addresses.
Memory management 101811_0100_00
Version 1.0

4 The Memory Management Unit (MMU)

The Memory Management Unit (MMU) performs translations.

The MMU contains the following:

• The table walk unit, which contains logic that reads the translation tables from memory.
• Translation Lookaside Buffers (TLBs), which cache recently used translations.

All memory addresses that are issued by software are virtual. These memory addresses are passed to the MMU, which checks
the TLBs for a recently used cached translation. If the MMU does not find a recently cached translation, the table walk unit reads
the appropriate table entry, or entries, from memory, as shown here:

A virtual address must be translated to a physical address before a memory access can take place (because we must know which
physical memory location we are accessing). This need for translation also applies to cached data, because on Armv6 and later
processors, the data caches store data using the physical address (addresses that are physically tagged). Therefore, the address
must be translated before a cache lookup can complete.

Note: Architecture is a behavioral specification. The caches must behave as if they are physically tagged. An implementation
might do something different, as long as this is not software-visible.
Memory management 101811_0100_00
Version 1.0

4.1. Table entry

The translation tables work by dividing the virtual address space into equal-sized blocks and by providing one entry in the table
per block.

Entry 0 in the table provides the mapping for block 0, entry 1 provides the mapping for block 1, and so on. Each entry contains
the address of a corresponding block of physical memory and the attributes to use when accessing the physical address.
Memory management 101811_0100_00
Version 1.0

4.2. Table lookup

A table lookup occurs when a translation takes place. When a translation happens, the virtual address that is issued by the
software is split in two, as shown in this diagram:

This diagram shows a single-level lookup.

The upper-order bits, which are labeled 'Which entry' in the diagram, tell you which block entry to look in and they are used as an
index into the table. This entry block contains the physical address for the virtual address.

The lower-order bits, which are labeled 'Offset in block' in the diagram, are an offset within that block and are not changed by
the translation.
Memory management 101811_0100_00
Version 1.0

4.3. Multilevel translation

In a single-level lookup, the virtual address space is split into equal-sized blocks. In practice, a hierarchy of tables is used.

The first table (Level 1 table) divides the virtual address space into large blocks. Each entry in this table can point to an equal-
sized block of physical memory or it can point to another table which subdivides the block into smaller blocks. We call this type of
table a 'multilevel table'. Here we can see an example of a multilevel table that has three levels:

In Armv8-A, the maximum number of levels is four, and the levels are numbered 0 to 3. This multilevel approach allows both
larger blocks and smaller blocks to be described. The characteristics of large and small blocks are as follows:

• Large blocks require fewer levels of reads to translate than small blocks. Plus, large blocks are more efficient to cache in
the TLBs.
• Small blocks give software fine-grain control over memory allocation. However, small blocks are less efficient to cache in
the TLBs. Caching is less efficient because small blocks require multiple reads through the levels to translate.

To manage this trade-off, an OS must balance the efficiency of using large mappings against the flexibility of using smaller
mappings for optimum performance.

Note: The processor does not know the size of the translation when it starts the table lookup. The processor works out the size
of the block that is being translated by performing the table walk.
Memory management 101811_0100_00
Version 1.0

5 Address spaces in Armv8-A

There are several independent virtual address spaces in Armv8-A. This diagram shows these virtual address spaces:

The diagram shows three virtual address spaces:

• NS.EL0 and NS.EL1 (Non-secure EL0/EL1).

• NS.EL2 (Non-secure EL2).
• EL3.

Each of these virtual address spaces is independent, and has its own settings and tables. We often call these settings and tables
'translation regimes'. There are also virtual address spaces for Secure EL0, Secure EL1 and Secure EL2, but they are not shown
in the diagram.

Note: Support for Secure EL2 was added in Armv8.4-A.

Because there are multiple virtual address spaces, it is important to specify which address space an address is in. For example,
NS.EL2:0x8000 refers to the address 0x8000 in the Non-secure EL2 virtual address space.

The diagram also shows that the virtual addresses from Non-secure EL0 and Non-secure EL1 go through two sets of tables.
These tables support virtualization and allow the hypervisor to virtualize the view of physical memory that is seen by a virtual
machine (VM).
Memory management 101811_0100_00
Version 1.0

In virtualization, we call the set of translations that are controlled by the OS, Stage 1. The Stage 1 tables translate virtual
addresses to intermediate physical addresses (IPAs). In Stage 1 the OS thinks that the IPAs are physical address spaces. However,
the hypervisor controls a second set of translations, which we call Stage 2. This second set of translations translates IPAs to
physical addresses. This diagram shows how the two sets of translations work:

Although there are some minor differences in the table format, the process of Stage 1 and Stage 2 translation is usually the same.

Note: At Arm, we use the address 0x8000 in many of our examples. 0x8000 is also the default address for linking with the Arm
linker, armlink. The address comes from an early microcomputer, the BBC Micro Model B, which had ROM (and sideways RAM)
at the address 0x8000. The BBC Micro Model B was built by a company called Acorn, which developed the Acorn RISC Machine
(ARM), and later became Arm.

5.1. Address sizes

Armv8-A is a 64-bit architecture, but this does not mean that all addresses are 64-bit.

5.1.1 Size of virtual addresses

Virtual addresses are stored in a 64-bit format. As a result, the address in load instructions (LDR) and store instructions (STR) is
always specified in an X register. However, not all of the addresses in the X register are valid.

This diagram shows the layout of the virtual address space in AArch64:
Memory management 101811_0100_00
Version 1.0

There are two regions for the EL0/EL1 virtual address space: kernel space and application space. These two regions are shown
on the left-hand side of the diagram, with kernel space at the top, and application space, which is labeled 'User space', at the
bottom of the address space. Kernel space and user space have separate translation tables and this means that their mappings
can be kept separate.

There is a single region at the bottom of the address space for all other Exception levels. This region is shown on the right-hand
side of the diagram and is the box with no text in it.

Note: If you set HCR_EL2.E2H to 1 it enables a configuration where a host OS runs in EL2, and the applications of the host OS
run in EL0. In this scenario, EL2 also has an upper and a lower region.

Each region of address space has a size of up to 252 bytes. However, each region can be independently shrunk to a smaller size.
The TnSZ fields in the TCR_ELx registers control the size of the virtual address space. For example, this diagram shows that
TCR_EL1 controls the EL0/EL1 virtual address space:

The virtual address size is encoded as:

virtual address size in bytes = 264-TCR_ELx.TnSZ

The virtual address size can also be expressed as a number of address bits:

Number of address bits = 64 – TnSZ

Therefore, if TCR_EL1.SZ1 is set to 32, the size of the kernel region in the EL0/EL1 virtual address space is 232 bytes
(0xFFFF_FFFF_0000_0000 to 0xFFFF_FFFF_FFFF_FFFF). Any address that is outside of the configured range or ranges will,
when it is accessed, generate an exception as a translation fault. The advantage of this configuration is that we only need to
describe as much of the address space as we want to use, which saves time and space. For example, imagine that the OS kernel
needs 1GB of address space (30-bit address size) for its kernel space. If the OS sets T1SZ to 34, then only the translation table
entries to describe 1GB are created, as 64 – 34 = 30.

Note: All Armv8-A implementations support 48-bit virtual addresses. Support for 52-bit virtual addresses is optional and
reported by ID_AA64MMFR2_EL1. At the time of writing, none of the Arm Cortex-A processors support 52-bit virtual
addresses.
Memory management 101811_0100_00
Version 1.0

5.1.2 Size of physical addresses

The size of a physical address is IMPLEMENTATION DEFINED, up to a maximum of 52 bits. The ID_AA64MMFR0_EL1 register
reports the size that is implemented by the processor. For Arm Cortex-A processors, this will usually be 40 bits or 44 bits.

Note: In Armv8.0-A, the maximum size for a physical address is 48 bits. This was extended to 52 bits in Armv8.2-A.

5.1.3 Size of intermediate physical addresses

If you specify an output address in a translation table entry that is larger than the implemented maximum, the Memory
Management Unit (MMU) will generate an exception as an address size fault.

The size of the IPA space can be configured in the same way as the virtual address space. VTCR_EL2.T0SZ controls the size. The
maximum size that can be configured is the same as the physical address size that is supported by the processor. This means that
you cannot configure a larger IPA space than the supported physical address space.

5.2. Address Space Identifiers - Tagging translations with the owning process
Many modern OSs have all applications that seem to run from the same address region, this is what we have described as user
space. In practice, different applications require different mappings. This means, for example, that the translation for VA 0x8000
depends on which application is currently running.

Ideally, we would like the translations for different applications to coexist within the Translation Lookaside Buffers (TLBs), to
prevent the need for TLB invalidates on a context switch. But how would the processor know which version of the VA 0x8000
translation to use? In Armv8-A, the answer is Address Space Identifiers (ASIDs).

For the EL0/EL1 virtual address space, translations can be marked as Global (G) or Non-Global (nG) using the nG bit in the
attributes field of the translation table entry. For example, kernel mappings are Global translations, and application mappings are
Non-Global translations. Global translations apply whichever application is currently running. Non-Global translations only
apply with a specific application.

Non-Global mappings are tagged with an ASID in the TLBs. On a TLB lookup, the ASID in the TLB entry is compared with the
currently selected ASID. If they do not match, then the TLB entry is not used. This diagram shows a Global mapping in the kernel
space with no ASID tag and a non-Global mapping in user space with an ASID tag:
Memory management 101811_0100_00
Version 1.0

The diagram shows that TLB entries for multiple applications are allowed to coexist in the cache, and the ASID determines which
entry to use.

The ASID is stored in one of the two TTBRn_EL1 registers. Usually TTBR0_EL1 is used for user space. As a result, a single
register update can change both the ASID and the translation table that it points to.

Note: ASID tagging is also available in EL2, when HCR_EL2.E2H==1.

5.3. Virtual Machine Identifiers - Tagging translations with the owning VM

EL0/EL1 translations can also be tagged with a Virtual Machine Identifier (VMID). VMIDs allow translations from different VMs to
coexist in the cache. This is similar to the way in which ASIDs work for translations from different applications. In practice, this
means that some translations will be tagged with both a VMID and an ASID, and that both must match for the TLB entry to be
used.

Note: When virtualization is supported for a security state, EL0/EL1 translations are always tagged with a VMID – even if Stage
2 translation is not enabled. This means that if you are writing initialization code and are not using a hypervisor, it is important to
set a known VMID value before setting up the Stage 1 MMU.

5.4. Common not Private

If a system includes multiple processors, do the ASIDs and VMIDs used on one processor have the same meaning on other
processors?

For Armv8.0-A the answer is that they do not have to mean the same thing. There is no requirement for software to use a given
ASID in the same way across multiple processors. For example, ASID 5 might be used by the calculator on one processor and by
the web browser on another processor. This means that a TLB entry that is created by one processor cannot be used by another
processor.

In practice, it is unlikely that software will use ASIDs differently across processors. It is more common for software to use ASIDs
and VMIDs in the same way on all processors in a given system. Therefore, Armv8.2-A introduced the Common not Private (CnP)
bit in the Translation Table Base Register (TTBR). When the CnP bit is set, the software promises to use the ASIDs and VMIDs in
the same way on all processors, which allows the TLB entries that are created by one processor to be used by another.

Note: We have been talking about processors, however, technically, we should be using the term, Processing Element (PE). PE is a
generic term for any machine that implements the Arm architecture. It is important here because there are microarchitectural
reasons why sharing TLBs between processors would be difficult. But within a multithreaded processor, where each hardware
thread is a PE, it is much more desirable to share TLB entries.
Memory management 101811_0100_00
Version 1.0

6 Controlling address translation

6.1. Translation table format
Here we can see the different formats that are allowed for translation table entries:

Note: For purposes of clarity, this diagram does not specify the width of bit fields. You can find this information in the Arm
Architecture Reference Manual Armv8, for Armv8-A architecture profile: The VMSAv8-64 translation table format descriptors.

Each entry is 64 bits and the bottom two bits determine the type of entry.

Notice that some of the table entries are only valid at specific levels. The maximum number of levels of tables is four, which is
why there is no table descriptor for level 3 (or the fourth level), tables. Similarly, there are no Block descriptors or Page
descriptors for level 0. Because level 0 entry covers a large region of virtual address space, it does not make sense to allow
blocks.

Note: The encoding for the Table descriptor at levels 0-2 is the same as the Page descriptor at level 3. This encoding allows
'recursive tables', which point back to themselves. This is useful because it makes it easy to calculate the virtual address of a
particular page table entry so that it can be updated.
Memory management 101811_0100_00
Version 1.0

7 Translation granule
A translation granule is the smallest block of memory that can be described. Nothing smaller can be described, only larger blocks,
which are multiples of the granule.

Armv8-A supports three different granule sizes: 4KB, 16KB, and 64KB.

The granule sizes that a processor supports are IMPLEMENTATION DEFINED and are reported by ID_AA64MMFR0_EL1. All
Arm Cortex-A processors support 4KB and 64KB. The selected granule is the smallest block that can be described in the latest
level table. Larger blocks can also be described. This table shows the different block sizes for each level of table based on the
selected granule:

Level 4KB granule 16KB granule 64KB granule

of
table Size per Bits used Size per Bits used Size per Bits used
entry to index entry to index entry to index
0 512GB 47:39* 128TB 47* - -

1 1GB 38:30 64GB 46:36 4TB 51:42

2 2MB 29:21 32MB 35:25 512MB 41:29

3 4KB 20:12 16KB 24:14 64KB 28:16

* There are restrictions on using 52-bit addresses. When the selected granule is 4KB or 16KB, the maximum virtual address
region size is 48 bits. Similarly, output physical addresses are limited to 48 bits. It is only when the 64KB granule is used that the
full 52 bits can be used.

Note: TCR_EL1 has two separate fields that control the granule size for the kernel space and the user space virtual address
ranges. These fields are called TG1 for kernel space and TG0 for user space. A potential problem for programmers is that these
two fields have different encodings.
Memory management 101811_0100_00
Version 1.0

7.1. The starting level of address translation

Together, the granule and the size of the virtual address space control the starting level of address translation.

The previous table summarized the block size (size of virtual address range covered by a single entry) for each granule at each
level of table. From the block size, you can work out which bits of the virtual address are used to index each level of table.

Let us take the 4KB granule as an example. This diagram shows the bits that are used to index the different levels of table for a
4KB granule:

Imagine that, for a configuration, you set the size of the virtual address space, TCR_ELx.T0SZ, to 32. Then the size of the virtual
address space, in address bits, is calculated as:

64 - T0SZ = 32-bit address space (address bits 31:0)

If we look at the previous 4KB granule diagram again, level 0 is indexed by bits 47:39. With a 32-bit address space you do not
have these bits. Therefore, the starting level of translation for your configuration is level 1.

Next, imagine you set T0SZ to 34:

64 - T0SZ = 30-bit address space (address bits 29:0)

This time, you do not have any other bits that are used to index the level 0 table or the level 1 table, so the starting level of
translation for your configuration is level 2.

As the previous diagram shows, when the size of the virtual address space reduces, you need fewer levels of tables to describe it.

These examples are based on using the 4KB granule. The same principle applies when using 16KB and 64KB granules, but the
address bits change.
Memory management 101811_0100_00
Version 1.0

7.2. Registers that control address translation

Address translation is controlled by a combination of system registers:

• SCTLR_ELx
o M - Enable Memory Management Unit (MMU).
o C - Enable for data and unified caches.
o EE - Endianness of translation table walks.
• TTBR0_ELx and TTBR1_ELx
o BADDR - Physical address (PA) (or intermediate physical address, IPA, for EL0/EL1) of start of translation
table.
o ASID - The Address Space Identifier for Non-Global translations.
• TCR_ELx
o PS/IPS - Size of PA or IPA space, the maximum output address size.
o TnSZ - Size of address space covered by table.
o TGn - Granule size.
o SH/IRGN/ORGN - Cacheability and shareability to be used by MMU table walks.
o TBIn - Disabling of table walks to a specific table.
• MAIR_ELx
o Attr - Controls the Type and cacheability in Stage 1 tables.

7.3. MMU disabled

When the MMU is disabled for a stage of translation, all addresses are flat-mapped. Flat mapping means that the input and
output addresses are the same.
Memory management 101811_0100_00
Version 1.0

8 Translation Lookaside Buffer

maintenance
The Translation Lookaside Buffers (TLBs) cache recently used translations. This caching allows the translations to be reused by
subsequent lookups without needing to reread the tables.

Note: The TLBs are caches of translations, not caches of the translation tables. The difference is subtle. Several register fields
control how the translation table entries are interpreted. What is in a TLB entry is the interpretation of the translation table
entry given the configuration at the point that the tables were walked. In the Arm Architecture Reference Manual (Arm ARM), such
register fields are described as 'permitted to be cached in a TLB'.

If you change a translation table entry, or the controls that affect how entries are interpreted, then you need to invalidate the
affected entries in the TLB. If you do not invalidate those entries, then the processor might continue to use the old translation.

The processor is not permitted to cache a translation into the TLBs that results in any of the following faults:

• A translation fault (unmapped address).

• An address size fault (address outside of range).
• An access flag fault.

As a result, you do not need to issue a TLB invalidate when mapping an address for the first time. However, you do need to issue
a TLB invalidate if you want to do any of the following:

• Unmap an address
Take an address that was previously valid or mapped and mark it as faulting.

• Change the mapping of an address

Change the output address or any of the attributes. For example, change an address from read-only to read-write
permissions.

• Change the way the tables are interpreted

This is less common. But, for example, if the granule size was changed, then the interpretation of the tables also changes.
Therefore, a TLB invalidate would be necessary.
Memory management 101811_0100_00
Version 1.0

8.1. Format of a TLB operation

The TLBI instruction is used to invalidate entries in the TLBs. The syntax of this instruction is:

TLBI <type><level>{IS|OS} {, <xt>}

Where:

• <type> Which entries to invalidate.

o All - All entries
o VA - Entry matching VA and ASID in -Xt
o VAA - Entry matching VA in Xt, for any ASID
o ASID - Any entry matching the ASID in Xt
o and many more
• <level> Which address space to operate on.
o E1 = EL0/1 virtual address space
o E2 = EL2 virtual address space
o E3 = EL3 virtual address space
• <IS|OS> Whether operation is Inner Shareable or Outer Shareable.
o When IS is added to the operation, it is broadcast to the other cores in the Inner Shareable domain.
o When OS is added to the operation, it is broadcast to the other cores in the Outer Shareable domain (Added in
Armv8.4-A).
• <Xt>Which address or ASID to operate on.
o Only used for operations by address or ASID.

Consider, for example, an OS that is updating an entry in its kernel translation tables. A typical TLB invalidate sequence would
look like this:

STR X1, [X5] // Write to translation table entry

DSB ISH // Barrier instructions – not covered in this guide
TLBI VAAE1IS , X0 // Invalidate VA specified by X0, in EL0/1
// virtual address space for all ASIDs
DSB ISH // Barrier instructions – not covered in this guide
ISB // Synchronize context on this processor
Memory management 101811_0100_00
Version 1.0

9 Address translation instructions

An Address Translation (AT) instruction lets the software query the translation for a specific address. The translation that results,
including the attributes, is written to the Physical Address Register, PAR_EL1.

The syntax of the AT instruction lets you specify which translation regime to use. For example, EL2 can query the EL0/EL1
translation regime. However, EL1 cannot use the AT instruction to query the EL2 translation regime, as this is a breach of
privilege.

If the requested translation would have caused a fault, no exception is generated. Instead, the type of fault that would have been
generated is recorded in PAR_EL1.
Memory management 101811_0100_00
Version 1.0

10 Check your knowledge

Q. What is the difference between a stage and a level in address translation?

A. A stage is the process of translating an input address to an output address. For Stage 1 this is the process of going from VA to
IPA and for Stage 2 going from IPA to PA.

A level refers to the tables in a given stage of translation. It is also how a larger block can be subdivided into smaller blocks.

Q. What is the maximum size of a physical address?

A. The maximum size of the physical address space is IMPLEMENTATION DEFINED, and up to 52 bits (since Armv8.2-A).

Q. Which register field controls the size of the virtual address space?

A. TCR_ELx.TnSZ, or VTCR_EL2.T0SZ for Stage 2.

Q. What is a translation granule, and what are the supported sizes?

A. It is the smallest block of memory that can be described.

The supported sizes are 4KB, 16KB, and 64KB.

Q. What does the TLBI ALLE3 do?

A. It invalidates all the TLB entries for the EL3 virtual address space.

Q. Can a translation table entry that causes a Translation Fault be cached in the TLBs?

A. No, it cannot be stored in the TLBs.

Q. How are addresses mapped when the MMU is disabled?

A. Addresses are flat mapped, so that the input and output addresses are the same.

Q. What is an ASID and when does a TLB entry include an ASID?

A. An ASID is an Address Space Identifier, it identifies which application a translation is associated with. Non-Global mappings
(nG=1) are tagged with an ASID in the TLBs.
Memory management 101811_0100_00
Version 1.0

11 Related information
Here are some resources related to material in this guide:

Caches
• This topic is covered in the Caches and coherency guide (coming soon).

Virtualization
• For Armv8-A, this topic is covered in Virtualization.
• This topic is also covered in our R-profile guide, Armv8-R Virtualization.

Useful links to training:

• Introduction to Armv8-A
• Memory model overview
• Memory types overview
• What does architecture consist of?
Memory management 101811_0100_00
Version 1.0

12 Next steps
This guide has introduced the concept of memory management, explaining the mapping of virtual to physical addresses.
Understanding this information will help you to create your own bare-metal page tables and understand the processes your OS
performs when allocating memory.

Memory types and attributes, such as access permissions, are covered in the Memory Model guide.

As well as the Memory Management Unit (MMU) in the processor, it is increasingly common to have MMUs for non-processor
masters, such as Direct Memory Access (DMA) engines. These are referred to as SMMUs (System MMUs) in Arm systems, and
elsewhere as IOMMU.

To keep learning about the Armv8-A architecture, see more in our series of guides.

Power Optimization (Part 2) : Xuan Silvia' Zhang
No ratings yet
Power Optimization (Part 2) : Xuan Silvia' Zhang
26 pages
Gated Clock Conversion
No ratings yet
Gated Clock Conversion
8 pages
Corelink Mmu600 System Memory Management Unit Technical Reference Manual 100310 0202 00 en
No ratings yet
Corelink Mmu600 System Memory Management Unit Technical Reference Manual 100310 0202 00 en
144 pages
Understanding Trace
No ratings yet
Understanding Trace
28 pages
Amba 5 Ahb Spec
No ratings yet
Amba 5 Ahb Spec
86 pages
Axi Prot
No ratings yet
Axi Prot
273 pages
DDI0403E D Armv7m Arm PDF
No ratings yet
DDI0403E D Armv7m Arm PDF
858 pages
CHI - E Spec
No ratings yet
CHI - E Spec
516 pages
TPM Rev 2.0 Part 1 - Architecture 00.96 130315 PDF
No ratings yet
TPM Rev 2.0 Part 1 - Architecture 00.96 130315 PDF
267 pages
Amba2 0rev PDF
No ratings yet
Amba2 0rev PDF
107 pages
Mvsim Pag
No ratings yet
Mvsim Pag
16 pages
AHB Lite Specification
No ratings yet
AHB Lite Specification
72 pages
Ahb Generator
No ratings yet
Ahb Generator
14 pages
TrustedComputing SecurityFromGroundUp
No ratings yet
TrustedComputing SecurityFromGroundUp
88 pages
An4581 2
No ratings yet
An4581 2
20 pages
Design of An AMBA AHB Reconfigurable Arbiter For On-Chip Bus Architecture
No ratings yet
Design of An AMBA AHB Reconfigurable Arbiter For On-Chip Bus Architecture
8 pages
FEIG CPR70 Manual
No ratings yet
FEIG CPR70 Manual
231 pages
Synthesizable Finite State Machine Design Techniques Using The New Systemverilog 3.0 Enhancements
No ratings yet
Synthesizable Finite State Machine Design Techniques Using The New Systemverilog 3.0 Enhancements
53 pages
Tutorial On DNN 4 of 9 DNN Accelerator Architectures PDF
No ratings yet
Tutorial On DNN 4 of 9 DNN Accelerator Architectures PDF
73 pages
Verilog Designers Library 0130811548 9780130811547 - Compress
No ratings yet
Verilog Designers Library 0130811548 9780130811547 - Compress
430 pages
AXI To AHB-Lite Bridge Cycle Model
No ratings yet
AXI To AHB-Lite Bridge Cycle Model
22 pages
Toshiba Efuse TCKE805NA - Datasheet - en - 20220817
No ratings yet
Toshiba Efuse TCKE805NA - Datasheet - en - 20220817
24 pages
01 Finite State Machine 1
No ratings yet
01 Finite State Machine 1
2 pages
PT PX
No ratings yet
PT PX
4 pages
Architectural Support For High Speed Protection of Memory Integrity and Confidentiality in Multiprocessor Systems
No ratings yet
Architectural Support For High Speed Protection of Memory Integrity and Confidentiality in Multiprocessor Systems
29 pages
978 3 031 01725 4
No ratings yet
978 3 031 01725 4
137 pages
Xilinx Answer 65062 AXI PCIe Address Mapping
No ratings yet
Xilinx Answer 65062 AXI PCIe Address Mapping
11 pages
AMBA Transactors User and Reference Guide: Product Version 5.4 May 2005
No ratings yet
AMBA Transactors User and Reference Guide: Product Version 5.4 May 2005
170 pages
HSM Draft PDF
No ratings yet
HSM Draft PDF
30 pages
Phy Ip For Pcie 3.0
No ratings yet
Phy Ip For Pcie 3.0
2 pages
OceanofPDF - Com Understanding Logic Locking - Kimia Zamiri Azar
No ratings yet
OceanofPDF - Com Understanding Logic Locking - Kimia Zamiri Azar
526 pages
7 Series Memory Controllers
100% (1)
7 Series Memory Controllers
36 pages
Arnold An eFPGA-Augmented RISC-V SoC For Low Power Iot End Nodes
No ratings yet
Arnold An eFPGA-Augmented RISC-V SoC For Low Power Iot End Nodes
14 pages
Coresight v3 0 Architecture Specification IHI0029E
No ratings yet
Coresight v3 0 Architecture Specification IHI0029E
280 pages
Cache Design
No ratings yet
Cache Design
59 pages
Opencores Coding Guidelines
No ratings yet
Opencores Coding Guidelines
28 pages
IHI0051B Amba Axi Stream Protocol Spec
No ratings yet
IHI0051B Amba Axi Stream Protocol Spec
56 pages
Palestra 4 Abram Belk
No ratings yet
Palestra 4 Abram Belk
143 pages
Cache Controller Verilog Project
No ratings yet
Cache Controller Verilog Project
4 pages
P-Tile Avalon Streaming IP For PCI Express User Guide
No ratings yet
P-Tile Avalon Streaming IP For PCI Express User Guide
222 pages
Axi BFM
No ratings yet
Axi BFM
85 pages
Intelligent High Performance Memory Access Technique in Aspect of DDR3
No ratings yet
Intelligent High Performance Memory Access Technique in Aspect of DDR3
6 pages
Chapter 4 - Cache Memory: Luis Tarrataca
No ratings yet
Chapter 4 - Cache Memory: Luis Tarrataca
159 pages
SoC or System On Chip Seminar Report
No ratings yet
SoC or System On Chip Seminar Report
25 pages
System On Chip SOC
No ratings yet
System On Chip SOC
25 pages
IHI0050E B Amba Chi Architecture Spec
No ratings yet
IHI0050E B Amba Chi Architecture Spec
508 pages
Embedded Systems Design - 2: Dr. N. Mathivanan
No ratings yet
Embedded Systems Design - 2: Dr. N. Mathivanan
10 pages
ASIC Design Guidelines: Hauw Suwito, Consultant
No ratings yet
ASIC Design Guidelines: Hauw Suwito, Consultant
8 pages
M2 80mm PCIeNVMe Phison PS5007 PDF
No ratings yet
M2 80mm PCIeNVMe Phison PS5007 PDF
54 pages
Verifying A Low Power Design: Asif Jafri
No ratings yet
Verifying A Low Power Design: Asif Jafri
10 pages
Verilog 2001 Ref Guide
No ratings yet
Verilog 2001 Ref Guide
56 pages
Application-Specific Integrated Circuit ASIC A Complete Guide
From Everand
Application-Specific Integrated Circuit ASIC A Complete Guide
Gerardus Blokdyk
No ratings yet
DDI0598C B MPAM Supplement
No ratings yet
DDI0598C B MPAM Supplement
410 pages
ARM An ARMv8.1-M Performance Monitoring User Guide
No ratings yet
ARM An ARMv8.1-M Performance Monitoring User Guide
58 pages
Introducing The Arm Architecture PDF
No ratings yet
Introducing The Arm Architecture PDF
19 pages
Corelink Mmu600ae Technical Reference Manual 101412 0000 01 en
No ratings yet
Corelink Mmu600ae Technical Reference Manual 101412 0000 01 en
177 pages
DDI0553B J Armv8m Arm
No ratings yet
DDI0553B J Armv8m Arm
1,982 pages
SysReg XML A Profile-2025-03
No ratings yet
SysReg XML A Profile-2025-03
6,718 pages
User Guide 101470 2021-0 00 en
No ratings yet
User Guide 101470 2021-0 00 en
677 pages
Arm V8A Self-Hosted Debug
No ratings yet
Arm V8A Self-Hosted Debug
31 pages
SSRN Id4414065
No ratings yet
SSRN Id4414065
69 pages
DBMS QB
No ratings yet
DBMS QB
16 pages
srx4600 Firewall Datasheet
No ratings yet
srx4600 Firewall Datasheet
6 pages
Cloud Computing Best Practices for Managing and Measuring Processes for On Demand Computing Applications and Data Centers in the Cloud with Slas 1st Edition by Beard Haley ISBN 9781921523199 instant download
100% (4)
Cloud Computing Best Practices for Managing and Measuring Processes for On Demand Computing Applications and Data Centers in the Cloud with Slas 1st Edition by Beard Haley ISBN 9781921523199 instant download
47 pages
Object Oriented Programming Viva Questions: 1) What Is OOPS?
No ratings yet
Object Oriented Programming Viva Questions: 1) What Is OOPS?
7 pages
Pic Report
100% (2)
Pic Report
46 pages
Cisco Just Switch It Playbook
100% (1)
Cisco Just Switch It Playbook
49 pages
Course Materials - The Ultimate Hands-On Hadoop - Machine Learning and Big Data Training With Frank Kane
No ratings yet
Course Materials - The Ultimate Hands-On Hadoop - Machine Learning and Big Data Training With Frank Kane
6 pages
Unit 5 (BCA 1) Bootstrap
No ratings yet
Unit 5 (BCA 1) Bootstrap
5 pages
Installation & Usage V1s1t0r1sh3r3:airgeddon Wiki GitHub PDF
No ratings yet
Installation & Usage V1s1t0r1sh3r3:airgeddon Wiki GitHub PDF
3 pages
ws2022 Ig en
No ratings yet
ws2022 Ig en
74 pages
Revised RFP V2
No ratings yet
Revised RFP V2
334 pages
Oop 4
No ratings yet
Oop 4
12 pages
CPUs MR - Hakar
No ratings yet
CPUs MR - Hakar
7 pages
C II Semester Sylabus
No ratings yet
C II Semester Sylabus
2 pages
Documentation - Gi Fi
No ratings yet
Documentation - Gi Fi
23 pages
Western University Department of Civil and Environmental Engineering
No ratings yet
Western University Department of Civil and Environmental Engineering
3 pages
SRE Project Documentation by Shaheer
No ratings yet
SRE Project Documentation by Shaheer
21 pages
Visual Basic 2012 Tutorial PDF
100% (8)
Visual Basic 2012 Tutorial PDF
172 pages
V$Lock V$SQL V$session DBA Locks V$process V$sqlstats
No ratings yet
V$Lock V$SQL V$session DBA Locks V$process V$sqlstats
15 pages
IT8076 Question With Answer 2
No ratings yet
IT8076 Question With Answer 2
37 pages
Creative Presentations With Microsoft Powerpoint
No ratings yet
Creative Presentations With Microsoft Powerpoint
9 pages
Suganth Resume+
No ratings yet
Suganth Resume+
1 page
How To Use Find My Mobile To Locate or Unlock Your Galaxy Phone Remotely - Samsung CA
No ratings yet
How To Use Find My Mobile To Locate or Unlock Your Galaxy Phone Remotely - Samsung CA
5 pages
Windows 10 Beyond The Manual PDF
100% (14)
Windows 10 Beyond The Manual PDF
178 pages
Chapter 2 - Introduction To Data Science
No ratings yet
Chapter 2 - Introduction To Data Science
56 pages
Core (1 2 3)
No ratings yet
Core (1 2 3)
80 pages
8.detail Views
No ratings yet
8.detail Views
22 pages
EC 303 Chapter 2
No ratings yet
EC 303 Chapter 2
48 pages
Symfony 4 Cheat Sheet: by Via
No ratings yet
Symfony 4 Cheat Sheet: by Via
6 pages

LearnTheArchitecture-MemoryManagement-101811 0100 00 en

Uploaded by

LearnTheArchitecture-MemoryManagement-101811 0100 00 en

Uploaded by

Memory management

Version Date Confidentiality Change

1.0 20 June 2019 Non-Confidential First release

Non-Confidential Proprietary Notice

This document may include technical inaccuracies or typographical errors.

Arm Limited. Company 02557590 registered in England.

110 Fulbourn Road, Cambridge, England CB1 9NJ.

Unrestricted Access is an Arm internal classification.

2 What is memory management?..................................................................................................................................................................................................... 6

3 Virtual and physical addresses ...................................................................................................................................................................................................... 7

4 The Memory Management Unit (MMU) ..................................................................................................................................................................................... 8

5 Address spaces in Armv8-A..........................................................................................................................................................................................................12

6 Controlling address translation ..................................................................................................................................................................................................17

7 Translation granule .........................................................................................................................................................................................................................18

8 Translation Lookaside Buffer maintenance ............................................................................................................................................................................21

9 Address translation instructions ................................................................................................................................................................................................23

10 Check your knowledge ................................................................................................................................................................................................................24

11 Related information .....................................................................................................................................................................................................................25

12 Next steps ........................................................................................................................................................................................................................................26

2 What is memory management?

2.1. Why is memory management needed?

3 Virtual and physical addresses

4 The Memory Management Unit (MMU)

The MMU contains the following:

4.1. Table entry

4.2. Table lookup

This diagram shows a single-level lookup.

4.3. Multilevel translation

5 Address spaces in Armv8-A

The diagram shows three virtual address spaces:

• NS.EL0 and NS.EL1 (Non-secure EL0/EL1).

Note: Support for Secure EL2 was added in Armv8.4-A.

5.1. Address sizes

5.1.1 Size of virtual addresses

The virtual address size is encoded as:

virtual address size in bytes = 264-TCR_ELx.TnSZ

Number of address bits = 64 – TnSZ

5.1.2 Size of physical addresses

5.1.3 Size of intermediate physical addresses

Note: ASID tagging is also available in EL2, when HCR_EL2.E2H==1.

5.3. Virtual Machine Identifiers - Tagging translations with the owning VM

5.4. Common not Private

6 Controlling address translation

Level 4KB granule 16KB granule 64KB granule

1 1GB 38:30 64GB 46:36 4TB 51:42

2 2MB 29:21 32MB 35:25 512MB 41:29

3 4KB 20:12 16KB 24:14 64KB 28:16

7.1. The starting level of address translation

64 - T0SZ = 32-bit address space (address bits 31:0)

Next, imagine you set T0SZ to 34:

64 - T0SZ = 30-bit address space (address bits 29:0)

7.2. Registers that control address translation

7.3. MMU disabled

8 Translation Lookaside Buffer

• A translation fault (unmapped address).

• Change the mapping of an address

• Change the way the tables are interpreted

8.1. Format of a TLB operation

TLBI <type><level>{IS|OS} {, <xt>}

• <type> Which entries to invalidate.

STR X1, [X5] // Write to translation table entry

9 Address translation instructions

10 Check your knowledge

Q. What is the maximum size of a physical address?

A. TCR_ELx.TnSZ, or VTCR_EL2.T0SZ for Stage 2.

Q. What is a translation granule, and what are the supported sizes?

A. It is the smallest block of memory that can be described.

The supported sizes are 4KB, 16KB, and 64KB.

Q. What does the TLBI ALLE3 do?

A. No, it cannot be stored in the TLBs.

Q. How are addresses mapped when the MMU is disabled?

Q. What is an ASID and when does a TLB entry include an ASID?

Useful links to training:

You might also like