linux

mirror of https://github.com/torvalds/linux.git synced 2025-12-07 20:06:24 +00:00

Go to file

Jason Gunthorpe 879ced2bab iommupt: Add the AMD IOMMU v1 page table format

AMD IOMMU v1 is unique in supporting contiguous pages with a variable size
and it can decode the full 64 bit VA space. Unlike other x86 page tables
this explicitly does not do sign extension as part of allowing the entire
64 bit VA space to be supported.

The general design is quite similar to the x86 PAE format, except with a
6th level and quite different PTE encoding.

This format is the only one that uses the PT_FEAT_DYNAMIC_TOP feature in
the existing code as the existing AMDv1 code starts out with a 3 level
table and adds levels on the fly if more IOVA is needed.

Comparing the performance of several operations to the existing version:

iommu_map()
   pgsz  ,avg new,old ns, min new,old ns  , min % (+ve is better)
     2^12,     65,64    ,      62,61      ,  -1.01
     2^13,     70,66    ,      67,62      ,  -8.08
     2^14,     73,69    ,      71,65      ,  -9.09
     2^15,     78,75    ,      75,71      ,  -5.05
     2^16,     89,89    ,      86,84      ,  -2.02
     2^17,    128,121   ,     124,112     , -10.10
     2^18,    175,175   ,     170,163     ,  -4.04
     2^19,    264,306   ,     261,279     ,   6.06
     2^20,    444,525   ,     438,489     ,  10.10
     2^21,     60,62    ,      58,59      ,   1.01
 256*2^12,    381,1833  ,     367,1795    ,  79.79
 256*2^21,    375,1623  ,     356,1555    ,  77.77
 256*2^30,    356,1338  ,     349,1277    ,  72.72

iommu_unmap()
   pgsz  ,avg new,old ns, min new,old ns  , min % (+ve is better)
     2^12,     76,89    ,      71,86      ,  17.17
     2^13,     79,89    ,      75,86      ,  12.12
     2^14,     78,90    ,      74,86      ,  13.13
     2^15,     82,89    ,      74,86      ,  13.13
     2^16,     79,89    ,      74,86      ,  13.13
     2^17,     81,89    ,      77,87      ,  11.11
     2^18,     90,92    ,      87,89      ,   2.02
     2^19,     91,93    ,      88,90      ,   2.02
     2^20,     96,95    ,      91,92      ,   1.01
     2^21,     72,88    ,      68,85      ,  20.20
 256*2^12,    372,6583  ,     364,6251    ,  94.94
 256*2^21,    398,6032  ,     392,5758    ,  93.93
 256*2^30,    396,5665  ,     389,5258    ,  92.92

The ~5-17x speedup when working with mutli-PTE map/unmaps is because the
AMD implementation rewalks the entire table on every new PTE while this
version retains its position. The same speedup will be seen with dirtys as
well.

The old implementation triggers a compiler optimization that ends up
generating a "rep stos" memset for contiguous PTEs. Since AMD can have
contiguous PTEs that span 2Kbytes of table this is a huge win compared to
a normal movq loop. It is why the unmap side has a fairly flat runtime as
the contiguous PTE sides increases. This version makes it explicit with a
memset64() call.

Reviewed-by: Kevin Tian <kevin.tian@intel.com>
Reviewed-by: Vasant Hegde <vasant.hegde@amd.com>
Tested-by: Alejandro Jimenez <alejandro.j.jimenez@oracle.com>
Tested-by: Pasha Tatashin <pasha.tatashin@soleen.com>
Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>

2025-11-05 09:07:08 +01:00

arch

iommu: Pass in old domain to attach_dev callback functions

2025-10-27 13:55:35 +01:00

block

Merge tag 'block-6.18-20251023' of git://git.kernel.org/pub/scm/linux/kernel/git/axboe/linux

2025-10-24 12:48:19 -07:00

certs

sign-file,extract-cert: use pkcs11 provider for OPENSSL MAJOR >= 3

2024-09-20 19:52:48 +03:00

crypto

Merge tag 'v6.18-p3' of git://git.kernel.org/pub/scm/linux/kernel/git/herbert/crypto-2.6

2025-10-10 08:56:16 -07:00

Documentation

genpt: Add Documentation/ files

2025-11-05 09:07:07 +01:00

drivers

iommupt: Add the AMD IOMMU v1 page table format

2025-11-05 09:07:08 +01:00

Merge tag 'x86_urgent_for_v6.18_rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

2025-10-26 09:57:18 -07:00

include

iommupt: Add the AMD IOMMU v1 page table format

2025-11-05 09:07:08 +01:00

init

Merge tag 'printk-for-6.18' of git://git.kernel.org/pub/scm/linux/kernel/git/printk/linux

2025-10-04 11:13:11 -07:00

io_uring

io_uring: fix buffer auto-commit for multishot uring_cmd

2025-10-23 19:41:31 -06:00

ipc

Merge tag 'namespace-6.18-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs

2025-09-29 11:20:29 -07:00

kernel

Merge tag 'irq_urgent_for_v6.18_rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

2025-10-26 09:54:36 -07:00

lib

lib/crypto: poly1305: Restore dependency of arch code on !KMSAN

2025-10-22 10:52:10 -07:00

LICENSES

LICENSES: Replace the obsolete address of the FSF in the GFDL-1.2

2025-07-24 11:15:39 +02:00

Merge tag 'slab-for-6.18-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/vbabka/slab

2025-10-24 12:40:51 -07:00

net

Merge tag 'net-6.18-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net

2025-10-23 07:03:18 -10:00

rust

Merge tag 'driver-core-6.18-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/driver-core/driver-core

2025-10-25 11:03:46 -07:00

samples

Merge tag 'char-misc-6.18-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/char-misc

2025-10-04 16:26:32 -07:00

scripts

Merge tag 'kbuild-fixes-6.18-1' of git://git.kernel.org/pub/scm/linux/kernel/git/kbuild/linux

2025-10-11 15:47:12 -07:00

security

Merge tag 'integrity-v6.18' of git://git.kernel.org/pub/scm/linux/kernel/git/zohar/linux-integrity

2025-10-05 10:48:33 -07:00

sound

ALSA: hda/realtek: Fix mute led for HP Omen 17-cb0xxx

2025-10-17 16:37:21 +02:00

tools

Merge tag 'objtool_urgent_for_v6.18_rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

2025-10-26 09:44:36 -07:00

usr

gen_init_cpio: Ignore fsync() returning EINVAL on pipes

2025-10-07 09:53:05 -07:00

virt

Merge tag 'kvm-x86-fixes-6.18-rc2' of https://github.com/kvm-x86/linux into HEAD

2025-10-18 10:25:43 +02:00

.clang-format

genpt: Generic Page Table base API

2025-11-05 09:07:04 +01:00

.clippy.toml

rust: clean Rust 1.88.0's warning about clippy::disallowed_macros configuration

2025-05-07 00:11:47 +02:00

.cocciconfig

…

.editorconfig

.editorconfig: remove trim_trailing_whitespace option

2024-06-13 16:47:52 +02:00

.get_maintainer.ignore

MAINTAINERS: remove Alyssa Rosenzweig

2025-09-18 21:17:31 +02:00

.gitattributes

.gitattributes: set diff driver for Rust source code files

2023-05-31 17:48:25 +02:00

.gitignore

.gitignore: ignore compile_commands.json globally

2025-08-12 15:53:55 -07:00

.mailmap

MAINTAINERS: Update Alex Williamson's email address

2025-10-20 15:45:03 -06:00

.pylintrc

tools: docs: parse-headers.py: move it from sphinx dir

2025-08-29 15:54:42 -06:00

.rustfmt.toml

rust: add .rustfmt.toml

2022-09-28 09:02:20 +02:00

COPYING

COPYING: state that all contributions really are covered by this file

2020-02-10 13:32:20 -08:00

CREDITS

Merge tag 'usb-6.18-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/usb

2025-10-04 16:07:08 -07:00

Kbuild

sched: Make migrate_{en,dis}able() inline

2025-09-25 09:57:16 +02:00

Kconfig

io_uring: Rename KConfig to Kconfig

2025-02-19 14:53:27 -07:00

MAINTAINERS

Merge tag 'io_uring-6.18-20251023' of git://git.kernel.org/pub/scm/linux/kernel/git/axboe/linux

2025-10-24 12:44:31 -07:00

Makefile

Linux 6.18-rc3

2025-10-26 15:59:49 -07:00

README

README: Fix spelling

2024-03-18 03:36:32 -06:00

README

Linux kernel
============

There are several guides for kernel developers and users. These guides can
be rendered in a number of formats, like HTML and PDF. Please read
Documentation/admin-guide/README.rst first.

In order to build the documentation, use ``make htmldocs`` or
``make pdfdocs``.  The formatted documentation can also be read online at:

    https://www.kernel.org/doc/html/latest/

There are various text files in the Documentation/ subdirectory,
several of them using the reStructuredText markup notation.

Please read the Documentation/process/changes.rst file, as it contains the
requirements for building and running the kernel, and information about
the problems which may result by upgrading your kernel.

Languages

C 97.1%

Assembly 1%

Shell 0.6%

Rust 0.4%

Python 0.4%

Other 0.3%