LinuxMMFAQ - linux-mm.org Wiki

What are the design internals behind zap_xxx_range() APIs?

Rik van Riel to Wu, kernelnewbies

On Sun, 30 Mar 2008 17:55:26 +0800 "Wu Yu" < vestige.lug@gmail.com > wrote:

> Hi all,

> AFAIK, for IA-32 arch, two-level paging is sufficient. But the

> positions of PUD and PMD are kept.

This is done so all architectures can use the same iterative functions, like unmap_page_range -> zap_pud_range -> zap_pmd_range -> zap_pte_range.

> I don't quite understand the word "mapping". Does that mean there > is one entry in PGD for both PUD and PMD?

It means that the kernel pretends that the pgd entry points to a pud (with one entry), even though it really points to a page table.

By pretenting it points to itself unmap_page_range can call the next function down, zap_pud_range.

We do the same thing in zap_pud_range - it calls zap_pmd_range on the same pgd entry, except now it thinks it points to a pmd.

Finally, zap_pmd_range interprets the entry and sees that it points to a page table. Zap_pmd_range invokes zap_pte_range and everything works as normal.

Because, on i386, we pretend that the PUD and PMD page tables only have 1 entry, the loop does not loop, but simply invokes the next function down once.