Lecture: Address Translation & Paging

another resource allocation problem: limited physical memory, multiple processes
so how do we allocate memory?
- simple case: one process at a time
  - give the entire physical memory to the process
  - no translation needed, process's address = physical address
  - pro vs cons?
- actual case: multiple processes
  - attempt 1:
  - attempt 2:

how would we implement this purely in software?
1. divide up physical memory into page sized chunks
2. track which page is mapped to which frame (physical memory)
  - is this information per process or per entire system?
  - what data structure can we use to store this info?
  - what's the cost for accessing the data structure?
  - where do we store the data structure?
  - how many of these translation mappings would we need to store?
3. on every memory access, transfers control to the kernel and asks the kernel to perform address translation
how is it actually done?
- page table
- how often do we need to perform address translation?
- how can we speed it up?

architecture specification defines format of the page table

x86-64 page table format

4 level page table
1. PML4: Page Map Level 4, top level page table, each entry stores the address of a PDPT
2. PDPT: Page Directory Pointer Table, 2nd level page table, each entry stores the address of a PDT
3. PDT: Page Directory Table, 3rd level page table, each entry stores the address of a PT
4. PT: Page Table, last level page table, each entry stores the address of the mapped frame
each table is 4KB in size and each table entry is 8 bytes

4096 (table size) / 8 (entry size) = 512 (entries)
each table is indexed with 9 bits of the virtual address
what does the 8 byte page table entry look like?
page table entry:
- Bit 0-11 contain information about the page (bit 0: present, 1: writable, 2: user accessible)
- Bit 12-47 contain the physical page number of the frame
- Bit 48-63 contain either reserved field or other permission info about the page (63: executable)

why do we care about the format if hardware does the walk and permission checking?

the kernel is responsible for setting up the page tables and filling out the entries

the kernel can use these bits to make paging policy decisions
- eg. bit 5 indicates if the page has been accessed, 6 indicate if the page has been written to

an exception that is raised by the hardware when something wrong happens in the page table walk
could be missing the translation mapping or violating the access permission
how does the kernel handle a page fault?

terminate threads with invalid page faults
- nullptr, random address in unallocated virtual memory
- actual permission mismatch

needs bookkeeping structures to track information (unrelated to address translation) about each page
machine independent bookkeeping structures vs machine dependent page table

track the size of each region (stack, heap, code), if a page is associated to any file, if a page is cow

you can update just vspace and generate a new machine depedent page table with vspaceinvalidate

if we change the permission of a page while handling page fault (eg. cow), is the cached result in TLB still valid?
if we add a new mapping in page fault (stack growth), do we need to do anything to the TLB?