Scheduling

problem: how to share the CPU?
more generally: multiplex tasks onto limited resources
- cloud service, limited machines, lots of requests
- supermarket, limited cashiers, lots of customers
terms/metrics

latency/response time
- user-perceived time to finish a task (including time waiting)
throughput
- the rate of task completion (# of tasks done per period of time)
scheduling overhead
- time it takes to perform scheduling (run scheduling policy + context switch)
fairness
- are tasks given similar times on the CPU & wait for similar amount of time
predictability
- how consistent is the latency
starvation
- lack of progress for one task due to higher priority tasks

first in first out (FIFO)
- run each task to completion, in the order they come in
- pros/cons?
- average latency: best case? worst case?
shortest job first (SJF)
- schedule the shortest job first
- if a new shorter task arrives, preempts the current task and switches to the new task
- pros/cons?
- average latency?
round robin (RR)
- fifo but each task only gets a fixed amount of time (time slice/quantum)
- how to pick the quantum?
- pros/cons?
- average latency
- seems fair but not all tasks need CPU equally
multilevel feedback queue (MLFQ)
- multiple levels of RR queues, with increasing the quantum
- improves latency for I/O tasks, fair to tasks that use less CPU