Overflow Resolution in the DPJoin
Requires a bunch of ugly bookkeeping!
Need to mark tuples depending on state of opposite bucket - this lets us know whether they need to be joined later
Tukwila “Incremental left flush” strategy
- Pause reading from outer relation, swap some of its buckets
- Finish reading from inner; still join with left-side hash table if possible, or swap to disk
- Read outer relation, join with inner’s hash table
- Read from overflow files and join as in hybrid hash join