The goal is to swap out the page of memory that we'll need in the longest amount
of time, but of course this requires knowing the future, so since no one has yet
written the crystal ball algorithm, we have to do other things.
One common algorithm is LRU, where the least recently used page gets swapped to
disk. The problem with this algorithm is that if a multi gigabyte dataset is
being dealt with, all the other applications that are actually used get swapped.
Least frequently used (LFU) works better in that respect but there are also
problems. For instance multi pass compilers will touch the same data set several
times, and it can cause the compiler itself to be swapped.
You also have NRU, Not recently used (LRU approximation where you don't evict
pages which were just used) and NFU, Not Frequently Used is an LFU approximation
where pages which are used over and over again are kept in memory.
It's hard to get the best algorithm because it depends on the workload. Linux has page aging which is a mix of NRU and NFU that people can tweak to match what they're doing.
With streaming IO, data is typically used only once and pages are put in an inactive list so the goal is to detect that kind of memory access and not try to cache it.
As for memory management itself:
Linux 2.2 introduced do_try_to_free_pages() which was a simple algorithm with
low overhead, but it didn't work very well in some cases. It didn't take into
account the relative activity of the differnt cache sizes and didn't work well
under wildly varying or heavy VM load.
2.4 attempt to fix those problems.
It has balanced page aging, multiqueue VM and smarter page flushing.
As a result, it is more robust under heavy loads, shared memory is integrated
into the page cache, streaming IO is detected and dealt with accordingly.
Multiqueue VM works as follows: you have an active list, an inactive_dirty list
(pages that might become reclaimable), and an inactive_clean list (pages that
are certainly immediately reclaimable)
Other improvements allow a configurable RSS per process where you can limit the amount of resident memory that is being used by a process and the rest will be swapped.
Rik talked about a lot more, but being constrained with time, he flew through
some of his slides a bit faster than we could follow.
Hopefully, he'll get 2 hours or more next time .
Picture library | Back to Main Page | Next page |
2001/01/28 (14:01): Version 1.0