Why mmap is faster than system calls

budabudimir · on Dec 19, 2019

I get the point, but mmap is a syscall as well. Perhaps a better title would be, "Why using mmap is faster than using read and write syscalls".

Would using O_DIRECT flag result in similar timings for mmap and read/write?

fedorova · on Dec 19, 2019

I doubt it. Using O_DIRECT is essentially bypassing the buffer cache. Similar to the “cold” experiments where the file is not cached. Mmap is still faster. I have also done experiments on a NVRAM machine with DAX file system (no buffer cache). Mmap is still several times faster.

budabudimir · on Dec 20, 2019

It is skipping the buffer cache that is true, but, if I understand things correctly, it allows kernel to use user provided buffers directly, thus skipping the copying of the data from kernel to user land. That is why buffers used in O_DIRECT context have to be aligned properly.

It would be fun to run the experiment non the less.

From open(2) man pages on O_DIRECT: Try to minimize cache effects of the I/O to and from this file. In general this will degrade performance, but it is useful in special situations, such as when applications do their own caching. File I/O is done directly to/from user- space buffers

fedorova · on Dec 19, 2019

I do agree with you about the title!

jojo9978 · on Dec 19, 2019

A system call is a special function that lets you cross protection domains