[Issue 9027] LMDB: API to retrieve map address

5 Jun 2021


      https://bugs.openldap.org/show_bug.cgi?id=9027
--- Comment #3 from doug@hcsw.org ---
Hi! I would like to voice my support for this feature, and explain how this API
would help me.
First of all, a bit of background: I'm maintaining C++ bindings for LMDB
(https://github.com/hoytech/lmdbxx), and have been a contributor to the perl
bindings and others. I have published two interface layers that use LMDB,
https://github.com/hoytech/quadrable and https://github.com/hoytech/rasgueadb
and as my colleagues would attest, am a relentless LMDB advocate.
Application 1: Testing zero-copy
That previous link, RasgueaDB, is an indexing and query layer for LMDB. It uses
flatbuffers for its data format, which allows zero-copy access to fields. Since
there are several layers involved, I wanted to make sure that no copying was
happening, and also to add a test to the test-suite to make sure this remains
the case.
To do this, I have a library called assert_zerocopy:
https://github.com/hoytech/hoytech-cpp/blob/master/hoytech/assert_zerocopy.h
(and also a perl equivalent here: https://metacpan.org/pod/Test::ZeroCopy )
However, in order to verify the returned value is actually a zero-copy
reference, I need the pointer to the memory map (and the map's size). I have
added a terrible hack to the lmdbxx bindings to retrieve this:
https://github.com/hoytech/lmdbxx/blob/08eddafcc4613c7fc8ebd88f5db87c7d7bfb9...
But this is obviously not a good approach (as mentioned by Nic in this thread)
and it would be much better if there was an API to retrieve this value (like
there is for me_mapsize).
Application 2: Virtual memory controls
In general I agree that mlock()ing large files like databases is
counter-productive. However, there are other controls that can be applied to
the virtual memory of an application.
For example, rsync attempts to preserve the filesystem cache state so that
rsync invocations have as small an impact as possible on the cache (since it's
a shared resource): https://insights.oetiker.ch/linux/fadvise/
Another example, before restarting a server, Instagram uses my utility vmtouch
to snapshot the virtual memory state (basically the set of "hot" files). After
rebooting, the state is restored before the server is added back to the active
pool of servers.
I have generalised this with my application/library vmprobe:
https://vmprobe.com/filesystem-cache
If I had access to me_map within my application, I would be able to integrate
libvmprobe more easily and portably. For instance, this would allow me to take
a snapshot of which pages are resident in memory, perform some large read query
that touches many pages, then restore the original VM page residency set after
the query completes.
Another variant of this: If you have multiple customers who use the same server
(at different times) then you can restore a customer's previous VM state when
they login. If you know approximately the pages that will be accessed in
advance, sequentially pre-paging them in can have significant performance
benefits. I did some experiments with Postgres pre-paging you can see here:
https://vmprobe.com/database-speedup
Conclusion
Although certainly a user could abuse me_map if it were exposed, one of the
reasons why I enjoy LMDB so much is that it generally assumes its users know
what they are doing and doesn't compromise on flexibility or performance for
the sake of users who don't. For example, although it is not recommended to use
MDB_WRITEMAP in most cases, it nevertheless exists.
As explained above, I have several use-cases for me_map that I feel are
legitimate. I am already accessing it in a hacky way but naturally would prefer
a more portable and future-proof API.
If this API is added, the documentation should of course make clear that
accessing or modifying (if MDB_WRITEMAP) data through me_map is not supported
in any way, and regular use-cases have no need for this value and should stick
to the supported APIs.
Thank you!
-- 
You are receiving this mail because:
You are on the CC list for the issue.

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

[Issue 9027] LMDB: API to retrieve map address