Re: LMDB crash consistency, again

5 Jan 2015


      Hallvard Breien Furuseth wrote:
...
On 15/11/14 02:57, Howard Chu wrote:
...
Ah good point. If you check out their slides, #103 of 106 asks the
question; the only failure they found in LMDB occurred on ext3 (and not
on XFS) so we may just chalk this up to a flaw in ext3 instead.
Given that ext3 has already been superseded by ext4, this result of
theirs may not be all that useful in the real world. We already have
disrecommended ext3 for performance reasons, perhaps we should just note
this and move on.
No, ext4 breaks too.  Their paper's page 459, 5.1.2 LightningDB:
 The fact that the journal commit block (op#402) is
 flushed with the next pwrite64 in the same thread
 means fdatasync on ext3 does not wait for the comple-
 tion of journaling (similar behavior has been observed
 on ext4).


I guess O_DSYNC and fdatasync() should not be the lmdb
defaults yet, at least not on Linux:-(
We need a bug report to Linux or to the distro they were
using, noting that the power faults are simulated.  And is
this O_DSYNC, fdatasync or both?  The problem might be with
only one of them.  I'm not reading the paper in detail yet.
This appears to be quite old news.
https://lkml.org/lkml/2012/9/3/83
It has references going back to at least 2008.
-- 
   -- Howard Chu
   CTO, Symas Corp.           http://www.symas.com
   Director, Highland Sun     http://highlandsun.com/hyc/
   Chief Architect, OpenLDAP  http://www.openldap.org/project/

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

Re: LMDB crash consistency, again