Hi,
unfortunately i was not able to reproduce the exact problem with the segfault, but, after
a few updates,
we still have the problem that with replication enabled the slapd freezes during a write
operation.
SETUP DESCRIPTION:
Openldap Version 2.4.36
Back-MDB (we have issues for quite a while, even when we where running on bdb)
All write and read requests are directed to the active node, so the passive
node is replicating.
So, if I did not understand something wrong I have two threads: The main thread,
and the one which is doing the replication.
Netstat of TCP Replication connections, the second is initiated by the
passive system polling from the active
tcp 0 53 10.169.127.13:389 10.169.126.13:43340 ESTABLISHED
tcp 1905336 0 10.169.127.13:52384 10.169.126.13:389 ESTABLISHED
top -H of the LDAP Processes:
7767 ldap 20 0 84.4g 7.1g 6.9g S 1 10.1 1:02.13 slapd
7768 ldap 20 0 84.4g 7.1g 6.9g S 0 10.1 7:54.44 slapd
8023 ldap 20 0 84.4g 7.1g 6.9g S 0 10.1 0:32.31 slapd
7766 ldap 20 0 84.4g 7.1g 6.9g S 0 10.1 0:00.00 slapd
7769 ldap 20 0 84.4g 7.1g 6.9g S 0 10.1 0:32.81 slapd
7770 ldap 20 0 84.4g 7.1g 6.9g S 0 10.1 7:44.94 slapd
8024 ldap 20 0 84.4g 7.1g 6.9g t 0 10.1 0:32.53 slapd
PASTEBIN:
I Pastebinned all the backtraces to:
http://pastebin.com/vVGEqEUt
I hope this helps to track back the problem.
Kind regards - Mit freundlichen Grüßen
i.A. Hans Freitag
» Linux Administrator
ENTIRETEC AG . Pforzheimer Strasse 33 . 01189 Dresden . Germany
T: +49.351.41355.0 . M: . F: +49.351.41355.99
E: hans.freitag(a)entiretec.com
ENTIRETEC |
http://www.entiretec.com
Germany | Switzerland | United Arab Emirates | Malaysia | United States of America
ENTIRETEC AG
Vorstand: Thomas Herrmann (Vorsitzender), Thomas Wetzel, Carsten Klemm .
Aufsichtsratsvorsitzende: Dr. Jutta Horezky
Sitz der Gesellschaft: Dresden . Amtsgericht Dresden HRB 24915 . USt-IdNr. DE227705033
-----Ursprüngliche Nachricht-----
Von: openldap-bugs-bounces(a)OpenLDAP.org [mailto:openldap-bugs-
bounces(a)OpenLDAP.org] Im Auftrag von quanah(a)zimbra.com
Gesendet: Montag, 5. August 2013 05:15
An: openldap-its(a)openldap.org
Betreff: Re: (ITS#7655) segfault during initial mirror of multimaster
delta replication
--On Sunday, August 04, 2013 4:27 PM +0000 hans.freitag(a)entiretec.com
wrote:
> Full_Name: Hans Freitag
> Version: 2.4.35 and 33
> OS: SLES 11SP2
> URL:
ftp://ftp.openldap.org/incoming/
> Submission from: (NULL) (193.200.138.3)
>
>
> I have a Multimaster Delta replication setup here with bdb on a 18 GB
> Database.
>
> After a crash due to a full disk I made a new database on one node
ans
> started over.
>
> The empty node started to replicate, from the full one but after a
while
> (approx. 2GB) it crashed with a segfault:
>
> Aug 4 11:45:32 mhr-dd-lda-01 kernel: [52189.476209] slapd[10158]:
> segfault at 20 ip 00007ff97ebfabc0 sp 00007ff6e57e6b38 error 4 in
> libc-2.11.1.so[7ff97eb79000+155000]
>
> So i thought, maybe it is not e good Idea to put in a package for SP2
in a
> machine running SP1 so my first attempt to solve was an upgrade.
After the
> upgrade I got this:
>
> Aug 4 12:46:29 mhr-dd-lda-01 kernel: [ 1414.757587] slapd[3704]:
> segfault at 20 ip 00007fc82eee6182 sp 00007fc592e0acf0 error 4 in
> slapd[7fc82ee7a000+1e6000]
>
> So I created a brandnew openldap RPM 2.4.35 rpm to try out if the
problem
> is maybe related to the 2.4.33 version I am running. But fail:
>
> Aug 4 13:47:19 mhr-dd-lda-01 kernel: [ 5063.074410] slapd[8749]:
> segfault at 20 ip 00007fcbc1b537dc sp 00007fc92624fb88 error 4 in
> slapd[7fcbc1ac8000+1ea000]
>
> At the moment I deactivated the accesslogging on the node which seems
to
> work. I will know for sure in a few hours. ;-) I can try to reproduce
> that on a backup node next week. Whenn all the main nodes are up and
> running again. :)
I would suggest you build with debugging symbols, enable core files,
and
provide a backtrace of the problem. What you have provided does not
give
any useful information for debugging purposes. You also fail to state
the
backend you are using (back-bdb or back-hdb).
For information on how to provide a backtrace:
<
http://www.openldap.org/faq/data/cache/59.html>
Regards,
Quanah
--
Quanah Gibson-Mount
Lead Engineer
Zimbra, Inc
--------------------
Zimbra :: the leader in open source messaging and collaboration