Sent: Wed May 23 2012 18:28:08 GMT-0400 (EDT)
From: Patrick Hemmer <openldap(a)stormcloud9.net>
Subject: slapd hangs upon performing modification of cn=config
I have a database which was working fine, but now whenever I go to
perform any sort of modification on the database, the modification
just hangs. No error, no timeout, just sits there. However while that
first modification is hung, I can perform modifications of another
database (other than cn=config) and they work just fine. I can even do
further searches against cn=config while the modification is still hung.
Also when I run slapd in the foreground, and then send it a SIGINT, it
says "slapd shutdown: waiting for X operations/tasks to finish" and
never ends. I end up having to SIGKILL it.
The database is part of a MMR group of servers I was building, but I
shut down all the other servers to troubleshoot the issue (it makes no
difference whether the other servers are up or down, and they all
exhibit the same problem).
Where should I start looking to figure out whats going on (the
specific operation/task thats hanging)? I can run slapd in debug mode
with '-1', but theres a ton of info, and I dont know whats relevant,
or which debug mode I should use other than '-1'.
Ok, so the problem went away. Didnt change a thing, just came back after
a few hours, started slapd up, and it behaved (though I am still
interested to know how to find stuck operations/tasks).
However a new (maybe related?) issue has popped up. I tried to add
olcSpReloadHint=TRUE to the syncprov overlay (all the replicas are >
2.3.11) and this change isnt replicating (other changes, including
attribute adds, to other DNs in cn=config replicate fine, just not this
running the consumer slapd with `-d sync` I get the following:
4fbd9472 syncrepl_message_to_entry: rid=510 DN:
4fbd9472 syncrepl_entry: rid=510 LDAP_RES_SEARCH_ENTRY(LDAP_SYNC_ADD)
4fbd9472 syncrepl_entry: rid=510 be_search (0)
4fbd9472 syncrepl_entry: rid=510
4fbd9472 syncprov_matchops: skipping original sid 033
4fbd9472 null_callback : error code 0x50
4fbd9472 syncrepl_entry: rid=510 be_modify
4fbd9472 syncrepl_entry: rid=510 be_modify failed (80)
4fbd9472 do_syncrepl: rid=510 rc 80 retrying