Syncrepl error causes consumers to freeze - openldap-technical

24 Nov 2011


      Hi,
I am using 2.4.26 on syncrepl master (provider) (package on CentOS 5.7 
x86_64) and 2.4.22, 2.4.26 on two consumers respectively.
Last night, I edited a user account (hosted in LDAP) and when this tried 
to replicate to two consumers, both froze. This did not happen on 
another consumer (also 2.4.26) which was using replication over the 
Manager account. The two ones which froze are using a limited-privileged 
BindDN for replication which does not have access to user accounts (so, 
the user account should/would not be replicated on those two consumers).
On the master:
Nov 23 23:12:04 ldap slapd[2295]: syncprov_sendresp: 
cookie=rid=333,csn=20111123211204.601542Z#000000#000#000000
Nov 23 23:12:04 ldap slapd[2295]: syncprov_sendresp: 
cookie=rid=222,csn=20111123211204.601542Z#000000#000#000000
On slave 222:
Nov 23 23:12:04 vdns slapd2.4[2145]: do_syncrep2: rid=222 
cookie=rid=222,csn=20111123211204.601542Z#000000#000#000000
Nov 23 23:12:04 vdns slapd2.4[2145]: syncrepl_entry: rid=222 
LDAP_RES_SEARCH_ENTRY(LDAP_SYNC_MODIFY)
Nov 23 23:12:04 vdns slapd2.4[2145]: syncrepl_entry: rid=222 be_search (0)
Nov 23 23:12:04 vdns slapd2.4[2145]: syncrepl_entry: rid=222 
uid=userx,ou=people,dc=example,dc=com
Nov 23 23:12:04 vdns slapd2.4[2145]: slap_queue_csn: queing 
0x2aaab0019970 20111123211204.601542Z#000000#000#000000
and
/var/log/messages:
Nov 23 23:12:04 vdns kernel: slapd2.4[2164]: segfault at 
00000001075c61a8 rip 0000000000480ecb rsp 00000000424e04c0 error 4
On slave 333:
Nov 23 23:12:04 dns2 slapd[2364]: do_syncrep2: rid=333 
cookie=rid=333,csn=20111123211204.601542Z#000000#000#000000
Nov 23 23:12:04 dns2 slapd[2364]: syncrepl_entry: rid=333 
LDAP_RES_SEARCH_ENTRY(LDAP_SYNC_MODIFY)
Nov 23 23:12:04 dns2 slapd[2364]: syncrepl_entry: rid=333 be_search (0)
Nov 23 23:12:04 dns2 slapd[2364]: syncrepl_entry: rid=333 
uid=userx,ou=people,dc=example,dc=com
Nov 23 23:12:04 dns2 slapd[2364]: slap_queue_csn: queing 0x19e8cfa0 
20111123211204.601542Z#000000#000#000000
and
/var/log/messages:
Nov 23 23:12:04 dns2 kernel: slapd[2736] general protection rip:4b5342 
rsp:43c54530 error:0
I have not seen this behavior in months and months of use.
Any advice?
Thanks,
Nick