--On Thursday, December 29, 2016 7:55 PM -0800 "Paul B. Henson" henson@acm.org wrote:
On Thu, Dec 29, 2016 at 04:48:42PM -0800, Quanah Gibson-Mount wrote:
I'm able to reproduce the problem with my test script, so I think things are set from here.
Excellent; in that case please let me know when there is a fix available :). Thanks much...
Well, it seems to be some sort of race condition. I've only had it occur one time out of some 700+ attempts. I don't think it is really related to 4-way MMR at all, but it may be the larger the number of consumers of the accesslog, the more likely it is to occur (so you'll see it more often in 4-way MMR).
I did want to confirm that you see this on servers that are long running (I.e., they've been running for a long time, and had other group deletes that went through w/o issue during that time). If so, then I can modify the test to randomly add and delete groups as a part of the test, increasing the likelyhood of triggering the issue within the test.
Thanks, Quanah
--
Quanah Gibson-Mount Product Architect Symas Corporation Packaged, certified, and supported LDAP solutions powered by OpenLDAP: http://www.symas.com
On Fri, Dec 30, 2016 at 02:41:06PM -0800, Quanah Gibson-Mount wrote:
Well, it seems to be some sort of race condition.
Yes, I'd agree; probably also load dependent as I never triggered it on my dev systems which are mostly idle other than my test load. It only showed up on my prod systems which tend to have continuous load from various other things.
I did want to confirm that you see this on servers that are long running (I.e., they've been running for a long time, and had other group deletes that went through w/o issue during that time). If so, then I can modify the test to randomly add and delete groups as a part of the test, increasing the likelyhood of triggering the issue within the test.
I don't have too many deletions of group objects themselves in production, mostly just deletions of the members of groups. I didn't see any issues with group deletions in dev, or during some basic initial testing in prod. I'll go ahead and make a new test group, add some members to it, and then delete it and see what happens now that I've been running this code for about 3 weeks...
I didn't see any errors deleting a group, although there were these syncrepl messages that I don't believe used to show up:
Dec 30 21:23:29 themis slapd[2607]: syncrepl_message_to_op: rid=001 be_delete uid=ldaptest5,ou=group,dc=cpp,dc=edu (32) Dec 30 21:23:29 themis slapd[2607]: syncrepl_message_to_op: rid=003 be_delete uid=ldaptest5,ou=group,dc=cpp,dc=edu (32) Dec 30 21:23:29 themis slapd[2607]: syncrepl_message_to_op: rid=002 be_delete uid=ldaptest5,ou=group,dc=cpp,dc=edu (32)
The group and memberOf attributes are gone on all four servers, so other than noise in the logs I'm not sure what these messages meant.
openldap-technical@openldap.org