Hi all,
When compiling 2.3.34, I ran into a make test error for test007 where replication via slurpd is tested:
>>>>> Starting test007-replication ... running defines.sh Starting master slapd on TCP/IP port 9011... Starting slave slapd on TCP/IP port 9012... Using ldapsearch to check that master slapd is running... Using ldapsearch to check that slave slapd is running... Starting slurpd... Using ldapadd to populate the master directory... Waiting 15 seconds for slurpd to send changes... Using ldapmodify to modify master directory... Waiting 15 seconds for slurpd to send changes... Stopping the slave... Waiting 5 seconds for slave slapd to die... Applying more changes to the master slapd... Stopping slurpd... Waiting 5 seconds for slurpd to die... Applying more changes to the master slapd... Restarting slave slapd on TCP/IP port 9012... Using ldapsearch to check that slave slapd is running... Waiting 5 seconds for slapd to start... Waiting 5 seconds for slapd to start... Waiting 5 seconds for slapd to start... Waiting 5 seconds for slapd to start... Waiting 5 seconds for slapd to start... Waiting 5 seconds for slapd to start... Restarting slurpd... Waiting 15 seconds for slurpd to send changes... Try updating the slave slapd... ldapmodify failed (1)! kill: 2730635: no such process >>>>> ./scripts/test007-replication failed (exit 1) *** Error code 1 (bu21) *** Error code 1 (bu21)
From the slurpd log file:
ldap_connect_to_host: Trying 127.0.0.1:9012 ldap_connect_timeout: fd: 6 tm: -1 async: 0 ldap_close_socket: 6 ldap_err2string Error: ldap_simple_bind_s for localhost:9012 failed: Can't contact LDAP server ldap_unbind Retrying operation for DN cn=James A Jones 1,ou=Alumni Association,ou=People,dc=example,dc=com on replica localhost:9012 end replication thread for localhost:9012 fm: exiting slurpd: terminated.
and from the slapd.2 log:
backend_startup_one: starting "dc=example,dc=com" bdb_db_open: dc=example,dc=com bdb_db_open: database already in use backend_startup_one: bi_db_open failed! (-1) slapd shutdown: initiated ====> bdb_cache_release_all slapd destroy: freeing system resources. slapd stopped. connections_destroy: nothing to destroy.
It looked to me like maybe the old slave slapd was still running, so I looked at the process table, and found that not only that, but all slapd's from previous tests were still running. If I'm not mistaken the test scripts are trying to shut slapd with a HUP signal. However, reading in the Administrator guide I would say that slapd should be stopped with INT. Is that right and should I change HUP into INT, or is something else wrong? Best regards,
Gerard Ranke