mslby@deshaw.com wrote:
Full_Name: Mark Selby Version: 2.4.25 OS: Solaris 10 x86 URL: ftp://ftp.openldap.org/incoming/ Submission from: (NULL) (149.77.104.214)
My company uses OpenLDAP 2.4.25 with Berkeley DB 4.8.30 compiled on Solaris 10 x86 using Sun Studio. OpenLDAP is used as the backend for generic naming services (passwd, group, netgroup etc?) as well as holding mail routing and some custom data. We have master and slave servers and are using syncrepl refresh and persist.
Lately we have been experiencing hangs with slapd and I cannot figure out what the cause is. Things will be humming along and then slapd will simply stop accepting connections and answering any requests for reads/writes. We have set the loglevel to 256 and there is nothing in the logs that indicates what the issue it. At the time that the process goes catatonic all syslogs from slapd stop
I have not found a way to reproduce this on demand.
Today we have had three hangs and the truss ouput is exactly the same on all process. Once the slapd process gets in this state a simple kill does not work. The syslog always says that slapd is waiting for tasks to complete but this never happens. I need to kill -9 the pid.
I am going to include all of the debug info that I have collected and hopefully someone will have some idea what is going on. I also have a gcore of the process if anyone wants and info from that
All and any help is greatly appreciated
This appears to be a dup of ITS#6833. Your OS is broken. Closing this ITS.