I just upgraded to 2.4.8, and I experience a stability problem with synrepl and schema replication.
My replicas get cn=schema,cn=config through syncrepl. If the master is restarted, the replicas tend to crash (not always, but very frequently). The problem disapear if I don't replicate cn=schema,cn=config.
Below is a log of what happens at master restaty time, and the backtrace I collected in GDB. Any suggestion of what should I try next?
Apr 17 22:16:01 khazad-dum slapd[6292]: conn=55 fd=13 closed (connection lost) Apr 17 22:16:02 khazad-dum slapd[6292]: do_syncrep2: rid=222 (-1) Can't contact LDAP server Apr 17 22:16:02 khazad-dum slapd[6292]: do_syncrep2: rid=022 (-1) Can't contact LDAP server Apr 17 22:16:02 khazad-dum slapd[6292]: do_syncrepl: rid=022 retrying Apr 17 22:16:02 khazad-dum slapd[6292]: do_syncrepl: rid=222 retrying
Program received signal SIGSEGV, Segmentation fault. [Switching to LWP 1] 0x08098b62 in at_next () (gdb) bt #0 0x08098b62 in at_next () #1 0x080506fe in config_generic () #2 0x0805b4e7 in config_set_vals () #3 0x0805bc9f in config_parse_add () #4 0x080579d8 in config_modify_add () #5 0x0805820e in config_modify_internal () #6 0x080585d0 in config_back_modify () #7 0x080af683 in syncrepl_entry () #8 0x080abd18 in do_syncrep2 () #9 0x080acf32 in do_syncrepl () #10 0x08063c25 in connection_read_thread () #11 0xbdbbbd61 in ldap_int_thread_pool_wrapper () from /usr/pkg/lib/libldap_r-2.4.so.2 #12 0xbd95a17d in pthread_create () from /usr/lib/libpthread.so.0
Emmanuel Dreyfus wrote:
I just upgraded to 2.4.8, and I experience a stability problem with synrepl and schema replication.
A large number of syncrepl bugs have been fixed since 2.4.8. It would be helpful for you to shift to RE24 at this point. Also, if the problem still occurs, get a trace with debug info present. The trace you attached below isn't very useful.
My replicas get cn=schema,cn=config through syncrepl. If the master is restarted, the replicas tend to crash (not always, but very frequently). The problem disapear if I don't replicate cn=schema,cn=config.
Below is a log of what happens at master restaty time, and the backtrace I collected in GDB. Any suggestion of what should I try next?
Apr 17 22:16:01 khazad-dum slapd[6292]: conn=55 fd=13 closed (connection lost) Apr 17 22:16:02 khazad-dum slapd[6292]: do_syncrep2: rid=222 (-1) Can't contact LDAP server Apr 17 22:16:02 khazad-dum slapd[6292]: do_syncrep2: rid=022 (-1) Can't contact LDAP server Apr 17 22:16:02 khazad-dum slapd[6292]: do_syncrepl: rid=022 retrying Apr 17 22:16:02 khazad-dum slapd[6292]: do_syncrepl: rid=222 retrying
Program received signal SIGSEGV, Segmentation fault. [Switching to LWP 1] 0x08098b62 in at_next () (gdb) bt #0 0x08098b62 in at_next () #1 0x080506fe in config_generic () #2 0x0805b4e7 in config_set_vals () #3 0x0805bc9f in config_parse_add () #4 0x080579d8 in config_modify_add () #5 0x0805820e in config_modify_internal () #6 0x080585d0 in config_back_modify () #7 0x080af683 in syncrepl_entry () #8 0x080abd18 in do_syncrep2 () #9 0x080acf32 in do_syncrepl () #10 0x08063c25 in connection_read_thread () #11 0xbdbbbd61 in ldap_int_thread_pool_wrapper () from /usr/pkg/lib/libldap_r-2.4.so.2 #12 0xbd95a17d in pthread_create () from /usr/lib/libpthread.so.0