I just remove the syncprov-checkpoint and also remove the interval=00:00:01:00

I have restarted the LDAP servers and try again, result is still the same.

Thanks

Oliver Liebel wrote:
i just took a look at your first mail in the thread, that i had not read
yet.
just two points:

  
syncprov-checkpoint 100 10
  
    
remove checkpoint
  
syncprov-sessionlog 100

# syncrepl directives
syncrepl      rid=002
             provider=ldap://10.166.23.218:389/
             bindmethod=simple
             binddn="cn=Manager"
             credentials=secret
             searchbase="o=HKSARG"
             schemachecking=off
             type=refreshAndPersist
             attrs="*,+"
             retry="1 +"
             interval=00:00:01:00
    
dont use refreshAndPersist AND interval=  together as the
second one is for refreshOnly Mode.




Paul Lee schrieb:
  
Hi Oliver,

I have checked by using the command ntpq -p <host1 host2>, and the
result is below :

[root@webserver openldap]# ntpq -p 10.0.1.34 202.245.193.128
ser      remote           refid      st t when poll reach   delay  
offset  jitter
==================================================================================
10.0.1.34 *202.245.193.128 LOCAL(0)        11 u  131 1024  377   
0.410    1.423   3.083
ser      remote           refid      st t when poll reach   delay  
offset  jitter
==================================================================================
202.245.193.128 *LOCAL(0)        LOCAL(0)        10 l   43   64 
377    0.000    0.000   0.004
[root@webserver openldap]# ntpq -p 10.0.1.34 202.245.193.128
ser      remote           refid      st t when poll reach   delay  
offset  jitter
==================================================================================
10.0.1.34 *202.245.193.128 LOCAL(0)        11 u  133 1024  377   
0.410    1.423   3.083
ser      remote           refid      st t when poll reach   delay  
offset  jitter
==================================================================================
202.245.193.128 *LOCAL(0)        LOCAL(0)        10 l   45   64 
377    0.000    0.000   0.004

The offset value seems small, but when I add records to 10.0.1.34 LDAP
server, the other LDAP server 202.245.193.128 can't see that records,
but found "CSN too old" in the ldap log.

Any suggestion to solve this problem ?  Is there any way to increase
the tolerance, or any other alternative way ?

Thanks


Oliver Liebel wrote:
    
Paul Lee schrieb:
      
Hi Oliver,

I have updated to version 2.4.11 and the problem still exists.  Do I
need to remove all data in other servers (leaving 1 as master) and
then let them synchronized from the master server ?
        
thats what i would try, but before you resync, check/sync the time
exactly on all nodes (see below).
      
I have checked the ntp is running and ntpq -p and check the jlitter
value is less than 10.
        
checked the offset on all correspondings nodes ? ntpq -p <host1 host2
...>
once again, timesync is very sensitive to mmr (as i said - very small
timeslices,
2.4 csns have additionally 6 digits for microseconds) .
there can be an offset to the (external) ntp, but the offset should
be the same an all nodes.
if youre using  virtual machines, they can cause lots of problems in
syncing time.

      
Thanks


Oliver Liebel wrote:

        
take a look at the changelogs. many reasons to upgrade,
especially syncrepl/syncrov and contextcsn -related bugfixes.

          
Is there any larger tolerance can be configured ?

Besides upgrade to 2.4.11, any other alternative method ?  It's
because we have gone through many testing in 2.4.9, if change to
2.4.11, need to re-test all again.

Thanks

            
absolutely sure?
timeslices needed for mmr in 2.4 are very small
so all servers have to be in exact sync.
maybe you should check/recalibrate and refill the dit again.


Paul Lee schrieb:

          
yes, all servers' clock are in sync...

Thanks


Oliver Liebel wrote:

            
time-sync ok?
and you should upgrade to 2.4.11.


Paul Lee schrieb:

              
Dear all,

I am using openldap version 2.4.9, I am using 4 ldap servers
with 4-way master configured.

Data can be synronized initially, but when the records is
getting more and more, now is around 100k records, I always find
data is not synronized between the four ldap servers.

I find that "CSN is too old, ignoring" in the LDAP log file.

Sep 24 04:04:03 disb01 slapd[28779]: do_syncrep2:
cookie=rid=003,sid=001,csn=20080923200403.713637Z#000000#003#000000
Sep 24 04:04:03 disb01 slapd[28779]: do_syncrep2: rid=003 CSN
too old, ignoring 20080923200403.713637Z#000000#003#000000
Sep 24 04:04:03 disb01 slapd[28779]: do_syncrep2:
cookie=rid=002,sid=001,csn=20080923200403.713637Z#000000#003#000000
Sep 24 04:04:03 disb01 slapd[28779]: do_syncrep2: rid=002 CSN
too old, ignoring 20080923200403.713637Z#000000#003#000000
Sep 24 04:04:03 disb01 slapd[28779]: do_syncrep2:
cookie=rid=002,sid=001,csn=20080923200403.739461Z#000000#002#000000
Sep 24 04:04:03 disb01 slapd[28779]: syncrepl_entry: rid=002
LDAP_RES_SEARCH_ENTRY(LDAP_SYNC_MODIFY)
Sep 24 04:04:03 disb01 slapd[28779]: syncrepl_entry: rid=002
be_search (0)
Sep 24 04:04:03 disb01 slapd[28779]: syncrepl_entry: rid=002
cn=pwdfail,ou=SCIG,ou=Govt-Dept,o=HKSARG

Any idea of this kind of error and how to fixed it ?

Thanks

Confidential Communication - This e-mail (including any
attachments) is confidential and may be legally privileged. If
this e-mail has been sent to you by mistake please inform us by
reply e-mail and then delete the e-mail, destroy any printed
copy and do not disclose or use the information in it.

                
____________
Virus checked by G DATA AntiVirusKit
Version: AVK 19.689 from 24.09.2008
Virus news: www.antiviruslab.com


              
Confidential Communication - This e-mail (including any
attachments) is confidential and may be legally privileged. If
this e-mail has been sent to you by mistake please inform us by
reply e-mail and then delete the e-mail, destroy any printed copy
and do not disclose or use the information in it.


            
____________
Virus checked by G DATA AntiVirusKit
Version: AVK 19.690 from 24.09.2008
Virus news: www.antiviruslab.com


          
Confidential Communication - This e-mail (including any attachments)
is confidential and may be legally privileged. If this e-mail has
been sent to you by mistake please inform us by reply e-mail and
then delete the e-mail, destroy any printed copy and do not disclose
or use the information in it.


        
____________
Virus checked by G DATA AntiVirusKit
Version: AVK 19.700 from 25.09.2008
Virus news: www.antiviruslab.com


      
Confidential Communication - This e-mail (including any attachments) is confidential and may be 
legally privileged. If this e-mail has been sent to you by mistake please inform us by reply 
e-mail and then delete the e-mail, destroy any printed copy and do not disclose or use the 
information in it.

    

  
Confidential Communication - This e-mail (including any attachments) is confidential and may be 
legally privileged. If this e-mail has been sent to you by mistake please inform us by reply 
e-mail and then delete the e-mail, destroy any printed copy and do not disclose or use the 
information in it.