Hello,
I'm using a simple setup on CentOS Linux release 7.2.1511
(Core)/openldap-servers-2.4.40-9 /cn=config/mdb : one provider with the
syncprov overlay and 2 syncrepl consumers. The DIT itself is about 10000
dn in size (about 3000 active users).
Everything works fine except that sometimes, some clients report
(temporary) failure to reach the consumers (NAS servers for instance).
All I see in the logs is that when this happens, the time windows
loosely match a moment where log rate limiting is dropping messages (for
debugging purpose, I disabled journald ratelimiting and doubled default
rsyslog one - still some drops occur on a regular basis).
So I assume it happens when slapd is kind of busy...
Here are some question about that :
- First tuning attempt :
-------------------------
I noticed that olcDbMaxReaders value was set to 0 (not by me!)
I changed it to olcDbMaxReaders: 512
I thought the problem didn't occur anymore but I was wrong.
-> Is there any guideline about how to setup MaxReaders ?
- Second tuning attempt :
-------------------------
I thought maybe replication was responsible (I'm using refreshAndPersist
mode) so I raised the size of the sessionLog (from 100 to 800).
I read the doc again and I'd like to know if the following understanding
is correct :
- thanks to contextCSN and the sync cookies, replication CAN be stateless
- if we want it stateless, syncprov HAS TO use the present phase, which
basically is like sending the whole DIT except that for unchanged
entries, only names are sent
- if and ONLY IF a state is used (in the form of a sessionLog), then
delete phase can be used (and if the sessionLog can hold enough since
the last sync)
As a matter of fact, at the opposite of the present phase, in the delete
phase, syncprov has to 'remember' (i.e store in a sessionLog) which
entries has been deleted.
-> this assumes that delete phase is more efficient than the present
phase, right ?
-> if for some reason (for instance sessionLog being too small, delete
phase can not be used) syncprov HAS TO fall back to present phase, correct ?
-> does using the sessionLog MAKE SENSE AT ALL when using
refreshAndPersist mode ?
-> is there any guideline to choose the right size for the sessionLog ?
- Third tuning attempt
-------------------------
I also raised the checkpoint values.
-> Is there any guideline here as well ?
Thanks.
--
Thomas HUMMEL