Hi Quanah,

Understood. Unfortunately, we are experiencing the issue. Here is my configuration. 

On Master:
database        mdb
suffix          "dc=georgetown,dc=edu"
.
dbnosync  FALSE
checkpoint 128 5
maxsize 108993459200

# Replication
overlay syncprov
syncprov-checkpoint 10 5
syncprov-sessionlog 100

On the Replica:
syncrepl rid=132
        provider=ldaps://aaa-master-1.uis.georgetown.edu:636/
        type=refreshAndPersist
        keepalive=300:5:5
        retry="5 5 300 +"

How do we go about fixing this?

Thanks,
Suresh


On Fri, Jan 17, 2025 at 2:30 PM Quanah Gibson-Mount <quanah@fast-mail.org> wrote:


--On Friday, January 17, 2025 10:51 AM -0700 Thomas Pritchard
<pritchardtw@gmail.com> wrote:

>
> I have not experienced the master crashing but I have also experienced
> stalled replication regularly over the last 2 years. We've been on 2.46
> and have keep alive enabled. We were hoping an upgrade would resolve the
> problem but it seems to exist on latest versions as well. I am also on
> aws ec2 instances. 


I've been running OpenLDAP in AWS on EC2 instances for the last 3 years at
two different companies, and have never encountered this behavior.

--Quanah



--
Suresh Veliveli
Sr. UNIX Systems Engineer
Georgetown University
University Information Services | Security Infrastructure and Policy-Identity and Collaboration
202-262-6676 (cell) | 202-687-3108 (work)