Hi,
I'm trying to figure out the configuration options available for
approximate/phonetic/accent-insensitive search on OpenLDAP (2.4).
First off, is there any documentation that I might have missed? I
only found mention of an approx index without further details...
nothing about what needs to be enabled etc.
In some older questions (~2002) an option "--enable-phonetic" was
mentioned... was this removed respectively is it obsolete (e.g. the
default)?
I did some tests using the approximate match operator ~= once on a
fresh LDAP with no approx index and once with the approx index set
and I got the same result... is this expected behaviour that
approximate matching also works without defining an index? If it's
not necessary, what's the advantage of setting the approx index,
improved performance? (Mind that the attributes I search do have
other indices set e.g. sub. Maybe approximate search implementation
also considers that index?)
Since data in our LDAP is mostly French, Italian and German I was
wondering if any language specific settings can be made? I came
across a related question from 2015 which then was answered with no,
is that still true?
> Is there any possibility to configure phontic search specially for Austria?
ITS is for bug reports/enhancements. Use the openldap-technical
mailinglist for help reqests.
Anyway: No, soundex/metaphone is hardcoded in slapd approx match.
See <openldap source>/servers/slapd/phonetic.c. Clients can do
a little of this "by hand" by inserting many variants of a name
(e.g. o instead of รถ) as invisible attribute values in the
directory, see the x-hidden examples in man slapd.conf.
http://www.openldap.org/lists/openldap-bugs/201506/msg00002.html
Could matching rules be used in any way to influence the search
behaviour with regard to localization e.g. using collation? I came
across "caseIgnoreOrderingMatch" (RFC 4517) but as far as I can see
it's only good for sorting...
Is there a way to search only ignoring the order or in other words
performing an accent-insensitive search?
Any other hints/best practices for configuring OpenLDAP to support
approximate/phonetic/accent-insensitive search in general and
specifically for languages other than English?
Thanks for your consideration.
Clauds