Make unaccent handle all diacritics known to Unicode, and expand ligatures correctly

Edit
ID 301
Title Make unaccent handle all diacritics known to Unicode, and expand ligatures correctly
Topic Miscellaneous
Created 2015-07-13 19:53:29
Last modified 2016-03-15 21:45:42 (8 years, 8 months ago)
Latest email 2016-03-16 13:52:31 (8 years, 8 months ago)
Status
2015-09: Committed
Target version
Authors Thomas Munro (macdice)
Reviewers Become reviewer
Committer Fedor Sigaev (sigaev)
Links CFbot results (CirrusCI) CFbot GitHub
Checkout latest CFbot patchset Go to your local checkout of the PostgreSQL repository and run:
git remote add commitfest https://github.com/postgresql-cfbot/postgresql.git
git fetch commitfest cf/301
git checkout commitfest/cf/301
Emails
BUG #13440: unaccent does not remove all diacritics
First at 2015-06-14 06:37:07 by mike at busbud.com
Latest at 2016-03-16 13:52:31 by Teodor Sigaev <teodor at sigaev.ru>
Latest attachment (improve-unaccent-default-rules-generation-script-v5.patch) at 2016-03-15 17:44:57 from Léonard Benedetti <benedetti at mlpo.fr>
    Attachment (improve-unaccent-default-rules-generation-script-v5.patch) at 2016-03-15 17:44:57 from Léonard Benedetti <benedetti at mlpo.fr> (Patch: Yes)
    Attachment (improve-unaccent-default-rules-generation-script-v4.patch) at 2016-03-15 17:01:39 from Teodor Sigaev <teodor at sigaev.ru> (Patch: Yes)
    Attachment (improve-unaccent-default-rules-generation-script-v3.patch) at 2016-03-10 14:44:17 from Léonard Benedetti <benedetti at mlpo.fr> (Patch: Yes)
    Attachment (contrib_unaccent_generate_unaccent_rules.py) at 2016-03-10 14:35:00 from Léonard Benedetti <benedetti at mlpo.fr> (Patch: No)
    Attachment (improve-unaccent-default-rules-generation-script-v2.patch) at 2016-02-11 21:13:47 from Léonard Benedetti <benedetti at mlpo.fr> (Patch: Yes)
    Attachment (contrib_unaccent_generate_unaccent_rules.py) at 2016-02-11 21:05:41 from Léonard Benedetti <benedetti at mlpo.fr> (Patch: No)
    Attachment (improve-unaccent-default-rules-generation-script.patch) at 2016-01-24 03:47:40 from Léonard Benedetti <benedetti at mlpo.fr> (Patch: Yes)
    Attachment (unaccent.rules) at 2016-01-24 03:18:07 from Léonard Benedetti <benedetti at mlpo.fr> (Patch: No)
    Attachment (unaccent-rules-with-all-diacritics-and-ligatures.patch) at 2015-09-03 03:16:38 from Thomas Munro <thomas.munro at enterprisedb.com> (Patch: Yes)
    Attachment (unaccent.rules) at 2015-06-23 01:00:43 from Thomas Munro <thomas.munro at enterprisedb.com> (Patch: No)
    Attachment (unaccent.rules-with-ligatures) at 2015-06-19 05:28:40 from Thomas Munro <thomas.munro at enterprisedb.com> (Patch: No)
    Attachment (make_rules_v3.py) at 2015-06-19 02:00:21 from Thomas Munro <thomas.munro at enterprisedb.com> (Patch: No)
    Attachment (make_rules_decompose.py) at 2015-06-17 01:25:53 from Thomas Munro <thomas.munro at enterprisedb.com> (Patch: No)
    Attachment (make_rules.py) at 2015-06-16 03:30:53 from Thomas Munro <thomas.munro at enterprisedb.com> (Patch: No)

Annotations

When Who Mail Annotation
2016-01-24 03:49:34 Léonard Benedetti (mlpo) From Léonard Benedetti <benedetti@mlpo.fr>
at 2016-01-24 03:47:40
Patch attached
2016-02-11 21:16:45 Léonard Benedetti (mlpo) From Léonard Benedetti <benedetti@mlpo.fr>
at 2016-02-11 21:13:47
Patch of v2 attached
2016-03-10 14:47:58 Léonard Benedetti (mlpo) From Léonard Benedetti <benedetti@mlpo.fr>
at 2016-03-10 14:44:17
Patch of v3 attached (Python 2 -> Python 3)
2016-03-15 21:45:42 Léonard Benedetti (mlpo) From Léonard Benedetti <benedetti@mlpo.fr>
at 2016-03-15 17:44:57
Patch of v5 attached (Python 2, only ASCII characters in source code)
History
When Who What
2016-03-15 21:45:42 Léonard Benedetti (mlpo) Added annotation "Patch of v5 attached (Python 2, only ASCII characters in source code)" to 56E84A19.4010901@mlpo.fr
2016-03-10 14:47:58 Léonard Benedetti (mlpo) Added annotation "Patch of v3 attached (Python 2 -> Python 3)" to 56E18841.2000406@mlpo.fr
2016-02-11 21:16:45 Léonard Benedetti (mlpo) Added annotation "Patch of v2 attached" to 56BCF98B.3040302@mlpo.fr
2016-01-24 03:49:34 Léonard Benedetti (mlpo) Added annotation "Patch attached" to 56A4495C.8020705@mlpo.fr
2015-09-04 09:53:00 Fedor Sigaev (sigaev) Closed in commitfest 2015-09 with status: Committed
2015-09-04 09:50:48 Fedor Sigaev (sigaev) Added sigaev as committer
2015-09-02 22:49:47 Andres Freund (andresfreund) New status: Ready for Committer
2015-07-13 22:11:47 Thomas Munro (macdice) Changed name to Make unaccent handle all diacritics known to Unicode, and expand ligatures correctly
2015-07-13 22:10:50 Thomas Munro (macdice) Attached mail thread 20150614063707.2734.30270@wrigleys.postgresql.org
2015-07-13 19:59:40 Thomas Munro (macdice) Changed authors to Thomas Munro (macdice)
2015-07-13 19:53:29 Thomas Munro (macdice) Created patch record
Edit