aboutsummaryrefslogtreecommitdiffstats
path: root/admin/unidata/blocks.awk (follow)
Commit message (Collapse)AuthorAgeFilesLines
* ; Add 2026 to copyright years.Sean Whitton2026-01-011-1/+1
|
* Update to Unicode 17.0Eli Zaretskii2025-09-111-0/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * test/manual/BidiCharacterTest.txt: * admin/unidata/BidiBrackets.txt: * admin/unidata/BidiMirroring.txt: * admin/unidata/Blocks.txt: * admin/unidata/IVD_Sequences.txt: * admin/unidata/IdnaMappingTable.txt: * admin/unidata/NormalizationTest.txt: * admin/unidata/PropertyValueAliases.txt: * admin/unidata/ScriptExtensions.txt: * admin/unidata/Scripts.txt: * admin/unidata/SpecialCasing.txt: * admin/unidata/UnicodeData.txt: * admin/unidata/confusables.txt: * admin/unidata/emoji-data.txt: * admin/unidata/emoji-sequences.txt: * admin/unidata/emoji-test.txt: * admin/unidata/emoji-variation-sequences.txt: * admin/unidata/emoji-zwj-sequences.txt: Import from Unicode 17.0. * etc/NEWS: * test/lisp/international/ucs-normalize-tests.el (ucs-normalize-tests--failing-lines-part1) (ucs-normalize-tests--failing-lines-part2): * lisp/international/mule-cmds.el (ucs-names): * lisp/international/fontset.el (script-representative-chars) (otf-script-alist): * lisp/international/characters.el: * admin/unidata/blocks.awk: Update for Unicode 17.0.
* Update copyright year to 2025Paul Eggert2025-01-011-1/+1
| | | | Run "TZ=UTC0 admin/update-copyright".
* Improve accuracy of character categoriesEli Zaretskii2024-09-131-2/+6
| | | | | | | | | | * lisp/international/characters.el: Assign 'digit' category to all the characters whose Unicode 'general-category' is Nd. * admin/unidata/blocks.awk: Add code to assign 'symbol' category to all characters belonging to the 'symbol' script. * etc/NEWS: Announce the above changes
* Update Emacs sources for Unicode 16.0Eli Zaretskii2024-09-121-0/+1
| | | | | | | | | | | | | | | | | * lisp/international/ucs-normalize.el (check-range): Update ranges of character codes with decompositions. * lisp/international/mule-cmds.el (ucs-names): Update unused ranges. * lisp/international/fontset.el (script-representative-chars) (otf-script-alist, setup-default-fontset): Add new scripts. * lisp/international/characters.el: * admin/unidata/blocks.awk: * test/manual/BidiCharacterTest.txt: * test/lisp/international/ucs-normalize-tests.el (ucs-normalize-tests--failing-lines-part2): Update for Unicode 16.0. * etc/NEWS: Announce support for Unicode 16.0.
* Fix script for characters in 3300..3357 rangeEli Zaretskii2024-08-171-0/+13
| | | | | * admin/unidata/blocks.awk: The characters in the 3300..3357 range are Katakana according to Unicode's Scripts.txt.
* ; Add 2024 to copyright yearsPo Lu2024-01-021-1/+1
|
* Fix script for some charactersEli Zaretskii2023-12-201-1/+2
| | | | | | | | * lisp/international/characters.el (char-script-table): Fix script for 2 characters. * admin/unidata/blocks.awk: Fix script for Yijing Hexagram Symbols. (Bug#67924)
* ; Add 2023 to copyright years.Eli Zaretskii2023-01-011-1/+1
|
* ; Prefer HTTPS to HTTP in many URLsStefan Kangas2022-10-151-1/+1
|
* Mark only VS-16 as emojiRobert Pluim2022-08-191-3/+8
| | | | | * admin/unidata/blocks.awk: Only mark U+FE0F as belonging to the 'emoji' script. (Bug#57072)
* Fix minor inconsistencies in 'char-script-table'Eli Zaretskii2022-01-171-2/+4
| | | | | | | | | | * admin/unidata/blocks.awk (name2alias): Map some obscure blocks to their native scripts, to follow Scripts.txt. * lisp/international/characters.el (char-script-table): Add few exceptions. * lisp/international/fontset.el (script-representative-chars): Remove scripts no longer used.
* ; Add 2022 to copyright years.Eli Zaretskii2022-01-011-1/+1
|
* Handle VS-16 correctly for non-emoji codepointsRobert Pluim2021-10-191-24/+2
| | | | | | | | | | | | | | | | | * admin/unidata/blocks.awk: Remove emoji overrides for codepoints with Emoji_Presentation = No, they're no longer necessary. * lisp/composite.el: Remove #xFE0F (VS-16) from the range handled by `compose-gstring-for-variation-glyph' so it can be handled by `font_range'. * src/composite.c (syms_of_composite): New variable `auto-composition-emoji-eligible-codepoints'. * admin/unidata/emoji-zwj.awk: Generate value for `auto-composition-emoji-eligible-codepoints'. Add `composition-function-table' entries for 'codepoint + U+FE0F' for them. * src/font.c (codepoint_is_emoji_eligible): New function to check if we should try to use the emoji font for a codepoint. (font_range): Use it.
* Enhance font_range to check for emoji composition triggersRobert Pluim2021-09-271-0/+6
| | | | | | | | | | | | | | | | | | | | | | | | If the codepoint that triggered composition is from the emoji script, use the emoji font to check the string being composed, rather than the font of the first character of the string. This makes e.g. "emoji codepoint with Emoji_Presentation = No followed by VS-16 (FE0F)" display the emoji version of the glyph for that codepoint. * admin/unidata/blocks.awk: Add VS-1 through VS-16 to the emoji script. * src/composite.c (autocmp_chars): Accept additional argument CH for the codepoint that triggered composition, pass it to font_range. (composition_reseat_it, find_automatic_composition): Pass codepoint that triggered composition to autocmp_chars. * src/font.c (font_range): Accept additional argument CH for the triggering codepoint. If the codepoint is from the 'emoji' script, use Vscript_representative_chars to find the font to use for the composition attempt. (syms_of_font): Add Qemoji symbol. * src/font.h: Update font_range prototype for argument CH. * etc/NEWS: Announce change.
* Add a few more missing emoji overridesRobert Pluim2021-09-241-1/+4
| | | | * admin/unidata/blocks.awk: Add some more emoji overrides.
* Update provenance comment in charscript.elRobert Pluim2021-09-241-1/+1
| | | | | * admin/unidata/blocks.awk: Update comment about sources used to generate charscript.el.
* Remove the emoji script overrides for U+2xxx codepointsRobert Pluim2021-09-211-12/+0
| | | | | * admin/unidata/blocks.awk: Remove the overrides for U+261D, U+26F9, U+270C..U+270D, and U+2764. They don't belong in the emoji script.
* Support for Unicode emoji sequencesRobert Pluim2021-09-201-0/+40
| | | | | | | | | | | | | | | | | | | | | | | | | | This covers both sequences using Zero-Width-Joiner codepoints and those without. Bug#39799, I hope. * .gitignore: Add emoji-zwj.el * admin/notes/unicode: Add emoji-zwj-sequences.txt and emoji-sequences.txt references. Describe how to test after updating to a newer Unicode version. * admin/unidata/Makefile.in (all): add emoji-zwj.el as a dependency. (emoji-zwj.el): Add target plus rules for building. (gen-clean): Add emoji-zwj.el. * admin/unidata/README: Add emoji-zwj-sequences.txt and emoji-sequences.txt references. * admin/unidata/blocks.awk: Force emoji script to be used for certain codepoints that are used by the Unicode sequences. * admin/unidata/emoji-sequences.txt: New file. * admin/unidata/emoji-zwj-sequences.txt: New file. * admin/unidata/emoji-zwj.awk: New file. Derives composition-function-table rules from emoji-zwj-sequences.txt, plus hardcodes some derived manually from emoji-sequences.txt. * etc/NEWS: Announce change. * lisp/international/characters.el: Load the generated emoji-zwj.el * src/Makefile.in (emoji-zwj): New target. (temacs): Add emoji-zwj as a dependency.
* Base emoji script membership on Emoji_PresentationRobert Pluim2021-09-201-6/+1
| | | | | | | | | | | The Emoji property describes which codepoints can be displayed as emoji, but Emoji_Presentation governs which are displayed as emoji by default. * admin/notes/unicode: Adjust check-emoji-coverage to look in the Emoji_Presentation sections of emoji-data.txt * admin/unidata/blocks.awk: Assign emoji script using the Emoji_Presentation section.
* Fix blocks.awk wrt to Emoji charactersEli Zaretskii2021-09-191-1/+1
| | | | | * admin/unidata/blocks.awk: Fix emoji-data.txt processing for older Awks.
* Fix emoji-induced build breakageRobert Pluim2021-09-171-3/+3
| | | | | * admin/unidata/blocks.awk: Cater for out-of-tree builds, match the name of the file using regexp rather than exact match.
* Split Unicode emoji into their own scriptRobert Pluim2021-09-171-2/+25
| | | | | | | | | | | | | | | | * admin/notes/unicode: Describe how to update emoji for new Unicode release. * admin/unidata/Makefile.in: Pass emoji-data.txt to blocks.awk script. * admin/unidata/README: Add pointer to emoji-data.txt file. * admin/unidata/blocks.awk: Parse emoji-data.txt, add emoji codepoints to the 'emoji' script (except for the ASCII ones). * admin/unidata/emoji-data.txt: New file. * etc/NEWS: Describe new 'emoji' script. * etc/TODO: Update item about 'emoji' script. * lisp/international/fontset.el (script-representative-chars): Add 'emoji' script. (setup-default-fontset): Add 'emoji' script. Use "Noto Color Emoji" as default font for it.
* Use lexical-binding in the remaining preloaded filesStefan Monnier2021-01-041-1/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * lisp/widget.el: * lisp/w32-fns.el: * lisp/textmodes/fill.el: * lisp/term/common-win.el: * lisp/scroll-bar.el: * lisp/rfn-eshadow.el: * lisp/menu-bar.el: * lisp/language/tibetan.el: * lisp/language/thai.el: * lisp/language/misc-lang.el: * lisp/language/lao.el: * lisp/language/korean.el: * lisp/language/japanese.el: * lisp/language/indian.el: * lisp/language/hebrew.el: * lisp/language/european.el: * lisp/language/ethiopic.el: * lisp/language/english.el: * lisp/language/cyrillic.el: * lisp/language/chinese.el: * lisp/jka-cmpr-hook.el: * lisp/international/ucs-normalize.el: * lisp/international/mule.el: * lisp/international/mule-conf.el: * lisp/international/iso-transl.el: * lisp/international/fontset.el: * lisp/international/characters.el: * lisp/format.el: * lisp/facemenu.el: * lisp/electric.el: * lisp/dos-w32.el: * lisp/dos-fns.el: * lisp/disp-table.el: * lisp/cus-face.el: * lisp/composite.el: * lisp/bindings.el: * admin/unidata/blocks.awk: * admin/charsets/eucjp-ms.awk: * admin/charsets/cp51932.awk: Use `lexical-binding`.
* Update copyright year to 2021Paul Eggert2021-01-011-1/+1
| | | | Run "TZ=UTC0 admin/update-copyright".
* ; Maintainer header updatesGlenn Morris2020-07-311-0/+1
|
* Support Unicode 13.0Eli Zaretskii2020-03-141-0/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | * admin/unidata/BidiBrackets.txt: * admin/unidata/BidiMirroring.txt: * admin/unidata/Blocks.txt: * admin/unidata/NormalizationTest.txt: * admin/unidata/SpecialCasing.txt: * admin/unidata/UnicodeData.txt: * admin/unidata/copyright.html: * test/manual/BidiCharacterTest.txt: Updated files imported from Unicode v13.0. * admin/unidata/blocks.awk: Add "Symbols for Legacy Computing" to known aliases. * lisp/international/fontset.el (script-representative-chars) (setup-default-fontset): Add new scripts. * lisp/international/characters.el: Set syntax for Symbols for Legacy Computing characters. Update setting of char-width-table. * lisp/international/mule-cmds.el (ucs-names): Update ranges of characters for which we want names in 'ucs-names'. * test/lisp/international/ucs-normalize-tests.el (ucs-normalize-tests--failing-lines-part1) (ucs-normalize-tests--failing-lines-part2): Update according to 'ucs-normalize-check-failing-lines'.
* Update copyright year to 2020Paul Eggert2020-01-011-1/+1
| | | | Run "TZ=UTC0 admin/update-copyright $(git ls-files)".
* Import Unicode 12.0 data filesEli Zaretskii2019-03-091-2/+3
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * admin/unidata/copyright.html: * admin/unidata/UnicodeData.txt: * admin/unidata/SpecialCasing.txt: * admin/unidata/NormalizationTest.txt: * admin/unidata/Blocks.txt: * admin/unidata/BidiMirroring.txt: * admin/unidata/BidiBrackets.txt: New versions from Unicode 12.0. * admin/unidata/unidata-gen.el (unidata-gen-file): * admin/unidata/blocks.awk (name2alias): Adapt to changes in new data files. * admin/notes/unicode: Update and improve instructions for importing a new Unicode Standard. * lisp/international/characters.el (char-width-table): Update lists of characters according to Unicode 12.0. * lisp/international/fontset.el (script-representative-chars): Add characters from new scripts to 'script-representative-chars'. (otf-script-alist): Update according to data on the MS site. * lisp/international/mule-cmds.el (ucs-names): Update unused ranges of codepoints according to Unicode 12.0. * test/lisp/international/ucs-normalize-tests.el (ucs-normalize-tests--failing-lines-part1) (ucs-normalize-tests--failing-lines-part2): Update for the new NormalizationTest.txt file. * test/manual/BidiCharacterTest.txt: Update with the new version from Unicode 12.0.
* Update copyright year to 2019Paul Eggert2019-01-011-1/+1
| | | | Run 'TZ=UTC0 admin/update-copyright $(git ls-files)'.
* Update copyright year to 2018Paul Eggert2018-01-011-1/+1
| | | | Run admin/update-copyright.
* Prefer HTTPS to FTP and HTTP in documentationPaul Eggert2017-09-131-2/+2
| | | | | | | | | | | | | Most of this change is to boilerplate commentary such as license URLs. This change was prompted by ftp://ftp.gnu.org's going-away party, planned for November. Change these FTP URLs to https://ftp.gnu.org instead. Make similar changes for URLs to other organizations moving away from FTP. Also, change HTTP to HTTPS for URLs to gnu.org and fsf.org when this works, as this will further help defend against man-in-the-middle attacks (for this part I omitted the MS-DOS and MS-Windows sources and the test tarballs to keep the workload down). HTTPS is not fully working to lists.gnu.org so I left those URLs alone for now.
* Update copyright year to 2017Paul Eggert2016-12-311-1/+1
| | | | Run admin/update-copyright.
* Update copyright year to 2016Paul Eggert2016-01-011-1/+1
| | | | Run admin/update-copyright.
* Generate char-script-table from Unicode source. (Bug#20789)Glenn Morris2015-06-161-0/+225
* admin/unidata/Makefile.in (AWK): New, set by configure. (all): Add charscript.el. (blocks): New variable. (charscript.el, ${unidir}/charscript.el): New targets. (extraclean): Also remove generated charscript.el. * admin/unidata/blocks.awk: New script. * admin/unidata/Blocks.txt: New data file, from unicode.org. * lisp/international/characters.el: Load charscript. * src/Makefile.in (charscript): New variable. (${charscript}): New target. (${lispintdir}/characters.elc): Depend on charscript.elc. (temacs$(EXEEXT)): Depend on charscript. ; * admin/unidata/README: Mention Blocks.txt. ; * .gitignore: Add lisp/international/charscript.el.