Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unicode RGI Emoji ZWJ Sequences / WhatWG URL / CheckJoiners #510

Closed
adraffy opened this issue Sep 9, 2023 · 1 comment
Closed

Unicode RGI Emoji ZWJ Sequences / WhatWG URL / CheckJoiners #510

adraffy opened this issue Sep 9, 2023 · 1 comment

Comments

@adraffy
Copy link

adraffy commented Sep 9, 2023

How do you encode an Emoji ZWJ Sequence with Ada?

There are 3K+ RGI emoji and 39% of them involve ZWJ sequences. From Unicode's front page:

92% of the world’s online population use emoji in their communications

The WhatWG URL decision to use CheckJoiners was shortsighted.

What is the feature you are proposing to solve the problem?

CheckJoiners should be false.

@lemire
Copy link
Member

lemire commented Mar 21, 2024

The WHATWG URL standard states:

Let result be the result of running Unicode ToASCII with domain_name set to domain, UseSTD3ASCIIRules set to beStrict, CheckHyphens set to false, CheckBidi set to true, CheckJoiners set to true, Transitional_Processing set to false, and VerifyDnsLength set to beStrict. [UTS46]

The ada library is an implementation of the WHATWG URL standard. We do not set the standard nor do we set out to change the standard.

You may disagree with the standard, and be right that it is incorrect. If so, you should get the standard to change. If the standard changes, we will adapt the implementation.

I am closing this issue. The reported should open an issue with the WHATWG URL standard instead.

@lemire lemire closed this as completed Mar 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants