Hi everyone

We're considering a change to the format of the new Annotations refsets, in order to allow for Dialects. We're including the notice in the Early Visibility page + the Jan 2024 Release Notes, but wanted to run it by you direct as well:

NOTICE
PROPOSED CHANGE(s) TO THE COVERAGE OF LANGUAGE IN ANNOTATIONS REFSET

After discussions with the community, SNOMED International is proposing to change the coverage of language in the Annotations refsets in the following way:
The langueCode field was specified as the two characters of ISO-639-1 code for the language of the annotation text. The change is to include support for specifying dialect when it is applicable by adhering to RFC 5646, which allows the combination of two characters of ISO 639-1 code and two uppercase letters of country code (ISO 3166) separated by a hyphen.
The "languageCode" will be changed to "languageDialectCode" in the field column. A new metadata c "Language dialect", a subconcept of "Language Code", will be used for the metadata of this column in the descriptor refset.

Please provide any feedback by Friday 26th January 2024 at the absolute latest, as lack of response will be taken as implicit acceptance, and the changes will proceed as planned.

 

-------------------

If you have any feedback please let us know asap, otherwise we'll proceed as planned. Thanks very much for your help!

Kind regards,

Yong

Comments

  1. Guillermo Reynoso
    2024-01-23 08:17

    We are not against allowing the OPTIONAL usage of language+dialect in alignment with RFC 5646, using the languageCode column or the corresponding metadata representation as proposed. However, we would not expect the dialect granularity to be required or even desirable in most situations (e.g. we would not expect an "es" text annotation to be localized to different Spanish dialects, that would likely require a more robust mechanism, or the dialect used to indicate a particular realm or resource that could otherwise be represented by the moduleId). The text annotations are not a replacement for the lexical representation of content or the eventual addition of new descriptionTypes.

    Reply
  2. Add new comment