Page tree

A tracker item has been created to clarify the behaviour of $validate-code

https://gforge.hl7.org/gf/project/fhir/tracker/?action=TrackerItemEdit&tracker_item_id=17218

This came from a discussion of the ambiguity regarding the validation that should be done for the display term. For SNOMED CT for example, should this return true (i.e. valid) if the term provided to $validate-code matches

  • the term specified in the ValueSet, if provided?
  • the preferred term? If so, in which language reference set?
  • any of the acceptable synonyms? Again, if so in which language reference set?
  • any of the active synonyms?
  • any active description? (i.e. any active synonym or FSN)
  • any description, current or historic?

This comes partly down to the use case for performing this validation. Validating a code against a ValueSet makes sense to determine if someone has sent a valid code for the binding.

Validating the display is perhaps a bit more subtle. The most obvious use case is to validate if it is likely there is a mistake or not. For example it may be useful to know if a coded element's display term matches the code in the terminology as confirmation.

  • code = 387517004, term = Paracetamol - matches preferred term
  • code = 387517004, term = Acetaminophen - not the preferred term, but still a valid term
  • code = 387517004, term = Amoxicillin - a mistake has probably been made!

In the above case it is useful to know from $validate-code whether the term is a valid description for the code - at least all active descriptions. Receiving the FSN as the display is perhaps odd, and probably wrong (wouldn't imagine many clinical systems where it is valid for it to be the display term), but not unsafe (it is supposed to be unambiguous) and still useful for validation/confirmation.

In this scenario, requiring the specified term to be the preferred term is problematic - which language reference set is the first problem, but also if the version of the code system being used isn't specified it may not be possible to determine the right value even if the language reference set is known. Similarly if the version of the terminology isn't known, then matching inactive descriptions may be useful when dealing with old versions...but then descriptions are inactivated for a reason.

Perhaps a ternary valued response of "valid", "matches inactive description", and "invalid" would solve this issue...but adds complexity for something that is perhaps an edge case.

Questions

  • what are the use cases for $validate-code involving validating the term?
  • based on that, what should be considered "valid" for an input code and term generally for a FHIR CodeSystem and ValueSet combination?
  • then specifically, for SNOMED CT what should the interpretation be and is any supporting clarification necessary in the Using SNOMED CT with FHIR page?

Group Discussion

26 June 2018 - Basically we really need to know what SNOMED Edition is being used, then we can come up with sensible behaviour. 

In general, valid codes should return "valid = true" and term issues should be identified as a warning in the message.  Any matching term should result in positive validation.

RH: If we had an option to specify a "strict" flag, then we could validate the term in a more exacting way.

DM: Developers want finer grained responses to enumerate the validation result of the term, distinct from the code.   Ontoserver currently validates the term if it's either the PT or FSN.  If not specified, the latest "default" edition for the server is used ie AU langrefset.   Note: Potential to make two calls - one without the term to check the code and then another with the term to check that also.

PJ: The value-set will define the language used, but this would not apply when validating against the code-system. 

If the URI that is passed is an edition, then hopefully we could look up a default (set?) of language reference sets.

Case sensitivity: To be discussed.


10 July 2018

Determined that it would be best to have multiple response parameters to capture the various possible scenarios so the client can choose the correct behaviour. Specifically discussed was adding a well known set of extension parameters which can be considered as optional response parameters in the specification in future.

The table below is intended to be built out to cover the various scenarios. Note that the philosophy with the boolean result parameter from validate-code is that it should return true if the code is correct, and only false if the text provided does not match at all, this is considered most fault tolerant for clients that don't use the additional parameters to discern the result more closely.

scenarioresultmessagedisplaycode validity codedisplay validity code
codetermValueSet term
valid codeno termnotrue
code system preferred displayvalid code in ValueSet-
valid codeno termyestrue
ValueSet display for codevalid code in ValueSet-
code not in ValueSetno termn/afalseexplanation the code is invalidcode system preferred displayvalid code not in ValueSet-
code not in ValueSetpreferred displayn/afalseexplanation the code is invalidcode system preferred displayvalid code not in ValueSetcorrect display term
code not in ValueSetacceptable termn/afalseexplanation the code is invalid and warning that the display was not the preferred displaycode system preferred displayvalid code in ValueSetacceptable display term
code not in ValueSetactive but unacceptable term for localen/afalseexplanation the code is invalid and warning that the display matches an active term for the code, but not an acceptable onecode system preferred displayvalid code in ValueSetactive display term
code not in ValueSetinactive/historical term for coden/afalseexplanation the code is invalid and warning the provided term is historical for the code and not activecode system preferred displayvalid code in ValueSethistorical display term
code not in ValueSetunknown term for code systemn/afalseexplanation the code is invalid and explanation that the term was unknowncode system preferred display
valid code in ValueSetunknown term
valid codepreferred displaynotrue
code system preferred displayvalid code in ValueSetcorrect display term
valid code preferred displayyestrueexplanation the display was valid for the code but didn't match the ValueSetValueSet display for codevalid code in ValueSetcorrect display term
valid codeacceptable termnotruewarning that the display was not the preferred displaycode system preferred displayvalid code in ValueSetacceptable display term
valid codeacceptable termyestrueexplanation the display was not preferred for the code and didn't match the ValueSetValueSet display for codevalid code in ValueSetcorrect display term
valid codeactive but unacceptable term for localenotruewarning that the display matches an active term for the code, but not an acceptable onecode system preferred displayvalid code in ValueSetactive display term
valid codeactive but unacceptable term for localeyestruewarning the display was not acceptable for the code and didn't match the ValueSetValueSet display for codevalid code in ValueSetcorrect display term
valid codeinactive/historical term for codenotruewarning the provided term is historical for the code and not activecode system preferred displayvalid code in ValueSethistorical display term
valid codeinactive/historical term for codeyestruewarning the provided term is historical for the code and didn't match the ValueSetValueSet display for codevalid code in ValueSetcorrect display term
valid codeunknown term for code systemnofalseexplanation that the term was unknowncode system preferred display
valid code in ValueSetunknown term
valid codeunknown term for code systemyestrue
ValueSet display for code
valid code in ValueSetcorrect display term
known code for code system family, but not in code system versionno termn/afalseexplanation the code was not valid for the code system versioncode system preferred displayknown code not in code system version-
known code for code system family, but not in code system version
preferred displayn/afalseexplanation the code was not valid for the code system versioncode system preferred displayknown code not in code system version
correct display term
known code for code system family, but not in code system version
acceptable termn/afalseexplanation the code was not valid for the code system versioncode system preferred displayknown code not in code system version
acceptable display term
known code for code system family, but not in code system version
active but unacceptable term for localen/afalseexplanation the code was not valid for the code system versioncode system preferred displayknown code not in code system version
active display term
known code for code system family, but not in code system version
inactive/historical term for coden/afalseexplanation the code was not valid for the code system versioncode system preferred displayknown code not in code system version
historical display term
known code for code system family, but not in code system version
unknown term for code systemn/afalseexplanation the code was not valid for the code system versioncode system preferred display
known code not in code system version
unknown term
syntactically valid code (i.e. concept SCTID) but not known to any version on the serverunknown or no termn/afalseexplanation the code was not valid for any known version for this code system-unknown but syntactically valid code
valid identifier for component other than a concept (e.g. description or relationship)unknown or no termn/afalseexplanation the code is not for the right type of component-incorrect identifier type
syntactically invalid codeunknown or no termn/afalseexplanation the code is syntactically invalid, doesn't meet the code system's scheme-code not valid for code system scheme

code validity and display validity are intended to be extensible bindings to a Code datatype.

This table needs further refinement and consideration of particularly whether all scenarios are worth the effort. Description handling is somewhat SNOMED CT specific, but generalisable to other code systems as most would fall under a subset of the scenarios (i.e. may not draw all the distinctions SNOMED CT does wrt term preferences).

Case significance status in SNOMED CT should also be taken into account, so "Heart structure" and "heart structure" would be matched equally for 80891009 and get the same display validity code as they are equivalent once case significance is taken into account.

  • No labels

12 Comments

  1. Oh...and should the validate-code functionality honour the case significance indicator?

    For example if the preferred term is "White blood cell count" with initial character status of 900000000000020002 "Only initial character case insensitivewhat should the response be (valid or invalid) for the text

    • white blood cell count
    • White blood cell count
    • WHITE BLOOD CELL COUNT

    Intuitively I'd think the answer is that the case significance should be considered in the response to the validation of the term being provided, and that the answers to above are

    • valid
    • valid
    • invalid
    1. That sounds reasonable. I would consider your example |Case insensitive| so all of the above would be valid, but yeah the logic sounds good. It would be an issue if a preferred term like |mg| (for milligram (qualifier value)) was represented as |MG|.

       


  2. In terms of checking case, I think we really do have to look at the case significance indicator of the term before rejecting - for example - terms that are all capitals.   62479008 |Acquired immune deficiency syndrome (disorder)| is the obvious example that has an all caps preferred term and the case sensitivity there is 900000000000017005 |Entire term case sensitive (core metadata concept)| Peter Jordan

    1. I've made some changes to the implementation of the $validate-code operations, for SCT, on 'my' server (Terminz) that will be deployed at the weekend. In particular...

      a. display values passed in requests will be validated using the Case Significance Id of the relevant description

      b. an out parameter 'result' of 'true' will be returned in respect of any display value passed in a request that validates successfully against an active description for the relevant code

      c. if no display value, or a display value other than the preferred term, is passed in the request, the preferred term is returned in the 'display' out parameter of a successful response

      1. Could you provide some sample requests to make it tangible for us?

        1. Some simple examples...

          a. will return true and the preferred term as no display value is provided

            BaseURL /CodeSystem/$validate-code?system=http://snomed.info/sct&code=19030005

          b. will return true and the preferred term as the display value provided is not the preferred term

          BaseURL /CodeSystem/$validate-code?system=http://snomed.info/sct&code=19030005&display=HIV

          c. will return true only as the preferred term is provided as the display value.

            BaseURL /CodeSystem/$validate-code?system=http://snomed.info/sct&code=19030005&display=Human immunodeficiency virus

           d. will return false as the code is invalid

            BaseURL /CodeSystem/$validate-code?system=http://snomed.info/sct&code=19030005xxxxx&display=HIV

          e. will return false as the display value fails validation because the correct value is "HIV" and case sensitive

          BaseURL /CodeSystem/$validate-code?system=http://snomed.info/sct&code=19030005&display=hiv

  3. The more I think about this the more I think the most useful thing is two result parameters, one for the code and one for the display.

    The code is boolean - either it is valid or it isn't.

    The display is either

    • the preferred term
    • an acceptable synonym
    • an active term
    • an historical term
    • not a known term for the code

    and I think the text testing should take into account the case significance in the case of SNOMED CT.

    Overall that would be a much more useful set of responses for the client to decide what needs to be done in the circumstances they are calling $validate-code.

    However that's quite a change. Short of that I can imagine saying something like this is the desired result

    Scenarioresultmessagedisplay
    valid code no termtrue
    preferred term
    invalid code no termfalseexplanation the code is invalidpreferred term
    invalid code with termfalseexplanation the code is invalidpreferred term
    valid code preferred termtrue
    preferred term
    valid code acceptable synonymtruewarningpreferred term
    valid code active unacceptable synonymtruesterner warningpreferred term
    valid code inactive termtruereally stern warningpreferred term
    valid code unknown termfalseexplanation that the term was unknownpreferred term

    All of the above I'd think should obey the case significance when the test is done.

    My rationale for only warning on the inactive term and unacceptable synonym is that the code system version the code/text being tested was from may not be known, therefore it may be that the text was the preferred term at the time but isn't anymore.

    1. There's another scenario and that is where the code is valid and the term matches one supplied in the ValueSet definition itself and not equal/matching to any of the acceptable or otherwise SNOMED-sourced descriptions.

      This sort of thing crops up when the context of use may require a short (unambiguous) string for display purposes, when an interface terminology has been interposed without using RF2 and the SNOMED extension mechanism, or with certain well-known large hospital EMR systems that allow the user to edit / augment the display text after selecting a code (common scenarios are to add a "left", "right", or other qualifying text).


    2. Minor correction to table above - if the code is invalid you can't provide display text (4th column)

      Also, on the binary nature of the code validity, I think there's a third (and maybe fourth or more) case:

      1. Known Concept SCTID
      2. non-SCTID (eg includes non-digits, too long, too short, check-digit doesn't match)
      3. Syntactically correct Concept SCTID, but not known to be used (eg an SCTID from the UK Edition being checked against the AU Edition)
      4. Known Description SCTID
      5. Unknown Description SCTID
      6. Known / Unknown Relationship SCTID

      Note, 4 is clearly called out as an invalid use, so we can probably put 4, 5, and 6 in the same basket as 2 but perhaps with a useful message.

      The main reason I'm calling these cases out is that $validate-code was originally defined with respect to a ValueSet – the supplied code could be a perfectly known and valid SCTID, but just not a member of the ValueSet. However it is now also an operation on CodeSystem and thus case 3 is not quite so clear.

    3. ...and collation support in particular for non-english languages. E.g. in Swedish "Lasègues test" and "Lasegues test" woudl be considered equal (or close enough) but maybe not in other languages.

  4. That is quite a change and pretty specific to SNOMED CT, whereas FHIR has to consider concepts from a multitude of code systems.

  5. Misc text saved off from agenda page:

    Current behaviour doesn't allow for distinction to be made in responding to quality of term queried.

    Note: Since server returns display term, if the query just checks for membership then the client could check its term against that returned. This waters down the usefulness of the server but would simplify if functionality is not in the 80% of features required.

    See GForge issue #17218 (ML's). Also #16586.

    See also https://chat.fhir.org/#narrow/stream/48-terminology/subject/Validation.20for.20foreign.20language.20resources

    Discussion

    14 Aug Update - Discussion planned for Vancouver Meeting Agenda. Note that Grahame Grieve stated "already done" on tracker item 11 Aug.