Page tree

Title


Narrative description:

The concept is considered to be ambiguous when there is the potential for a user to interpret the meaning in more than one way. This may arise due to inherent conflict between or within the descriptions, where the modelling of the concept does not reflect the meaning of the FSN or where the subtypes or supertypes imply that the concept has been misplaced.

Details:

What is being inactivated (concept/description/any component)?The concept
What is the reason for inactivation (description)?

Ambiguity may be apparent for a number of reasons:

  • The FSN is ambiguous, where "ambiguous" carries the strict interpretation of: has more than one distinct and individually unambiguous interpretation. The most obvious form of ambiguous FSN is one that contains an explicit disjunction, for example 178841003 Complex reconstruction operations on hand and foot, where 'and' almost certainly really means 'or'. The ambiguous nature of this code is perhaps most easily revealed if we consider the universe of all patients with an EPR entry of 178841003, and then ask which should be returned in response to a query for all patients who have had a <<112746006 Operative procedure on hand ? Of course, we don't know; some had a foot operation. This thought experiment is "diagnostic" of an ambiguous term.

    Note that this therefore precludes using Ambiguous in cases where nobody knows what the FSN means at all, and so no active concept(s) can be identified that precisely correspond to even one of its distinct meanings. For example, 194964003 Milk spots of pericardium is clearly some kind of pericardial disorder or sign, but exactly what has long been lost in the mists of medical time. However, as far as we know, this term only ever had one clear and unambiguous meaning  - we just don't know what it was. The reason for inactivation of such concepts should be stated to be Outdated, or possibly "reason not stated". But not Ambiguous.

  • A special case of a concept with more than one possible unambiguous interpretation concerns where there is a mismatch between an apparently unambiguous FSN, and the modelling assigned to the same concept. In this scenario, the set of ancestors and descendants (if any) computed by a reasoner for the concept will be inconsistent with what is implied by the FSN. Mismatch patterns will therefore include: 
    • Modelling is for a completely different category to the FSN meaning; this obviously includes where a concept needs to be moved to a different top-level hierarchy (except when between permitted pairs of top-level hierarchy)
    • Modelling is semantically more precise than the FSN meaning
    • The FSN meaning is semantically more specific than the modelling; inactivation is determined case-by-case as this could simply be a primitive concept which cannot be defined
  • Where the existing FSN is for the common name of an organism, changing it to the scientific name in the scenario where the same common name is in fact also associated with more than one other species provided that the scientific name was not already present as a synonym on the concept. Even if the scientific name was already also present but as a synonym, and even if the taxonomic position of the concept clearly also implies the same specific species, the concept remains intrinsically ambiguous because the "common name" form of FSN is not specific.
Which inactivation value should be used?Ambiguous component
Which historical association reference set should be used?

POSSIBLY_EQUIVALENT_TO association reference set (foundation metadata concept)

Known issues:

  1. The presence of ambiguity implies that the concept could have more than one meaning, however, there may not be available concepts within SNOMED CT which match exactly all of the possible meanings.
  2. While a concept may have more than one meaning it is possible that one or more of the meanings are considered not to be clinically useful and therefore only one target concept is considered to be appropriate.
  3. Where there is ambiguity between the FSN and attached synonyms a judgement needs to be made as to whether it is better to inactivate the concept and create new concept(s) or inactivate the non-synonymous synonyms and link them to concepts which are semantically equivalent

Examples

Simple Example
269388002 Burn of eye (& [cornea]) (disorder)
MAY BE 282752000 Injury of eye region (disorder)
MAY BE 284542003 Burn of eye structure (disorder)
MAY BE 274204004 Corneal burn (disorder)
112524000 Northern barred owl (organism)
MAY BE 396690003 Strix varia (organism)
MAY BE 423927009 Strix varia varia (organism)
16704000 Repair AND revision of stoma of esophagus (procedure)
MAY BE 386697002 Repair AND revision of stoma of esophagus (procedure)
MAY BE 386695005 Repair of stoma of esophagus (procedure)
MAY BE 386698007 Revision of esophagostomy (procedure)
Complex Example
267940001 Joint effusion of the forearm (disorder)
MAY BE 202373004 Elbow joint effusion (disorder)
MAY BE 202375006 Wrist joint effusion (disorder)
239801005 Juvenile rheumatoid arthritis (disorder)
MAY BE 410795001 Juvenile rheumatoid arthritis (disorder)
MAY BE 410796000 Juvenile seropositive polyarthritis (disorder)
236568008 Renal transplant disorder (disorder)
MAY BE 58797008 Complication of transplanted kidney (disorder)
MAY BE 429451003 Disorder related to renal transplantation (disorder)
Erroneous Example
85062008 Isopropyl alcohol (substance)
MAY BE 259268001 Isopropyl alcohol (substance)
228318004 Regular drinker (finding)
MAY BE 365968000 Finding of drinking habits (finding)
MAY BE 228320001 Habitual drinker (finding)
78765007 Intrauterine device (physical object)
MAY BE 268460000 Intrauterine contraceptive device (physical object)
MAY BE ???
195746005 Recurrent chest infection (disorder)
MAY BE 448739000 Recurrent lower respiratory tract infection (disorder)
MAY BE ???
28746002 Recurrent apnea (finding)
MAY BE 1023001 Apnea (finding)
MAY BE 416945002 Recurrent apnea (finding)

Impact:

  • Authors
Care is needed to identify all possible associations in order to ensure semantic equivalence
  • Release Management Team

  • Developers
  1. Review and update refsets
  2. Provide an appropriate user interface to enable the user to choose the appropriate replacement concept from the options provided
  • End Users

1. If the end user agrees the component is ambiguous then a decision needs to be made as to which option to choose on a case by case basis

2. If the end user disagrees that the component is ambiguous, then a suitable replacement SNOMED component will be needed.

Proposed improvement:

  1. Resolve non-synonymous synonyms by inactivation of the descriptions and re-assign to either existing concepts which are semantically equivalent or by creating new concepts
  2. Establish whether the ambiguous FSN requires 2 or more clinically useful concepts in order to maintain semantic equivalence. If 2 or more concepts are required use the association type of "POSSIBLY_EQUIVALENT_TO.
  3. If after due diligence only one target is considered to be clinically useful then the association type should be 'REPLACED_BY'
  4. Update the tooling to support the allocation of a "REPLACED_BY" association with a text field to explain the reason(s)

Supporting resources:

<url><comment>
  • No labels

9 Comments

  1. In "Details" section, under "What is the reason for inactivation (description)?" = "Changing the common name to the scientific name", please see my comment for Duplicate Component. It is only applicable if a common name applies to more than one organism. Thanks, Farzaneh

  2. The reasons for inactivating listed above are tighter than what is currently applied. Some reasons are clear cut and others (more than currently noted here) are a judgement call. Modelling changes and correction of placement in the hierarchy do occur without inactivation. A review of some concept changes that have been made, could demonstrate this. 

    1. I would agree that this is stipulating a tighter use of both AMBIGUOUS and, far more importantly, of the closely related POSSIBLY_EQUIVALENT_TO association than is evident in both historically and recently authored content.

      But that's the point: there is both a need and an opportunity for a much clearer specification as to what the point of all this 'reason for inactivation' and 'historical association' information actually is. 

      What, and who, is it all for?

      If its all really no more than arcane jottings in the margins by the authors, but of no practical computational value thereafter or externally, then fine. But then, leaving aside that this would mean that SNOMED offers nothing to help mitigate the false negative reporting and data repair problem that all concept inactivation engenders, why publish it at all, if it can't meaningfully be used in machine processing to achieve such data repair, because it does not follow predictable semantics? 

      The notion of true semantic categorial  ambiguity/disjunction is a relatively simple one and the inferences that can be drawn from it are also relatively simple to define. I would therefore strongly recommend locking down use of the AMBIGUOUS reason and POSSIBLY_EQUIVALENT_TO association to that tighter context.

      From the comments so far, I sense that the proposed tightening up of the AMBIGUOUS/POSSIBLY_EQUIVALENT_TO mechanism in particular is one of the more contentious parts of this draft. But the concern seems to have less to do with which content would still be permitted to be inactivated in this way and more to do is with all the other content in need of inactivation that would not now clearly or cleanly fit into any of the currently available but more tightly defined reason/association buckets. My request, however, would be not to pollute/degrade/overload the AMBIGUOUS/POSSIBLY_EQUIVALENT_TO mechanism as a dumping ground for these. The acid test for AMBIGUITY should not be either 'my brain hurts thinking about this' or 'its none of the other reasons so it must be ambiguous as the reason of last resort'.

      A clearer test could be one based on the thought experiment of considering the universe of patients already given a code now marked for inactivation, and their clinically expected future behaviour in response to the universe of all possible query abstractions. If and only if you can imagine a valid query that should properly return only some of the patients given the code about to be inactivated, then it is AMBIGUOUS. Essentially, the entire mindset for concept inactivation needs to be turned on its head: start from the data repair question, work backward from there to the required association and its targets, and from there to the Reasons for Inactivation - which, incidentally, probably nobody outside authoring centres really care about; they probably really ARE functionally mere jottings in the margins!

      Farzaneh's point is therefore an interesting one to consider against this stricter definition. I think her query concerns whether Ambiguous should still be permitted as a reason for inactivation of an (organism) code whose FSN or PT is a common name, but where that same common name can also signify at least one other completely different species and so is concurrently available as a synonym on different code(s). 65534001 Gerbil (organism) and 11534008 Pipit (organism) seem to have undergone that treatment, but I note that 58269001 Hamster (organism) is the same problem but was handled differently.

      But consider in this vein a clinician who, having searched for 'peanut', is presented with both 125260015 Peanut and 3635997012 Peanut (note: description IDs). Because the clinical coding interface does not provide further information to allow disambiguation, the clinician accidentally selects the plant genus over the nut as a food allergen - whereas the latter is what they actually intended to record: The patient is allergic to (ingesting) the nut, not (touching) the plant it comes from. In this scenario, the two underlying concepts themselves were not truly and intrinsically ambiguous within SNOMED, but rather the clinical system has rendered them functionally ambiguous at run time by suppressing from display all the information within SNOMED needed to make the distinction clear. Were we minded to try and compensate for the clinical system's behaviour by altering either underlying concept, I don't think either should properly be inactivated as "ambiguous". The clinician miscoded: based on what they actually said (even though it was wrong), there is no doubt about how records with the 75413007 'peanut plant' code within should behave in queries. In the same way, if a clinician records species:X when they mean species:Y, because the common name is not unique to either, this is a miscode not intrinsic ambiguity.

      Unfortunately, of course, this is all in turn likely to create a requirement for much of the historical use of this mechanism to be refactored to comply with the newer guidance.


  3. Hi Jeremy Rogers

    My comments actually just intended to raise the question "is inactivation always the best approach when the logical model is different to the FSN?" I know if the logical definition is broader then it's a judgement call and if the concept is in the wrong top level hierarchy then it is inactivated. I'd been considering the judgement call aspect for the broader definitions and also when the logical definition is narrower in meaning. For example, if the finding site is slightly narrower in meaning than that expressed in the FSN, would it be better to inactivate or change the modelling? There will be impacts on the developers and users whichever course of action is taken. 

    Whilst saying that, which historical association and reason for inactivation should be chosen needs to be as clear cut as possible (for authors, developers and users), so I also support making a hardline where it's feasible. We need to be able to be consistent in application across authors regardless of which edition/extension we work in. Examples help provide this clarity both for authors and those teaching others to author. 

    'Content in need of inactivation that would not now clearly or cleanly fit into any of the currently available'... these concepts don't have to be 'ambiguous' rather it's the need for guidance on how to manage them that is needed. Specific examples aren't easy to pull out so perhaps when these arise they need to be discussed at the Internal Editors call and a group decision made. That can then be added to documentation to support future decisions.

    Starting with data repair question and working back - totally agree. 

    Cheers,

    Cathy

    1. The question of what should or can safely happen when there is a mismatch between the FSN for a concept and its underlying modelling is one that Paul raised (so I shall blame him!). At heart, of course, this represents a challenge to a long established SNOMED position: the Fully Specified Name is the ONLY arbiter of the meaning of a concept identifier, and therefore you both should and also can change the underlying modelling to better fit the FSN but without ever needing to inactivate the concept.

      In truth, this mantra of "FSN Primacy" and the freedom to simply remodel wherever modelling doesn't fit the FSN have not been problematic in SNOMED's life to date. So I can certainly sympathise with an "if it ain't broken, don't fix it" pushback.

      However, on reflection, it is possible that this apparent lack of a problem may be merely an accident of the fact that - to date - there has been only very low use of the facility to create novel postcoordinated expressions and record these as primary clinical data. Once that begins to happen at any scale, there appears to be scope for "FSN-Model mismatches" to cause some clinically "interesting" consequences.

      When users assemble ad hoc postcoordinated modelled expressions, these of course don't come with any FSN. But if the user expression happens to be equivalent to the normal form of an existing precoordinated code, then should both this code and the associated FSN be substituted for the original expression? If yes, when would it matter if in fact that FSN was either more specific or more general than the semantics of its underlying modelling and so, now, also the user postcoordinated expression that has been coerced to it?

      Another example: consider a query designer tasked with defining an 'all patients with X' variety of report specification and who accordingly selects the node corresponding to X from the reporting taxonomy on the basis of its FSN alone. However, the underlying modelling is more general than implied by that FSN alone. Postcoordinated expressions are then classified below that modelled definition even though clinically they are not subtypes of the notion conveyed by the FSN alone, and so the designer's query returns false positives.

      In these examples, the fundamental problem is that the FSN of course plays no part in determining the relative meanings between precoordinated codes published by The Centre within the body of its  SNOMED releases, and any novel postcoordinated expressions assembled at the clinical coalface periphery. It may fundamentally be a convenient oversimplifcation that the FSN alone could ever be entirely safely considered universally the only arbiter of the meaning embodied by a concept identifier, and we may need to rethink what can safely happen when there is a mismatch between FSN and the modelling.

      By way of a final and more concrete example, consider a query framed as:

      <<266887003 Family history: Raised blood lipids (situation)

      and what a classifier will currently do with patients assigned the following finding, expressed in the EPR only as an ad hoc postcoordinated expression:

      416471007|Family history of clinical finding|:246090004|Associated finding|=
        (118245000|Measurement finding|:
          {363713009|Has interpretation|=281302008|Above reference range|,
          363714003|Interprets|=391464008|Urine propionic acid level|)

      At present, these "raised urine propionic acid" levels will I think always be misclassified and returned as subtypes of "raised blood lipids" because the modelling of the latter is more general than its FSN. Specifically, the current modelling does not restrict the sample to blood.

      Conventionally, the approved solution to this problem is simply to remodel 266887003 in place so that its definition aligns with its FSN, and so changing from the current:

      416471007|Family history of clinical finding|:246090004|Associated finding|=124042003|Increased lipid|

      to:

      416471007|Family history of clinical finding|:246090004|Associated finding|=
        (118245000|Measurement finding|:
          {363713009|Has interpretation|=281302008|Above reference range|,
          363714003|Interprets|=(104780002|Lipids measurement|:116686009|Has specimen|=119297000|Blood specimen|)})

      Theoretically, however, this remodelling could have other undesirable clinical consequences on historical clinical data.

      Consider now any patient for whom, prior to any remodelling of 266887003, a clinician had previously assembled this expression:

      416471007|Family history of clinical finding|:246090004|Associated finding|=124042003|Increased lipid|

      ...but behind the scenes, and before the clinical utterance was committed to the EPR, this expression had been identified by normal form comparison to be equivalent to the old definition of 

      266887003 Family history: Raised blood lipids (situation)

      ...and so it was that precoordinated code - and not the longer expression - that was persisted to the EPR.

      By changing the modelling in the reference data, you would now be asserting for all these patients that the "raised lipid level" entered by the clinicians had in fact all along been statements specifically about blood lipid levels, even though the original clinical statements were explicitly not that specific.

      A possible alternative resolution, therefore,  might be to require an inactivation of the 266887003 concept as ambiguous with POSSIBLY_EQUIVALENT_TO associations pointing at both possible meanings. Whether this approach would have to be invoked for all cases of FSN:Model mismatch remains to be discussed, but I would suspect pragmatically not.

      However, the first step down this road is to debate whether the above exposition is a reasonable argument for re-examining the 'FSN Primacy' mantra.

  4. I promised myself I wouldn't enter this discussion, but here goes...

    For the record I remain unconvinced that an FSN/modelling mismatch, where the FSN is otherwise blameless, is a justification for concept inactivation. If the modelling is wrong, correct the modelling.

    On the specific example above that includes "...and so it was that precoordinated code - and not the longer expression - that was persisted to the EPR..." - this approach to recording is in breach of SI's own current guidance which states:

    This guidance exists to mitigate against the sort of consequences described.

    Kind regards

    Ed

    1. Thanks Ed, that's very helpful.

      Even as I crafted the already slightly laboured example above, I was aware that it depended on an expression-for-precoordinatedId swap operation that felt instinctively like an unwise thing to be doing in the first place, and so I'm very glad you've dug out the official guidance specifically advising against it. It could perhaps usefully be given more force than it currently has; must not should, for example. And perhaps be linked to an explanation of why it matters: its not simply an echo of the usual medicolegal guidance that the clinician's original utterance should be immutably stored and transferred 'as is'.

      I would however be worried that not everybody will follow this guidance; pretty much this operation would underpin any templated GUI crafted to support clinicians in gathering detailed multiaxial "who, what, when, where, why, wherefore, whence, whither..." information about e.g. certain procedures, and the technical preferences of at least some centres for such compositional detail to be subsequently extractable as a smaller number of (excessively) precoordinated codes. Or, to convert NLP extracts to streams of codes.

      But personally I remain ambivalent as to whether or not there is really an issue here; although its not so hard to find examples of fully defined concepts whose FSN and current modelling are not entirely in agreement, I'm not aware of (m)any where the difference has proved clinically significant. So I'm still personally minded to accept this phenomenon as an example of a tolerably inexact model and so, mostly, to allow it to be fixed by remodelling in place and without inactivation, with the whole thus incrementally but asymptotically approaching Truth and Beauty.

      But equally I have a gut feeling that there are examples out there where the FSN:Model mismatch is "more significant", and so now seems as good opportunity as any to revisit and have a discussion about what problems could at least theoretically arise in some future world where more expression building was going on, and whether we have it all locked down enough. Some examples would, of course, probably help the debate.

      1. Thanks Jeremy

        Sorry for the delay in responding. I’m particularly pleased to see the work going on with the REFERS TO CONCEPT association refset. As SNOMED CT is used more, as more local terms are created, as more translations are done, and as more mappings from interface terminologies are performed, it is inevitable that otherwise FSN-blameless concepts will develop ‘acquired ambiguity’, and we need a well-understood mechanism for governance and resolution whilst leaving underlying concepts active.

        Regarding FSN/definition mismatch as a justification for inactivation, I’d view this as another form of ‘acquired ’ or ‘imposed’ ambiguity, and would still prefer resolution to be achieved without inactivation. Part of the challenge is the non-locality of modelling errors and consequent false inferencing. The attached D3.js/Graphviz animations (example.zip*) for a ‘fairly’ uncontroversial concept such as 91936005 | Allergy to penicillin (disorder) show just how turbulent its ancestry, local definition (intention) and descendants (extension) have been over time. If any of the available inferences at any point in time were incorrect, it would be hard to figure out where the crime was committed, and thus which concept(s) to inactivate.

        I do wonder if a possible exception is where a highly-templated authoring workflow included the automatic generation of FSNs from the templates. If a bug led to a mismatch between the FSN and the simultaneously-generated local modelling then from a symbolic/semiotic perspective it could be validly argued that the initial modelling was the ‘arbiter of meaning’, the simultaneously-created FSN referred to something different, and this may need to be treated as ‘inherent ambiguity’ and managed by inactivation.

        Ed

        * Just unzip the folder locally and open each of the html files – necessary Javascript files are in the lib folder and the diagram notation is explained in the NOTES.txt file. I *think* confluence has allowed this to be posted.

        example.zip

  5. I like the proposed improvement here, where the association type depends 1..1 or 2..*  associations.
    Could even view "replaced by" as "probably equivalent to" - imply a sense of confidence.