Skip to content

Harmonize the use of biolink:supporting_text #524

@isbluis

Description

@isbluis

The biolink:supporting_text attribute is used to provide contextual evidence of text-mined associations between a subject and a publication. According to the Biolink Model documentation, it has a value of type String, which can be multivalued (i.e. a List).

Various KPs and ARAs, however, do not seem to follow this specification. This may cause an issue for tools that parse these results downstream, with a likely outcome of not displaying to the user the supporting text from the publication presented as evidence for a given edge / relationship.
Anecdotal evidence suggests that this is already happening in production.

Should TRAPI explicitly specify and enforce the allowed value types of this attribute?

Summary (examples follow):

  • RTX-KG2 encodes all as a dict with the key being the PMID of the source publication
  • ARAX encodes it as a list, but it always(?) seems to have a length of one, and that single array element is the kg2-style dict
  • BTE's is a simple string array with no context (though it seems to correspond 1:1 with the items in the biolink:publications array).
  • Text Mining Provider embeds this in sub-attributes of biolink:has_supporting_study_result

Examples

RTX-KG2: First edge in first Analysis of this response:
Image

ARAX: In result 1 -> Support Graph -> edge in this response:
Image

BTE: under the "preventative_for_condition" edge in first result here:
Image

Text Mining: In result 8, click on the one edge, then its support graph, and look at the fourth edge from the top -- "biolink:treats_or_applied_or_studied_to_treat" -- it has a bunch of sub-attributes:
Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions