-
Notifications
You must be signed in to change notification settings - Fork 28
Description
The biolink:supporting_text attribute is used to provide contextual evidence of text-mined associations between a subject and a publication. According to the Biolink Model documentation, it has a value of type String, which can be multivalued (i.e. a List).
Various KPs and ARAs, however, do not seem to follow this specification. This may cause an issue for tools that parse these results downstream, with a likely outcome of not displaying to the user the supporting text from the publication presented as evidence for a given edge / relationship.
Anecdotal evidence suggests that this is already happening in production.
Should TRAPI explicitly specify and enforce the allowed value types of this attribute?
Summary (examples follow):
- RTX-KG2 encodes all as a dict with the key being the PMID of the source publication
- ARAX encodes it as a list, but it always(?) seems to have a length of one, and that single array element is the kg2-style dict
- BTE's is a simple string array with no context (though it seems to correspond 1:1 with the items in the biolink:publications array).
- Text Mining Provider embeds this in sub-attributes of
biolink:has_supporting_study_result
Examples
RTX-KG2: First edge in first Analysis of this response:

ARAX: In result 1 -> Support Graph -> edge in this response:

BTE: under the "preventative_for_condition" edge in first result here:

Text Mining: In result 8, click on the one edge, then its support graph, and look at the fourth edge from the top -- "biolink:treats_or_applied_or_studied_to_treat" -- it has a bunch of sub-attributes:
