Skip to content

the repetition of attribute descriptions in TRAPI messages can adversely affect system performance #303

@bill-baumgartner

Description

@bill-baumgartner

Attributes have a description field allowing documentation of the attribute to accompany its value as part of a TRAPI message.

description:
type: string
description: >-
Human-readable description for the attribute and its value.
example: Assertion Authored By Dr. Trans L. Ator
nullable: true

For messages that contain many edges or for edges that have many attributes, e.g. a text-mined assertion may have 100's of evidence sentences represented as individual edge attributes, the attribute descriptions can add significantly to the size of the message which may contribute adversely to system performance. For example, the repeated attribute descriptions in a recent large (40MB) result from the Text Mining Provider were estimated to account for ~27% of the message size.

What is the best practice for the use of attribute descriptions? One suggestion has been to replace the text of the descriptions with URLs linking to external documentation. Another option could be to rely on the Biolink definition of each attribute_type_id to serve as the description, and to remove the description field entirely. Are there alternative suggestions/recommendations? Is there a preferred approach?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions