senotype_editor

Senotype Editor

User Documentation

Introduction

Senotype

A senotype, or senescence associated secretory phenotype, is a cellular senescence functional definition and associated multidimensional description. A senotype associates a phenotype with a disease; a set of markers (genes or proteins) expressed by cell types in an anatomic location; and an assay.

A senotype can be represented as a knowledge graph of relationships between entities. The entities and relationships of a senotype can be encoded from standard biomedical vocabularies. For example, the markers associated with a senotype can be identified with codes from HGNC and UniProt, with a relationship defined by the Relations Ontology.

img.png

The Senlib Database

Senotype definitions are maintained in the Senotype library. Senotype definitions are stored in Senlib in JSON format and conform to the schema defined in the senlib GitHub repository.

As described in the senlib provenance model, a senotype corresponds to a set of submission files that represent different versions of the evolving definition of the senotype. A senotype submission corresponds to a version of a senotype definition.

Because each version of a senotype can be published by associating it with a Digital Object Identifier (DOI) in DataCite, the fundamental level of organization in the Senlib database is the submission. Each submission has a unique SenNet identifier in both SenNet and the Senlib database.

The Senotype Editor

The Senotype Editor application allows users to manage senotype definitions in the Senlib database. Because the majority of the data in a senotype definition is categorical, most of the work of defining a senotype will involve the selection of values from lists.

{screen capture of Edit page}

With the Senotype Editor, users can:

  1. review a senotype definition
  2. create a new senotype definition
  3. revise an unpublished senotype definition
  4. publish a senotype definition by associating it with a DOI
  5. create a new version of a published senotype definition

Components

The Senotype Editor offers two types of tools:

  1. a tool to navigate the Senotype library of senotype submissions.
  2. tools to maintain senotype submissions

{annotated screen capture: Editor page, pointing out Navigator and Definition sections}

Senotype Library Navigator

The Senotype Library Navigator represents the Senotype library with an expandable tree view. The Navigator organizes the different submissions of a senotype definition in version order.

Senotype Definition

The Senotype Definition composes the remainder of the Senotype Editor. The Senotype Definition section allows the user to build or revise a senotype definition. The components of the Definition section are linked to both the data of a particular senotype submission and to possible values for data.

Because most of a senotype definition is encoded, another role of the Definition section is to translate the codes of a senotype definition into descriptive terms.

Business Rules of the Senotype Editor

Two rules govern whether a user can edit a senotype submission.

  1. A user can only edit a senotype submission file for which they are authorized. Authorization is controlled by means of the SenNet Consortium’s Globus environment.
  2. Only unpublished submissions can be edited. Once a submission is associated with a DOI, it becomes read-only in the Senotype Editor. To change a senotype definition, a user must create a new version of the definition–i.e., build another submission.

Using the Senotype Library Navigator

A senotype definition is represented in the Navigator tree with a folder node ( img_1.png ). Expanding the node for a senotype definition displays the senotype submissions associated with the definition.

Senotype submission files are represented with a file node (img_2.png ). Submission nodes are organized hierarchically in terms of provenance in the Senlib database. If a submission has a predecessor, the node for the submission can be expanded to view the predecessor.

When the user selects a submission file node in the Navigator, the Senotype Editor will load data for the submission into the Definition section.

img_3.png

Edit state

The icon next to a submission node indicates whether the submission can be edited.

  1. A lock icon ( img_4.png) indicates that the submission has been published and is read-only. The controls in the Definition section will be disabled.
  2. A pencil icon (img_6.png) indicates that the submission has not been published and can be edited.

A senotype definition will have, at most, one version (submission file) that can be edited.

Authorization state

Even if a senotype submission is potentially editable, only authorized users may edit it. If a senotype shows a “prohibited” icon ( img_8.png), the user is not authorized to edit the submission file. Although the data for the submission will still display in the Definition section, controls will be disabled.

img_7.png

Creating a new senotype

The last node in the Navigator allows the user to define a new senotype. The Editor will load into the Definition section defaults for the new submission, including:

  1. a new SenNet ID
  2. the Globus name and email of the user, as submitter

Creating a new version of an existing senotype

If all of the versions (submission files) of a senotype definition have been published, the user can use the new version button at the bottom right of the Navigator to create a new, editable version. The Editor will load into the Definition section:

  1. a new SenNet ID
  2. the Globus name and email of the user as submitter
  3. the data from the latest version of the senotype definition

A user will be able to create a new version of a senotype definition even if the latest version of the senotype was created by another user.

Using the Senotype Definition tools

Submission information

The user can edit the name and description for the submission. The Navigator will use the name for the latest version of a senotype definition as the name of the definition node.

DOI

The DOI section allows the user to associate the senotype submission with a DOI in DataCite.

Clicking the img_9.png button will launch a search window.

img_11.png

The search window will search DataCite for a DOI with ID that matches the value that the user enters in the Enter query… input. Because all senotype DOIs will have the same DataCite provider (e.g, 10.6586), the search window only needs the unique portion of the DOI.

For example, if the DOI’s full URL is https://doi.org/10.60586/snt259.dzbl.489, only snt259.dzbl.489 is required as a search term.

If the search window finds a matching DOI in DataCite, it will display the title of the citation as a link. If the user clicks the link, the Editor will associate the senotype submission with the selected DOI.

The img_12.png button in the search window opens the DataCite Commons search window.

When the user associates a DOI with a senotype submission, the Editor warns that saving the DOI with the submission will make the submission read-only.

The img_13.png button allows the user to remove the DOI from the submission.

Assertions

The bulk of the Definition section allows the definition of assertions.

Each input in the Definition section corresponds to an assertion in the senotype definition. For example, the Taxon input corresponds to the senotype definition’s in_taxon assertion.

There are four types of assertions. Each type has its own tool for selection.

Valueset-based assertions

The majority of a senotype definition’s assertions will be categorical–i.e., the possible values for the code of the object of the assertion will be in a valueset. For example, the categories for a senotype definition’s taxon might be the valueset {NCBI:9606, NCBI:10088}, corresponding to the codes for human and mouse.

Valueset-based assertions include:

assertion vocabulary
taxon NCBI
location UBERON
cell type CL
hallmark SENOTYPE
molecular microenvironment SENOTYPE
inducer SENOTYPE
assay OBI

Because there can be multiple assertions of the same type, the inputs for valueset-based assertions are lists.

img_14.png

The img_9.png button opens a select window that displays a list with the members of the valueset associated with the assertion. img_15.png Selecting an element in the list in the select window adds the selected value to the list for the assertion. The img_13.png button removes an element from the assertion list.

Context assertions

Context assertions allow the user to define ranges and units for an assertion. img_16.png. For example, the age assertion can be bounded to apply only to the range of 18 to 89 years.

External assertions

The citation, origin, and dataset assertions are also encoded; however, the codes are not stored in valuesets, but are maintained in external sources.

img_17.png

The Senotype Editor queries APIs to obtain information on:

The external assertion input functions similarly to the DOI:

The search window for an external assertion finds matches in the external site for IDs:

assertion source type of ID
citation PubMed PMID
origin SciCrunch Resolver RRID
dataset SenNet Data Portal SenNet ID

The img_12.png button in an external search window links to the corresponding external site to facilitate finding an appropriate identifier.

Marker assertions

img_18.png

A senotype definition has two types of markers:

Because it is anticipated that a senotype will be associated with many markers, the entire Markers section is collapsible.

Marker assertion management for both specified and regulating markers is similar:

  1. The img_9.png button opens a search window that allows the addition of a single marker.
  2. The img_19.png button opens a window for bulk import of a group of markers.

Search windows (individual addition)

A marker Search window searches for gene or protein markers.

The user can enter different types of identifiers for markers, including:

  1. HGNC numbers (e.g., 7178)
  2. HGNC approved symbols (e.g., MMRN1)
  3. HGNC aliases (ECM)
  4. HGNC prior symbols (e.g., MMRN)
  5. UniProtKB ID (e.g., Q13201)
  6. UniProtKB entry name (e.g., MMRN1_HUMAN)

img_20.png

The Search window for regulating markers includes a field for type of regulating:

img_21.png

Bulk Addition windows

Bulk addition windows allow the user to load a large number of markers from a local CSV file that the user specifies.

Specified markers

The CSV used for bulk upload of specified markers must have the following structure:

column values
type either gene or protein
id * if a gene, the HGNC symbol
* if a protein, the UniProtKB ID

Example:

type,id
gene,BRCA1
protein,Q13201

Regulating markers

The CSV used for bulk upload of regulating arkers must have the following structure:

column values
type either gene or protein
id * if a gene, the HGNC symbol
* if a protein, the UniProtKB ID
action one of the following:
* 1 for up regulation
* 0 for inconclusive regulation
* -1 for down regulation

Example:

type,id,action
gene,BRCA1,1
protein,Q13201,0
gene,BRAF,-1

Validation

When the user clicks the Update/Create button at the bottom of the Definition section, the Editor validates the inputs.

If the data is invalid or incomplete, the Editor:

{screen capture: validation error}

The Editor verifies that:

  1. All required fields have at least one value.
  2. Context assertions are ordered such that lowerbound <= value <= upperbound.

If the data passed validation, the Editor: