Overview:
The first subsystem identifies important entities related to a free text query. After retrieving documents for a query, the system uses a statistical test to rank all entities of each type, based on their overrepresentation in returned documents against the entire collection. As one scenario, a user may query for a biological process and the system will retrieve related documents with the recognized entities highlighted (and color coded); the system will suggest a list of important genes involved in this process derived from the retrieved documents.
Running the system:
To run the entity subsystem, enter your query in the search box in the BeeSpace Semantic Search tab and press enter or the go button (magnifying glass icon). Once the area below
populates
with the pertinent abstracts, you can look for enriched entities. For example, if you are interested in genes associated with your query click on the “Show Enriched Genes” button along the left side of the interface, and then proceed to click on the tab “
Enriched Genes”.
Similarly if you are interested in enriched
Behaviors
or
Chemicals
or
Anatomy associated with your query term, click on the relevant button along the left and proceed to the relevant tab. Within the tabs of the enriched entity there are three columns; entity identity, Score and Doc List. The score denotes statistical significance (based on their overrepresentation in returned documents against the entire collection). The scoring is directly correlated to the significance, i.e. higher the score, higher the significance. The Doc List contains the PMID numbers of abstracts that denotes the abstract linking the query with the entity.
The BeeSpace Semantic Search tab contains all the abstracts returned with your query and the abstract can be viewed by clicking the “+” on the left of the abstract title. Within the abstract, the relevant entities will be highlighted, orange for anatomy, blue for behavior, yellow for chemical and green for gene. All the highlighted entities are hyperlinked to the respective database; to Flybase (for
gene
and
anatomy),
to PubMed (for
behavior)
and to PubChem (for
chemical).
Examples to try on the entity enrichment subsystem:
- courtship
- chemosensory
- juvenile hormone