|
TRIM
-
documents categorization
examples:
renewable
energyTRIM - information summarization examples: health care policy gun control colonial America customer opinions home buyer preferences illegal aliens seeking missing children Oregon history, african-americans |
Topical Research
Information Modules (TRIM) should fit your needs when you have
permanent and/or in-depth research topics. TRIMs can be viewed as
specialized semantic
modules, tuned up to your topic and preloaded with relevant documents
to be analyzed. Unlike the iResearch Reporter, TRIMs
are typically used without a search query. Once a documents set
has been analyzed, the same TRIM can
be loaded with a fresh portion of documents, collected using a modified
search query. TRIMs provide multi-perspective, panoramic view of
information in your research area. Make
TRIMs
your response to the information overload, by using them as
a
front-end to online
databases, search engines, etc. Browse the example TRIMs (listed in
the left column) and learn more from more detailed description below. Email us
in order to build your
personalized free TRIM today!
Learn About The
Two Main TRIM Variants
1. TRIM-summarizationMulti-document summarization with semantic searchSemantic search capability is applied to identification of text passages needed to be excerpted from the relevant documents. Semantic search implementation is based on the dynamically created lists of concepts, characteristic for the user-indicated topic and its automatically identified subtopics. E.g., for the topic GUN VIOLENCE the system would apply such concepts as "shootings", "firearm related deaths", and for the topic GUN RIGHTS - such concepts as "Second Amendment", "keep and bear arms", "gun restrictions". Breakdown of the summarized information according to the main theme subtopics The TRIM way of multi-document summarization includes addressing the topic area as a whole, as well as with regards to its specific perspectives (subtopics). The same documents set is reprocessed in order to obtain specific information coverage from each of multiple perspectives. Organizing the extracted information The collected passages are ordered by their semantic closeness. The resulting summary is subdivided into two sections basing on the degree of pertinence. These sections - Most Pertinent and Possibly Useful - are structured further, introducing thematic sections and paragraphs, thus making the summary even more well-readable. Assigning headings to thematic sections The basic requirement for selecting a section heading is that it should represent a specific concept term most frequently occurring in a given text section. Its synonyms and related concepts are taken into consideration as well. Glancing through the created table of contents, the user can quickly glean the overall thematic structure, allowing him/her to quickly access a section of interest. Ranking the source documents by relatedness to topic & subtopics The documents related to each subtopic or the theme as a whole are ranked according to amount of specific informative passages they contain. Updating the documents set Once a documents set has been analyzed, the same TRIM can be loaded with a fresh portion of documents, collected using (optionally) a modified search query. 2. TRIM-categorizationCategorizationThe search results (retrieved documents) are organized into thematic groups (categories). The categories are NOT predefined, but established dynamically - they are labeled by specific concept terms semantically close to the user's query. The categories set is created with the concept-based approach - categories represent the most core and widely used concepts occurring in the publications on the query subject. Such categories are identified through a cascade-like multi-stage mechanism involving search and analysis of information found on the Internet and/or other sources according to a user's topic in general. The candidate categories are further refined by assessing their significance in documents found according to a user's query. This way, the retrieved relevant documents are categorized by concepts, which are core for a given theme, as verified on large masses of information. Categorization Example: Theme: RENEWABLE ENERGY Categories: CLEAN ENERGY GEOTHERMAL ENERGY WIND ENERGY RENEWABLE ENERGY EFFICIENCY ENERGY USE DEPARTMENT OF ENERGY RECURRENT ENERGY ENERGY ASSOCIATION ENERGY CONSUMPTION ENERGY POLICY ALTERNATIVE ENERGY ENERGY ECONOMY Ranking the documents The documents assigned to each category are ranked by their amount of category-specific content. Each document can be assigned to more than one category - typically to many. Summarization For every category, a user can get a cumulative summary of all documents fitting into this category, and also obtain the category-specific information for every document independently. |
| © Copyright 1998-2010, Power Text Solutions, All Rights Reserved. |