Definitions of terms used in Information Extraction

a property of an entity such as its name, alias, descriptor, or type
mark up of a text span in a specific format that indicates a feature or features of the text within the span
assessment of performance according to standard measures
textual input for an information extraction system
a set of newswire texts chosen according to pre-specified conditions and meant to represent a rich text stream
data in tabular format stored with the assistance of a relational database management system
a researcher who implements a system
Dry Run
an end-to-end practice run of an evaluation
an object of interest such as a person or organization
assessment of performance according to agreed upon measures
an activity or occurrence of interest such as a terrorist act or an airline crash
a relationship held between two or more entities
Formal Test Material
a blind dataset, task definitions, test procedure, answer keys, and scoring software
Formal Run
the "official" evaluation
Information Extraction
the extraction or pulling out of pertinent information from large volumes of texts
Information Extraction Systems
an automated system to extract pertinent information from large volumes of text
Information Extraction Technologies
techniques used to automatically extract specified information from text
pre-defined measures of performance calculable by comparison of system output with human-generated answer keys
Message Understanding Conference held at the end of the evaluation and attended only by participants and invited potential customers
Named Entity
a named object of interest such as a person, organization, or location
Science Applications International Corporation
Scoring Software
fully automated software for the comparison of system performance against answer keys that tallies and reports metrics and error types for developers and evaluators
Search Engine
software which gives relevance rankings to documents in a collection based on a user query
Sources of News
edited electronic feeds from established news organizations such as the Wall Street Journal and the New York Times News Service
Statistical Algorithm
algorithm to determine the statistical significance of evaluation results
Systems Integration
building a system from off-the-shelf components to accomplish a job previously not automated
Systems Integrator
builder of a system from off-the-shelf components
Task Definition
document which defines the format and criteria for annotation or extraction of text and placement into a database or template. For example, task definitions give general guidelines and examples for the extraction of named entities, attributes, facts, and events from texts.
electronically encoded alphabetic material from some human language
process by which a system learns about a dataset

For more website information contact: Ellen Voorhees
For more evaluation information contact: Nancy Chinchor
Last updated: Tuesday, 08-Mar-2005 13:16:36 MST
Date created: Friday, 12-Jan-01