glazing.references.extractor¶
Extracting references from datasets.
extractor
¶
Reference extraction from linguistic datasets.
This module provides functionality to extract cross-references from FrameNet, PropBank, VerbNet, and WordNet data models and build efficient indices for mapping lookups.
| CLASS | DESCRIPTION |
|---|---|
ReferenceExtractor |
Main class for extracting and indexing cross-dataset references. |
Notes
The extractor builds bidirectional indices for efficient lookup of mappings between datasets. All extracted references include confidence scores and metadata where available.
Classes¶
ReferenceExtractor()
¶
Extract and index cross-references from linguistic datasets.
This class provides methods to extract cross-references from loaded dataset models and build efficient indices for mapping lookups.
| ATTRIBUTE | DESCRIPTION |
|---|---|
mapping_index |
Bidirectional index for all extracted mappings.
TYPE:
|
verbnet_refs |
VerbNet member cross-references by verbnet_key.
TYPE:
|
propbank_refs |
PropBank roleset cross-references by roleset_id.
TYPE:
|
framenet_relations |
FrameNet frame relations by frame_id.
TYPE:
|
wordnet_sense_index |
WordNet sense key to synset offset mapping.
TYPE:
|
| METHOD | DESCRIPTION |
|---|---|
extract_all |
Extract references from all datasets. |
extract_verbnet_references |
Extract VerbNet member cross-references. |
extract_propbank_references |
Extract PropBank roleset cross-references. |
extract_framenet_relations |
Extract FrameNet frame and FE relations. |
extract_wordnet_mappings |
Build WordNet sense and synset indices. |
Initialize the reference extractor.
Source code in src/glazing/references/extractor.py
Functions¶
extract_all(framenet: list[Frame] | None = None, propbank: list[Frameset] | None = None, verbnet: list[VerbClass] | None = None, wordnet: tuple[list[Synset], list[Sense]] | None = None) -> None
¶
Extract references from all provided datasets.
| PARAMETER | DESCRIPTION |
|---|---|
framenet
|
FrameNet frames to process.
TYPE:
|
propbank
|
PropBank framesets to process.
TYPE:
|
verbnet
|
VerbNet classes to process.
TYPE:
|
wordnet
|
WordNet synsets and senses to process. |
Source code in src/glazing/references/extractor.py
extract_framenet_relations(frames: list[Frame]) -> None
¶
Extract frame relations and FE mappings from FrameNet.
Processes frame-to-frame relations and frame element mappings.
| PARAMETER | DESCRIPTION |
|---|---|
frames
|
FrameNet frames to process.
TYPE:
|
Source code in src/glazing/references/extractor.py
extract_propbank_references(framesets: list[Frameset]) -> None
¶
Extract cross-references from PropBank framesets.
Processes rolesets to extract VerbNet and FrameNet mappings via rolelinks and lexlinks.
| PARAMETER | DESCRIPTION |
|---|---|
framesets
|
PropBank framesets to process.
TYPE:
|
Source code in src/glazing/references/extractor.py
extract_verbnet_references(verb_classes: list[VerbClass]) -> None
¶
Extract cross-references from VerbNet classes.
Processes VerbNet members to extract FrameNet, PropBank, and WordNet mappings. Handles subclasses recursively.
| PARAMETER | DESCRIPTION |
|---|---|
verb_classes
|
VerbNet classes to process.
TYPE:
|
Source code in src/glazing/references/extractor.py
extract_wordnet_mappings(synsets: list[Synset], senses: list[Sense]) -> None
¶
Build WordNet sense and synset indices.
Creates mappings between sense keys and synset offsets for cross-reference resolution.
| PARAMETER | DESCRIPTION |
|---|---|
synsets
|
WordNet synsets to index.
TYPE:
|
senses
|
WordNet senses to index.
TYPE:
|
Source code in src/glazing/references/extractor.py
get_mappings_for_entity(entity_id: str, source_dataset: DatasetType) -> list[CrossReference]
¶
Get all mappings for a specific entity.
| PARAMETER | DESCRIPTION |
|---|---|
entity_id
|
Entity identifier in the source dataset.
TYPE:
|
source_dataset
|
Source dataset type.
TYPE:
|
| RETURNS | DESCRIPTION |
|---|---|
list[CrossReference]
|
All mappings from the specified entity. |
Source code in src/glazing/references/extractor.py
get_reverse_mappings(entity_id: str, target_dataset: DatasetType) -> list[CrossReference]
¶
Get all mappings targeting a specific entity.
| PARAMETER | DESCRIPTION |
|---|---|
entity_id
|
Entity identifier in the target dataset.
TYPE:
|
target_dataset
|
Target dataset type.
TYPE:
|
| RETURNS | DESCRIPTION |
|---|---|
list[CrossReference]
|
All mappings to the specified entity. |