Automatic Extraction of Subcategorization Frames for Czech

Published 2000 in International Conference on Computational Linguistics

ABSTRACT

We present some novel machine learning techniques for the identification of subcategorization information for verbs in Czech. We compare three different statistical techniques applied to this problem. We show how the learning algorithm can be used to discover previously unknown subcategorization frames from the Czech Prague Dependency Treebank. The algorithm can then be used to label dependents of a verb in the Czech treebank as either arguments or adjuncts. Using our techniques, we are able to achieve 88% precision on unseen parsed text.

PUBLICATION RECORD

Publication year
2000
Venue
International Conference on Computational Linguistics
Publication date
2000-07-31
Fields of study
Linguistics, Computer Science
Identifiers
DOI 10.3115/992730.992746 arXiv cs/0009003
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

The Automatic Acquisition of Frequencies of Verb Subcategorization Frames from Tagged Corpora
2002influential reference
Automatic Verb Classification Using Distributions of Grammatical Features
1999cited by this paper
Supervised Learning of Lexical Semantic Verb Classes Using Frequency Distributions
1999cited by this paper
Using Subcategorization to Resolve Verb Class Ambiguity
1999cited by this paper
Acquiring Lexical Generalizations from Corpora: A Case Study for Diathesis Alternations
1999cited by this paper
Can Subcategorisation Probabilities Help a Statistical Parser
1998influential reference
Valence Induction with a Head-Lexicalized PCFG
1998cited by this paper
Tagging Inflective Languages: Prediction of Morphological Categories for a Rich Structured Tagset
1998cited by this paper
Automatic Extraction of Subcategorization from Corpora
1997influential reference
A statistical syntactic disambiguation program and what it learns
1995cited by this paper
Accurate Methods for the Statistics of Surprise and Coincidence
1993cited by this paper
From Grammar to Lexicon: Unsupervised Learning of Lexical Syntax
1993influential reference
Automatic Acquisition of a Large Sub Categorization Dictionary From Corpora
1993influential reference
Automatic Acquisition of Subcategorization Frames from Tagged Text
1991influential reference
Automatic Acquisition of Subcategorization Frames From Untagged Text
1991cited by this paper
Automatic Acquisition of the Lexical Semantics of Verbs From Sentence Frames
1989cited by this paper
Mathematical Statistics: Basic Ideas and Selected Topics
1977cited by this paper
Mathematical Statistics
1944cited by this paper
Tagging In ective Languages: Prediction of Morphological Categories for a Rich, Structured Tagset
year unknowncited by this paper

CITED BY

A Frequency-Based Algorithm for Argument Extraction from Russian Treebanks
2025influential citation
Use words, not constructions!
2022cites this paper
Surface Realisation from Knowledge Bases. (Bases de Connaissances et Réalisation de Surface)
2016cites this paper
A Domain Agnostic Approach to Verbalizing n-ary Events without Parallel Corpora
2015cites this paper
Enhancing FreeLing Rule-Based Dependency Grammars with Subcategorization Frames
2015cites this paper
A Hebrew verb–complement dictionary
2014influential citation
Synergistic development of grammatical resources: a valence dictionary, an LFG grammar, and an LFG structure bank for Polish
2014cites this paper
Morphologically and Syntactically Annotated Corpora of Many Languages Current State , the Problem and its Significance
2014cites this paper
Subcategorisation Acquisition from Raw Text for a Free Word-Order Language
2014cites this paper
Towards Automatic Detection of Applicable Diatheses
2013influential citation
Using subcategorization knowledge to improve case prediction for translation to German
2013cites this paper
A Hebrew verb–complement dictionary
2013influential citation
한국어 용언 위계구조 자동구축
2012cites this paper
Incorporating Linguistic Knowledge in Statistical Machine Translation: Translating Prepositions
2012cites this paper
Extracción automática de los patrones de rección de verbos de los diccionarios explicativos
2012cites this paper
Content Extraction based on Hierarchical Relations in DOM Structures
2012cites this paper
Automatic Extraction of Semantic Valences of Verbs from Explanatory Dictionaries
2012cites this paper
Subcategorization Acquisition and Classes of Predication in Urdu
2011cites this paper
Inferring Subcat Frames of Verbs in Urdu
2010cites this paper
IRASubcat, a highly parametrizable, language independent tool for the acquisition of verbal subcategorization information from corpus
2010cites this paper
IRASubcat, a highly parametrizable, language independent tool for the acquisition of verbal subcategorization information from corpus
2010cites this paper
German clause-embedding predicates : an extraction and classification approach
2010cites this paper
An experiment in verb valency frame extraction from Croatian Dependency Treebank
2010cites this paper
Analysis of Definitions of Verbs in an Explanatory Dictionary for Automatic Extraction of Actants Based on Detection of Patterns
2010cites this paper
Fully Unsupervised Core-Adjunct Argument Classification
2010cites this paper
A treebank-driven investigation of predicative complements in Dutch : An efficient, practical, actually usable approach
2009cites this paper
Acquiring Verb Subcategorization Frames in Bengali from Corpora
2009cites this paper
Bengali Verb Subcategorization Frame Acquisition - A Baseline Model
2009cites this paper
The effect of borderline examples on language learning
2009cites this paper
Chinese Subcategorization Annotation Based on Machine Learning
2009influential citation
Chinese Verb Subcategorization Acquisition from Noisy Data on Sentence Level
2009cites this paper
Automatic extraction of subcategorization frames for Italian
2008influential citation
The Xavier Module – Information Processing of Treebanks
2008cites this paper
Exploiting Linguistic Data in Machine Translation
2008cites this paper
A procedure to automatically enrich verbal lexica with subcategorization frames
2008cites this paper
Learning verb complements for Modern Greek: balancing the noisy dataset
2008influential citation
Automatic Acquisition of Hungarian Subcategorization Frames
2008cites this paper
Automatic construction of Korean verbal type hierarchy using Treebank
2008cites this paper
Hybrid Methods for Acquisition of Lexical Information: the Case for Verbs
2008influential citation
Automatic Acquisition of Subcategorization Frames for Turkish with Purely Statistical Methods
2007cites this paper
Inducción de Clases de Comportamiento Verbal a partir del Corpus SENSEM
2007cites this paper
Valence extraction using EM selection and co-occurrence matrices
2007influential citation
Improving English Subcategorization Acquisition with Diathesis Alternations as Heuristic Information
2006cites this paper
Automatic extraction of subcategorization frames for French
2006cites this paper
Romanian Valence Dictionary in XML Format
2006cites this paper
Unsupervised Learning of Verb Argument Structures
2006cites this paper
Parsing and Subcategorization Data
2006cites this paper
Probabilistic word sense disambiguation: analysis and techniques for combining knowledge sources
2006cites this paper
Two-Fold Filtering for Chinese Subcategorization Acquisition with Diathesis Alternations Used as Heuristic Information
2006cites this paper
Robust Extraction of Subcategorization Data from Spoken Language
2005cites this paper
Automatic Extraction of Subcategorization Frames for Bulgarian
2005influential citation
Automatic Extraction of Subcategorization Frames from Spoken Corpora
2005cites this paper
Baseline Experiments in the Extraction of Polish Valence Frames
2005cites this paper
Large-Scale Induction and Evaluation of Lexical Resources from the Penn-II and Penn-III Treebanks
2005cites this paper
Métodos para análise discursiva automática
2005cites this paper
Collaborative and corpus-driven approaches towards lexicalized grammar-based natural language processing
2005cites this paper
Automatic Extraction of Polish Verb Subcategorization An Evaluation of Common Statistics
2005cites this paper
Informatická Sekce Matematicko–fyzikální Fakulta Rdf Vizualizátor * Rodičovské Investice Uměl´ych Bytostí Implementace Unifikační Gramatiky pro Strojov´y Překlad Learning of Multilayer Perceptrons with Piecewise-linear Activation Functions Faculty of Nuclear Science and Physical Engineering, ˇ Cvut
2005cites this paper
Generalizing Subcategorization Frames Acquired from Corpora Using Lexicalized Grammars
2004cites this paper
Subcategorization Acquisition and Evaluation for Chinese Verbs
2004cites this paper
Learning Greek Verb Complements: Addressing the Class Imbalance
2004cites this paper
Automatic Extraction of Subcategorization Frames from the Bulgarian Tree Bank
2004cites this paper
Czech Syntactic Analysis Constraint-based - XDG: One Possible Start
2004cites this paper
Problems of Inducing Large Coverage Constraint-Based Dependency Grammar for Czech
2004cites this paper
Towards Automatic Extraction of Verb Frames
2003cites this paper
Improving Subcategorization Acquisition Using Word Sense Disambiguation
2003cites this paper
Application of finite-state transducers to the acquisition of verb subcategorization information
2003cites this paper
Improving Subcategorization Acquisition with WSD
2002cites this paper
Learning Verb Argument Structure from Minimally Annotated Corpora
2002influential citation
Subcategorization acquisition
2002influential citation
Combining labeled and unlabeled data in statistical natural language parsing
2002cites this paper
Semantically Motivated Subcategorization Acquisition
2002cites this paper
Learning Argument/Adjunct Dictinction for Basque
2002cites this paper
Combining Bayesian and Support Vector Machines Learning to automatically complete Syntactical Information for HPSG-like Formalisms
2002cites this paper
Can Subcategorization Help a Statistical Dependency Parser?
2002influential citation
Extracting semantic classes and morphosyntactic features for English-Polish machine translation.
2002cites this paper
Subcategorization Acquisition as an Evaluation Method for WSD
2002cites this paper
Learning automatic acquisition of subcategorization frames using Bayesian inference and support vector machines
2001cites this paper
Influence of Conditional Independence Assumption on Verb Subcategorization Detection
2001cites this paper
Extracting Dependency Frames from Existing Lexical Resources
2001cites this paper
Hybrid Filtering for Extraction of Term Candidates from German Technical Texts
2001cites this paper
Smoothing a probablistic lexicon via syntactic transformations
2001cites this paper
From dictionary to corpus to self-organizing dictionary: learning valency associations in the face of variation and change
2001cites this paper
Statistical Filtering and Subcategorization Frame Acquisition
2000cites this paper
Using Semantically Motivated Estimates to Help Subcategorization Acquisition
2000cites this paper
Building Sub-corpora Suitable for Extraction of Lexico-Syntactic Information
year unknowncites this paper
Tools and procedures for the acquisition of morphological and syntactic information from corpora 1
year unknowncites this paper