SUNDAY, JUNE 3 | ||||
---|---|---|---|---|
8:45 | 9:00 | Welcome | ||
Session
I: Learning
Session Chair: Paola Velardi |
||||
9:00 | 9:25 | Limitations of Co-Training for Natural Language Learning from Large Datasets | ||
9:25 | 9:50 | A Sequential Model for Multi-Class Classification | ||
9:50 | 10:15 | Learning Within-Sentence Semantic Coherence | ||
10:15 | 10:45 | Break | ||
Session
II: Machine Translation
Session Chair: Kenneth Church |
||||
10:45 | 11:10 | Knowledge Sources for Word-Level Translation Models | ||
11:10 | 11:35 | Improving Lexical Mapping Model of English-Korean Bitext Using Structural Features | ||
11:35 | 11:45 | Short break | ||
11:45 | 12:45 | Invited
talk, Eric Brill
"Paucity Shmaucity -- What Can We Do With A Trillion Words?" |
||
12:45 | 2:00 | LUNCH | ||
Session
III: Text Categorization
Session Chair: Wessel Kraaij |
||||
2:00 | 2:25 | Stacking Classifiers for Anti-Spam Filtering of E-Mail | ||
2:25 | 2:50 | Feature Space Restructuring for SVMs with Application to Text Categorization | ||
2:50 | 3:15 | Using Bins to Empirically Estimate Term Weights for Text Categorization | ||
3:15 | 3:25 | Short break | ||
Session
IV: Question Answering and Information Extraction
Session Chair: Roman Yangarber |
||||
3:25 | 3:50 | Question Answering Using a Large Text Database: A Machine Learning Approach | ||
3:50 | 4:15 | Information Extraction Using the Structured Language Model | ||
4:15 | 4:30 | Short break | ||
4:30 | 5:30 | Panel:
When does EM work? (includes an introduction to the Expectation-Maximization
algorithm)
Eugene Charniak, Kevin Knight, Ted Pedersen, Stefan Riezler |
MONDAY, JUNE 4 | ||||
---|---|---|---|---|
Session
V: Lexical Acquisition and Text Segmentation
Session Chair: David Yarowsky |
||||
8:35 | 9:00 | Classifying the Semantic Relations in Noun Compounds via a Domain-Specific Lexical Hierarchy | ||
9:00 | 9:25 | The Unknown Word Problem: a Morphological Analysis of Japanese Using Maximum Entropy Aided by a Dictionary | ||
9:25 | 9:50 | Is Knowledge-Free Induction of Multiword Unit Dictionary Headwords a Solved Problem? | ||
9:50 | 10:15 | Latent Semantic Analysis for Text Segmentation | ||
10:15 | 10:45 | Break | ||
Session
VI: Applications
Session Chair: Marti Hearst |
||||
10:45 | 11:10 | Detecting Short Passages of Similar Text in Large Document Collections | ||
11:10 | 11:35 | Hybrid Text Mining for Finding Abbreviations and their Definitions | ||
11:35 | 11:45 | Short break | ||
11:45 | 12:45 | Panel:
What Works and What Doesn't? Industrial Perspectives
Adam Berger, David Evans, Joshua Goodman, Lynette Hirschman |
||
12:45 | 2:00 | LUNCH | ||
Session
VII: Spoken Language Output
Session Chair: Roni Rosenfeld |
||||
2:00 | 2:25 | Automatic Corpus-based Tone Prediction using K-ToBI Representation | ||
2:25 | 2:50 | Probabilistic Context-Free Grammars for Syllabification and Grapheme-to-Phoneme Conversion | ||
2:50 | 3:00 | Short break | ||
Session
VIII: POS Tagging and Corpus Analysis
Session Chair: Thorsten Brants |
||||
3:00 | 3:25 | Comparing Data-Driven Learning Algorithms for PoS Tagging of Swedish | ||
3:25 | 3:50 | Impact of Quality and Quantity of Corpora on Stochastic Generation | ||
3:50 | 4:15 | Corpus Variation and Parser Performance | ||
4:15 | 4:30 | Refreshments (close) |
Note: the formatting of this page comes from the version generated for the NAACL CDROM.