Sunita Sarawagi
My topics of interest span several fields including databases, data mining, machine learning and statistics. A good idea about my research interests can be obtained by following my publications. Some specific problems and projects on which I have worked are listed below.
* World Wide Tables: The goal of this project is to answer table queries by tapping partially structured sources like tables and lists on the web.
* Information Extraction and data integration: Recently, I have been interested in graphical models and their use for various extraction and integration problems. As part of this effort, I have developed a package for Conditional Random Fields (CRF) that can be downloaded from sourceforge.
* ALIAS: This is a prototype of an interesting and fairly compelling application of the use of machine learning techniques like Active Learning to ease the duplicate elimination task that arise in data cleaning.
* DATAMOLD: is a tool for Information Extraction (more l