-
CS Colloq: Manish Bhide, IBM Research (India)
Fri, Mar 05, 2010 @ 10:00 AM - 11:00 AM
Thomas Lord Department of Computer Science
Conferences, Lectures, & Seminars
Talk Title:
Part 1: IBM Research - India Overview
Part 2: Keyword Search over Dynamic Categorized InformationSpeaker: Manish Bhide, IBM Research (India)Abstract:
My talk will be in two parts. In the first part I will give an
overview of IBM Research - India where I will outline the kind of work we do, the job opportunities, internship options, etc. In the second part of the talk I will present one of my research works titled "Keyword Search on Dynamic Categorized Information".
The abstract of the technical talk is given below:
Consider an information repository whose content is categorized. A data item (in the repository) can belong to multiple categories and new data is continuously added to the system. In this talk, I will describe a system, CS*, which takes a keyword query and returns the relevant top-K categories.
In contrast, traditional keyword search returns the top-K documents (i.e., data items) relevant to a user query. The need to dynamically categorize new data and also update the meta-data required for fast responses to user queries poses interesting challenges. The brute force approach of updating the meta-data by comparing each new data item with all the categories is impractical due to (i) the large cost involved in finding the categories associated with a data item and (ii) the high rate of arrival of new data items. We show that a sampling based approach which provides statistical guarantees on the reported results is also impracticable. We hence develop the CS* approach whose effectiveness results from its ability to focus on a strategically chosen subset of categories on the one hand, and a subset of new data on the other. Given a query, CS* finds the top-K categories with high accuracy even in time-constrained situations.Bio:
Manish Bhide is a Research Staff Member at IBM Research - India. He joined IBM Research in 2002 after finishing his masters from IIT Bombay. He is currently pursuing a part time PhD from IIT Bombay (expected completion Dec-2010 ). His research interests are primarily in the area of Information management. At IBM Research he has worked on areas such as XML, policy based data management information management issues in cloud computing, etc.Location: Seaver Science Library (SSL) - 150
Audiences: Everyone Is Invited
Contact: CS Front Desk