Logo: University of Southern California

Events Calendar



Select a calendar:



Filter February Events by Event Type:


SUNMONTUEWEDTHUFRISAT
26
27
28
29
30
1

2
3
4
5
6
7
8

9
10
11
12
13
15

16
17
18
19
21
22

23
24
25
26
27
1


Conferences, Lectures, & Seminars
Events for February

  • AI Seminar-Fabio Rinaldi:

    AI Seminar-Fabio Rinaldi:

    Fri, Feb 14, 2014 @ 11:00 AM - 12:00 PM

    Information Sciences Institute

    Conferences, Lectures, & Seminars


    Speaker: Fabio Rinaldi, Senior Researcher, Lecturer, and PI at the University of Zurich, Switzerland

    Talk Title: OntoGene & SASEBio: biomedical text mining research at UZH

    Series: AISeminar

    Abstract: There are vast amounts of knowledge encoded in the scientific literature which could be made more easily accessible and useful to a broader range of users through the application of more effective software tools. Text mining is a new discipline which seeks to provide ways to find, extract and manipulate the knowledge which still remains to a large extent hidden in the literature.

    Text mining tools can already provide a very effective way to extract some specific types of information, but are not yet so advanced that their results can be used without human verification by domain experts. Therefore one very promising area of application of text mining technologies is within the process of database curation.

    The need to efficiently retrieve key information derived from experimental results, and published in the scientific literature, is of fundamental importance in biology. In order to help biologists, as well as in some cases medical practitioners, to efficiently find such
    information in the enormous quantity of published articles, several public and private institutions fund the construction and maintenance of specialized databases, which have the role to collect specific knowledge items and provide them in an easily accessible format. There are several dozens of such databases, each specializing in a
    particular domain of the life sciences [1].

    In this talk I will describe text mining activities conducted by my research group at the University of Zurich (OntoGene: www.ontogene.org). The OntoGene group is supported by the Swiss National Science Foundation (project SASEBIO: Semi-Automated Semantic
    Enrichment of the Biomedical Literature) and by Roche Pharmaceuticals. The SASEBio project focuses in particular on applications of text mining technologies to the process of biomedical database curation.

    The OntoGene team has participated in several competitive evaluations of biomedical text mining technologies, obtaining competitive results in all of them. Some of these results will be discussed in the talk. Additionally, I will present ODIN (OntoGene Document Inspector), an interactive tool which allows database curators to leverage upon the results of the OntoGene text mining system and use them in their
    curation tasks.

    ---
    [1] Xose M. Fernandez-Suarez, Daniel J. Rigden, and Michael Y. Galperin. The 2014 nucleic acids research database issue and an updated NAR online molecular biology database collection. Nucleic Acids Research, 42(D1):D1-D6, 2014

    The OntoGene text mining system is based on a scalable entity recognition component with a semi-automated organism-based disambiguation module, an in-house dependency parser, and a flexible relation mining approach. The OntoGene team has participated in several biomedical text mining challenges (BioCreative, BioNLP,
    CALBC), obtaining competitive results in all of them. Some of these results will be discussed in the talk.

    The OntoGene Document Inspector (ODIN) is an interactive tool which allows database curators to leverage upon the results of the OntoGene text mining system and use them in their curation tasks. One recent version of the system has been tested in the curation process of the Pharmacogenomics Knowledge Base (PharmGKB), and another version
    adapted for the Comparative Toxicogenomics Database in the context of
    a BioCreative challenge.

    Biography: Fabio Rinaldi is the leader of the OntoGene research group at the University of Zurich and the principal investigator of the SASEBio project. He holds an MSc in Computer Science (University of Udine, Italy) and a PhD in Computational Linguistics (University of Zurich, Switzerland). He is author of more than 100 scientific publications (including 19 journal papers) dealing with topics such as Ontologies, Text Mining, Text Classification, Document and Knowledge Management, Language Resources and Terminology.

    Host: David Chiang

    Webcast: http://webcasterms1.isi.edu/mediasite/Viewer/?peid=7bf4d5a5d7404d249254a2b96006ea6e1d

    Location: Information Science Institute (ISI) - 11th fl Large CR

    WebCast Link: http://webcasterms1.isi.edu/mediasite/Viewer/?peid=7bf4d5a5d7404d249254a2b96006ea6e1d

    Audiences: Everyone Is Invited

    Contact: Alma Nava / Information Sciences Institute


    This event is open to all eligible individuals. USC Viterbi operates all of its activities consistent with the University's Notice of Non-Discrimination. Eligibility is not determined based on race, sex, ethnicity, sexual orientation, or any other prohibited factor.

  • NL Seminar- Hal Daume: "Predicting Linguistic Structures Accurately and Efficiently"

    Fri, Feb 14, 2014 @ 03:00 PM - 04:00 PM

    Information Sciences Institute

    Conferences, Lectures, & Seminars


    Speaker: Hal Daume, University of Maryland

    Talk Title: "Predicting Linguistic Structures Accurately and Efficiently"

    Series: Natural Language Seminar

    Abstract: Many classic problems in natural language processing can be cast as building mapping from a complex input (e.g., a sequence of words) to a complex output (e.g., a syntax tree or semantic graph). This task is challenging both because language is ambiguous (learning difficulties) and represented with discrete combinatorial structures (computational difficulties). Often these are at odds: the features you want to add to decrease learning difficulties cause nontrivial additional structure yielding worse computational difficulties.

    I will begin by discussing algorithms that side-step the issue of combinatorial blowup and aim to predict an output structure directly. I will then present approaches that explicitly learn to trade-off accuracy and efficiency, applied to a variety of linguistic phenomena. Moreover, I will show that in some cases, we can actually obtain a model that is faster and more accurate by exploiting smarter learning algorithms.



    Biography: http://www.umiacs.umd.edu/~hal/

    Host: Yang Gao

    More Info: http://nlg.isi.edu/nl-seminar/

    Location: Information Science Institute (ISI) - 11th Flr Conf Rm # 1135, Marina Del Rey

    Audiences: Everyone Is Invited

    Contact: Peter Zamar

    Event Link: http://nlg.isi.edu/nl-seminar/


    This event is open to all eligible individuals. USC Viterbi operates all of its activities consistent with the University's Notice of Non-Discrimination. Eligibility is not determined based on race, sex, ethnicity, sexual orientation, or any other prohibited factor.

  • AI Seminar- Big Data Curation

    Thu, Feb 20, 2014 @ 02:30 PM - 03:30 PM

    Information Sciences Institute

    Conferences, Lectures, & Seminars


    Speaker: Renee Miller , Bell Canada Chair of Information Systems University of Toronto

    Talk Title: Big Data Curation

    Series: Artificial Intelligence Seminar

    Abstract: A new mode of inquiry, problem solving, and decision making has become pervasive in our society, consisting of applying computational, mathematical, and statistical models to infer actionable information from large quantities of data. This paradigm, often called Big Data Analytics or simply Big Data, requires new forms of data management to deal with the volume, variety, and velocity of Big Data. Many of these data management problems can be described as data curation. Data curation includes all the processes needed for principled and controlled data creation, maintenance, and management, together with the capacity to add value to data. In this talk, I describe our experience in curating several open data sets. I overview how we have adapted some of the traditional solutions for aligning data and creating semantics to account for (and take advantage of) Big Data.




    Biography: Renée J. Miller received BS degrees in Mathematics and in Cognitive Science from the Massachusetts Institute of Technology. She received her MS and PhD degrees in Computer Science from the University of Wisconsin in Madison, WI. She is a Fellow of the Royal Society of Canada (Canada's National Academy) and the Bell Canada Chair of Information Systems at the University of Toronto. She received the US Presidential Early Career Award for Scientists and Engineers (PECASE) , the highest honor bestowed by the United States government on outstanding scientists and engineers beginning their careers and the National Science Foundation Career Award. She is a Fellow of the ACM, the President of the VLDB Endowment, and was the Program Chair for ACM SIGMOD 2011 in Athens, Greece. Her work has focused on the long-standing open problem of data integration and has achieved the goal of building practical data integration systems. She and her co-authors received the ICDT Test-of-Time Award for their influential 2003 paper establishing the foundations of data exchange.

    http://dblab.cs.toronto.edu/~miller/

    Host: Craig Knoblock

    Webcast: TBA

    Location: Information Science Institute (ISI) - 11th Flr Conf Rm # 1135, Marina Del Rey

    WebCast Link: TBA

    Audiences: Everyone Is Invited

    Contact: Peter Zamar


    This event is open to all eligible individuals. USC Viterbi operates all of its activities consistent with the University's Notice of Non-Discrimination. Eligibility is not determined based on race, sex, ethnicity, sexual orientation, or any other prohibited factor.

  • NL Seminar- Kenji Sagae:Dependency parsing with directed graph output

    Fri, Feb 28, 2014 @ 03:00 PM - 04:00 PM

    Information Sciences Institute

    Conferences, Lectures, & Seminars


    Speaker: Kenji Sagae, USC/ ICT

    Talk Title: Dependency parsing with directed graph output

    Series: Natural Language Seminar

    Abstract: Most data-driven dependency parsing approaches assume that the structure of sentences is represented as trees. Although trees have several desirable properties from a computational perspective, the structure of linguistic phenomena that go beyond shallow syntax often cannot be fully captured by tree representations. I will describe data-driven dependency parsing approaches that produce more general graphs as output, and present results obtained with these approaches on predicate-argument structures extracted from CCG and HPSG datasets.



    Biography: Kenji Sagae is a Research Scientist in the Institute for Creative Technolgies at the University of Southern California, and a Research Assistant Professor in the USC Computer Science Department. He received his PhD from Carnegie Mellon University in 2006. Prior to joining USC in 2008, he was a research associate at the University of Tokyo. His main area of research is Natural Language Processing, focusing on data-driven approaches for syntactic parsing, predicate-argument analysis and discourse processing. His current work includes the application of these techniques in analysis of personal narratives in blog posts, the study of child language, spoken dialogue systems, and multimodal processing.

    Home Page
    http://ict.usc.edu/profile/kenji-sagae/

    Host: Kevin Knight & Yang Gao

    More Info: http://nlg.isi.edu/nl-seminar/

    Location: 11th Flr Conf Rm # 1135, Marina Del Rey @ ISI-Info Sciences Inst.

    Audiences: Everyone Is Invited

    Contact: Peter Zamar

    Event Link: http://nlg.isi.edu/nl-seminar/


    This event is open to all eligible individuals. USC Viterbi operates all of its activities consistent with the University's Notice of Non-Discrimination. Eligibility is not determined based on race, sex, ethnicity, sexual orientation, or any other prohibited factor.