DSpace Repository

Language Grounding in Massive Online Data

Show simple item record

dc.contributor.advisor Warren, David S en_US
dc.contributor.author Chen, Jianfu en_US
dc.contributor.other Department of Computer Science. en_US
dc.date.accessioned 2017-09-20T16:52:19Z
dc.date.available 2017-09-20T16:52:19Z
dc.date.issued 2015-12-01 en_US
dc.identifier.uri http://hdl.handle.net/11401/77273 en_US
dc.description 93 pg. en_US
dc.description.abstract Truly understanding natural language requires grounding language to perceptions and actions in the physical and social world. This goes beyond studying the textual modality alone. Today's web not only has sheer volume of data, but also increasingly multi-modal data, intertwining text with videos, images, audios, and ontologies that are perceptions or abstractions of people's everyday life. Hence the web provides rich and ever growing resources for studying grounded language. This thesis presents a series of investigations of language woven into various types of online data, ranging from ontology and images to time series. We contribute data distillation approaches and large-scale datasets connecting language to vision, a collection of models and algorithms, and multiple novel applications in hierarchical product classification, image description, and photo album summarization. en_US
dc.description.sponsorship This work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree. en_US
dc.format Monograph en_US
dc.format.medium Electronic Resource en_US
dc.language.iso en_US en_US
dc.publisher The Graduate School, Stony Brook University: Stony Brook, NY. en_US
dc.subject.lcsh Computer science en_US
dc.subject.other big data, computer vision, language grounding, Natural Langue Processing, web en_US
dc.title Language Grounding in Massive Online Data en_US
dc.type Dissertation en_US
dc.mimetype Application/PDF en_US
dc.contributor.committeemember Warren, David en_US
dc.contributor.committeemember Fodor, Paul en_US
dc.contributor.committeemember Ramakrishnan, I.V. en_US
dc.contributor.committeemember Choi, Yejin en_US
dc.contributor.committeemember Hajishirzi, Hannaneh. en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace

Advanced Search


My Account