Skip to main navigation Skip to search Skip to main content

Detecting visual text

  • Jesse Dodge
  • , Amit Goyal
  • , Xufeng Han
  • , Alyssa Mensch
  • , Margaret Mitchell
  • , Karl Stratos
  • , Kota Yamaguchi
  • , Yejin Choi
  • , Hal Daumé
  • , Alexander C. Berg
  • , Tamara L. Berg

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

When people describe a scene, they often include information that is not visually apparent; sometimes based on background knowledge, sometimes to tell a story. We aim to separate visual text - descriptions of what is being seen - from non-visual text in natural images and their descriptions. To do so, we first concretely define what it means to be visual, annotate visual text and then develop algorithms to automatically classify noun phrases as visual or non-visual. We find that using text alone, we are able to achieve high accuracies at this task, and that incorporating features derived from computer vision algorithms improves performance. Finally, we show that we can reliably mine visual nouns and adjectives from large corpora and that we can use these effectively in the classification task.

Original languageAmerican English
Title of host publicationProceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics
Subtitle of host publicationHuman Language Technologies
PublisherAssociation for Computational Linguistics (ACL)
Pages762-772
Number of pages11
ISBN (Electronic)1937284204, 9781937284206
StatePublished - 2012
Externally publishedYes
Event2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2012 - Montreal, Canada
Duration: Jun 3 2012Jun 8 2012

Publication series

NameNAACL HLT 2012 - 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference

Conference

Conference2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2012
Country/TerritoryCanada
CityMontreal
Period6/3/126/8/12

ASJC Scopus subject areas

  • Language and Linguistics
  • Computer Science Applications
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Detecting visual text'. Together they form a unique fingerprint.

Cite this