Déjà Image-Captions: A Corpus of Expressive Descriptions in Repetition

Jianfu Chen,Polina Kuznetsova,D. Warren,Yejin Choi

Published 2015 in North American Chapter of the Association for Computational Linguistics

ABSTRACT

We present a new approach to harvesting a large-scale, high quality image-caption corpus that makes a better use of already existing web data with no additional human efforts. The key idea is to focus on Deja Image-Captions: naturally existing image descriptions that are repeated almost verbatim – by more than one individual for different images. The resulting corpus provides association structure between 4 million images with 180K unique captions, capturing a rich spectrum of everyday narratives including figurative and pragmatic language. Exploring the use of the new corpus, we also present new conceptual tasks of visually situated paraphrasing, creative image captioning, and creative visual paraphrasing.

PUBLICATION RECORD

Publication year
2015
Venue
North American Chapter of the Association for Computational Linguistics
Publication date
Unknown publication date
Fields of study
Linguistics, Computer Science
Identifiers
DOI 10.3115/v1/N15-1053
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Refer-to-as Relations as Semantic Knowledge
2015cited by this paper
TreeTalk: Composition and Compression of Trees for Image Descriptions
2014cited by this paper
Metaphor Detection with Cross-Lingual Model Transfer
2014cited by this paper
Semantic Parsing via Paraphrasing
2014cited by this paper
Improving Image-Sentence Embeddings Using Large Weakly Annotated Photo Collections
2014cited by this paper
Grounded Compositional Semantics for Finding and Describing Images with Sentences
2014cited by this paper
Microsoft COCO: Common Objects in Context
2014cited by this paper
Comparing Automatic Evaluation Measures for Image Description
2014cited by this paper
Nonparametric Method for Data-driven Image Captioning
2014cited by this paper
Understanding and Quantifying Creativity in Lexical Composition
2013cited by this paper
Generalizing Image Captions for Image-Text Parallel Corpus
2013cited by this paper
From Large Scale Image Categorization to Entry-Level Categories
2013cited by this paper
PPDB: The Paraphrase Database
2013cited by this paper
Harvesting Parallel News Streams to Generate Paraphrases of Event Relations
2013cited by this paper
Data-Driven Metaphor Recognition and Explanation
2013cited by this paper
Sentence-Based Image Description with Scalable, Explicit Models
2013cited by this paper
Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics
2013influential reference
Detecting Visual Text
2012cited by this paper
A Computational Approach to the Automation of Creative Naming
2012cited by this paper
Visual and semantic similarity in ImageNet
2011cited by this paper
Im2Text: Describing Images Using 1 Million Captioned Photographs
2011influential reference
Creative Language Retrieval: A Robust Hybrid of Information Retrieval and Linguistic Creativity
2011cited by this paper
Collecting Highly Parallel Data for Paraphrase Evaluation
2011cited by this paper
Models of Metaphor in NLP
2010cited by this paper
A new approach to cross-modal multimedia retrieval
2010cited by this paper
Automatic Attribute Discovery and Characterization from Noisy Web Data
2010cited by this paper
Collecting Image Annotations Using Amazon’s Mechanical Turk
2010cited by this paper
ImageNet: A large-scale hierarchical image database
2009cited by this paper
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition
2008cited by this paper
In defense of Nearest-Neighbor based image classification
2008cited by this paper
Bridging the Gap: Query by Semantic Example
2007cited by this paper
Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis
2005cited by this paper
METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments
2005cited by this paper
Canonical Correlation Analysis: An Overview with Application to Learning Methods
2004cited by this paper
Unsupervised Construction of Large Paraphrase Corpora: Exploiting Massively Parallel News Sources
2004cited by this paper
Syntax-based Alignment of Multiple Translations: Extracting Paraphrases and Generating New Sentences
2003cited by this paper
Bleu: a Method for Automatic Evaluation of Machine Translation
2002cited by this paper
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope
2001cited by this paper
Extracting Paraphrases from a Parallel Corpus
2001cited by this paper
WordNet: A Lexical Database for English
1995cited by this paper

CITED BY

Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy and Novel Ensemble Method
2024cites this paper
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends, and Metrics Analysis
2024cites this paper
Image Encoder and Sentence Decoder Based Video Event Description Generating Model: A Storytelling
2022cites this paper
PINEAPPLE: Personifying INanimate Entities by Acquiring Parallel Personification Data for Learning Enhanced Generation
2022cites this paper
Machine-in-the-Loop Rewriting for Creative Image Captioning
2021influential citation
Deep Learning-Based Short Story Generation for an Image Using the Encoder-Decoder Structure
2021cites this paper
Image Captioning through Cognitive IOT and Machine-Learning Approaches
2021cites this paper
Chapter 9. New developments in corpus approaches to social media
2020cites this paper
Chapter 7. Constructing corpora from images and text
2020cites this paper
"I Hope This Is Helpful"
2020cites this paper
Robust Image Captioning
2020influential citation
Automatic Image Description Generation Using Deep Multimodal Embeddings
2020influential citation
Captioning Images Taken by People Who Are Blind
2020cites this paper
Vision to Language: Methods, Metrics and Datasets
2020cites this paper
Chapter 8. Working with images and emoji in the 🦆 Dukki Facebook Corpus
2020cites this paper
Generating Diverse and Descriptive Image Captions Using Visual Paraphrases
2019cites this paper
Evaluation of Multiple Approaches for Visual Question Reasoning
2018cites this paper
From image to language and back again
2018cites this paper
Stories for Images-in-Sequence by using Visual and Narrative Components
2018cites this paper
Defoiling Foiled Image Captions
2018cites this paper
IntPhys: A Framework and Benchmark for Visual Intuitive Physics Reasoning
2018cites this paper
Exploring the Internal Statistics: Single Image Super-Resolution, Completion and Captioning
2017cites this paper
Задачи и методы ресурсосберегающей оптимизации в электроэнергетической системе
2017cites this paper
Building a Non-Trivial Paraphrase Corpus Using Multiple Machine Translation Systems
2017cites this paper
Computer Vision for Political Science Research: A Study of Online Protest Images ∗
2017cites this paper
Using Internet based paraphrasing tools: Original work, patchwriting or facilitated plagiarism?
2017cites this paper
Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation
2017cites this paper
Self-Guiding Multimodal LSTM—When We Do Not Have a Perfect Training Dataset for Image Captioning
2017cites this paper
Leveraging Captions in the Wild to Improve Object Detection
2016cites this paper
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
2016cites this paper
Album Story 1 Description for Images in Isolation & in Sequences Re-telling Story 1 Caption in Sequence Storytelling Story 2 Story 3 Re-telling Preferred Photo Sequence Story 4 Story
2016cites this paper
1 Million Captioned Dutch Newspaper Images
2016cites this paper
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures
2016influential citation
A Corpus of Images and Text in Online News
2016cites this paper
Generating Natural Questions About an Image
2016cites this paper
Cross-validating Image Description Datasets and Evaluation Metrics
2016influential citation
Learning Prototypical Event Structure from Photo Albums
2016cites this paper
Visual Storytelling
2016cites this paper
Exploring Nearest Neighbor Approaches for Image Captioning
2015cites this paper
Microsoft COCO Captions: Data Collection and Evaluation Server
2015cites this paper
A Survey of Current Datasets for Vision and Language Research
2015cites this paper
On Available Corpora for Empirical Methods in Vision & Language
2015cites this paper
Images as Data: Computer Vision for Social Science Research ∗
year unknowncites this paper