Extended bag-of-words formalism for image classification | Theses.fr

Sandra Eliza Fontes De Avila

Le moteur de recherche
des thèses françaises

Désactiver l'aide à la saisie

Extension du modèle par sac de mots visuels pour la classification d'images

FR |

EN

Auteur / Autrice :	Sandra Eliza Fontes De Avila
Direction :	Matthieu Cord
Type :	Thèse de doctorat
Discipline(s) :	Informatique
Date :	Soutenance en 2013
Etablissement(s) :	Paris 6

Mots clés

FR

Mots clés contrôlés

Classification automatique

Télédétection

Traitement d'images

Reconnaissance des formes (informatique)

Vision par ordinateur

Résumé

FR |

EN

In this dissertation, we have addressed the problem of representing images based on their visual information. Our aim is content-based concept detection in images and videos, with a novel representation that enriches the Bag-of-Words model. Relying on the quantization of highly discriminant local descriptors by a codebook, and the aggregation of those quantized descriptors into a single pooled feature vector, the Bag-of-Words model has emerged as the most promising approach for image classification. We propose BossaNova, a novel image representation which offers a more information-preserving pooling operation based on a distance-to-codeword distribution. The experimental evaluations on many challenging image classification benchmarks, such as ImageCLEF Photo Annotation, MIRFLICKR, PASCAL VOC and 15-Scenes, have shown the advantage of BossaNova when compared to traditional techniques, even without using complex combinations of different local descriptors. An extension of our approach has also been studied. It concerns the combination of BossaNova representation with another representation very competitive based on Fisher Vectors. The results consistently reaches other state-of-the-art representations in many datasets. It also experimentally demonstrate the complementarity of the two approaches. This study allowed us to achieve, in the competition ImageCLEF 2012 Flickr Photo Annotation Task, the 2nd among the 28 visual submissions.

Le moteur de recherche
des thèses françaises

Les thèses

Les personnes
liées aux thèses

Extension du modèle par sac de mots visuels pour la classification d'images

Mots clés

Mots clés contrôlés

Résumé

Le moteur de recherche des thèses françaises

Les thèses

Les personnes liées aux thèses

Recherche Avancée

Extension du modèle par sac de mots visuels pour la classification d'images

Mots clés

Mots clés contrôlés

Résumé

Le moteur de recherche
des thèses françaises

Les personnes
liées aux thèses