Automatic tag correction in videos : an approach based on frequent pattern mining

Hoang Tung Tran

Thèse Année : 2014

Automatic tag correction in videos : an approach based on frequent pattern mining

Correction automatique d’annotations de vidéos : une approche à base de fouille de motifs fréquents

Hoang Tung Tran

Fonction : Auteur
PersonId : 784604
IdRef : 204754801

Résumé

This thesis presents a new system for video auto tagging which aims at correcting the tags provided by users for videos uploaded on the Internet. Most existing auto-tagging systems rely mainly on the textual information and learn a great number of classifiers (on per possible tag) to tag new videos. However, the existing user-provided video annotations are often incorrect and incomplete. Indeed, users uploading videos might often want to rapidly increase their video’s number-of-view by tagging them with popular tags which are irrelevant to the video. They can also forget an obvious tag which might greatly help an indexing process. In this thesis, we limit the use this questionable textual information and do not build a supervised model to perform the tag propagation. We propose to compare directly the visual content of the videos described by different sets of features such as SIFT-based Bag-Of-visual-Words or frequent patterns built from them. We then propose an original tag correction strategy based on the frequency of the tags in the visual neighborhood of the videos. We have also introduced a number of strategies and datasets to evaluate our system. The experiments show that our method can effectively improve the existing tags and that frequent patterns build from Bag-Of-visual-Words are useful to construct accurate visual features

Nous présentons dans cette thèse un système de correction automatique d'annotations (tags) fournies par des utilisateurs qui téléversent des vidéos sur des sites de partage de documents multimédia sur Internet. La plupart des systèmes d'annotation automatique existants se servent principalement de l'information textuelle fournie en plus de la vidéo par les utilisateurs et apprennent un grand nombre de "classifieurs" pour étiqueter une nouvelle vidéo. Cependant, les annotations fournies par les utilisateurs sont souvent incomplètes et incorrectes. En effet, un utilisateur peut vouloir augmenter artificiellement le nombre de "vues" d'une vidéo en rajoutant des tags non pertinents. Dans cette thèse, nous limitons l'utilisation de cette information textuelle contestable et nous n'apprenons pas de modèle pour propager des annotations entre vidéos. Nous proposons de comparer directement le contenu visuel des vidéos par différents ensembles d'attributs comme les sacs de mots visuels basés sur des descripteurs SIFT ou des motifs fréquents construits à partir de ces sacs. Nous proposons ensuite une stratégie originale de correction des annotations basées sur la fréquence des annotations des vidéos visuellement proches de la vidéo que nous cherchons à corriger. Nous avons également proposé des stratégies d'évaluation et des jeux de données pour évaluer notre approche. Nos expériences montrent que notre système peut effectivement améliorer la qualité des annotations fournies et que les motifs fréquents construits à partir des sacs de motifs fréquents sont des attributs visuels pertinents

Mots clés

Video tag correction Bag of visual word Frequent pattern mining Tag propagation Data mining KRIMP SLIM Automatic tagging

Correction d'annotations de vidéos Sac de mots visuels Motifs fréquents Propagation des annotations Exploration de données KRIMP SLIM Annotation automatique

Domaines

Algorithme et structure de données [cs.DS]

Fichier principal

These-Tran-Hoang_-_Tung-2014.pdf (7.88 Mo)

Origine : Version validée par le jury (STAR)

ABES STAR : Contact

https://theses.hal.science/tel-01623441

Soumis le : mercredi 25 octobre 2017-12:05:08

Dernière modification le : mercredi 1 février 2023-03:56:58

Archivage à long terme le : vendredi 26 janvier 2018-13:45:20

Dates et versions

tel-01623441 , version 1 (25-10-2017)

Identifiants

HAL Id : tel-01623441 , version 1

Citer

Hoang Tung Tran. Automatic tag correction in videos : an approach based on frequent pattern mining. Data Structures and Algorithms [cs.DS]. Université Jean Monnet - Saint-Etienne, 2014. English. ⟨NNT : 2014STET4028⟩. ⟨tel-01623441⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

STAR PARISTECH

155 Consultations

153 Téléchargements

Automatic tag correction in videos : an approach based on frequent pattern mining

Correction automatique d’annotations de vidéos : une approche à base de fouille de motifs fréquents

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager