Thèse soutenue

Fouille de Données dans les Réseaux Sociaux et d’Information : Dynamiques et Applications

FR  |  
EN
Auteur / Autrice : Fragkiskos Malliaros
Direction : Michalis Vazirgiannis
Type : Thèse de doctorat
Discipline(s) : Informatique
Date : Soutenance en 2015
Etablissement(s) : Palaiseau, Ecole polytechnique

Mots clés

FR

Mots clés contrôlés

Résumé

FR

Networks (or graphs) have become ubiquitous as data from diverse disciplines can naturally be mapped to graph structures. The problem of extracting meaningful information from large scale graph data in an efficient and effective way has become crucial and challenging with several important applications and towards this end, graph mining and analysis methods constitute prominent tools. This dissertation contributes models, tools and observations to problems that arise in the area of mining social and information networks. We built upon computationally efficient graph mining methods in order to: (i) design models for analyzing the structure and dynamics of real-world networks towards unraveling properties that can further be used in practical applications; (ii) develop algorithmic tools for large-scale analytics on data with inherent (e. G. , social networks) or without inherent (e. G. , text) graph structure. In particular, for the former point we show how to model the engagement dynamics of large social networks and how to assess their vulnerability with respect to user departures from the network. In both cases, by unraveling the dynamics of real social networks, regularities and patterns about their structure and formation can be identified; such knowledge can further be used in various applications including churn prediction, anomaly detection and building robust social networking systems. For the latter, we examine how to identify influential users in complex networks, having direct applications to epidemic control and viral marketing and how to utilize graph mining techniques in order to enhance text analytics tasks and in particular the one of text categorization.