Data Mining and classification

From Bioinformatique théorique, Fouille de données et Optimisation stochastique
Jump to: navigation, search
Classification and Data Mining theme

The Classification and Data Mining (CDM) theme, lead by Pierre Gançarski, focuses on machine learning and knowledge extraction from complex data (eg. images, databases, etc.). Our research have two aims: on the one hand, it consists in conceiving and implementing knowledge extraction methods, and, on the other hand, to apply those methods to analyse databases and numerical images. Our approaches belongs to the machine learning area, in particular to clustering and relational data mining. Our main application domains are remote sensing or medical images, biochemical data, and also customer relationship management.

Our theme is articulated around different aspects:


Contents


Main collaborators

Former PhD students

Operations

FODOMUST: Multistrategy data mining

Our works on multistrategy data mining follow three axes:

search for the partition built from local solutions proposed by individuals that minimizes the LKM cost function. Each individual from a population is evaluated according to individuals from other populations: the better the best result that can build using its solution, the better the individual evaluation.

All aspects are implemented in a common platform Samarah (Multi-agent learning system for the automatic refinement of hierarchies).

Many of these works have been realized in collaboration with the Laboratoire Image et Ville (UMR CNRS/UDS 7011) and have been validated in the remote sensing domain. Thus, the main application domain of our methods is the clustering of remote sensing images, and generally the clustering of images. Our main application area is the automatic remote sensing classification.

In parallel, we work on the automatic structuration of sets of images and video sequences.

FODOREL: Relational data mining

Relational data mining deals with the knowledge extraction from (relational, of course) databases, and more generally with inductive learning from data that cannot naturally be represented as a single attribute-value table, e.g. chemical reactions.

Our application areas include:

Our research topics are:

FODOST: Structural data mining

Structural data mining concerns the knowledge extraction from complex data structured by spatial, semantic, or temporal dimensions. Our aim is to make use of the structure between objects to cluster.

We work from multisource, multiview, multiresolution and multitemporal data, mainly in the remote sensing domain.

FODOGECO: Data mining and knowledge management

We are concerned here with the semantic interpretation of high-resolution satellite images. To enable the identification of high-level structured objects (house, street…), it is necessary to merge the classifications the regions coming from the analysis of the images with from inferences based on a geographical ontology. This ontology needs to describe not only the urban objects, but also their spatial qualitative and quantitative relationships.

Main projects and collaborations


Former trainees

Publications in international journals

Other scientific communications

Bioinformatique théorique, Fouille de données et Optimisation stochastique
Views
Actions
BFO Team
Research themes
Research
Personal tools