High-Dimensional Incremental Divisive Clustering under Population Drift

Event type: 
HIIT seminar
Event time: 
06.06.2014 - 10:15 - 11:15
Lecturer : 
Nicos Pavlidis
Kumpula, Exactum B119

Title: High-Dimensional Incremental Divisive Clustering under Population Drift

Abstract: Clustering is a central problem in data mining and statistical pattern recognition with a long and rich history. The advent of Big Data has introduced important challenges to existing clustering methods in the form of high-dimensional, high-frequency, time-varying streams of data. Up-to-date research on Big Data clustering has been almost exclusively focused on addressing individual aspects of the problem in isolation, largely ignoring whether and how the proposed methods can be extended to address the overall problem. We will discuss an incremental divisive clustering approach for high-dimensional data that has storage requirements that are low and more importantly independent of the stream size, and can identify changes in the population distribution that require a revision of the clustering result.

Bio: Dr Nicos Pavlidis is a lecturer in the department of Management Science at Lancaster University, UK. His research interests are in the fields of statistics, data mining, and machine learning. His research focuses on the development of methods that are able to automatically handle time-varying population distributions, and can reveal temporal structure in the data. His most recent work is on applications which involve data arriving sequentially and at a high frequency relative to the available storage and processing capabilities.

04.06.2014 - 10:41 Sotiris Tasoulis
26.05.2014 - 11:52 Sotiris Tasoulis