University homepage Suomenkielinen versio puuttuu Inte på svenska In english
University of Helsinki Department of Computer Science
 

Department of Computer Science

58308109 Seminar: Predicting Structured Data (Spring 2008)

Time: periods III-IV, Thursdays 16-18, room C220
Organizers: PhD Huizhen Janey Yu , prof. Juho Rousu

BACKGROUND

Complex learning targets such as sequences, taxonomies and graphs are frequent in real-world applications, for example, hand-writing recognition (target is a sequence of letters), hierarchical classification (tree), and gene function prediction (poset), to only name a few.

The mainstream of machine learning research, in particular that of kernels methods, has been succesful in developing flexible and powerful methods for treating complex inputs. The complementary methods for complex outputs have so far received significantly less attention. The chief approachs towards complex targets has been to decompose the target (e.g. a hand-written word) prior to learning and learning each component (e.g. a character) indepedently. With this approach, dependencies between the components are not utilized.

During last five years, research in complex and structured output learning has emerged as one of the mega-trends in machine learning. In particular, methods marrying kernel methods and graphical models have received significant attention.

SEMINAR GOALS

The purpose of the seminar is to explore the recent progress in machine learning for complex and structured outputs

PREREQUISITES AND SEMINAR POSITION

The seminar is an elective advanced level seminar. It is also well-suited for post-graduate studies.

Prequisite knowledge for the semimar is basic knowledge about probabilistic modelling and machine learning. Familiarity with kernel methods and graphical models will be helpful.

COMPLETING THE SEMINAR

  • Each participant will prepare a slide presentation of approximately 45 minutes (rule of thumb: 2 minutes per slide => approx. 22-23 slides). The pdf file of the draft presentation is sent no later than Monday preceding the presentation time via email to Janey.
  • In addition, succesfull completion of the course requires active participation in the seminar
  • Grading will be pass/fail

TOPICS

Kernels for Structured Data

String kernels

Rational kernels

Tree kernels

Graph kernels

Structured Prediction Models


  • G. Bakir, T. Hofmann, Schölkopf, A. Smola, B. Taskar, S.V.N. Vishwanathan: Modeling Structure via Graphical Models. Chapter 3 in Predicting Structured Data, MIT Press, 2007
  • J. Weston, G. Bakir, O. Bousquet, T. Mann, W.S. Noble and B. Schölkopf: Joint Kernel Maps. Chapter 4 in Predicting Structured Data, MIT Press, 2007
  • B. Taskar, C. Guestrin, D. Koller. Max-Margin Markov Networks. Advances in Neural Information Processing Systems 16, 2004
  • J. Rousu, C. Saunders, S. Szedmak and J. Shawe-Taylor: Efficient algorithms for Max-Margin Structured Classification. Chapter 6 in Predicting Structured Data, MIT Press, 2007

Structured Prediction Applications

Sequence annotation

Hierarchical multilabel classification

Supervised network inference & completion


Optimization algorithms for structured prediction

  • Y. Altun, T. Hofmann, I. Tsochantaridis: Support Vector Machine Learning for Interdependent and Structured Output Spaces. Chapter 5 in Predicting Structured Data, MIT Press, 2007
  • H. Daume and D. Marcu: Learning as Search optimization. Chapter 9 in Predicting Structured Data, MIT Press, 2007
  • A. Smola, S.V.N. Vishwanathan, Quoc Le. Bundle Methods for Machine Learning. Advances in Neural Information Processing Systems 20, 2007

Generalization error analysis for structured output

  • D. McAllester. Generalization Bounds and Consistency for Structured Labeling. Chapter 11 in Predicting Structured Data, MIT Press, 2007



SCHEDULE