Application of data mining to a data analysis problem. The project covers the whole data mining process, and includes either implementing a data mining algorithm or using a wider range of available implementations. The project is completed by a research report describing and justifying the steps taken and decisions made, and discussing the results obtained. Prerequisites: The course Data Mining. The project can only be taken during the specified period. There are no final exams.
Year Semester Date Period Language In charge
2016 spring 14.03-20.05. 4-4 English Hannu Toivonen


Time Room Lecturer Date
Mon 15-16 C221 Hannu Toivonen 14.03.2016-14.03.2016

Ilmoittautuminen tälle kurssille alkaa tiistaina 16.2. klo 9.00. Aloitusluento MA 14.3. 15-16 C221 on kaikille pakolinen!

Registration for this course starts on Tuesday 16th of February at 9.00. The first lecture on MON 14.3. 15-16 C221 is obligatory for everybody!


The participants in the data mining project will either participate in at least one of the phases of the current KDD cup, with the KDD cup data, or work on a topic of their own choosing. The project will be done either in teams of size 2-4 individuals or individually. If a participant wishes to work in a team, the teams will be formed during the first meeting.

Course duration and grading

  • The project is 2 credits, but larger projects with extra credit can also be undertaken. If you choose to do so, ask Arto if the topic you are considering is good and keep track of the hours you are using. The 2 credit projects should be finished by the end of the 4th period, and the larger projects by the end of the 5th period.
  • The project will be graded fail / pass / 5, where 5 corresponds to excellent, pass to good and fail to fail.

Project timeline

  • Finding a team (or deciding to work alone)
  • Selecting a topic
  • Working on the topic to decide whether it is feasible to do in a few credits
  • Reporting the topic of the project to Arto by the end of 31st of March
  • Working on the topic -- individual or project guidance hours can be reserved or asked from Arto (
  • Coming to present your work on 4.5. at 16:30 at CS Dept (Location: D234). There will be a computer with an internet connection, where you can download and show PDFs. Each group / presentation should be around 10-15 minutes, after which there will be time for questions. Overall, the presentation should be on a level that can be followed by anyone who took the DM course earlier.
  • Submitting a report on the project. Use as the Latex template
  • Reviewing reports from others
  • Finish