summaryrefslogtreecommitdiff
path: root/README.md
blob: e88439dc681b72d0359c195636856d271f91660a (plain)
1
2
3
4
5
6
7
8
9
10
11
# 2020 project for the data mining course
* [link to assignment](http://www.cs.uu.nl/docs/vakken/mdm/assignment1-2020.pdf)

This project is a from scratch implementation of a classification tree using
the theory we learned in the course. We were awarded a 10/10 for the coding
part, and 8.5/10 for the report, the code was said to be very readable, and we
also avoided time bottle necks using numpy.

The concepts of impurity reduction and the gini-index were used to construct an
algorithm that computes the "best split" at each step. We were also required to
implement ensembling methods.