Index of /teaching/dav_20/labs/lab12/

      Name                                                                             Last modified         Size  Description 
   
up Parent Directory 25-May-2020 14:17 - directory titanic 25-May-2020 14:17 - unknown test.csv 25-May-2020 14:20 28k unknown train.csv 25-May-2020 14:20 60k

====================================================================
                            TITANIC
                            Part 2
====================================================================

Use the passenger data from Titanic shipwreck to answer question 
"what sorts of people were more likely to survive?”

You will be given: name, age, gender, socio-economic class, etc) 

====================================================================

The data has been split into two groups:

- training set (train.csv)
- test set (test.csv)

Class description:

pclass: A proxy for socio-economic status (SES)
1st = Upper, 2nd = Middle, 3rd = Lower

age: Age is fractional if less than 1. If the age is estimated, is it in the form of xx.5

sibsp: The dataset defines family relations in this way...
Sibling = brother, sister, stepbrother, stepsister
Spouse = husband, wife

parch: The dataset defines family relations in this way...
Parent = mother, father
Child = daughter, son, stepdaughter, stepson
Some children traveled only with a nanny, therefore parch=0 for them.

====================================================================

3) ML models building
a) train Nearest Neighbors
b) train Support Vector Machine (RBF & GridSearchCV)
c) train MLPClassifier (check Adam vs. lbfgs and relu vs. tanh)

In all cases make some scripts and summaries in the form of tables 
(e.g. with scores from different setups)


Again, this week no homework (yet)
Proudly Served by LiteSpeed Web Server at bioinformatics.netmark.pl Port 80