Analyzing cancer clinical trials data using an LPM

Dataset

TODO: explain PDS

Exploring data using an LPM: Mutual Information and Conditional Probability

We have access to all this data, now what? We can start by trying to assess the relationships between variables: one way to do this is by computing mutual information values. A way to measure the dependence between two variables, mutual information is equal to zero if and only if the variables are independent, and it is positive if there is any dependence between the variables. By plotting the mutual information between all pairs of variables, we can get a sense of how strongly different variables are related: