A quick peek into potential correlations
host_since
column, which indicates
when this Airbnb host first logged in, different correlations appear.
host_is_superhost
correlates more strongly. This
means that hosts need quite a bit of time before actually becoming superhosts.
first_review
correlates more strongly instead. Which makes sense, since having a first review is the
most common event among new hosts.
We are skipping the two first entries on purpose. Correlations on an ID column (host_id
)
are generally not useful. And a strong correlation against the same variable is also to be expected.
Graphext evaluates all columns anyways!
month
which has the numbers 1 – 12 for each month,
and a column season
which has the values “Summer”, “Autumn”, “Winter”, “Spring”.
We all know that Spring happens in March, April and May, approximately. If we select the category “Spring”, we can see
that the months 3, 4 and 5 show up these blue spikes, whereas the rest of the months do not.
month
). Or, basically, the grey-ish bar behind the blue one.month
AND the value “Spring” in season
.
month
in the significant variables section: