Anscombe’s quartet highlights the importance of data visualization and warns against relying solely on summary statistics for data analysis:
Dowload the datasaurus datat set and set up your working environment.
datasaurus |>
group_by(dataset) |>
summarize(mean_x = mean(x),
sd_x = sd(x),
mean_y = mean(y),
sd_y = sd(y))
dataset | mean_x | sd_x | mean_y | sd_y |
---|---|---|---|---|
away | 54.26610 | 16.76983 | 47.83472 | 26.93974 |
bullseye | 54.26873 | 16.76924 | 47.83082 | 26.93573 |
circle | 54.26732 | 16.76001 | 47.83772 | 26.93004 |
dino | 54.26327 | 16.76514 | 47.83225 | 26.93540 |
dots | 54.26030 | 16.76774 | 47.83983 | 26.93019 |
h_lines | 54.26144 | 16.76590 | 47.83025 | 26.93988 |
high_lines | 54.26881 | 16.76670 | 47.83545 | 26.94000 |
slant_down | 54.26785 | 16.76676 | 47.83590 | 26.93610 |
slant_up | 54.26588 | 16.76885 | 47.83150 | 26.93861 |
star | 54.26734 | 16.76896 | 47.83955 | 26.93027 |
v_lines | 54.26993 | 16.76996 | 47.83699 | 26.93768 |
wide_lines | 54.26692 | 16.77000 | 47.83160 | 26.93790 |
x_shape | 54.26015 | 16.76996 | 47.83972 | 26.93000 |
Let’s start with some definitions regarding color. “Color” refers to three components:
Try to find the type B
data point in the three plots in the next slides
Try to find the type B
Try to find the type B
Try to find the type B
type B
data point should have been easier to find in the first and last plots, and harder in the middle plot.see the two plots on the next slides
Other encoding channels that are even less easily processed than shape are:
Visual elements are perceived as being related to each other when these elements are close to each other – i.e., Proximity principle:
Visual elements are perceived as being related to each other when these elements are similar in shape, color and size – i.e., Similarity principle:
Visual elements are perceived as being related to each other when these elements are similar in shape, color and size – i.e., Similarity principle:
Visual elements are perceived as being related to each other when these elements are visually tied – i.e., Connection principle:
Visual elements are perceived as being related to each other when these elements are perceived as being part paths, lines, curves, even when some elements are “hidden” – i.e., Continuity principle: