What are the three channels of color in visualization?

The three channels are luminance, saturation, and hue.

How many discriminable bins are recommended for categorical color?

Only between 6 and 12 bins are recommended.

Why should one avoid using rainbow color maps for ordered data?

Rainbows lack a perceptual intrinsic ordering and can cause nonlinear visual interpretation.

What is a useful rule of thumb for using categorical color in visualization?

Use only between 6 and 12 bins, considering the need for background and highlight colors.

How are sequential and diverging color maps different?

Sequential maps go from min to max, while diverging maps emphasize a semantically meaningful midpoint.

How does size affect color perception in visualization?

Smaller regions require highly saturated colors to be noticeable.

Why is transparency difficult to use for encoding data?

Transparency is not separable from luminance and saturation.

What should be remembered when using bivariate color maps?

Bivariate color maps with multiple levels in each direction are difficult to interpret.

What are some key considerations when designing color palettes?

Consider if the data is segmented or continuous, and choose sequential, diverging, or cyclic properties.

What tool is recommended for generating color palettes?

Colorbrewer is recommended for generating informed color palettes.

Color (Ch 10) I, Visualization Analysis & Design, 2021

00:18:55

https://www.youtube.com/watch?v=QNDd_hvdORw

Zusammenfassung

TLDRIn this discussion, the focus is on the effective use of color in visualization. The video emphasizes decomposing color into three separate channels: luminance (brightness), saturation (colorful intensity), and hue (the actual color, such as red or blue). These channels can be used to effectively encode data based on whether it is categorical or ordered. Key points include the limitation of using a small number of discriminable color bins (6 to 12) and avoiding rainbow color maps for ordered data due to their non-linear perceptual interpretation. The video also stresses the complexity of using bivariate color maps and encourages utilizing other visual channels alongside color. When designing color palettes, one must consider whether the data is segmented or continuous, and choose appropriate sequential, diverging, or cyclic properties. Moreover, tools like Colorbrewer can aid in creating effective color palettes based on these principles. Lastly, the intricacies of using transparency as a channel, often confused with luminance and saturation, are discussed, advocating for careful consideration when employing color in visual data representation.

Mitbringsel

🎨 Decompose color into luminance, saturation, and hue for effective visualization.
🔍 Human perception prefers relative color comparisons, especially when colors are contiguous.
📊 Limit categorically colored bins to 6-12 for effective differentiation.
🌈 Avoid rainbow color maps for ordered data due to perceptual confusion.
🌀 Use cyclic color maps to emphasize the cyclical nature of data.
🚫 Be cautious using transparency as it intertwines with luminance and saturation.
📐 Consider using other visual channels along with color for better data encoding.
🎯 For small regions, use highly saturated colors to maintain visibility.
🔧 Tools like Colorbrewer assist in generating well-designed color palettes.
🖼️ Choose between sequential, diverging, or cyclic palettes based on data properties.

Zeitleiste

00:00:00 - 00:05:00
The segment begins with an exploration of color as a visual encoding channel, expanding beyond spatial arrangements. Color is broken down into three channels: luminance (brightness), saturation (colorfulness), and hue (identity channel). These channels serve different purposes, such as ordered and categorical attributes. Focusing on color channels, the importance of understanding and utilizing them correctly is highlighted, especially in visualization contexts where the number of discriminable bins is a consideration. Examples illustrate how misuse of color can lead to confusion, particularly with non-contiguous, small regions.
00:05:00 - 00:10:00
The discussion shifts to the proper use of categorical and ordered colors, emphasizing the limited discriminable bins usually available (6-12 bins). It advises on using additional visual channels and deliberate binning for categorical colors. The segment also critiques the common misuse of rainbow color ordering due to perceived ordering inconsistencies, explaining how luminance-ordered sequential colors like Viridis and magma are preferable for sequential data and how matching visual encoding to dataset characteristics is critical. It touches on issues like the intrinsic perceptual ordering of colors and the impact of size on color salience.
00:10:00 - 00:18:55
The final segment delves into color palette design and its considerations, such as categorical, sequential, and diverging color maps. It explains how sequential maps go from min to max values, while diverging maps emphasize a midpoint, often using neutral colors. It also covers the complexity of designing bivariate color maps for two attributes, especially when both possess multiple levels. The segment wraps with a focus on ensuring color palettes are perceptually linear, appropriately ordered by luminance, and colorblind safe, providing examples and caveats in the use of such color mappings.

Mind Map

Video-Fragen und Antworten

What are the three channels of color in visualization?
The three channels are luminance, saturation, and hue.
How many discriminable bins are recommended for categorical color?
Only between 6 and 12 bins are recommended.
Why should one avoid using rainbow color maps for ordered data?
Rainbows lack a perceptual intrinsic ordering and can cause nonlinear visual interpretation.
What is a useful rule of thumb for using categorical color in visualization?
Use only between 6 and 12 bins, considering the need for background and highlight colors.
How are sequential and diverging color maps different?
Sequential maps go from min to max, while diverging maps emphasize a semantically meaningful midpoint.
How does size affect color perception in visualization?
Smaller regions require highly saturated colors to be noticeable.
Why is transparency difficult to use for encoding data?
Transparency is not separable from luminance and saturation.
What should be remembered when using bivariate color maps?
Bivariate color maps with multiple levels in each direction are difficult to interpret.
What are some key considerations when designing color palettes?
Consider if the data is segmented or continuous, and choose sequential, diverging, or cyclic properties.
What tool is recommended for generating color palettes?
Colorbrewer is recommended for generating informed color palettes.

Weitere Video-Zusammenfassungen anzeigen

Erhalten Sie sofortigen Zugang zu kostenlosen YouTube-Videozusammenfassungen, die von AI unterstützt werden!

Untertitel

Automatisches Blättern:

00:00:01
Let's continue talking about Visualization Analysis and Design by diving into color.
00:00:07
So, when we think about ways of visually encoding so far, we've been pretty focused on spatial
00:00:15
arrangements of attributes. Now let's switch over and go beyond just spatial arrangement,
00:00:21
and think about mapping to some of the other visual channels, particularly color.
00:00:28
So what is going on with color? When I was talking about marks and channels before,
00:00:32
there were these three different things that had color in them. Let's think about that.
00:00:38
So, we need to decompose color, because the first rule of color is: don't talk about color.
00:00:44
Specifically don't just talk about color. It's extremely confusing
00:00:48
if we treat that as monolithic. What we really need to do is decompose this into
00:00:54
three channels that have different characteristics.
00:00:57
So two of these are magnitude channels that are good for ordered attributes. The
00:01:03
luminance channel is how bright something is - think of that as grayscale black and white.
00:01:08
And then the saturation channel is how colorful something is. So,
00:01:13
how much pink is there in between gray and light pink and fully saturated pink.
00:01:19
And then the third channel is an identity channel. And that's good for showing ordered
00:01:24
attributes. So color, specifically hue, is what we usually think of as color.
00:01:30
And so we can ask questions like what color is that? Is it red? Is it blue? Is it green?
00:01:35
So, we've got that identity versus magnitude, and that is what helps us think about color.
00:01:42
These channels have different properties. So what do they convey directly to the perceptual system?
00:01:47
How much information can they convey in terms of how many
00:01:51
discriminable bins can we use? And so we'll dive into that.
00:01:56
So, let's talk about color channels in visualization.
00:02:01
So, the question of whether we think of color as categorical
00:02:05
or ordered is going to depend. We can have it either way. Right,
00:02:09
in our upper left corner we're really focusing on the fact that there are four different years here,
00:02:14
and those different changes of hue for 2010 versus 2013 versus 2011 distinguishes them as categories.
00:02:22
And in contrast, in the lower left, we're much more emphasizing an order of these years,
00:02:28
as we go from light green, to darker green, to the darkest green there is from 2010 through to 2013.
00:02:34
And we're doing something similar on the upper right, where we've got a choropleth
00:02:39
map, where we're using color saturation within geographic regions to color code,
00:02:44
and then we're seeing the relationship between that and this bar chart.
00:02:48
And we obviously have redundantly encoded
00:02:51
the length of the bar, and the amount of green in that bar, the saturation.
00:02:57
So we can see examples that color really can be either one of these.
00:03:04
Now when we're thinking about categorical color, the thing that a lot of people unfortunately get
00:03:09
wrong is, they want to encode more levels of a categorical attribute
00:03:14
than there are available discriminable bins for categorical color.
00:03:20
So remember that we talked a lot when we were introducing marks and channels,
00:03:23
about the idea that human perception is completely built on relative comparisons.
00:03:29
So that's fantastic if there's colored blocks that are next to each other.
00:03:33
If they're contiguous, we have very high precision ability to discriminate.
00:03:38
Here we've got a 21 chromosomes on a mouse and we're color coding.
00:03:43
And even really subtle distinctions between greens, like in chromosomes six through
00:03:48
nine, we're able to tell the difference when they're right next to each other.
00:03:52
But, and here's something that's gone wrong, when they now try to map those
00:03:59
into small and scattered regions - right here what they've done is they've said okay
00:04:06
for a mouse chromosome where does it fall in the human chromosome - let's see how
00:04:10
we're doing when we have these absolute comparisons of small scattered regions.
00:04:14
Well, we've got tan, and green, brown, white, red, blue, now we're at the top of chromosome 2. Okay,
00:04:22
light blue, purple, wait is that yellow, is that the same tan we saw before ...
00:04:28
see a green ,is that the same green as before, maybe there's another one.
00:04:34
So, I'm starting to, I really haven't even run out of fingers but I'm starting
00:04:38
to lose the ability to discriminate these subtle differences in color because they're separated.
00:04:45
And so it turns out that if you've got non-contiguous small regions of color,
00:04:50
it's almost always fewer bins than people wish that they had available to them.
00:04:54
So the rule of thumb is only between 6 and 12 bins can you really count on.
00:04:59
Remembering that you need a color for the background, maybe you need a default color
00:05:03
for something, you might need a highlight color. So in general, it's a lot fewer bins than people
00:05:10
actually would like to use. So what can you do?
00:05:14
Well one thing you could remember is, you don't only have color. You have other visual channels.
00:05:20
If you do want to encode with categorical color, deliberately yourself, bin things
00:05:27
and then map those into color. Right now what's happening is, chromosomes
00:05:31
say 5 through 11, are all getting roughly mapped into the same bin by our eyes. It might be better
00:05:38
if we could do a more semantic meaningful binning rather than just having the luck of the draw in
00:05:43
terms of which things get binned involuntarily by us not being able to notice the difference.
00:05:52
Just soIi don't only pick on the genomics folks, here's another example
00:05:57
of people using more discriminable bins than we can really see. Notice how that light blue for
00:06:03
cancer is pretty hard to tell apart from the light blue for ear, nose, and throat.
00:06:09
So we really do have a limit as to the number of categories. Think carefully yourself
00:06:15
about re: transforming your data, deriving a new
00:06:20
categorical attribute with fewer bins if you insist on using categorical color.
00:06:26
This is also an issue for ordered color. How many bins can we discriminate? Often not quite as many
00:06:32
as people might like. This is a nice example from Gregor Aisch, showing how if we use the same basic
00:06:38
idea of a choropleth map, of color coding onto geographic regions, well, what do we notice?
00:06:44
When we just look at those legends across the top it's always quite clear what the
00:06:48
differences are between neighboring bins. But then, when we look at the scattering of
00:06:52
U.S. states - well great for two classes; we can definitely tell those apart. For three classes,
00:06:58
no problem at all. For four classes, still pretty good.
00:07:02
Now once we even start getting to five classes, it might be a little bit tricky to distinguish.
00:07:08
Is Maine the same color as Utah?Probably? But now once we start getting to six, seven,
00:07:15
and eight classes, it gets very difficult to tell apart when things are separate from each other.
00:07:22
Understanding if West Virginia and Arizona are the same color with the eight class one;
00:07:26
it's very difficult. So, remember that we have a
00:07:30
limited number of discriminable bins if things are separated as opposed to right next to each other.
00:07:38
The other thing we have to remember with ordered data is,
00:07:41
everyone wants to use rainbows, but they're really a poor default.
00:07:45
Now why is that? People get a little confused, I think, about the physics of reality, which is yes,
00:07:51
sunlight going through a prism will scatter into a rainbow. That's a physical fact.
00:07:56
But the biology of our eyes is not actually necessarily using
00:08:01
that information in the way people hope. It's not intrinsic to have the ordering
00:08:06
of the rainbow in terms of our perception. If I lock two people into separate rooms with
00:08:10
no way to communicate, and I give them four color chips of purple, and red, and orange,
00:08:16
and blue, are they going to put them in the same order if I ask them to do that? Probably
00:08:21
not. I sure wouldn't bet a thousand dollars. But if I had people in rooms and I said, "Okay,
00:08:27
what's the order in terms of if I got like a light gray, and a white, and a black, and a dark gray."
00:08:32
Would they put them in the same order? Surely yes. So remember we want this perceptual intrinsic
00:08:38
ordering, as opposed to anything that you have to learn. People can learn, but we want
00:08:43
with the perceptual stuff to make sure that it's something intrinsic to the human visual system.
00:08:48
But that's not the only problem - is the lack of intrinsic ordering - there's also a non-linearity.
00:08:54
I'm going to say here's two regions of that spectrum and notice how within the one on
00:09:00
the left I can go from red to orange to yellow. I can clearly see at least three different colors.
00:09:06
And in a region of the same size i see green, green, green, on the right.
00:09:10
Because of the way that the human eye responds to the visual spectrum it is not a linear situation.
00:09:19
But, what are some of the benefits of rainbows? It's not that they're all bad;
00:09:23
it's that we can actually have fine-grained structure be visible and enable. And in
00:09:28
this picture we can talk about the red parts versus the yellow parts versus the green parts.
00:09:32
That makes us focus on the fine grain. Now in contrast with the one on the bottom,
00:09:39
it can actually be difficult to tell what's going on. We'll come back to that one in a minute.
00:09:45
So let's compare a different color map for that same data set on the top.
00:09:51
And when we just have two hues going from purple through gray
00:09:55
into yellow we're seeing a really different sense of this dataset.
00:10:00
We are much more able to focus on large scale structure. So, it's harder to focus
00:10:05
on fine grain but it's easier to focus on large scale. That is then a choice of the designer.
00:10:12
Now here we see an example where the mystery is cleared up;
00:10:15
it's actually the coastline of Florida. And there's something else going on here
00:10:19
there's a very carefully chosen change of hue at that zero point, to really distinguish
00:10:25
the blues getting darker as we go down into the depths, and then the heights of the mountains.
00:10:31
Something careful was done here. The luminance is increasing, so we do have multiple colors - but
00:10:38
they're ordered by the luminance. So we're going from dark up to bright.
00:10:43
So that's actually a general principle of color map design. There's some nice ones.
00:10:48
Viridis and magma are great for sequential color, carefully designed and now deployed in many tools.
00:10:54
And in these, the luminance is monotonically increasing. It's the hues are ordered by
00:11:01
luminance. They have other nice properties. They're perceptually uniform. They do have
00:11:05
multiple colors in them. And as we'll talk about later they are colorblind safe.
00:11:12
So, these are both some useful color maps for sequential data.
00:11:17
Let me mention one thing: rainbows are not always bad if you have categorical attributes,
00:11:24
then you do want very, very bright colors
00:11:27
because the ability to perceive small colors is dependent on size.
00:11:31
So if you have little bits of color scattered around, we want saturated color. And of course the
00:11:37
most saturated colors we can get are those rainbow colors. So if we have a segmented color map,
00:11:43
where we're doing something categorical rather than a continuous one, then in fact rainbow colors
00:11:49
can be very good choices. So, it all depends on matching
00:11:53
the characteristics of your visual encoding to the characteristics of your data set, as always.
00:12:02
So I alluded to this idea that maybe there's some interaction between channels. We really
00:12:07
have to be careful about this with color. Color does not have separable characteristics.
00:12:13
In particular as I mentioned, size heavily affects the salience of color.
00:12:19
So and in particular, if you have small regions you need highly saturated
00:12:25
colors to be able to even notice them. That's why we have highly saturated red maps
00:12:31
on top of these larger background swaths of pale, more unsaturated, colors in the map on the left.
00:12:38
Now if you have large regions you typically would want those to be low saturation.
00:12:42
Notice how these big, big areas of highly saturated color really jump out at you and
00:12:47
they're quite jarring so we want to be aware of the size as we think about the saturation to use.
00:12:54
That can be tricky in a context where you don't know ahead of time how large
00:12:58
a region will be if you've got, of course, a data-driven situation. So there's nuance there.
00:13:06
Some key things to know about saturation and luminance. The most crucial is, they are
00:13:10
not separable from each other. If you encode with one, you cannot usefully encode with the other.
00:13:17
And moreover, they're not separable from transparency either. So these
00:13:21
three channels of saturation, and luminance, and transparency,
00:13:24
are all, you have to use only one of them. They they very much are not separable.
00:13:31
Here's just an example with the same tool. And this tool, by the way, is called colorbrewer. It's
00:13:36
a great tool for generating some color palettes that's quite informed by visualization design
00:13:43
principles. So notice how what I've done on the left here, is I've got fully saturated colors
00:13:53
that I've specified in the middle there, but
00:13:57
the transparency is reasonably high. But because I've got some transparency
00:14:04
now, these look unsaturated. And in contrast, when I have the
00:14:08
unsaturated colors on the right, and then I have the transparency all the way,
00:14:13
notice how I can't tell the difference. So I'm unable to tell the difference
00:14:17
of transparency versus saturation. So, key principle; definitely keep that in mind.
00:14:23
We're typically going to use transparency often in the design of making visual layers
00:14:29
as we will talk about later, but we don't usually try to explicitly encode
00:14:35
with transparency, because it is so hard to separate out from luminance and saturation.
00:14:44
In general, if you're going to have small separated regions, the safe thing to do is
00:14:47
to have only two bins. And, of course, remember to only use one of either saturation or luminance or
00:14:53
transparency. Absolute max for things like saturation coding for separated regions,
00:14:59
would be something like three to four bins. So for contiguous regions, you can have many
00:15:06
bins if you're going to be making distinctions of things right next to each other. But,
00:15:10
again, be careful not to try to use these three channels simultaneously.
00:15:18
Let's talk about color palettes. Now we've already started down this, with thinking
00:15:23
about things like categorical colors. These are all color. We're talking
00:15:29
right now about palettes for a single attribute univariate color palettes.
00:15:33
So we really want maximum distinguishability. That sometimes means we go for as saturated as we can,
00:15:39
although you could also have less saturated ones. Sometimes these are called qualitative or nominal,
00:15:45
particularly in the geographic literature if you're using tools built after that.
00:15:51
So if you have ordered data then we really distinguish between sequential and diverging.
00:15:57
So sequential is when you're going from min to max, and diverging is when there's some
00:16:01
semantically meaningful midpoint. Typically we're going to use a neutral color for that,
00:16:05
like white or yellow or gray, and then we'll pick a saturated color for each of these end points.
00:16:11
And we see a bunch of examples of these diverging palettes - some
00:16:14
with yellow, some with white midpoints. Now if you're doing something sequential,
00:16:19
then we've got this ramp going from a minimum to a maximum value,
00:16:23
where typically it's unsaturated on one side and then fully saturated or
00:16:28
fully bright versus dark, depending on whether we're thinking about this as
00:16:32
luminance or saturation. But those are quite hard to tell apart we want to pick just one typically.
00:16:38
So here's a few examples of where we're doing sequential color maps, and we have multiple hues,
00:16:45
and so we are being careful to order them by luminance as we talked about.
00:16:51
Finally there's one more property we can think about. Remember that you can
00:16:55
have ordered data that's cyclic, and you can emphasize the cyclic nature of that by having
00:17:00
many hues. So multi-hue maps are useful for trying to show the cyclic nature of the data.
00:17:09
So the design considerations for a color palette for a single attribute are, first of all,
00:17:15
are we looking at segmented discrete little boxes, or something that's continuous?
00:17:21
Are we looking at diverging or sequential or cyclic as the properties of the data set we want
00:17:26
to emphasize? Do we have one hue or two hues or multiple hues? Is it perceptually linear?
00:17:33
And have we ordered the hues by luminance? And as we'll get to later, is it colorblind safe?
00:17:42
Well, we had been talking about color for one attribute. What if we want to encode two
00:17:46
different attributes with color? These are what's called bivariate color maps.
00:17:52
Now there is a straightforward case where
00:17:55
we have two different attributes, but one of those attributes is binary. It's just on or off.
00:18:01
So here's an example of that where essentially we've got a variation in hue, and then we have the
00:18:09
low saturation and the high saturation version of that. So what's getting encoded with saturation
00:18:14
is just binary on or off. People are reasonably able to deal with those.
00:18:21
But in the cases where you actually have multiple levels in each of two different attributes,
00:18:28
it gets very tricky people do use these sometimes, but there's a fair amount of empirical evidence
00:18:34
that people find them difficult to interpret. Not impossible, but certainly difficult.
00:18:41
So be aware that it is a much more tricky design problem to have bivariate color maps with multiple
00:18:49
levels in each of the directions as opposed to just binary in one of those directions.