Solved – Alternative visualizations to 3D bar chart

data visualizationexcelggplot2

I have a dataset that consists of a numerical variable (height, y-axis). Each data point is replicated for an individual (1,2,3) in each treatment (A,B,C,D). Here is a terrible figure that I am looking to replace:

What other creative ways could I show this data? I have been playing around with facets in ggplot2, but couldn't get a layout I like. I am open to any suggestions. I would also like to add some error bars in there at some point, as the data here are means. Help make my data sexy!

Here is the data:

help_3D <- structure(list("one"=c(10,9,8,7), "two"=c(8,7,6,5),   "three"=c(8.9,8.7,8.5,8.4), treatment=c("A", "B", "C", "D")), .Name = c("one", "two", "three", "treatment"), row.names=c(NA, 15L),  class="data.frame")

Best Answer

One candidate is the dot chart ably and energetically promoted by W.S. Cleveland. Here's a Stata implementation:

Key points include

There is no absolute reason for lines to start at zero. Here it seems natural; in other cases it can seem superfluous.
Solid markers here draw attention to magnitudes. Whenever points might occlude or obscure each other, open markers may be better.
It's arbitrary which one categorical control nests inside another. Here treatments A B C D occur on the inside, which was found to show a simpler pattern. Another design has all treatments on the same line.

For other ideas and examples, see

Graph for relationship between two ordinal variables

Chart for visualizing multi-dimensional data

How to add a third variable to a bar plot?

Is there a better way than side-by-side barplots to compare binned data from different series

How to best visualize differences in many proportions across three groups?

In this case, there is a small functional difference between this display and similar bar charts, whether vertical or horizontal. The advantages of dot charts are more striking when each line contains two or more "dots" (more generally, markers or point symbols). Some of these threads above are especially pertinent here.

Note: Implemented in Stata with code

graph dot (asis) y, over(treatment) over(x) scheme(s1color) linetype(line) lines(lc(gs12) lw(vthin))

EDIT: Regardless of whether these are real data, a further possibility is just to shuffle the individuals 1, 2, 3. Unless you tell us otherwise, their identifiers are arbitrary; in terms of their response patterns 3 might be better placed between 1 and 2.

Related Solutions

Solved – Alternative graphics to “handle bar” plots

Thanks for all you answers. For completeness I thought I should include what I usually do. I tend to do a combination of the suggestions given: dots, boxplots (when n is large), and se (or sd) ranges.

(Removed by moderator because the site hosting the image no longer appears to work correctly.)

From the dot plot, it is clear that data is far more spread out the "handle bar" plots suggest. In fact, there is a negative value in A3!

I've made this answer a CW so I don't gain rep

Solved – Calculating standard error and attaching an error bar on ggplot2 bar chart

Here's an example from the ggplot2 homepage: https://ggplot2.tidyverse.org/reference/geom_errorbarh.html as others have mentioned in the comments, you have to calculate SE on your own and append this information to the data.frame

df <- data.frame( 
  trt = factor(c(1, 1, 2, 2)), 
  resp = c(1, 5, 3, 4), 
  group = factor(c(1, 2, 1, 2)), 
  se = c(0.1, 0.3, 0.3, 0.2) 
 ) 
df2 <- df[c(1,3),]
p <- ggplot(df, aes(fill=group, y=resp, x=trt)) 
p + geom_bar(position="dodge", stat="identity") 
dodge <- position_dodge(width=0.9) 
p + geom_bar(position=dodge, stat="identity") + geom_errorbar(aes(ymax = resp + se, ymin=resp - se), position=dodge, width=0.25)

As a pointer, SE @ 95% CI usually looks something like this:

df$se <- 1.96*(sd(your_data, na.rm=T)/sqrt(your_n))

Your upper and lower CI bounds will just be df$se +/- the response (as shown in the aes() for geom_errorbar(), above)

Best Answer

Related Solutions

Solved – Alternative graphics to “handle bar” plots

Solved – Calculating standard error and attaching an error bar on ggplot2 bar chart

Related Question