Returns a heatmap of the X data alongside the response values y. Here, the fill colors in the heatmap correspond to the values in X. If the response vector y is categorical, the rows/samples in X are grouped into their respective y categories. If the response vector is continuous, a scatter plot of the y values are plotted to the right of the heatmap. Furthermore, the rows and columns of X can be clustered via hierarchical clustering if desired.

plot_data_heatmap(
  X,
  y,
  subsample_rows = 1,
  subsample_cols = 1,
  clust_rows = TRUE,
  clust_cols = TRUE,
  ...
)

Arguments

X

Data matrix or data frame to use for heatmap.

y

A response vector.

subsample_rows

Proportion of rows to subsample and keep in plotting display. Default is 1, which does not perform any subsampling.

subsample_cols

Proportion of columns to subsample and keep in plotting display. Default is 1, which does not perform any subsampling.

clust_rows

Logical indicating whether or not to cluster the rows in X via hierarchical clustering.

clust_cols

Logical indicating whether or not to cluster the columns in X via hierarchical clustering.

...

Additional arguments to pass to plot_hclust_heatmap()

Value

A gglot object. Specifically, if y is a continuous response vector, then this function returns a `patchwork` object with two ggplots: one for the heatmap of X and another for the scatter plot of y values. If y is categorical, this function returns a single ggplot object with the heatmap of X with samples grouped by each category in y.