Learning Deep Low-Dimensional Models from High-Dimensional Data: From Theory to Practice

CVPR 2024 Tutorial

Date: Tuesday, June 18 (full day tutorial)

Location: Room 3 (Summit 442)

Overview

Over the past decade, the advent of machine learning and large-scale computing has immeasurably changed the ways we process, interpret, and predict with data in imaging and computer vision. The “traditional” approach to algorithm design, based around parametric models for specific structures of signals and measurements—say sparse and low-rank models—and the associated optimization toolkit, is now significantly enriched with data-driven learning-based techniques, where large-scale networks are pre-trained and then adapted to a variety of specific tasks. Nevertheless, the successes of both modern data-driven and classic model-based paradigms rely crucially on correctly identifying the low-dimensional structures present in real-world data, to the extent that we see the roles of learning and compression of data processing algorithms—whether explicit or implicit, as with deep networks—as inextricably linked.

As such, this tutorial provides a timely tutorial that uniquely bridges low-dimensional models with deep learning in imaging and vision. This tutorial will show how:

Low-dimensional models and principles provide a valuable lens for formulating problems and understanding the behavior of modern deep models in imaging and computer vision; and how
Ideas from low-dimensional models can provide valuable guidance for designing new parameter efficient, robust, and interpretable deep learning models for computer vision problems in practice.

We will begin by introducing fundamental low-dimensional models (e.g., basic sparse and low-rank models) with motivating computer vision applications. Based on these developments, we will discuss strong conceptual, algorithmic, and theoretical connections between low-dimensional structures and deep models, providing new perspectives to understand state-of-the-art deep models in terms of learned representations, generalizability, and transferability. Finally, we will demonstrate that these connections can lead to new principles for designing deep networks learning low-dimensional structures in computer vision, with both clear interpretability and practical benefits. We will conclude with a panel discussion with expert researchers from academia and industry on what role low-dimensional models can and should play in our current age of opaque large language models and foundation models for computer vision.

Speakers

Panelists

Schedule

The tutorial will take place on Tuesday, June 18th.

Lecture	Speaker	Time (PT)
Session 1: Understanding Low-Dimensional Representations & Learning in Deep Networks
Lecture 1-1: Introduction to Basic Low-Dimensional Models (Lecture Abstract) The first part will introduce fundamental properties and results for sensing, processing, analyzing, and learning low-dimensional structures from high-dimensional data. We will first discuss classical low-dimensional models, such as sparse coding and low-rank matrix sensing, and motivate these models by applications in computer vision. Based on convex relaxation, we will characterize the conditions, in terms of sample/data complexity, under which the learning problems of recovering such low-dimensional structures become tractable and can be solved efficiently, with guaranteed correctness or accuracy.	Yi Ma	9:00-10:00
Lecture 1-2: Understanding Low-Dimensional Representation via Neural Collapse (Lecture Abstract) Continuing, we focus on the strong conceptual connections between low-dimensional structures and deep models in terms of learned representation. We start with the introduction of an intriguing Neural Collapse phenomenon in the last-layer representation and its universality in deep network, and lays out the mathematical foundations of understanding its cause by studying its optimization landscapes. We then generalize and explain this phenomenon and its implications under data imbalancedness. Furthermore, we demonstrate the practical algorithmic implications of Neural Collapse on training deep neural networks.	Zhihui Zhu	10:00-11:00
Lecture 1-3: Invariant Low-Dimensional Subspaces of Learning Dynamics (Lecture Abstract) We show that low-dimensional structures also emerge in training dynamics of deep networks. Specifically, we show that the evolution of gradient descent only affects a minimal portion of singular vector spaces across all weight matrices. The analysis enables us to considerably improve training efficiency by taking advantage of the low-dimensional structure in learning dynamics. We can construct smaller, equivalent deep linear networks without sacrificing the benefits associated with the wider counterparts. Moreover, it allows us to better understand deep representation learning by elucidating the progressive feature compression and discrimination from shallow to deep layers.	Qing Qu	11:15-12:15
Session 2: Designing Deep Networks for Pursuing Low-Dimensional Structures
Lecture 2-1: Low-Dimensional Representation Learning for High-Dimensional Data via the Principle of Compression (Lecture Abstract) In this part, we will discuss the overall objective of high-dimensional data analysis, that is, learning and transforming the data distribution towards low-dimensional template distributions for downstream tasks (such as linear discriminative representations, expressive mixtures of semantically-meaningful incoherent subspaces), and how it connects to deep representation learning. In particular, we will introduce the principle of maximal coding rate reduction (MCR^2) for learning compact and structured representations. We will discuss how to use MCR^2 as a simple and principled objective to measure the goodness of learned representations. Furthermore, we will discuss the theoretical analysis of the coding reduction objective, including global optima properties and the optimization landscape.	Yi Ma	13:30-14:30
Lecture 2-2: White-Box Architecture Design via Unrolled Optimization and Compression (Lecture Abstract) In this part, we will focus on how to construct white-box deep neural network architectures using unrolled optimization. We will review classical methods such as sparse coding through dictionary learning as particular instantiations of this learning paradigm when the underlying signal model is linear or sparse, along with unrolled optimization as a design principle for white-box deep networks (e.g., LISTA network) that are interpretable ab initio. Next, we will discuss how to construct a white-box deep neural network architecture from the principle of data compression. We will show that the basic iterative gradient ascent scheme for optimizing the rate reduction objective leads to a multi-layer deep network, named ReduNet, which shares common characteristics of modern deep networks such as CNN and ResNet.	Yaodong Yu (Yuqian Zhang conflict)	14:45-15:45
Lecture 2-3: White-Box Transformers via Sparse Rate Reduction (Lecture Abstract) We demonstrate how combining sparse coding and rate reduction yields sparse linear discriminative representations using an objective called sparse rate reduction. We develop CRATE, a deep network architecture, by unrolling the optimization of this objective and parameterizing feature distribution in each layer. CRATE's operators are mathematically interpretable, with each layer representing an optimization step, making the network a transparent "white box". Remarkably, CRATE closely resembles the transformer architecture, suggesting that the interpretability gained from such networks might also improve our understanding of current, practical deep architectures. Experiments show that CRATE, despite its simplicity, indeed learns to compress and sparsify representations of large-scale real-world image and text datasets. It achieves performance very close to highly engineered transformer-based models. We will also discuss recent results on scaling up such white-box transformers as well as more efficient white-box architecture designs.	Sam Buchanan	15:45-16:45
Session 3: Panel Discussion		17:00-18:00

Materials

Slides for the tutorial are available at this Dropbox link.

Learning Deep Low-Dimensional Models from High-Dimensional Data: From Theory to Practice

CVPR 2024 Tutorial

Date: Tuesday, June 18 (full day tutorial)

Location: Room 3 (Summit 442)

Overview

Speakers

Sam Buchanan

Yi Ma

Qing Qu

Yaodong Yu

Yuqian Zhang

Zhihui Zhu

Panelists

Beidi Chen

Wuyang Chen

Mojan Javaheripi

Liyue Shen

Atlas Wang

Schedule

Materials